Ji-Ha (Ji-Ha)

upvoted a paper 4 months ago

DeepSeek-Prover-V1.5: Harnessing Proof Assistant Feedback for Reinforcement Learning and Monte-Carlo Tree Search

Paper • 2408.08152 • Published Aug 15 • 52

upvoted a collection 5 months ago

Llama 3.1

This collection hosts the transformers and original repos of the Llama 3.1, Llama Guard 3 and Prompt Guard models • 11 items • Updated 20 days ago • 636

upvoted a collection 7 months ago

MatMulfree LM

Collection

Pre-trined models for Matmulfree LM. • 4 items • Updated Jun 10 • 25

upvoted a paper 7 months ago

DeepSeek-Prover: Advancing Theorem Proving in LLMs through Large-Scale Synthetic Data

Paper • 2405.14333 • Published May 23 • 36

upvoted a collection 8 months ago

DeepSeek-Math

Collection

DeepSeek Math series • 4 items • Updated Aug 16 • 12

upvoted a paper 8 months ago

Prometheus 2: An Open Source Language Model Specialized in Evaluating Other Language Models

Paper • 2405.01535 • Published May 2 • 119

upvoted a collection 8 months ago

WizardLM

Collection

0 items • Updated Jul 11 • 102

upvoted 10 papers 9 months ago

MathVerse: Does Your Multi-modal LLM Truly See the Diagrams in Visual Math Problems?

Paper • 2403.14624 • Published Mar 21 • 51

Cobra: Extending Mamba to Multi-Modal Large Language Model for Efficient Inference

Paper • 2403.14520 • Published Mar 21 • 33

Evolutionary Optimization of Model Merging Recipes

Paper • 2403.13187 • Published Mar 19 • 50

Mora: Enabling Generalist Video Generation via A Multi-Agent Framework

Paper • 2403.13248 • Published Mar 20 • 78

ORPO: Monolithic Preference Optimization without Reference Model

Paper • 2403.07691 • Published Mar 12 • 63

upvoted 3 papers 10 months ago

Quiet-STaR: Language Models Can Teach Themselves to Think Before Speaking

Paper • 2403.09629 • Published Mar 14 • 75

MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training

Paper • 2403.09611 • Published Mar 14 • 125

Video Editing via Factorized Diffusion Distillation

Paper • 2403.09334 • Published Mar 14 • 21

Ji-Ha

AI & ML interests

Organizations

Ji-Ha's activity

DeepSeek-Prover-V1.5: Harnessing Proof Assistant Feedback for Reinforcement Learning and Monte-Carlo Tree Search

Llama 3.1

MatMulfree LM

DeepSeek-Prover: Advancing Theorem Proving in LLMs through Large-Scale Synthetic Data

DeepSeek-Math

Prometheus 2: An Open Source Language Model Specialized in Evaluating Other Language Models

WizardLM

DiJiang: Efficient Large Language Models through Compact Kernelization

GAIA: a benchmark for General AI Assistants

Mamba: Linear-Time Sequence Modeling with Selective State Spaces

Exponentially Faster Language Modelling

DreamReward: Text-to-3D Generation with Human Preference

MathVerse: Does Your Multi-modal LLM Truly See the Diagrams in Visual Math Problems?

Cobra: Extending Mamba to Multi-Modal Large Language Model for Efficient Inference

Evolutionary Optimization of Model Merging Recipes

Mora: Enabling Generalist Video Generation via A Multi-Agent Framework

ORPO: Monolithic Preference Optimization without Reference Model

Quiet-STaR: Language Models Can Teach Themselves to Think Before Speaking

MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training

Video Editing via Factorized Diffusion Distillation