Hieu Ngo's picture

Hieu Ngo

hiieu

·

AI & ML interests

Applied, Post-Training LLM

Recent Activity

updated a model about 1 month ago

hiieu/R1_tool_call_Distill-Qwen-1.5B

updated a model about 1 month ago

hiieu/R1_tool_call_Distill-Qwen-7B

upvoted a collection about 1 month ago

Reasoning Datasets

View all activity

Organizations

hiieu's activity

upvoted a collection about 1 month ago

Reasoning Datasets

Distilled synthetic Reasoning datasets • 7 items • Updated Feb 2 • 55

upvoted a paper about 1 month ago

Search-o1: Agentic Search-Enhanced Large Reasoning Models

Paper • 2501.05366 • Published Jan 9 • 95

upvoted a collection about 2 months ago

HuatuoGPT-o1

4 items • Updated Dec 30, 2024 • 16

upvoted a paper about 2 months ago

The Lessons of Developing Process Reward Models in Mathematical Reasoning

Paper • 2501.07301 • Published Jan 13 • 92

upvoted a collection about 2 months ago

Reasoning Datasets

Reasoning datasets that are trending 🔥 • 10 items • Updated Jan 3 • 24

upvoted 5 papers 4 months ago

Marco-o1: Towards Open Reasoning Models for Open-Ended Solutions

Paper • 2411.14405 • Published Nov 21, 2024 • 58

RedPajama: an Open Dataset for Training Large Language Models

Paper • 2411.12372 • Published Nov 19, 2024 • 53

SelfCodeAlign: Self-Alignment for Code Generation

Paper • 2410.24198 • Published Oct 31, 2024 • 24

GPT-4o System Card

Paper • 2410.21276 • Published Oct 25, 2024 • 84

Steering Knowledge Selection Behaviours in LLMs via SAE-Based Representation Engineering

Paper • 2410.15999 • Published Oct 21, 2024 • 20

upvoted 2 articles 5 months ago

Article

MedEmbed: Fine-Tuned Embedding Models for Medical / Clinical IR

By

•

Oct 20, 2024

• 36

Article

Model2Vec: Distill a Small Fast Model from any Sentence Transformer

By

and 1 other •

Oct 14, 2024

• 77

upvoted a paper 6 months ago

MMEvol: Empowering Multimodal Large Language Models with Evol-Instruct

Paper • 2409.05840 • Published Sep 9, 2024 • 48

upvoted a collection 6 months ago

Gemma 2 ChatQA RAG finetuned

1 item • Updated Sep 2, 2024 • 1

upvoted an article 7 months ago

Article

Improving Hugging Face Training Efficiency Through Packing with Flash Attention

Aug 21, 2024

• 31

upvoted 2 papers 7 months ago

Synthesizing Text-to-SQL Data from Weak and Strong LLMs

Paper • 2408.03256 • Published Aug 6, 2024 • 11

Improving Text Embeddings for Smaller Language Models Using Contrastive Fine-tuning

Paper • 2408.00690 • Published Aug 1, 2024 • 25

upvoted a collection 7 months ago

ShieldGemma Release

A series of safety classifiers, trained on top of Gemma 2, for developers to filter inputs and outputs of their applications. • 3 items • Updated Dec 13, 2024 • 11

upvoted a paper 8 months ago

Compact Language Models via Pruning and Knowledge Distillation

Paper • 2407.14679 • Published Jul 19, 2024 • 39

upvoted a collection 8 months ago

Minitron

A family of compressed models obtained via pruning and knowledge distillation • 12 items • Updated Jan 17 • 60