7 9 8

Dmitrii Stoianov

heylimon

DimaStoyanov

AI & ML interests

None yet

Recent Activity

upvoted a paper 17 days ago

System Message Generation for User Preferences using Open-Source Models

upvoted a paper about 1 month ago

Analyze Feature Flow to Enhance Interpretation and Steering in Language Models

upvoted a paper about 1 month ago

How to Synthesize Text Data without Model Collapse?

View all activity

Organizations

heylimon's activity

upvoted a paper 17 days ago

System Message Generation for User Preferences using Open-Source Models

Paper • 2502.11330 • Published 21 days ago • 15

upvoted 2 papers about 1 month ago

Analyze Feature Flow to Enhance Interpretation and Steering in Language Models

Paper • 2502.03032 • Published Feb 5 • 58

How to Synthesize Text Data without Model Collapse?

Paper • 2412.14689 • Published Dec 19, 2024 • 51

upvoted a collection about 1 month ago

RL/Alignment

Collection

197 items • Updated Jun 18, 2024 • 25

upvoted a paper about 1 month ago

The Differences Between Direct Alignment Algorithms are a Blur

Paper • 2502.01237 • Published Feb 3 • 112

New activity in t-tech/T-lite-it-1.0 3 months ago

Как запустить квантованную модель?

#6 opened 3 months ago by

Without69

liked 2 datasets 3 months ago

google/trueteacher

Viewer • Updated Dec 26, 2023 • 1.38M • 157 • 20

lytang/LLM-AggreFact

Viewer • Updated Dec 20, 2024 • 59.7k • 1.3k • 21

upvoted a paper 3 months ago

Phi-4 Technical Report

Paper • 2412.08905 • Published Dec 12, 2024 • 111

commented a paper 3 months ago

Phi-4 Technical Report

Paper • 2412.08905 • Published Dec 12, 2024 • 111 •

New activity in t-tech/T-lite-it-1.0 3 months ago

Special Tokens

#4 opened 3 months ago by

strangelex42

New activity in t-tech/T-pro-it-1.0 3 months ago

Context size

#5 opened 3 months ago by

deksden

New activity in AnatoliiPotapov/T-lite-instruct-0.1 3 months ago

Fix ignored 'add_generation_prompt' in the chat template

#12 opened 6 months ago by

heylimon

liked a dataset 3 months ago

O1-OPEN/OpenO1-SFT

Viewer • Updated Dec 17, 2024 • 77.7k • 1.44k • 358

liked 2 datasets 5 months ago

nyuuzyou/EMERCOM-questions

Viewer • Updated Feb 23, 2024 • 25.7k • 59 • 1

nyuuzyou/9111-questions

Preview • Updated Feb 19, 2024 • 49 • 6

upvoted a paper 7 months ago

Instruction Pre-Training: Language Models are Supervised Multitask Learners

Paper • 2406.14491 • Published Jun 20, 2024 • 90

commented a paper 7 months ago

Instruction Pre-Training: Language Models are Supervised Multitask Learners

Paper • 2406.14491 • Published Jun 20, 2024 • 90 •

New activity in AnatoliiPotapov/T-lite-0.1 7 months ago

Дальнейшее дообучение

#1 opened 7 months ago by

alamacra

liked a Space 9 months ago

853

FineWeb: decanting the web for the finest text data at scale

🍷

Generate high-quality web text data for LLM training