Anton Lozhkov's picture

Anton Lozhkov

anton-l

·

AI & ML interests

Generative Models, Distributed Training, Photo and Video Enhancement

Recent Activity

new activity about 6 hours ago

HuggingFaceTB/finemath:Create ？

new activity 2 days ago

HuggingFaceTB/finemath:[bot] Conversion to Parquet

updated a dataset 2 days ago

HuggingFaceTB/math_tasks

View all activity

Articles

SmolLM - blazingly fast and remarkably powerful

Cosmopedia: how to create large-scale synthetic data for pre-training Large Language Models

StarCoder2 and The Stack v2

Organizations

anton-l's activity

upvoted a paper 4 months ago

Building and better understanding vision-language models: insights and future directions

Paper • 2408.12637 • Published Aug 22 • 124

upvoted an article 5 months ago

Article

SmolLM - blazingly fast and remarkably powerful

Jul 16

• 292

upvoted an article 6 months ago

Article

Ethics and Society Newsletter #6: Building Better AI: The Importance of Data Quality

Jun 24

• 33

upvoted a paper 6 months ago

The FineWeb Datasets: Decanting the Web for the Finest Text Data at Scale

Paper • 2406.17557 • Published Jun 25 • 87

upvoted a collection 7 months ago

📀 Dataset comparison models

1.8B models trained on 350BT to compare different pretraining datasets • 8 items • Updated Jun 12 • 34

upvoted a paper 10 months ago

StarCoder 2 and The Stack v2: The Next Generation

Paper • 2402.19173 • Published Feb 29 • 136

upvoted a paper about 1 year ago

Zephyr: Direct Distillation of LM Alignment

Paper • 2310.16944 • Published Oct 25, 2023 • 123

upvoted a paper over 1 year ago

The RefinedWeb Dataset for Falcon LLM: Outperforming Curated Corpora with Web Data, and Web Data Only

Paper • 2306.01116 • Published Jun 1, 2023 • 32