Maziyar Panahi's picture

Maziyar Panahi PRO

MaziyarPanahi

·

AI & ML interests

Fine-Tuning, RLHF, Merging, Quantizations, Leaderboards

Recent Activity

upvoted a collection about 2 hours ago

updated a dataset about 5 hours ago

MaziyarPanahi/llama-3.1-tulu-3-70b-preference-mixture

published a dataset about 5 hours ago

MaziyarPanahi/llama-3.1-tulu-3-70b-preference-mixture

View all activity

Organizations

MaziyarPanahi's activity

upvoted a collection about 2 hours ago

Qwen2.5-VL

Vision-language model series based on Qwen2.5 • 3 items • Updated about 22 hours ago • 194

upvoted an article about 16 hours ago

Article

We now support VLMs in smolagents!

4 days ago

• 49

upvoted a paper 6 days ago

MiniCPM-V: A GPT-4V Level MLLM on Your Phone

Paper • 2408.01800 • Published Aug 3, 2024 • 80

upvoted an article 6 days ago

Article

Introducing the Synthetic Data Generator - Build Datasets with Natural Language

Dec 16, 2024

• 89

upvoted a paper 11 days ago

Search-o1: Agentic Search-Enhanced Large Reasoning Models

Paper • 2501.05366 • Published 18 days ago • 80

upvoted an article 12 days ago

Article

Train 400x faster Static Embedding Models with Sentence Transformers

13 days ago

• 125

upvoted a collection 13 days ago

InternLM3

6 items • Updated 11 days ago • 21

upvoted an article 15 days ago

Article

Mastering Tensor Dimensions in Transformers

By

•

15 days ago

• 40

upvoted a collection 19 days ago

Phi-4

Phi-4 small language model. • 2 items • Updated 19 days ago • 45

upvoted a collection 27 days ago

GIANTS

Frankenstein and giant models merged! • 11 items • Updated 26 days ago • 4

upvoted a collection about 1 month ago

InternVL2.5-MPO

Enhancing the Reasoning Ability of MLLMs via Mixed Preference Optimization • 16 items • Updated 18 days ago • 26

upvoted an article about 1 month ago

Article

Introduction to Quantization cooked in 🤗 with 💗🧑‍🍳

By

•

Aug 25, 2023

• 26

upvoted a paper about 1 month ago

NuNER: Entity Recognition Encoder Pre-training via LLM-Annotated Data

Paper • 2402.15343 • Published Feb 23, 2024 • 13

upvoted a collection about 1 month ago

ModernBERT

Bringing BERT into modernity via both architecture changes and scaling • 3 items • Updated Dec 19, 2024 • 127

upvoted 3 collections about 2 months ago

Qwen2.5-Math

Math-specific model series based on Qwen2.5 • 11 items • Updated 14 days ago • 65

OLMo 2

Artifacts for the second set of OLMo models. • 22 items • Updated 21 days ago • 75

Common Models

The first generation of models pretrained on Common Corpus. • 5 items • Updated Dec 5, 2024 • 28

upvoted an article about 2 months ago

Article

They Said It Couldn’t Be Done

By

•

Dec 5, 2024

• 77

upvoted a paper about 2 months ago

OpenScholar: Synthesizing Scientific Literature with Retrieval-augmented LMs

Paper • 2411.14199 • Published Nov 21, 2024 • 30

upvoted a collection about 2 months ago

INTELLECT-1 Dataset

INTELLECT-1 Training dataset • 5 items • Updated Oct 8, 2024 • 21