Philipp Schmid's picture

Philipp Schmid

philschmid

·

https://www.philschmid.de

AI & ML interests

None yet

Recent Activity

updated a dataset 4 days ago

philschmid/gretel-synthetic-text-to-sql

published a dataset 4 days ago

philschmid/gretel-synthetic-text-to-sql

liked a dataset 6 days ago

GeneralReasoning/GeneralThought-195K

View all activity

Organizations

Posts 2

Post

8009

New state-of-the-art open LLM! 🚀 Databricks just released DBRX, a 132B MoE trained on 12T tokens. Claiming to surpass OpenAI GPT-3.5 and is competitive with Google Gemini 1.0 Pro. 🤯

TL;DR
🧮 132B MoE with 16 experts with 4 active in generation
🪟 32 000 context window
📈 Outperforms open LLMs on common benchmarks, including MMLU
🚀 Up to 2x faster inference than Llama 2 70B
💻 Trained on 12T tokens
🔡 Uses the GPT-4 tokenizer
📜 Custom License, commercially useable

Collection: databricks/dbrx-6601c0852a0cdd3c59f71962
Demo: https://huggingface.co./spaces/databricks/dbrx-instruct

Kudos to the Team at Databricks and MosaicML for this strong release in the open community! 🤗

Articles 48

Article

42

Mini-R1: Reproduce Deepseek R1 „aha moment“ a RL tutorial

View all Articles

Collections 4

Papers 1

arxiv:2109.02846

spaces 7

Llm Pricing

Generate React TypeScript App

Pdf To Structured Data

PDF to Structured Data powered by Google DeepMind Gemini 2.0

Can I run on TGI

Igel Playground

Furiosa Ai Ocr

No application file

Test Gradio

models 147

philschmid/qwen-2.5-3b-r1-countdown

Text Generation • Updated Jan 30 • 282 • 7

philschmid/qwen-3b-r1-aha-moment

philschmid/dpo-llama-3-1-8b-math

Text Generation • Updated Jan 23 • 10

philschmid/dpo-llama-3-1-8b-math-ep3-merged

Updated Jan 23 • 9

philschmid/dpo-llama-3-1-8b-math-ep3

philschmid/modernbert-llm-router

Text Classification • Updated Dec 25, 2024 • 50 • 1

philschmid/llama-3-1-8b-math-orca-spectrum-10k-ep1

Text Generation • Updated Dec 19, 2024 • 436

philschmid/llama-3-1-8b-math-orca-qlora-10k-ep1-merged

Updated Nov 29, 2024 • 7

philschmid/llama-3-1-8b-math-orca-qlora-10k-ep1

Updated Nov 29, 2024

philschmid/meta-llama-3.1-8b-instruct-trt-l4

Updated Nov 4, 2024

datasets 49

philschmid/gretel-synthetic-text-to-sql

Viewer • Updated 4 days ago • 106k • 43

philschmid/pdf-samples

Viewer • Updated Feb 6 • 3 • 245

philschmid/philschmid-llama-3-1-8b-math-orca-spectr-philschmid-DMath-candidates

Viewer • Updated Jan 22 • 1.98k • 172

philschmid/DMath

Viewer • Updated Jan 21 • 7.94k • 166 • 1

philschmid/AIME_1983_2024

Viewer • Updated Jan 21 • 933 • 61

philschmid/ocr-invoice-data

Viewer • Updated Jan 15 • 2.24k • 67

philschmid/open-orca-10k-guidellm

Viewer • Updated Oct 9, 2024 • 10k • 70 • 1

philschmid/amazon-product-descriptions-vlm

Viewer • Updated Sep 30, 2024 • 1.35k • 797 • 9

philschmid/slimorca-deduped-cleaned-corrected-chatml

Viewer • Updated Sep 17, 2024 • 182k • 54

philschmid/open-orca-250-guidellm

Viewer • Updated Sep 17, 2024 • 250 • 71