25 167 578

Florent Daudens

fdaudens

AI & ML interests

AI & Journalism

Recent Activity

posted an update 3 days ago

AI will bring us "a country of yes-men on servers" instead of one of "Einsteins sitting in a data center" if we continue on current trends. Must-read by @thomwolf deflating overblown AI promises and explaining what real scientific breakthroughs require. https://thomwolf.io/blog/scientific-ai.html

liked a model 4 days ago

Qwen/QwQ-32B

upvoted an article 4 days ago

Introducing Three New Serverless Inference Providers: Hyperbolic, Nebius AI Studio, and Novita 🔥

View all activity

Organizations

fdaudens's activity

posted an update 3 days ago

Post

3517

AI will bring us "a country of yes-men on servers" instead of one of "Einsteins sitting in a data center" if we continue on current trends.

Must-read by @thomwolf deflating overblown AI promises and explaining what real scientific breakthroughs require.

https://thomwolf.io/blog/scientific-ai.html

2 replies

liked a model 4 days ago

Qwen/QwQ-32B

Text Generation • Updated 2 days ago • 103k • • 1.66k

upvoted an article 4 days ago

Article

Introducing Three New Serverless Inference Providers: Hyperbolic, Nebius AI Studio, and Novita 🔥

20 days ago

• 93

liked a model 4 days ago

Wan-AI/Wan2.1-T2V-14B

Text-to-Video • Updated 11 days ago • 186k • • 949

liked a model 5 days ago

THUDM/CogView4-6B

Text-to-Image • Updated 5 days ago • 6.84k • 166

liked a Space 9 days ago

310

AI Deadlines

⚡

Schedule tasks efficiently using AI-generated deadlines

posted an update 9 days ago

Post

3397

What if AI becomes as ubiquitous as the internet, but runs locally and transparently on our devices?

Fascinating TED talk by @thomwolf on open source AI and its future impact.

Imagine this for AI: instead of black box models running in distant data centers, we get transparent AI that runs locally on our phones and laptops, often without needing internet access. If the original team moves on? No problem - resilience is one of the beauties of open source. Anyone (companies, collectives, or individuals) can adapt and fix these models.

This is a compelling vision of AI's future that solves many of today's concerns around AI transparency and centralized control.

Watch the full talk here: https://www.ted.com/talks/thomas_wolf_what_if_ai_just_works

1 reply

liked a Space 10 days ago

1.03k

Wan2.1

💻

Wan: Open and Advanced Large-Scale Video Generative Models

updated a Space 10 days ago

The Essential AI Toolkit

🧰

A curated collection of AI tools for journalists & creators

liked a Space 10 days ago

PhineSpeechTranslator

👀

Break the language barrier

liked a model 10 days ago

microsoft/Phi-4-mini-instruct

Text Generation • Updated 4 days ago • 102k • 320

liked a Space 10 days ago

662

Open ASR Leaderboard

🏆

Request evaluation of a speech recognition model

upvoted a collection 10 days ago

olmOCR

Collection

olmOCR is a document recognition pipeline for efficiently converting documents into plain text. olmocr.allenai.org • 3 items • Updated 11 days ago • 89

liked a model 10 days ago

microsoft/Phi-4-multimodal-instruct

Automatic Speech Recognition • Updated 1 day ago • 231k • 1.03k

liked a Space 10 days ago

Phi4 Multimodal

🦀

Space demoing Phi4 MultiModal

posted an update 11 days ago

Post

3060

Is this the best tool to extract clean info from PDFs, handwriting and complex documents yet?

Open source olmOCR just dropped and the results are impressive.

Tested the free demo with various documents, including a handwritten Claes Oldenburg letter. The speed is impressive: 3000 tokens/second on your own GPU - that's 1/32 the cost of GPT-4o ($190/million pages). Game-changer for content extraction and digital archives.

To achieve this, Ai2 trained a 7B vision language model on 260K pages from 100K PDFs using "document anchoring" - combining PDF metadata with page images.

Best part: it actually understands document structure (columns, tables, equations) instead of just jumbling everything together like most OCR tools. Their human eval results back this up.

👉 Try the demo: https://olmocr.allenai.org

Going right into the AI toolkit: JournalistsonHF/ai-toolkit