Quentin Lhoest PRO

lhoestq

AI & ML interests

Maintainer of 🤗Datasets: NLP, Multimodal data processing and sharing

Articles

Organizations

lhoestq's activity

upvoted an article 4 days ago
view article
Article

Introducing the SQL Console on Datasets

11
upvoted an article 25 days ago
view article
Article

Scaling robotics datasets with video encoding

31
upvoted an article 26 days ago
view article
Article

Deep Learning over the Internet: Training Language Models Collaboratively

4
upvoted 2 articles about 1 month ago
view article
Article

⭐ PySpark and 🤗 Hugging Face Parquet Files

By asoria
5
view article
Article

XetHub is joining Hugging Face!

76
upvoted an article 2 months ago
view article
Article

WWDC 24: Running Mistral 7B with Core ML

54
upvoted 6 articles 2 months ago
view article
Article

Docmatix - a huge dataset for Document Visual Question Answering

64
view article
Article

Announcing Finance Commons and the Bad Data Toolbox: Pioneering Open Data and Advanced Document Processing

17
view article
Article

Enhancing Search Capabilities for Non-English Datasets in the Dataset Viewer

By asoria
4
view article
Article

Experimenting with Automatic PII Detection on the Hub using Presidio

23
upvoted 2 articles 3 months ago
view article
Article

Announcing New Dataset Search Features

22
upvoted 3 articles 3 months ago
view article
Article

Introducing Synthetic Data Workshop: Your Gateway to Easy Synthetic Dataset Creation

12
view article
Article

How to directly access 150k+ Hugging Face Datasets with DuckDB and query using GPT-4o

By chilijung
10
upvoted an article 4 months ago
upvoted an article 5 months ago
view article
Article

Synthetic data: save money, time and carbon with open source

45
upvoted an article 6 months ago
view article
Article

Text2SQL using Hugging Face Dataset Viewer API and Motherduck DuckDB-NSQL-7B

23