https://huggingface.co./papers/2501.03006
A sample demonstration of building with thinking LLMs
Erase any object just by naming it!
3D Generation from text prompts
automated video and sound synthesis from images
WebGPU text-to-Speech powered by OuteTTS and Transformers.js