Adina Yakefu

AdinaY

AI & ML interests

None yet

Recent Activity

updated a collection about 7 hours ago
✨ MoE models
liked a model about 7 hours ago
deepseek-ai/DeepSeek-V3-Base
liked a model about 15 hours ago
THUDM/cogagent-9b-20241220
View all activity

Articles

Organizations

Hugging Face's profile picture Hugging Face Chinese Localization's profile picture Huggingface Projects's profile picture Blog-explorers's profile picture ICCV2023's profile picture Open LLM Leaderboard's profile picture huggingPartyParis's profile picture Qwen's profile picture Journalists on Hugging Face's profile picture Women on Hugging Face's profile picture Social Post Explorers's profile picture Chinese LLMs on Hugging Face's profile picture Hugging Face for Legal's profile picture

AdinaY's activity

posted an update 1 day ago
view post
Post
1370
QvQ-72B-PreviewπŸŽ„ an open weight model for visual reasoning just released by Alibaba_Qwen team
Qwen/qvq-676448c820912236342b9888
✨ Combines visual understanding & language reasoning.
✨ Scores 70.3 on MMMU
✨ Outperforms Qwen2-VL-72B-Instruct in complex problem-solving
posted an update 10 days ago
view post
Post
496
Megrez-3B-Omni πŸ”₯ an on-device multimodal LLM by Infinigence AI, another startup emerging from the Tsinghua University ecosystem.
Model: Infinigence/Megrez-3B-Omni
Demo: Infinigence/Megrez-3B-Omni
✨Supports analysis of image, text, and audio modalities
✨Leads in bilingual speech ( English & Chinese ) input, multi-turn conversations, and voice-based queries
✨Outperforms in scene understanding and OCR across major benchmarks
reacted to qq8933's post with πŸ”₯πŸ‘€ 14 days ago
view post
Post
2521
LLaMA-O1-PRM and LLaMA-O1-Reinforcement will release in this weekend.
We have implemented a novel Reinforcement finetune(RFT) pipeline that taught models learning reasoning and reward labeling without human annotation.
Β·
reacted to julien-c's post with ❀️πŸ”₯ 15 days ago
view post
Post
7612
After some heated discussion πŸ”₯, we clarify our intent re. storage limits on the Hub

TL;DR:
- public storage is free, and (unless blatant abuse) unlimited. We do ask that you consider upgrading to PRO and/or Enterprise Hub if possible
- private storage is paid above a significant free tier (1TB if you have a paid account, 100GB otherwise)

docs: https://huggingface.co./docs/hub/storage-limits

We optimize our infrastructure continuously to scale our storage for the coming years of growth in Machine learning, to the benefit of the community πŸ”₯

cc: @reach-vb @pierric @victor and the HF team
Β·
reacted to m-ric's post with πŸ”₯ 16 days ago
view post
Post
2221
Last week was crazy in OS AI, with important models and datasets releases every day.

Here are the most important ones I've pinned:

🌎 Cohere relased GLobal-MMLU, a multilingual version of MMLU, to evaluate AI models' world knowledge in many languages!

πŸ¦™ Meta released Llama-3.3-70B-Instruct, a 70B model that's on par with Llama-3.1-405B-Instruct, GPT-4o and Claude. Probably my new go-to for agentic workflows.

πŸ”‰ FishAudio released fish-speech-1.5, multilingual text to speech model

🎨 Microsoft Research released TRELLIS, an extremely impressive image-to-3D model, which you can try here: JeffreyXiang/TRELLIS

πŸ“š Yesterday, Hugging Face release FineWeb 2, a new version that extends the previous FineWeb to over 1000 languages, including extended coverage in Russina, Mandarin, German, Japanese, Spanish, French, so a huge, high-quality dataset of > 3 trillion words! HuggingFaceFW/fineweb-2

Now let's go build to make this week as productive as last one!
reacted to davidberenstein1957's post with πŸ”₯ 16 days ago
view post
Post
2047
Open Preference Dataset for Text-to-Image Generation by the πŸ€— Community

Open Image Preferences is an Apache 2.0 licensed dataset for text-to-image generation. This dataset contains 10K text-to-image preference pairs across common image generation categories, while using different model families and varying prompt complexities.

https://huggingface.co./blog/image-preferences
reacted to thomwolf's post with πŸš€ 17 days ago
view post
Post
4336
We are proud to announce HuggingFaceFW/fineweb-2: A sparkling update to HuggingFaceFW/fineweb with 1000s of πŸ—£οΈlanguages.

We applied the same data-driven approach that led to SOTA English performance in🍷 FineWeb to thousands of languages.

πŸ₯‚ FineWeb2 has 8TB of compressed text data and outperforms other multilingual datasets in our experiments.

The dataset is released under the permissive πŸ“œ ODC-By 1.0 license, and the πŸ’» code to reproduce it and our evaluations is public.

We will very soon announce a big community project, and are working on a πŸ“ blogpost walking you through the entire dataset creation process. Stay tuned!

In the mean time come ask us question on our chat place: HuggingFaceFW/discussion

H/t @guipenedo @hynky @lvwerra as well as @vsabolcec Bettina Messmer @negar-foroutan and @mjaggi
  • 2 replies
Β·
posted an update 17 days ago
view post
Post
868
Updates from the Chinese community last week πŸ”₯

LLM:
✨ Sailor 2 , multilingual model supporting 10+ South Asian languages by Sea AI Lab. https://huggingface.co./sailor2

MLLM:
✨InternVL 2.5 , new open multimodal LLM by OpenGVLab
https://huggingface.co./collections/OpenGVLab/internvl-25-673e1019b66e2218f68d7c1c
✨Qwen2-VL 2B/7B/72B base model, the latest iteration of our Qwen-VL model by Alibaba Qwen
Qwen/qwen2-vl-66cee7455501d7126940800d

Video model:
✨HunyuanVideo , 13B open video model by Tencent
tencent/HunyuanVideo

Reasoning model:
✨ LLaMA-O1 πŸ¦™ base & supervised model; pretrain & finetune datasets and demo all released
zh-ai-community/reasoning-models-67409fb3aa1ed78f10087cd7

Audio model:
✨Fish Speech 1.5, Text-to-speech in 13 languages, trained on 1M+ hours of audio by FishAudio
fishaudio/fish-speech-1.5
✨ClearVoice, An advanced voice processing framework by Alibaba Tongyi SpeechAI https://huggingface.co./alibabasglab

More details πŸ‘‰ https://huggingface.co./zh-ai-community
posted an update 22 days ago
view post
Post
1572
Sailor 2 🚒 open multilingual model for Southeast Asia by Sea AI LabπŸ”₯
https://huggingface.co./sailor2
sail/Sailor2-20B-Chat

✨ Fully open code & ALL datasets πŸ™Œ
✨ 1B/ 8B/20B base & chat expanded on Qwen2.5
✨ Apache 2.0
✨ Supports 15 languages including English, Chinese, Burmese, Cebuano, Ilocano, Indonesian, Javanese, Khmer, Lao, Malay, Sundanese, Tagalog, Thai, Vietnamese, and WarayπŸ‡¬πŸ‡§πŸ‡¨πŸ‡³πŸ‡±πŸ‡¦πŸ‡²πŸ‡ΎπŸ‡²πŸ‡²πŸ‡»πŸ‡³πŸ‡ΉπŸ‡­
posted an update 22 days ago
view post
Post
1465
2023 & 2024 Top Downloaded (all time) Open Models on the hub are both from the Chinese community πŸ‘€

2023 πŸ‘‰ Bge base by BAAI
BAAI/bge-base-en-v1.5
2024 πŸ‘‰ Qwen 2.5 by Alibaba Qwen
Qwen/Qwen2.5-1.5B-Instruct

Can’t wait to see what incredible models the Chinese community will bring in 2025πŸš€

✨ Follow https://huggingface.co./zh-ai-community to get the latest updates from the Chinese community
✨ Explore the 2024 Year in Review huggingface/open-source-ai-year-in-review-2024
replied to qq8933's post 23 days ago
view reply

Congrats on the release πŸŽ‰πŸ”₯

reacted to qq8933's post with πŸ€—πŸš€ 23 days ago
view post
Post
3037
  • 3 replies
Β·
posted an update 23 days ago
view post
Post
1331
HunyuanVideo πŸ“Ή The new open video generation model by Tencent!
πŸ‘‰ tencent/HunyuanVideo
zh-ai-community/video-models-666afd86cfa4e4dd1473b64c
✨ 13B parameters: Probably the largest open video model to date
✨ Unified architecture for image & video generation
✨ Powered by advanced features: MLLM Text Encoder, 3D VAE, and Prompt Rewrite
✨ Delivers stunning visuals, diverse motion, and unparalleled stability
πŸ”“ Fully open with code & weights
posted an update 27 days ago
view post
Post
1104
Zhipu AI, the Chinese generative AI startup behind CogVideo, just launched their first productized AI Agent - AutoGLM πŸ”₯
πŸ‘‰ https://agent.aminer.cn

With simple text or voice commands, it:
✨ Simulates phone operations effortlessly
✨ Autonomously handles 50+ step tasks
✨ Seamlessly operates across apps

Powered by Zhipu's "Decoupled Interface" and "Self-Evolving Learning Framework" to achieve major performance gains in Phone Use and Web Browser Use!

Meanwhile, GLM4-Edge is now on Hugging Face hubπŸš€
πŸ‘‰ THUDM/glm-edge-6743283c5809de4a7b9e0b8b
Packed with advanced dialogue + multimodal models:
πŸ“± 1.5B / 2B models: Built for mobile & in-car systems
πŸ’» 4B / 5B models: Optimized for PCs
replied to qq8933's post 27 days ago
reacted to qq8933's post with πŸš€ 27 days ago
view post
Post
1343
LLaMA-O1 Base and SFT model will be uploaded to HF today.
RLHF pipeline already ready, still waiting for data sampling.
  • 1 reply
Β·
posted an update 28 days ago
view post
Post
1586
🌊 The wave of reasoning models from the Chinese community has arrived!

πŸš€ Marco-o1 by AIDC, Alibaba
πŸ‘‰ AIDC-AI/Marco-o1

✨ QwQ by Qwen, Alibaba
πŸ‘‰ Qwen/qwq-674762b79b75eac01735070a

🌟 Skywork-o1 by Kunlun Tech
πŸ‘‰ Skywork/skywork-o1-open-67453df58e12f6c3934738d0

πŸ”₯ Xkev/Llama-3.2V-11B-cot by PKU Yuan group
πŸ‘‰ Xkev/Llama-3.2V-11B-cot

πŸ’‘ DeepSeek-R1-Lite-Preview by DeepSeek AI
πŸ‘‰ https://chat.deepseek.com/

πŸ” InternThinker Preview by Shanghai AI Lab
πŸ‘‰ https://sso.openxlab.org.cn/login?redirect=https://internlm-chat.intern-ai.org.cn/&clientId=ebmrvod6yo0nlzaek1yp

πŸ“˜ k0-math by Moonshot AI
πŸš€ https://kimi.moonshot.cn/ ( coming soon! )

Who's next? πŸ‘€
zh-ai-community/reasoning-models-67409fb3aa1ed78f10087cd7