Adina Yakefu

AdinaY

AI & ML interests

None yet

Recent Activity

updated a collection about 4 hours ago
2025 January
replied to their post about 6 hours ago
🔥So many exciting releases coming from the Chinese community this month! https://huggingface.co./collections/zh-ai-community/2025-january-6786b054f492fb223591269e LLMs: ✨ Qwen2.5 -1M by Alibaba https://huggingface.co./collections/Qwen/qwen25-1m-679325716327ec07860530ba ✨ InternLM3-8B-Instruct by Shanghai AI Lab https://huggingface.co./internlm/internlm3-8b-instruct ✨ MiniMax-Text-01 by MiniMax AI https://huggingface.co./MiniMaxAI/MiniMax-Text-01 ✨ RWKV-7 by BlinkDL -- RNN + Transformer 👀 https://huggingface.co./BlinkDL/rwkv-7-world ✨ DeepSeek-R1 by DeepSeek -- THE ONE 🙌 https://huggingface.co./deepseek-ai ✨ Baichuan-M1-14B by Baichuan - Medical 🩺 https://huggingface.co./baichuan-inc/Baichuan-M1-14B-Base ✨ Qwen2.5-Math-PRM by Alibaba - Math 🔢 https://huggingface.co./Qwen/Qwen2.5-Math-PRM-7B Code: ✨ Tare by Bytedance https://trae.ai TTS: ✨ T2A-01-HD by MiniMax AI https://hailuo.ai/audio ✨ LLaSA by HKUST Audio https://huggingface.co./HKUSTAudio/Llasa-3B MLLM: ✨ Kimi k1.5 by Moonshot AI https://kimi.ai ✨ MiniCPM-o-2_6 by OpenBMB https://huggingface.co./openbmb/MiniCPM-o-2_6 ✨ Sa2VA-4B by ByteDance https://huggingface.co./ByteDance/Sa2VA-4B ✨ VideoLLaMA 3 by Alibaba DAMO https://huggingface.co./collections/DAMO-NLP-SG/videollama3-678cdda9281a0e32fe79af15 ✨ LLaVA-Mini by Chinese Academy of Sciences https://huggingface.co./ICTNLP/llava-mini-llama-3.1-8b ✨Hunyuan-7B by Tencent https://huggingface.co./tencent/Hunyuan-7B-Instruct ✨ Hunyuan 3D 2.0 by Tencent https://huggingface.co./tencent/Hunyuan3D-2 ✨MiniMax-VL-01 by MiniMax AI - A non transformer based VLM 👀 https://huggingface.co./MiniMaxAI/MiniMax-VL-01 Agent: ✨ UI-TARS by Bytedance https://huggingface.co./bytedance-research/UI-TARS-7B-SFT ✨ GLM-PC by Zhipu AI https://cogagent.aminer.cn Dataset: ✨ Fineweb-Edu-Chinese by Opencsg https://huggingface.co./datasets/opencsg/Fineweb-Edu-Chinese-V2.1 ✨ Multimodal_textbook by Alibaba https://huggingface.co./datasets/DAMO-NLP-SG/multimodal_textbook ✨ MME-Finance by Hithink AI
View all activity

Articles

Organizations

Hugging Face's profile picture Hugging Face Chinese Localization's profile picture Huggingface Projects's profile picture Blog-explorers's profile picture ICCV2023's profile picture Open LLM Leaderboard's profile picture huggingPartyParis's profile picture Qwen's profile picture Women on Hugging Face's profile picture Journalists on Hugging Face's profile picture Social Post Explorers's profile picture Chinese LLMs on Hugging Face's profile picture Hugging Face for Legal's profile picture

AdinaY's activity

replied to their post about 6 hours ago
reacted to clem's post with 🤗🔥 about 6 hours ago
view post
Post
1091
AI is not a zero-sum game. Open-source AI is the tide that lifts all boats!
replied to their post about 9 hours ago
view reply

Good catch! Qwen team also mentioned there'll be some surprise. 👀

posted an update about 18 hours ago
view post
Post
1031
🔥So many exciting releases coming from the Chinese community this month!
zh-ai-community/2025-january-6786b054f492fb223591269e

LLMs:
✨ Qwen2.5 -1M by Alibaba
Qwen/qwen25-1m-679325716327ec07860530ba
✨ InternLM3-8B-Instruct by Shanghai AI Lab
internlm/internlm3-8b-instruct
✨ MiniMax-Text-01 by MiniMax AI
MiniMaxAI/MiniMax-Text-01
✨ RWKV-7 by BlinkDL -- RNN + Transformer 👀
BlinkDL/rwkv-7-world
✨ DeepSeek-R1 by DeepSeek -- THE ONE 🙌
https://huggingface.co./deepseek-ai
✨ Baichuan-M1-14B by Baichuan - Medical 🩺
baichuan-inc/Baichuan-M1-14B-Base
✨ Qwen2.5-Math-PRM by Alibaba - Math 🔢
Qwen/Qwen2.5-Math-PRM-7B

Code:
✨ Tare by Bytedance
https://trae.ai

TTS:
✨ T2A-01-HD by MiniMax AI
https://hailuo.ai/audio
✨ LLaSA by HKUST Audio
HKUSTAudio/Llasa-3B

MLLM:
✨ Kimi k1.5 by Moonshot AI
https://kimi.ai
✨ MiniCPM-o-2_6 by OpenBMB
openbmb/MiniCPM-o-2_6
✨ Sa2VA-4B by ByteDance
ByteDance/Sa2VA-4B
✨ VideoLLaMA 3 by Alibaba DAMO
DAMO-NLP-SG/videollama3-678cdda9281a0e32fe79af15
✨ LLaVA-Mini by Chinese Academy of Sciences
ICTNLP/llava-mini-llama-3.1-8b
✨Hunyuan-7B by Tencent
tencent/Hunyuan-7B-Instruct
✨ Hunyuan 3D 2.0 by Tencent
tencent/Hunyuan3D-2
✨MiniMax-VL-01 by MiniMax AI - A non transformer based VLM 👀
MiniMaxAI/MiniMax-VL-01

Agent:
✨ UI-TARS by Bytedance
bytedance-research/UI-TARS-7B-SFT
✨ GLM-PC by Zhipu AI
https://cogagent.aminer.cn

Dataset:
✨ Fineweb-Edu-Chinese by Opencsg
opencsg/Fineweb-Edu-Chinese-V2.1
✨ Multimodal_textbook by Alibaba
DAMO-NLP-SG/multimodal_textbook
✨ MME-Finance by Hithink AI
  • 3 replies
·
reacted to Kseniase's post with 🔥 1 day ago
view post
Post
1861
7 Open-source Methods to Improve Video Generation and Understanding

AI community is making great strides toward achieving the full potential of multimodality in video generation and understanding. Last week studies showed that working with videos is now one of the main focuses for improving AI models. Another highlight of the week is that open source, once again, proves its value. For those who were impressed by DeepSeek-R1, we’re with you!

Today, we’re combining these two key focuses and bringing you a list of open-source methods for better video generation and understanding:

1. VideoLLaMA 3 model: Excels in various video and image tasks thanks to vision-centric training approach. VideoLLaMA 3: Frontier Multimodal Foundation Models for Image and Video Understanding (2501.13106)

2. FILMAGENT framework assigns roles to multiple AI agents, like a director, screenwriter, actor, and cinematographer, to automate the filmmaking process in 3D virtual environments. FilmAgent: A Multi-Agent Framework for End-to-End Film Automation in Virtual 3D Spaces (2501.12909)

3. Improving Video Generation with Human Feedback (2501.13918) proposes a new VideoReward Model and approach that uses human feedback to refine video generation models.

4. DiffuEraser video inpainting model, based on stable diffusion, is designed to fill in missing areas with detailed, realistic content and to ensure consistent structures across frames. DiffuEraser: A Diffusion Model for Video Inpainting (2501.10018)

5. MAGI is a hybrid video gen model that combines masked and casual modeling. Its key innovation, Complete Teacher Forcing (CTF), conditions masked frames on fully visible frames. Taming Teacher Forcing for Masked Autoregressive Video Generation (2501.12389)

6. Go-with-the-Flow: Motion-Controllable Video Diffusion Models Using Real-Time Warped Noise (2501.08331) proposes motion control, allowing users to guide how objects or the camera move in generated videos. Its noise warping algorithm replaces random noise in videos with structured noise based on motion info.

7. Video Depth Anything model estimates depth consistently in super-long videos (several minutes or more) without sacrificing quality or speed. Video Depth Anything: Consistent Depth Estimation for Super-Long Videos (2501.12375)