221 537 415

Adina Yakefu

AdinaY

AI & ML interests

None yet

Recent Activity

updated a collection about 4 hours ago

2025 January

replied to their post about 6 hours ago

reacted to clem's post with 🤗 about 6 hours ago

AI is not a zero-sum game. Open-source AI is the tide that lifts all boats!

View all activity

Articles

Organizations

AdinaY's activity

updated a collection about 4 hours ago

2025 January

Collection

29 items • Updated about 4 hours ago • 10

replied to their post about 6 hours ago

What a month! 🤯
Let’s add the two more major releases to the list!

✨Qwen2.5-VL by Alibaba
https://huggingface.co./collections/Qwen/qwen25-vl-6795ffac22b334a837c0f9a5

✨Janus-pro by DeepSeek
https://huggingface.co./deepseek-ai/Janus-Pro-7B

reacted to clem's post with 🤗🔥 about 6 hours ago

Post

1091

AI is not a zero-sum game. Open-source AI is the tide that lifts all boats!

updated a collection about 6 hours ago

2025 January

Collection

29 items • Updated about 4 hours ago • 10

upvoted a collection about 6 hours ago

Qwen2.5-VL

Collection

Vision-language model series based on Qwen2.5 • 3 items • Updated about 22 hours ago • 195

replied to their post about 9 hours ago

Good catch! Qwen team also mentioned there'll be some surprise. 👀

updated a collection about 9 hours ago

2025 January

Collection

29 items • Updated about 4 hours ago • 10

liked a model about 9 hours ago

deepseek-ai/Janus-Pro-7B

Any-to-Any • Updated about 10 hours ago • 935

updated a collection about 9 hours ago

2025 January

Collection

29 items • Updated about 4 hours ago • 10

liked a model about 9 hours ago

deepseek-ai/Janus-Pro-1B

Any-to-Any • Updated about 10 hours ago • 144

upvoted a collection about 17 hours ago

2025 January

Collection

29 items • Updated about 4 hours ago • 10

posted an update about 18 hours ago

Post

1031

🔥So many exciting releases coming from the Chinese community this month!
zh-ai-community/2025-january-6786b054f492fb223591269e

LLMs:
✨ Qwen2.5 -1M by Alibaba
Qwen/qwen25-1m-679325716327ec07860530ba
✨ InternLM3-8B-Instruct by Shanghai AI Lab
internlm/internlm3-8b-instruct
✨ MiniMax-Text-01 by MiniMax AI
MiniMaxAI/MiniMax-Text-01
✨ RWKV-7 by BlinkDL -- RNN + Transformer 👀
BlinkDL/rwkv-7-world
✨ DeepSeek-R1 by DeepSeek -- THE ONE 🙌
https://huggingface.co./deepseek-ai
✨ Baichuan-M1-14B by Baichuan - Medical 🩺
baichuan-inc/Baichuan-M1-14B-Base
✨ Qwen2.5-Math-PRM by Alibaba - Math 🔢
Qwen/Qwen2.5-Math-PRM-7B

Code:
✨ Tare by Bytedance
https://trae.ai

TTS:
✨ T2A-01-HD by MiniMax AI
https://hailuo.ai/audio
✨ LLaSA by HKUST Audio
HKUSTAudio/Llasa-3B

MLLM:
✨ Kimi k1.5 by Moonshot AI
https://kimi.ai
✨ MiniCPM-o-2_6 by OpenBMB
openbmb/MiniCPM-o-2_6
✨ Sa2VA-4B by ByteDance
ByteDance/Sa2VA-4B
✨ VideoLLaMA 3 by Alibaba DAMO
DAMO-NLP-SG/videollama3-678cdda9281a0e32fe79af15
✨ LLaVA-Mini by Chinese Academy of Sciences
ICTNLP/llava-mini-llama-3.1-8b
✨Hunyuan-7B by Tencent
tencent/Hunyuan-7B-Instruct
✨ Hunyuan 3D 2.0 by Tencent
tencent/Hunyuan3D-2
✨MiniMax-VL-01 by MiniMax AI - A non transformer based VLM 👀
MiniMaxAI/MiniMax-VL-01

Agent:
✨ UI-TARS by Bytedance
bytedance-research/UI-TARS-7B-SFT
✨ GLM-PC by Zhipu AI
https://cogagent.aminer.cn

Dataset:
✨ Fineweb-Edu-Chinese by Opencsg
opencsg/Fineweb-Edu-Chinese-V2.1
✨ Multimodal_textbook by Alibaba
DAMO-NLP-SG/multimodal_textbook
✨ MME-Finance by Hithink AI

3 replies

updated a collection 1 day ago

🖼️ 2025 MLLMs

Collection

6 items • Updated 1 day ago

reacted to Kseniase's post with 🔥 1 day ago

Post

1861

7 Open-source Methods to Improve Video Generation and Understanding

AI community is making great strides toward achieving the full potential of multimodality in video generation and understanding. Last week studies showed that working with videos is now one of the main focuses for improving AI models. Another highlight of the week is that open source, once again, proves its value. For those who were impressed by DeepSeek-R1, we’re with you!

Today, we’re combining these two key focuses and bringing you a list of open-source methods for better video generation and understanding:

1. VideoLLaMA 3 model: Excels in various video and image tasks thanks to vision-centric training approach. VideoLLaMA 3: Frontier Multimodal Foundation Models for Image and Video Understanding (2501.13106)

2. FILMAGENT framework assigns roles to multiple AI agents, like a director, screenwriter, actor, and cinematographer, to automate the filmmaking process in 3D virtual environments. FilmAgent: A Multi-Agent Framework for End-to-End Film Automation in Virtual 3D Spaces (2501.12909)

3. Improving Video Generation with Human Feedback (2501.13918) proposes a new VideoReward Model and approach that uses human feedback to refine video generation models.

4. DiffuEraser video inpainting model, based on stable diffusion, is designed to fill in missing areas with detailed, realistic content and to ensure consistent structures across frames. DiffuEraser: A Diffusion Model for Video Inpainting (2501.10018)

5. MAGI is a hybrid video gen model that combines masked and casual modeling. Its key innovation, Complete Teacher Forcing (CTF), conditions masked frames on fully visible frames. Taming Teacher Forcing for Masked Autoregressive Video Generation (2501.12389)

6. Go-with-the-Flow: Motion-Controllable Video Diffusion Models Using Real-Time Warped Noise (2501.08331) proposes motion control, allowing users to guide how objects or the camera move in generated videos. Its noise warping algorithm replaces random noise in videos with structured noise based on motion info.

7. Video Depth Anything model estimates depth consistently in super-long videos (several minutes or more) without sacrificing quality or speed. Video Depth Anything: Consistent Depth Estimation for Super-Long Videos (2501.12375)

Adina Yakefu

AI & ML interests

Recent Activity

Articles

A Short Summary of Chinese AI Global Expansion

A Short Summary of Chinese AI Global Expansion

Exploring the Daily Papers Page on Hugging Face

Organizations

AdinaY's activity