Tiezhen WANG's picture

Tiezhen WANG

xianbao

AI & ML interests

This is my personal account

Recent Activity

updated a collection 2 days ago
πŸ”Š Audio Models
reacted to merve's post with πŸ”₯ 2 days ago
Oof, what a week! πŸ₯΅ So many things have happened, let's recap! https://huggingface.co./collections/merve/jan-24-releases-6793d610774073328eac67a9 Multimodal πŸ’¬ - We have released SmolVLM -- tiniest VLMs that come in 256M and 500M, with it's retrieval models ColSmol for multimodal RAG πŸ’— - UI-TARS are new models by ByteDance to unlock agentic GUI control 🀯 in 2B, 7B and 72B - Alibaba DAMO lab released VideoLlama3, new video LMs that come in 2B and 7B - MiniMaxAI released Minimax-VL-01, where decoder is based on MiniMax-Text-01 456B MoE model with long context - Dataset: Yale released a new benchmark called MMVU - Dataset: CAIS released Humanity's Last Exam (HLE) a new challenging MM benchmark LLMs πŸ“– - DeepSeek-R1 & DeepSeek-R1-Zero: gigantic 660B reasoning models by DeepSeek, and six distilled dense models, on par with o1 with MIT license! 🀯 - Qwen2.5-Math-PRM: new math models by Qwen in 7B and 72B - NVIDIA released AceMath and AceInstruct, new family of models and their datasets (SFT and reward ones too!) Audio πŸ—£οΈ - Llasa is a new speech synthesis model based on Llama that comes in 1B,3B, and 8B - TangoFlux is a new audio generation model trained from scratch and aligned with CRPO Image/Video/3D Generation ⏯️ - Flex.1-alpha is a new 8B pre-trained diffusion model by ostris similar to Flux - tencent released Hunyuan3D-2, new 3D asset generation from images
liked a model 2 days ago
tencent/Hunyuan3D-2
View all activity

Articles

Organizations

Hugging Face's profile picture 🧨Diffusers's profile picture Peking University's profile picture Multimodal Art Projection's profile picture OpenDILab's profile picture RWKV's profile picture Fengshenbang-LM's profile picture Zhejiang University's profile picture Luotuo Chinese Language Model's profile picture paddle-diffusion-hackathon's profile picture ChallengeHub's profile picture DreamBooth Hackathon's profile picture baixing's profile picture Hugging Face Chinese Localization's profile picture Webhooks Explorers (BETA)'s profile picture OpenMMLab's profile picture EuroPython 2022's profile picture Fish Audio's profile picture BigCode's profile picture Hack Engine's profile picture Hugging Face OSS Metrics's profile picture Blog-explorers's profile picture Qwen's profile picture agi-hackathon's profile picture My Test Org's profile picture Social Post Explorers's profile picture Zhejiang Gongshang University's profile picture Dev Mode Explorers's profile picture Chinese LLMs on Hugging Face's profile picture Tencent Hunyuan's profile picture FunAudioLLM's profile picture tiezhen test's profile picture ZP's profile picture Hugging Face Party @ PyTorch Conference's profile picture Rhymes.AI's profile picture MiniMax's profile picture

Posts 7

view post
Post
1874
With the open-weight release of CogVideoX-5B from THUDM, i.e. GLM team, the Video Generation Model (how about calling it VGM) field has officially became the next booming "LLM"

What does the landscape look like? What are other video generation models? This collection below is all your need.

xianbao/video-generation-models-66c350163c74f60f5c412af6

The above video is generated by @a-r-r-o-w with CogVideoX-5B, taken from a nice lookout for the field!
view post
Post
1918
Why Apache 2.0 Matters for LLMs πŸ€”

@01AI_Yi recently switched from a permissive & commercially friendly license, to Apache 2.0. And the community loved it! πŸš€

@JustinLin610 also had a poll on model license and the majority votes for Apache 2.0.

Why it is a Big Deal? ⬇️

πŸ“š Legal Simplicity: Custom licenses need costly & time-consuming legal review. Apache 2.0 is well-known & easier for legal teams to handle.

πŸ‘©β€πŸ’» Developer-Friendly: Legal docs are a pain for devs! Apache 2.0 is well-known and tech-friendly, making it easier for non-native developers to understand the implications too.

πŸ”— Easier Integration: Apache 2.0 is compatible with many other licenses, simplifying tasks like model merging with models of different licensing requirements.

🚫 No Permission Needed: Custom licenses often require explicit permission and additional documentation work of filling forms, creating barriers. Apache 2.0 removes this hurdle, letting devs focus on innovation.

There are a lot interesting discussions from
@JustinLin610 's poll: https://x.com/JustinLin610/status/1793559737482764375 which inspired this thread.

Any other thoughts? Let me know ^^