5 74 19

Ha-Yeong Choi

Ha0

https://scholar.google.com/citations?user=Jw3X6UgAAAAJ&hl=ko

hayeong0

AI & ML interests

Speech Synthesis, Voice Conversion, Generative Models

Recent Activity

upvoted a paper 6 days ago

Phi-4-Mini Technical Report: Compact yet Powerful Multimodal Language Models via Mixture-of-LoRAs

upvoted a paper 18 days ago

Soundwave: Less is More for Speech-Text Alignment in LLMs

upvoted a paper 20 days ago

Region-Adaptive Sampling for Diffusion Transformers

View all activity

Organizations

None yet

Ha0's activity

upvoted a paper 6 days ago

Phi-4-Mini Technical Report: Compact yet Powerful Multimodal Language Models via Mixture-of-LoRAs

Paper • 2503.01743 • Published 6 days ago • 65

upvoted a paper 18 days ago

Soundwave: Less is More for Speech-Text Alignment in LLMs

Paper • 2502.12900 • Published 20 days ago • 76

upvoted 2 papers 20 days ago

Region-Adaptive Sampling for Diffusion Transformers

Paper • 2502.10389 • Published 23 days ago • 52

Large Language Diffusion Models

Paper • 2502.09992 • Published 24 days ago • 99

upvoted a paper about 2 months ago

Tensor Product Attention Is All You Need

Paper • 2501.06425 • Published Jan 11 • 84

liked a model about 2 months ago

kyutai/helium-1-preview-2b

Text Generation • Updated Jan 14 • 906 • 139

upvoted 3 papers 3 months ago

liked a model 3 months ago

google-bert/bert-large-uncased

Fill-Mask • Updated Feb 19, 2024 • 2.48M • • 128

upvoted 2 papers 3 months ago

Switti: Designing Scale-Wise Transformers for Text-to-Image Synthesis

Paper • 2412.01819 • Published Dec 2, 2024 • 35

FLOAT: Generative Motion Latent Flow Matching for Audio-driven Talking Portrait

Paper • 2412.01064 • Published Dec 2, 2024 • 26

liked a Space 3 months ago

2.02k

Anychat

🏢

Display code snippets for different chat providers

upvoted 3 papers 4 months ago

GaussianAnything: Interactive Point Cloud Latent Diffusion for 3D Generation

Paper • 2411.08033 • Published Nov 12, 2024 • 22

MagicQuill: An Intelligent Interactive Image Editing System

Paper • 2411.09703 • Published Nov 14, 2024 • 68

Mixture-of-Transformers: A Sparse and Scalable Architecture for Multi-Modal Foundation Models

Paper • 2411.04996 • Published Nov 7, 2024 • 51

upvoted a paper 5 months ago

FrugalNeRF: Fast Convergence for Few-shot Novel View Synthesis without Learned Priors

Paper • 2410.16271 • Published Oct 21, 2024 • 81

liked a model 5 months ago

meta-llama/Llama-3.2-90B-Vision-Instruct

Image-Text-to-Text • Updated 5 days ago • 22.5k • • 326

upvoted 2 papers 5 months ago

DreamVideo-2: Zero-Shot Subject-Driven Video Customization with Precise Motion Control

Paper • 2410.13830 • Published Oct 17, 2024 • 25

Animate-X: Universal Character Image Animation with Enhanced Motion Representation

Paper • 2410.10306 • Published Oct 14, 2024 • 55