arxiv:2409.18216
Xinyi Bai
wwwbxy123
AI & ML interests
LLM, GenAI, Speech, SVC
Recent Activity
authored
a paper
2 days ago
Muskits-ESPnet: A Comprehensive Toolkit for Singing Voice Synthesis in
New Paradigm
authored
a paper
2 days ago
Towards Rationality in Language and Multimodal Agents: A Survey
authored
a paper
2 days ago
MMMT-IF: A Challenging Multimodal Multi-Turn Instruction Following
Benchmark
Organizations
models
None public yet
datasets
None public yet