NaturalSpeech 3: Zero-Shot Speech Synthesis with Factorized Codec and Diffusion Models Paper • 2403.03100 • Published Mar 5 • 34
MooER: LLM-based Speech Recognition and Translation Models from Moore Threads Paper • 2408.05101 • Published Aug 9 • 6