LLMVoX: Autoregressive Streaming Text-to-Speech Model for Any LLM Paper • 2503.04724 • Published 3 days ago • 50
LLMVoX: Autoregressive Streaming Text-to-Speech Model for Any LLM Paper • 2503.04724 • Published 3 days ago • 50
Word Form Matters: LLMs' Semantic Reconstruction under Typoglycemia Paper • 2503.01714 • Published 6 days ago • 5
ArTST - Arabic Text Speech Transformer Collection Open source project for Arabic Speech Recognition and Generation • 13 items • Updated 9 days ago • 8
GeoPixel Collection Pixel Grounding Large Multimodal Model in Remote Sensing • 5 items • Updated 12 days ago • 1
M4: Multi-generator, Multi-domain, and Multi-lingual Black-Box Machine-Generated Text Detection Paper • 2305.14902 • Published May 24, 2023 • 1