Text-to-Speech dataset Collection Malay Text-to-Speech dataset, gathered from crawled audiobooks, online TTS and synthetic cloning. • 14 items • Updated 3 days ago • 1
mesolitica/pseudolabel-malaysian-youtube-whisper-large-v3-timestamp Preview • Updated 6 days ago • 299
Multi-Lingual Malaysian Embedding: Leveraging Large Language Models for Semantic Representations Paper • 2402.03053 • Published Feb 5, 2024 • 2
Large Malaysian Language Model Based on Mistral for Enhanced Local Language Understanding Paper • 2401.13565 • Published Jan 24, 2024 • 4