Join the conversation

Join the community of Machine Learners and AI enthusiasts.

Sign Up
hexgradΒ 
posted an update 6 days ago
Post
5096
πŸ“£ Looking for labeled, high-quality synthetic audio/TTS data πŸ“£ Have you been or are you currently calling API endpoints from OpenAI, ElevenLabs, etc? Do you have labeled audio data sitting around gathering dust? Let's talk! Join https://discord.gg/QuGxSWBfQy or comment down below.

If your data exceeds quantity & quality thresholds and is approved into the next hexgrad/Kokoro-82M training mix, and you permissively DM me the data under an effective Apache license, then I will DM back the corresponding voicepacks for YOUR data if/when the next Apache-licensed Kokoro base model drops.

What does this mean? If you've been calling closed-source TTS or audio API endpoints to:
- Build voice agents
- Make long-form audio, like audiobooks or podcasts
- Handle customer support, etc
Then YOU can contribute to the training mix and get useful artifacts in return. ❀️

More details at hexgrad/Kokoro-82M#21

TLDR: 🚨 Trade Offer 🚨
I receive: Synthetic Audio w/ Text Labels
You receive: Trained Voicepacks for an 82M Apache TTS model
Join https://discord.gg/QuGxSWBfQy to discuss

Β·

In what kind of format do you want this?

Hi, i test it today. Nice work. Will be ther german to in future?

Β·

It's simple: what you put in is what you get out. πŸ˜„ German support in the future depends mostly on how much German data (synthetic audio + text labels) is contributed.

tell me about quantum machanic

If you are looking for Arabic data, There are Common Voice data , SADA, MASC , MGB-2 , MGB-3 and MGB-5

δ½ ε₯½οΌŒζˆ‘ζ˜―θ…Ύηš‡