Statistical speech synthesis
Generate speech quality score from audio
Transcribe spoken Japanese to text
Evaluate audio quality with MOS score