Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
espnet
's Collections
OWSM: Fully Open Speech Recognition and Translation Models
OWLS: Scaling Laws for Speech Recognition and Translation
OWSM-CTC: Ultra-Fast Speech Foundation Models
Neural Codecs
XEUS Model and Data
Neural Codecs
updated
2 days ago
Collection of neural codecs trained in ESPnet for speech tokenization
Upvote
-
espnet/dac_16k_music_survey
Updated
Jan 7
•
8
espnet/dac_44k_audio_single_survey
Updated
Jan 7
•
7
espnet/dac_16k_speech_single_survey
Updated
Jan 2
•
7
espnet/dac_16k_all_single_survey
Updated
Jan 2
•
6
ftshijt/espnet_codec_dac_large_v1.4_360epoch
Updated
Nov 21, 2024
•
14
ftshijt/espnet_codec_soundstream_large_v1.8
Updated
Oct 22, 2024
•
8
ftshijt/espnet_codec_dac_large_v1.6_240epoch
Updated
Nov 21, 2024
•
8
ftshijt/espnet_codec_dac_large_v1.4_120epoch
Updated
Oct 27, 2024
•
4
ftshijt/espnet_codec_encodec_large_v1.2
Updated
Oct 22, 2024
•
2
ftshijt/espnet_codec_dac_large_v1.11_120epoch
Updated
Nov 11, 2024
•
4
ftshijt/espnet_codec_soundstream_large_v1.6
Updated
Oct 22, 2024
•
2
ftshijt/espnet_codec_soundstream_large_v1.4
Updated
Oct 22, 2024
•
1
espnet/mls-audioset_soundstream_16k_360epoch
Updated
Sep 16, 2024
•
7
espnet/mls-multi_soundstream_16k
Updated
Sep 4, 2024
•
4
espnet/mls-english_encodec_16k_360epoch
Updated
Sep 16, 2024
•
4
espnet/mls-audioset_encodec_16k_360epoch
Updated
Sep 16, 2024
•
3
espnet/mls-english_soundstream_16k_360epoch
Updated
Sep 26, 2024
•
4
espnet/mls-audioset_soundstream_16k
Updated
Sep 4, 2024
•
3
espnet/mls-audioset_encodec_16k
Updated
Sep 4, 2024
•
2
espnet/mls-english_encodec_16k
Updated
Sep 4, 2024
•
3
espnet/mls-multi_encodec_16k
Updated
Sep 4, 2024
•
1
espnet/mls-multi_encodec_16k_360epoch
Updated
Sep 26, 2024
•
2
espnet/mls-multi_soundstream_16k_360epoch
Updated
Sep 16, 2024
•
1
espnet/mls-english_soundstream_16k
Updated
Sep 4, 2024
•
1
Upvote
-
Share collection
View history
Collection guide
Browse collections