Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
nvidia
/
mamba2-8b-3t-4k
like
13
Follow
NVIDIA
7.56k
Text Generation
English
Megatron-LM
nvidia
Mamba
Mamba-2
SSM
8B
arxiv:
2406.07887
arxiv:
2405.21060
License:
apache-2.0
Model card
Files
Files and versions
Community
1
main
mamba2-8b-3t-4k
1 contributor
History:
2 commits
rwaleffe
Upload model
b915550
7 months ago
release
Upload model
7 months ago
.gitattributes
Safe
1.52 kB
initial commit
7 months ago
README.md
Safe
2.16 kB
Upload model
7 months ago
latest_checkpointed_iteration.txt
Safe
8 Bytes
Upload model
7 months ago
mt_nlg_plus_multilingual_ja_zh_the_stack_frac_015_256k.model
Safe
4.57 MB
LFS
Upload model
7 months ago