AlejandroOlmedo
/

DeepSeek-R1-Distill-Qwen-7B-GRPO_Math-4bit-mlx

Text Generation

Generated from Trainer

text-generation-inference

Inference Endpoints

4-bit precision

Model card Files Files and versions Community

DeepSeek-R1-Distill-Qwen-7B-GRPO_Math-4bit-mlx

1 contributor

History: 11 commits

AlejandroOlmedo's picture

AlejandroOlmedo

Update README.md

99b083a verified about 1 hour ago

.gitattributes

1.57 kB

Upload tokenizer.json with huggingface_hub 3 days ago
README.md

2.85 kB

Update README.md about 1 hour ago
config.json

917 Bytes

Upload config.json with huggingface_hub 3 days ago
model.safetensors

4.28 GB
LFS

Upload model.safetensors with huggingface_hub 3 days ago
model.safetensors.index.json

51.7 kB

Upload model.safetensors.index.json with huggingface_hub 3 days ago
special_tokens_map.json

485 Bytes

Upload special_tokens_map.json with huggingface_hub 3 days ago
tokenizer.json

11.4 MB
LFS

Upload tokenizer.json with huggingface_hub 3 days ago
tokenizer_config.json

6.86 kB

Upload tokenizer_config.json with huggingface_hub 3 days ago