AlejandroOlmedo
/

DeepSeek-R1-Distill-Qwen-7B-GRPO_Math-8bit-mlx

Text Generation

Generated from Trainer

text-generation-inference

8-bit precision

Model card Files Files and versions

DeepSeek-R1-Distill-Qwen-7B-GRPO_Math-8bit-mlx

Ctrl+K

Ctrl+K

1 contributor

History: 16 commits

AlejandroOlmedo's picture

AlejandroOlmedo

Update README.md

8419a26 verified 7 months ago

.gitattributes

1.57 kB

Upload tokenizer.json with huggingface_hub 7 months ago
README.md

2.88 kB

Update README.md 7 months ago
config.json

917 Bytes

Upload config.json with huggingface_hub 7 months ago
model-00001-of-00002.safetensors

5.32 GB
xet

Upload model-00001-of-00002.safetensors with huggingface_hub 7 months ago
model-00002-of-00002.safetensors

2.78 GB
xet

Upload model-00002-of-00002.safetensors with huggingface_hub 7 months ago
model.safetensors.index.json

62.7 kB

Upload model.safetensors.index.json with huggingface_hub 7 months ago
special_tokens_map.json

485 Bytes

Upload special_tokens_map.json with huggingface_hub 7 months ago
tokenizer.json

11.4 MB
xet

Upload tokenizer.json with huggingface_hub 7 months ago
tokenizer_config.json

6.86 kB

Upload tokenizer_config.json with huggingface_hub 7 months ago