tyfeng1997
/

Qwen2.5-Math-1.5B-Open-R1-Distill

Text Generation

Generated from Trainer

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Qwen2.5-Math-1.5B-Open-R1-Distill

1 contributor

History: 3 commits

tyfeng1997's picture

End of training

7a54dc2 verified about 23 hours ago

.gitattributes

1.57 kB

Model save about 23 hours ago
README.md

1.9 kB

End of training about 23 hours ago
added_tokens.json

605 Bytes

Model save about 23 hours ago
all_results.json

381 Bytes

End of training about 23 hours ago
config.json

750 Bytes

End of training about 23 hours ago
eval_results.json

166 Bytes

End of training about 23 hours ago
generation_config.json

122 Bytes

Model save about 23 hours ago
merges.txt

1.67 MB

Model save about 23 hours ago
model.safetensors

3.09 GB
LFS

Model save about 23 hours ago
special_tokens_map.json

502 Bytes

Model save about 23 hours ago
tokenizer.json

11.4 MB
LFS

Model save about 23 hours ago
tokenizer_config.json

7.35 kB

Model save about 23 hours ago
train_results.json

217 Bytes

Model save about 23 hours ago
trainer_state.json

22.7 kB

Model save about 23 hours ago
training_args.bin
Detected Pickle imports (10)
- "open_r1.configs.SFTConfig",
- "accelerate.state.PartialState",
- "transformers.trainer_utils.SchedulerType",
- "transformers.trainer_utils.HubStrategy",
- "transformers.training_args.OptimizerNames",
- "accelerate.utils.dataclasses.DistributedType",
- "transformers.trainer_pt_utils.AcceleratorConfig",
- "transformers.trainer_utils.IntervalStrategy",
- "transformers.trainer_utils.SaveStrategy",
- "torch.device"
How to fix it?
6.01 kB
LFS

Model save about 23 hours ago
vocab.json

2.78 MB

Model save about 23 hours ago