Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
tyfeng1997
/
Qwen2.5-Math-1.5B-Open-R1-Distill
like
0
Text Generation
Transformers
Safetensors
HuggingFaceH4/Bespoke-Stratos-17k
qwen2
Generated from Trainer
open-r1
trl
sft
conversational
text-generation-inference
Inference Endpoints
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
main
Qwen2.5-Math-1.5B-Open-R1-Distill
1 contributor
History:
3 commits
tyfeng1997
End of training
7a54dc2
verified
about 23 hours ago
.gitattributes
1.57 kB
Model save
about 23 hours ago
README.md
1.9 kB
End of training
about 23 hours ago
added_tokens.json
605 Bytes
Model save
about 23 hours ago
all_results.json
381 Bytes
End of training
about 23 hours ago
config.json
750 Bytes
End of training
about 23 hours ago
eval_results.json
166 Bytes
End of training
about 23 hours ago
generation_config.json
122 Bytes
Model save
about 23 hours ago
merges.txt
1.67 MB
Model save
about 23 hours ago
model.safetensors
3.09 GB
LFS
Model save
about 23 hours ago
special_tokens_map.json
502 Bytes
Model save
about 23 hours ago
tokenizer.json
11.4 MB
LFS
Model save
about 23 hours ago
tokenizer_config.json
7.35 kB
Model save
about 23 hours ago
train_results.json
217 Bytes
Model save
about 23 hours ago
trainer_state.json
22.7 kB
Model save
about 23 hours ago
training_args.bin
pickle
Detected Pickle imports (10)
"open_r1.configs.SFTConfig"
,
"accelerate.state.PartialState"
,
"transformers.trainer_utils.SchedulerType"
,
"transformers.trainer_utils.HubStrategy"
,
"transformers.training_args.OptimizerNames"
,
"accelerate.utils.dataclasses.DistributedType"
,
"transformers.trainer_pt_utils.AcceleratorConfig"
,
"transformers.trainer_utils.IntervalStrategy"
,
"transformers.trainer_utils.SaveStrategy"
,
"torch.device"
How to fix it?
6.01 kB
LFS
Model save
about 23 hours ago
vocab.json
2.78 MB
Model save
about 23 hours ago