Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
AXERA-TECH
/
DeepSeek-R1-Distill-Qwen-7B
like
0
Follow
AXERA
9
Transformers
Inference Endpoints
arxiv:
2501.12948
License:
bsd-3-clause
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
738a450
DeepSeek-R1-Distill-Qwen-7B
1 contributor
History:
12 commits
qqc1989
Upload 2 files
738a450
verified
10 days ago
deepseek-r1-7b-ax650
Upload 2 files
13 days ago
deepseek-r1_tokenizer
Upload 5 files
13 days ago
.gitattributes
Safe
4.19 kB
Upload 2 files
10 days ago
README.md
Safe
19.5 kB
Update README.md
13 days ago
config.json
Safe
23 Bytes
Create config.json
13 days ago
deepseek-r1_tokenizer.py
Safe
4.27 kB
Upload 5 files
13 days ago
main_prefill
Safe
3 MB
LFS
Upload 5 files
13 days ago
main_prefill_postprocess
Safe
3.06 MB
LFS
Upload 2 files
10 days ago
post_config.json
Safe
277 Bytes
Upload 2 files
10 days ago
run_deepseek-r1_7b_ax650.sh
Safe
497 Bytes
Upload 5 files
13 days ago