Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
AXERA-TECH
/
DeepSeek-R1-Distill-Qwen-7B
like
0
Follow
AXERA
9
Transformers
Inference Endpoints
arxiv:
2501.12948
License:
bsd-3-clause
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
18d03a5
DeepSeek-R1-Distill-Qwen-7B
1 contributor
History:
7 commits
qqc1989
Upload 5 files
18d03a5
verified
13 days ago
deepseek-r1-7b-ax650
Upload 5 files
13 days ago
deepseek-r1_tokenizer
Upload 5 files
13 days ago
.gitattributes
2.54 kB
Upload 5 files
13 days ago
README.md
33 Bytes
initial commit
13 days ago
config.json
23 Bytes
Create config.json
13 days ago
deepseek-r1_tokenizer.py
4.27 kB
Upload 5 files
13 days ago
main_prefill
3 MB
LFS
Upload 5 files
13 days ago
run_deepseek-r1_7b_ax650.sh
497 Bytes
Upload 5 files
13 days ago