Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
AXERA-TECH
/
DeepSeek-R1-Distill-Qwen-1.5B
like
3
Follow
AXERA
9
Transformers
Inference Endpoints
arxiv:
2501.12948
License:
bsd-3-clause
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
main
DeepSeek-R1-Distill-Qwen-1.5B
1 contributor
History:
22 commits
qqc1989
Upload 3 files
139dcbd
verified
9 days ago
deepseek-r1-1.5b-ax630c
Upload 30 files
about 1 month ago
deepseek-r1-1.5b-ax650
Upload 30 files
about 1 month ago
deepseek-r1_tokenizer
Upload 6 files
about 1 month ago
figures
Rename figures/figures_benchmark.jpg to figures/benchmark.jpg
11 days ago
.gitattributes
Safe
7.12 kB
Upload 2 files
10 days ago
README.md
Safe
19.4 kB
Update README.md
15 days ago
config.json
Safe
20 Bytes
Create config.json
13 days ago
deepseek-r1_tokenizer.py
Safe
4.27 kB
Upload 6 files
about 1 month ago
main_axcl_aarch64
Safe
999 kB
LFS
Upload 3 files
9 days ago
main_axcl_x86
Safe
1.02 MB
LFS
Upload 3 files
9 days ago
main_prefill
Safe
954 kB
LFS
Upload 3 files
9 days ago
post_config.json
Safe
277 Bytes
Upload 2 files
9 days ago
run_deepseek-r1_1.5B_ax630c.sh
Safe
512 Bytes
Upload 6 files
about 1 month ago
run_deepseek-r1_1.5B_ax650.sh
Safe
509 Bytes
Upload 6 files
about 1 month ago
run_deepseek-r1_1.5b_axcl_aarch64.sh
Safe
508 Bytes
Upload 2 files
9 days ago
run_deepseek-r1_1.5b_axcl_x86.sh
Safe
504 Bytes
Upload 2 files
9 days ago