Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
25.7
TFLOPS
87
75
878
ddh0
PRO
ddh0
Follow
chargoddard's profile picture
cubic2023's profile picture
Severian's profile picture
144 followers
·
95 following
AI & ML interests
None yet
Recent Activity
reacted
to
eaddario
's
post
with 🤗
about 17 hours ago
Experimental global target bits‑per‑weight quantization of allenai/Olmo-3-7B-Instruct and allenai/Olmo-3-7B-Think Unlike standard llama.cpp quantizations that rely on fixed type heuristics (e.g., Q4_K_M), the Target BPW approach optimizes per-tensor precision where it matters the most, and produces high quality models that meet a precise global file size target. Key Advantages: - VRAM Maximization: Can generate high quality models sized exactly to fit hardware constraints (e.g., fitting the model into exactly 24GB VRAM). - Data-Driven Precision: Quantization mix is determined by actual weight error sensitivity rather than hardcoded rules, often yielding better PPL/KLD size trade-offs. Full benchmarks (PPL, KLD, ARC, MMLU, etc.) and methodology in the models' cards https://huggingface.co/eaddario/Olmo-3-7B-Instruct-GGUF https://huggingface.co/eaddario/Olmo-3-7B-Think-GGUF
liked
a dataset
about 17 hours ago
nbeerbower/hemlock-sft-v0.2
upvoted
a
paper
2 days ago
Latent Implicit Visual Reasoning
View all activity
Organizations
ddh0
's models
79
Sort: Recently updated
ddh0/MiniMax-M2.1-GGUF
229B
•
Updated
3 days ago
•
79
ddh0/GLM-4.7-GGUF
358B
•
Updated
6 days ago
•
42
ddh0/soft-prompts-data
Updated
8 days ago
ddh0/Q4_K_X.gguf
71B
•
Updated
8 days ago
•
347
•
2
ddh0/imatrices
Updated
8 days ago
•
31
•
1
ddh0/GLM-4.5V-GGUF
107B
•
Updated
14 days ago
•
461
•
1
ddh0/GLM-4.6V-GGUF
107B
•
Updated
14 days ago
•
499
ddh0/GLM-4.5-Air-Derestricted-GGUF
110B
•
Updated
29 days ago
•
710
•
2
ddh0/INTELLECT-3-GGUF
107B
•
Updated
Nov 27
•
21
•
1
ddh0/simple-tokenizer-2048
Updated
Nov 25
ddh0/Qwen3-235B-A22B-Thinking-2507-GGUF
235B
•
Updated
Nov 16
•
30
ddh0/simple-tokenizer-1024
Updated
Nov 14
ddh0/MiniMax-M2-GGUF
229B
•
Updated
Nov 5
•
9
ddh0/GLM-4.5-Iceblink-v2-106B-A12B-GGUF
110B
•
Updated
Nov 3
•
1.65k
•
6
ddh0/GLM-4.5-Air-GGUF
110B
•
Updated
Nov 3
•
1.26k
•
16
ddh0/simple-tokenizer-5120
Updated
Oct 29
ddh0/Ling-flash-2.0-Q8_0.gguf
103B
•
Updated
Oct 21
•
8
ddh0/Qwen3-0.6B-intermediate-tensor-data-npy
Updated
Oct 8
ddh0/GLM-4.5-3.34bpw.gguf
358B
•
Updated
Sep 13
•
3
•
1
ddh0/GLM-Steam-106B-A12B-v1b-Q8_0.gguf
110B
•
Updated
Aug 27
•
8
ddh0/gemma-3-it-GGUF
12B
•
Updated
Aug 18
•
148
ddh0/Andromeda-70B
71B
•
Updated
Jul 30
•
7
•
2
ddh0/Cassiopeia-70B
71B
•
Updated
Jul 29
•
22
•
9
ddh0/fallen-glimmer-27b
27B
•
Updated
Jul 17
•
9
ddh0/AI21-Jamba-Mini-1.7-GGUF
52B
•
Updated
Jul 10
•
5
ddh0/ay-oh-three
Updated
Jul 4
ddh0/dots.llm1.inst-GGUF-Q4_0-EXPERIMENTAL
143B
•
Updated
Jun 11
•
4
•
1
ddh0/StrawberryLemonade-L3-70B-v1.0-GGUF
71B
•
Updated
Jun 10
•
20
ddh0/Qwen3-4B
Text Generation
•
4B
•
Updated
May 26
•
60
ddh0/Qwen2.5-14B-All-Variants-q8_0-q6_K-GGUF
15B
•
Updated
Apr 29
•
74
•
2
Previous
1
2
3
Next