ddh0's picture

ddh0 PRO

ddh0

·

AI & ML interests

None yet

Recent Activity

reacted to eaddario's post with 🤗 about 17 hours ago

Experimental global target bits‑per‑weight quantization of allenai/Olmo-3-7B-Instruct and allenai/Olmo-3-7B-Think Unlike standard llama.cpp quantizations that rely on fixed type heuristics (e.g., Q4_K_M), the Target BPW approach optimizes per-tensor precision where it matters the most, and produces high quality models that meet a precise global file size target. Key Advantages: - VRAM Maximization: Can generate high quality models sized exactly to fit hardware constraints (e.g., fitting the model into exactly 24GB VRAM). - Data-Driven Precision: Quantization mix is determined by actual weight error sensitivity rather than hardcoded rules, often yielding better PPL/KLD size trade-offs. Full benchmarks (PPL, KLD, ARC, MMLU, etc.) and methodology in the models' cards https://huggingface.co/eaddario/Olmo-3-7B-Instruct-GGUF https://huggingface.co/eaddario/Olmo-3-7B-Think-GGUF

liked a dataset about 17 hours ago

nbeerbower/hemlock-sft-v0.2

upvoted a paper 2 days ago

Latent Implicit Visual Reasoning

View all activity

Organizations

ddh0 's models 79

ddh0/MiniMax-M2.1-GGUF

229B • Updated 3 days ago • 79

ddh0/GLM-4.7-GGUF

358B • Updated 6 days ago • 42

ddh0/soft-prompts-data

Updated 8 days ago

ddh0/Q4_K_X.gguf

71B • Updated 8 days ago • 347 • 2

ddh0/imatrices

Updated 8 days ago • 31 • 1

ddh0/GLM-4.5V-GGUF

107B • Updated 14 days ago • 461 • 1

ddh0/GLM-4.6V-GGUF

107B • Updated 14 days ago • 499

ddh0/GLM-4.5-Air-Derestricted-GGUF

110B • Updated 29 days ago • 710 • 2

ddh0/INTELLECT-3-GGUF

107B • Updated Nov 27 • 21 • 1

ddh0/simple-tokenizer-2048

ddh0/Qwen3-235B-A22B-Thinking-2507-GGUF

235B • Updated Nov 16 • 30

ddh0/simple-tokenizer-1024

ddh0/MiniMax-M2-GGUF

229B • Updated Nov 5 • 9

ddh0/GLM-4.5-Iceblink-v2-106B-A12B-GGUF

110B • Updated Nov 3 • 1.65k • 6

ddh0/GLM-4.5-Air-GGUF

110B • Updated Nov 3 • 1.26k • 16

ddh0/simple-tokenizer-5120

ddh0/Ling-flash-2.0-Q8_0.gguf

103B • Updated Oct 21 • 8

ddh0/Qwen3-0.6B-intermediate-tensor-data-npy

ddh0/GLM-4.5-3.34bpw.gguf

358B • Updated Sep 13 • 3 • 1

ddh0/GLM-Steam-106B-A12B-v1b-Q8_0.gguf

110B • Updated Aug 27 • 8

ddh0/gemma-3-it-GGUF

12B • Updated Aug 18 • 148

ddh0/Andromeda-70B

71B • Updated Jul 30 • 7 • 2

ddh0/Cassiopeia-70B

71B • Updated Jul 29 • 22 • 9

ddh0/fallen-glimmer-27b

27B • Updated Jul 17 • 9

ddh0/AI21-Jamba-Mini-1.7-GGUF

52B • Updated Jul 10 • 5

ddh0/ay-oh-three

ddh0/dots.llm1.inst-GGUF-Q4_0-EXPERIMENTAL

143B • Updated Jun 11 • 4 • 1

ddh0/StrawberryLemonade-L3-70B-v1.0-GGUF

71B • Updated Jun 10 • 20

ddh0/Qwen3-4B

Text Generation • 4B • Updated May 26 • 60

ddh0/Qwen2.5-14B-All-Variants-q8_0-q6_K-GGUF

15B • Updated Apr 29 • 74 • 2