In a Training Loop 🔄

4 44 72

Karsten Kuhnke PRO

mindchain

https://www.linkedin.com/in/jankarstenkuhnke/

AI & ML interests

Mechanistic Interpretability, Sparse Autoencoders, JumpReLU, Reward Modeling, RLHF, AI Alignment, Function Calling, Gemma, Nemotron

Recent Activity

updated a collection 15 minutes ago

Bread&Butter

liked a model 16 minutes ago

upstage/Solar-Open-100B

upvoted a collection 29 minutes ago

NemoGuard

View all activity

Organizations

upvoted a collection 29 minutes ago

NemoGuard

Collection

Essential datasets and models for content safety, topic-following, and security guardrails • 13 items • Updated 11 days ago • 16

upvoted a paper about 5 hours ago

SmolDocling: An ultra-compact vision-language model for end-to-end multi-modal document conversion

Paper • 2503.11576 • Published Mar 14, 2025 • 125

upvoted a collection about 7 hours ago

Cerebras REAP

Collection

Sparse MoE models compressed using REAP (Router-weighted Expert Activation Pruning) method • 19 items • Updated 15 days ago • 77

upvoted a paper about 9 hours ago

mHC: Manifold-Constrained Hyper-Connections

Paper • 2512.24880 • Published 3 days ago • 148

upvoted a paper about 13 hours ago

PaddleOCR 3.0 Technical Report

Paper • 2507.05595 • Published Jul 8, 2025 • 19

upvoted 2 collections about 13 hours ago

OCR

Collection

3 items • Updated about 13 hours ago • 1

PP-StructureV3

Collection

PP-StructureV3 is a SOTA document parsing solution on OmniDocBench, supporting the conversion of PDFs and do cument images to Markdown and JSON. • 17 items • Updated Sep 15, 2025 • 12

upvoted 6 collections 5 days ago

upvoted 4 papers 5 days ago

Bolmo: Byteifying the Next Generation of Language Models

Paper • 2512.15586 • Published 17 days ago • 14

Mem0: Building Production-Ready AI Agents with Scalable Long-Term Memory

Paper • 2504.19413 • Published Apr 28, 2025 • 36

SAM 3: Segment Anything with Concepts

Paper • 2511.16719 • Published Nov 20, 2025 • 125

TurboDiffusion: Accelerating Video Diffusion Models by 100-200 Times

Paper • 2512.16093 • Published 17 days ago • 90

upvoted an article 5 days ago

Article

Diffusers welcomes FLUX-2

Nov 25, 2025

•

167

upvoted a paper 5 days ago

Emergent temporal abstractions in autoregressive models enable hierarchical reinforcement learning

Paper • 2512.20605 • Published 11 days ago • 59

upvoted a collection 5 days ago

— Awesome RL datasets 📈 —

Collection

3 items • Updated Sep 23, 2025 • 1

Karsten Kuhnke PRO

AI & ML interests

Recent Activity

Organizations

mindchain's activity

Diffusers welcomes FLUX-2