gn00029914

AI & ML interests

None yet

Recent Activity

liked a model about 9 hours ago

mradermacher/Qwen2.5-DeepHyper-GGUF

liked a model about 9 hours ago

CultriX/Qwen2.5-DeepHyper

liked a model about 10 hours ago

mradermacher/FuseO1-DeepSeekR1-Qwen2.5-Coder-32B-Preview-i1-GGUF

View all activity

Organizations

None yet

gn00029914's activity

liked 2 models about 9 hours ago

mradermacher/Qwen2.5-DeepHyper-GGUF

Updated about 17 hours ago • 233 • 2

CultriX/Qwen2.5-DeepHyper

Text Generation • Updated 2 days ago • 4 • 2

liked a model about 10 hours ago

mradermacher/FuseO1-DeepSeekR1-Qwen2.5-Coder-32B-Preview-i1-GGUF

Updated 17 days ago • 2.15k • 2

upvoted a paper about 10 hours ago

Tiny Time Mixers (TTMs): Fast Pre-trained Models for Enhanced Zero/Few-Shot Forecasting of Multivariate Time Series

Paper • 2401.03955 • Published Jan 8, 2024 • 8

upvoted a collection about 11 hours ago

Cognitive Architecture

Collection

9 items • Updated about 19 hours ago • 2

upvoted a paper about 11 hours ago

Can 1B LLM Surpass 405B LLM? Rethinking Compute-Optimal Test-Time Scaling

Paper • 2502.06703 • Published 1 day ago • 83

liked 2 models about 16 hours ago

mradermacher/RombO1-Fuse-i1-GGUF

Updated 7 days ago • 567 • 1

valoomba/RombO1-Fuse

Text Generation • Updated 8 days ago • 14 • 1

upvoted a collection about 19 hours ago

interesting stuff

Collection

115 items • Updated 5 days ago • 3

upvoted a collection about 20 hours ago

Papers

Collection

Large Language Model (LLM) and NLP related papers. • 190 items • Updated 3 days ago • 9

upvoted 5 papers about 20 hours ago

DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence

Paper • 2406.11931 • Published Jun 17, 2024 • 63

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published 21 days ago • 315

upvoted an article about 20 hours ago

Article

FuseO1-Preview: System-II Reasoning Fusion of LLMs

and 4 others •

22 days ago

• 13

liked a model about 20 hours ago

FuseAI/FuseO1-DeepSeekR1-Qwen2.5-Coder-32B-Preview

Updated 18 days ago • 489 • 22

liked 2 Spaces about 23 hours ago

Quantizes a GGUF model

🔥

Run web apps with Streamlit

Quant

💻

Display interactive data visualizations and apps

liked a model 2 days ago

simplescaling/step-conditional-control

Text Generation • Updated 8 days ago • 108 • 1