Vivek Rp's picture

30 238

Vivek Rp

vivekrp

·

https://vivekrp.com

AI & ML interests

None yet

Recent Activity

liked a model 3 days ago

sentence-transformers/all-mpnet-base-v2

upvoted an article 4 days ago

Mastering Long Contexts in LLMs with KVPress

liked a Space 11 days ago

codelion/optillm

View all activity

Organizations

vivekrp's activity

upvoted an article 4 days ago

Article

Mastering Long Contexts in LLMs with KVPress

By

and 1 other •

20 days ago

• 62

upvoted an article 12 days ago

Article

Welcome to Inference Providers on the Hub 🔥

15 days ago

• 322

upvoted a paper 13 days ago

Kimi k1.5: Scaling Reinforcement Learning with LLMs

Paper • 2501.12599 • Published 21 days ago • 91

upvoted an article 15 days ago

Article

Open-R1: a fully open reproduction of DeepSeek-R1

15 days ago

• 710

upvoted a collection 16 days ago

FuseChat 3.0

Preference Optimization for Implicit Model Fusion • 13 items • Updated 5 days ago • 11

upvoted an article 16 days ago

Article

FuseChat-3.0: Preference Optimization for Implicit Model Fusion

By

and 2 others •

Dec 18, 2024

• 5

upvoted a collection 16 days ago

Multimodal Research

9 items • Updated 19 days ago • 1

upvoted 2 articles 16 days ago

Article

Visual Document Retrieval Goes Multilingual

Jan 10

• 68

Article

SmolVLM Grows Smaller – Introducing the 250M & 500M Models!

20 days ago

• 124

upvoted a collection 16 days ago

SmolVLM 256M & 500M

Collection for models & demos for even smoller SmolVLM release • 12 items • Updated 20 days ago • 68

upvoted an article 17 days ago

Article

FuseO1-Preview: System-II Reasoning Fusion of LLMs

By

and 4 others •

22 days ago

• 13

upvoted 2 collections 17 days ago

FuseO1-Preview

System-II Reasoning Fusion of LLMs • 10 items • Updated 12 days ago • 17

SmolLM2

State-of-the-art compact LLMs for on-device applications: 1.7B, 360M, 135M • 16 items • Updated 6 days ago • 228

upvoted a paper 17 days ago

The FineWeb Datasets: Decanting the Web for the Finest Text Data at Scale

Paper • 2406.17557 • Published Jun 25, 2024 • 91

upvoted 2 papers 19 days ago

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published 20 days ago • 315

O1-Pruner: Length-Harmonizing Fine-Tuning for O1-Like Reasoning Pruning

Paper • 2501.12570 • Published 21 days ago • 23

upvoted an article 21 days ago

Article

The SOTA Text-to-speech and Zero Shot Voice cloning model that no one knows about...

By

•

22 days ago

• 60

upvoted a collection 21 days ago

DeepSeek R1 (All Versions)

DeepSeek R1 - the most powerful reasoning open-source model - available in GGUF, original & 4-bit formats. Includes Llama & Qwen distilled models. • 29 items • Updated 4 days ago • 168

upvoted a collection 22 days ago

DeepSeek-R1

8 items • Updated 22 days ago • 473

upvoted a paper 22 days ago

Evolving Deeper LLM Thinking

Paper • 2501.09891 • Published 26 days ago • 105