jiakai's picture

87 557

jiakai

real-jiakai

·

https://blog.gujiakai.top

AI & ML interests

LLM && Smart QA

Recent Activity

liked a model about 1 hour ago

agents-course/notebooks

liked a model about 3 hours ago

FireRedTeam/FireRedASR-AED-L

liked a model about 3 hours ago

FireRedTeam/FireRedTTS

View all activity

Organizations

real-jiakai's activity

upvoted an article about 14 hours ago

Article

The Open Arabic LLM Leaderboard 2

2 days ago

• 19

upvoted a paper 1 day ago

Goku: Flow Based Video Generative Foundation Models

Paper • 2502.04896 • Published 5 days ago • 60

upvoted a collection 2 days ago

DeepSeek-R1-abliterated

7 items • Updated 12 days ago • 76

upvoted a collection 3 days ago

Xwen-Chat

6 items • Updated 9 days ago • 10

upvoted an article 7 days ago

Article

Open-source DeepResearch – Freeing our search agents

8 days ago

• 919

upvoted an article 9 days ago

Article

Model2Vec: Distill a Small Fast Model from any Sentence Transformer

By

and 1 other •

Oct 14, 2024

• 69

upvoted a collection 12 days ago

Tulu 3 Models

All models released with Tulu 3 -- state of the art open post-training recipes. • 10 items • Updated 1 day ago • 90

upvoted a paper 12 days ago

Critique Fine-Tuning: Learning to Critique is More Effective than Learning to Imitate

Paper • 2501.17703 • Published 14 days ago • 51

upvoted a paper 15 days ago

Chain-of-Retrieval Augmented Generation

Paper • 2501.14342 • Published 19 days ago • 48

upvoted an article 15 days ago

Article

Train 400x faster Static Embedding Models with Sentence Transformers

28 days ago

• 142

upvoted a collection 15 days ago

Qwen2.5-VL

Vision-language model series based on Qwen2.5 • 3 items • Updated 16 days ago • 337

upvoted a paper 19 days ago

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published 21 days ago • 315

upvoted 2 papers 28 days ago

Critical Tokens Matter: Token-Level Contrastive Estimation Enhence LLM's Reasoning Capability

Paper • 2411.19943 • Published Nov 29, 2024 • 57

OCR Hinders RAG: Evaluating the Cascading Impact of OCR on Retrieval-Augmented Generation

Paper • 2412.02592 • Published Dec 3, 2024 • 22

upvoted 6 papers about 1 month ago

rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking

Paper • 2501.04519 • Published Jan 8 • 255

LLM4SR: A Survey on Large Language Models for Scientific Research

Paper • 2501.04306 • Published Jan 8 • 33

Agent Laboratory: Using LLM Agents as Research Assistants

Paper • 2501.04227 • Published Jan 8 • 84

LLaVA-Mini: Efficient Image and Video Large Multimodal Models with One Vision Token

Paper • 2501.03895 • Published Jan 7 • 48

Personalized Graph-Based Retrieval for Large Language Models

Paper • 2501.02157 • Published Jan 4 • 28

OneKE: A Dockerized Schema-Guided LLM Agent-based Knowledge Extraction System

Paper • 2412.20005 • Published Dec 28, 2024 • 17