simonholm (Simon Holm )

upvoted an article about 7 hours ago

Article

🌁#87: Why DeepResearch Should Be Your New Hire

By

•

1 day ago

• 3

liked a dataset about 20 hours ago

open-r1/OpenR1-Math-220k

Viewer • Updated about 18 hours ago • 225k • 260 • 156

liked a model 1 day ago

tomg-group-umd/huginn-0125

Text Generation • Updated 2 days ago • 4.84k • 74

upvoted an article 1 day ago

Article

Open R1: Update #2

By

and 6 others •

1 day ago

• 126

reacted to Kseniase's post with 🔥🤗 3 days ago

Post

6870

8 New Types of RAG

RAG techniques continuously evolve to enhance LLM response accuracy by retrieving relevant external data during generation. To keep up with current AI trends, new RAG types incorporate deep step-by-step reasoning, tree search, citations, multimodality and other effective techniques.

Here's a list of 8 latest RAG advancements:

1. DeepRAG -> DeepRAG: Thinking to Retrieval Step by Step for Large Language Models (2502.01142)
Models retrieval-augmented reasoning as a Markov Decision Process, enabling strategic retrieval. It dynamically decides when to retrieve external knowledge and when rely on parametric reasoning.

2. RealRAG -> RealRAG: Retrieval-augmented Realistic Image Generation via Self-reflective Contrastive Learning (2502.00848)
Enhances novel object generation by retrieving real-world images and using self-reflective contrastive learning to fill knowledge gap, improve realism and reduce distortions.

3. Chain-of-Retrieval Augmented Generation (CoRAG) -> Chain-of-Retrieval Augmented Generation (2501.14342)
Retrieves information step-by-step and adjusts it, also deciding how much compute power to use at test time. If needed it reformulates queries.

4. VideoRAG -> VideoRAG: Retrieval-Augmented Generation over Video Corpus (2501.05874)
Enables unlimited-length video processing, using dual-channel architecture that integrates graph-based textual grounding and multi-modal context encoding.

5. CFT-RAG -> CFT-RAG: An Entity Tree Based Retrieval Augmented Generation Algorithm With Cuckoo Filter (2501.15098)
A tree-RAG acceleration method uses an improved Cuckoo Filter to optimize entity localization, enabling faster retrieval.

6. Contextualized Graph RAG (CG-RAG) -> CG-RAG: Research Question Answering by Citation Graph Retrieval-Augmented LLMs (2501.15067)
Uses Lexical-Semantic Graph Retrieval (LeSeGR) to integrate sparse and dense signals within graph structure and capture citation relationships

7. GFM-RAG -> GFM-RAG: Graph Foundation Model for Retrieval Augmented Generation (2502.01113)
A graph foundation model that uses a graph neural network to refine query-knowledge connections

8. URAG -> URAG: Implementing a Unified Hybrid RAG for Precise Answers in University Admission Chatbots -- A Case Study at HCMUT (2501.16276)
A hybrid system combining rule-based and RAG methods to improve lightweight LLMs for educational chatbots

1 reply

·

upvoted an article 5 days ago

Article

What is test-time compute and how to scale it?

By

and 1 other •

5 days ago

• 18

upvoted an article 6 days ago

Article

🦸🏻#9: Does AI Remember? The Role of Memory in Agentic Workflows

By

•

10 days ago

• 11

liked a model 6 days ago

deepseek-ai/deepseek-vl2-small

Image-Text-to-Text • Updated Dec 18, 2024 • 21.7k • 122

liked a Space 6 days ago

296

Chat with DeepSeek-VL2-small

🌍

Generate text based on images and prompts

upvoted an article 7 days ago

Article

Open-source DeepResearch – Freeing our search agents

8 days ago

• 919

reacted to m-ric's post with 🚀🔥 7 days ago

Post

9211

Introducing 𝗼𝗽𝗲𝗻 𝗗𝗲𝗲𝗽-𝗥𝗲𝘀𝗲𝗮𝗿𝗰𝗵 by Hugging Face! 💥

OpenAI's latest agentic app Deep Research seems really good... But it's closed, as usual.

⏱️ So with a team of cracked colleagues, we set ourselves a 24hours deadline to replicate and open-source Deep Research! ⏱️

➡️ We built open-Deep-Research, an entirely open agent that can: navigate the web autonomously, scroll and search through pages, download and manipulate files, run calculation on data...

We aimed for the best performance: are the agent's answers really rigorous?

On GAIA benchmark, Deep Research had 67% accuracy on the validation set.
➡️ open Deep Research is at 55% (powered by o1), it is:
- the best pass@1 solution submitted
- the best open solution 💪💪

And it's only getting started ! Please jump in, drop PRs, and let's bring it to the top !

Read the blog post 👉 https://huggingface.co/blog/open-deep-research

liked a dataset 7 days ago

cognitivecomputations/dolphin-r1

Viewer • Updated 12 days ago • 814k • 3.52k • 228

upvoted an article 8 days ago

Article

Welcome to Inference Providers on the Hub 🔥

15 days ago

• 323

liked a Space 8 days ago

38

GOT OCR Transformers

📷

Demo of GOT-OCR 2.0's Transformers implementation

upvoted an article 9 days ago

Article

Open-R1: Update #1

By

and 7 others •

10 days ago

• 270

reacted to Kseniase's post with 🤗 9 days ago

Post

4792

8 Free Sources on Reinforcement Learning

With the phenomenon of DeepSeek-R1's top reasoning capabilities, we all saw the true power of RL. At its core, RL is a type of machine learning where a model/agent learns to make decisions by interacting with an environment to maximize a reward. RL learns through trial and error, receiving feedback in the form of rewards or penalties.

Here's a list of free sources that will help you dive into RL and how to use it:

1. "Reinforcement Learning: An Introduction" book by Richard S. Sutton and Andrew G. Barto -> https://web.stanford.edu/class/psych209/Readings/SuttonBartoIPRLBook2ndEd.pdf

2. Hugging Face Deep Reinforcement Learning Course -> https://huggingface.co/learn/deep-rl-course/unit0/introduction
You'll learn how to train agents in unique environments, using best libraries, share your results, compete in challenges, and earn a certificate.

3. OpenAI Spinning Up in Deep RL -> https://spinningup.openai.com/en/latest/index.html
A comprehensive overview of RL with many useful resources

4. "Reinforcement Learning and Optimal Control" books, video lectures and course material by Dimitri P. Bertsekas from ASU -> https://web.mit.edu/dimitrib/www/RLbook.html
Explores approximate Dynamic Programming (DP) and RL with key concepts and methods like rollout, tree search, and neural network training for RL and more.

5. RL Course by David Silver (Google DeepMind) -> https://www.youtube.com/watch?v=2pWv7GOvuf0&list=PLqYmG7hTraZDM-OYHWgPeb
Many recommend these video lectures as a good foundation

6. RL theory seminars -> https://sites.google.com/view/rltheoryseminars/home?authuser=0
Provides virtual seminars from different experts about RL advancements

7. "Reinforcement Learning Specialization" (a 4-course series on Coursera) -> https://www.coursera.org/learn/fundament

8. Concepts: RLHF, RLAIF, RLEF, RLCF -> https://www.turingpost.com/p/rl-f
Our flashcards easily explain what are these four RL approaches with different feedback

upvoted 2 articles 9 days ago

Article

🦸🏻#8: Rewriting the Rules of Knowledge: How Modern Agents Learn to Adapt

By

•

12 days ago

• 5

Article

Introducing multi-backends (TRT-LLM, vLLM) support for Text Generation Inference

27 days ago

• 66

Simon Holm

AI & ML interests

Recent Activity

Organizations

simonholm's activity

🌁#87: Why DeepResearch Should Be Your New Hire

open-r1/OpenR1-Math-220k

tomg-group-umd/huginn-0125

Open R1: Update #2

What is test-time compute and how to scale it?

🦸🏻#9: Does AI Remember? The Role of Memory in Agentic Workflows

deepseek-ai/deepseek-vl2-small

Chat with DeepSeek-VL2-small

Open-source DeepResearch – Freeing our search agents

cognitivecomputations/dolphin-r1

Welcome to Inference Providers on the Hub 🔥

GOT OCR Transformers

Open-R1: Update #1

🦸🏻#8: Rewriting the Rules of Knowledge: How Modern Agents Learn to Adapt

Introducing multi-backends (TRT-LLM, vLLM) support for Text Generation Inference