Collections

1

Agentless: Demystifying LLM-based Software Engineering Agents

Paper • 2407.01489 • Published Jul 1, 2024 • 61
Summary of a Haystack: A Challenge to Long-Context LLMs and RAG Systems

Paper • 2407.01370 • Published Jul 1, 2024 • 88
OneKE: A Dockerized Schema-Guided LLM Agent-based Knowledge Extraction System

Paper • 2412.20005 • Published Dec 28, 2024 • 18
Understanding Alignment in Multimodal LLMs: A Comprehensive Study

Paper • 2407.02477 • Published Jul 2, 2024 • 23

3

Revisit Large-Scale Image-Caption Data in Pre-training Multimodal Foundation Models

Paper • 2410.02740 • Published Oct 3, 2024 • 52
From Code to Correctness: Closing the Last Mile of Code Generation with Hierarchical Debugging

Paper • 2410.01215 • Published Oct 2, 2024 • 32
Molmo and PixMo: Open Weights and Open Data for State-of-the-Art Multimodal Models

Paper • 2409.17146 • Published Sep 25, 2024 • 108
EuroLLM: Multilingual Language Models for Europe

Paper • 2409.16235 • Published Sep 24, 2024 • 26

-

4

Agentless: Demystifying LLM-based Software Engineering Agents

Summary of a Haystack: A Challenge to Long-Context LLMs and RAG Systems

OneKE: A Dockerized Schema-Guided LLM Agent-based Knowledge Extraction System

Understanding Alignment in Multimodal LLMs: A Comprehensive Study

Revisit Large-Scale Image-Caption Data in Pre-training Multimodal Foundation Models

From Code to Correctness: Closing the Last Mile of Code Generation with Hierarchical Debugging

Molmo and PixMo: Open Weights and Open Data for State-of-the-Art Multimodal Models

EuroLLM: Multilingual Language Models for Europe

stepfun-ai/GOT-OCR2_0

Midi Music Generator

OpenGVLab/InternVL2_5-78B-MPO

OpenGVLab/InternVL2_5-38B-MPO-AWQ

VILA^2: VILA Augmented VILA

Octopus v4: Graph of language models

Octo-planner: On-device Language Model for Planner-Action Agents

Dolphin: Long Context as a New Modality for Energy-Efficient On-Device Language Models

Human-like Episodic Memory for Infinite Context LLMs

MUSCLE: A Model Update Strategy for Compatible LLM Evolution

Refuse Whenever You Feel Unsafe: Improving Safety in LLMs via Decoupled Refusal Training

ChatQA 2: Bridging the Gap to Proprietary LLMs in Long Context and RAG Capabilities

Symbolic Learning Enables Self-Evolving Agents

Agent Laboratory: Using LLM Agents as Research Assistants

MotionLLM: Understanding Human Behaviors from Human Motions and Videos

Spectrally Pruned Gaussian Fields with Neural Compensation

Paint by Inpaint: Learning to Add Image Objects by Removing Them First

LoRA Land: 310 Fine-tuned LLMs that Rival GPT-4, A Technical Report

TheAgentCompany: Benchmarking LLM Agents on Consequential Real World Tasks

Training Software Engineering Agents and Verifiers with SWE-Gym

OS-Genesis: Automating GUI Agent Trajectory Construction via Reverse Task Synthesis

AgentGen: Enhancing Planning Abilities for Large Language Model based Agent via Environment and Task Generation

Communicative Agents for Software Development

Self-Refine: Iterative Refinement with Self-Feedback

ReST meets ReAct: Self-Improvement for Multi-Step Reasoning LLM Agent

ReAct: Synergizing Reasoning and Acting in Language Models

AgentOhana: Design Unified Data and Training Pipeline for Effective Agent Learning

AutoWebGLM: Bootstrap And Reinforce A Large Language Model-based Web Navigating Agent

Similarity is Not All You Need: Endowing Retrieval Augmented Generation with Multi Layered Thoughts

Parrot: Efficient Serving of LLM-based Applications with Semantic Variable