-
Plot2Code: A Comprehensive Benchmark for Evaluating Multi-modal Large Language Models in Code Generation from Scientific Plots
Paper • 2405.07990 • Published • 18 -
Large Language Models as Planning Domain Generators
Paper • 2405.06650 • Published • 11 -
AutoCrawler: A Progressive Understanding Web Agent for Web Crawler Generation
Paper • 2404.12753 • Published • 43 -
OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments
Paper • 2404.07972 • Published • 48
Collections
Discover the best community collections!
Collections including paper arxiv:2404.06209
-
FormNetV2: Multimodal Graph Contrastive Learning for Form Document Information Extraction
Paper • 2305.02549 • Published • 6 -
FormNet: Structural Encoding beyond Sequential Modeling in Form Document Information Extraction
Paper • 2203.08411 • Published • 1 -
More efficient manual review of automatically transcribed tabular data
Paper • 2306.16126 • Published • 1 -
CascadeTabNet: An approach for end to end table detection and structure recognition from image-based documents
Paper • 2004.12629 • Published • 2
-
Noise-Aware Training of Layout-Aware Language Models
Paper • 2404.00488 • Published • 10 -
FormNet: Structural Encoding beyond Sequential Modeling in Form Document Information Extraction
Paper • 2203.08411 • Published • 1 -
FormNetV2: Multimodal Graph Contrastive Learning for Form Document Information Extraction
Paper • 2305.02549 • Published • 6 -
ETC: Encoding Long and Structured Inputs in Transformers
Paper • 2004.08483 • Published • 1
-
Can large language models explore in-context?
Paper • 2403.15371 • Published • 32 -
GaussianCube: Structuring Gaussian Splatting using Optimal Transport for 3D Generative Modeling
Paper • 2403.19655 • Published • 19 -
WavLLM: Towards Robust and Adaptive Speech Large Language Model
Paper • 2404.00656 • Published • 11 -
Enabling Memory Safety of C Programs using LLMs
Paper • 2404.01096 • Published • 1
-
End-to-End Object Detection with Transformers
Paper • 2005.12872 • Published • 5 -
Elephants Never Forget: Memorization and Learning of Tabular Data in Large Language Models
Paper • 2404.06209 • Published • 5 -
TabReD: A Benchmark of Tabular Machine Learning in-the-Wild
Paper • 2406.19380 • Published • 47 -
SpreadsheetLLM: Encoding Spreadsheets for Large Language Models
Paper • 2407.09025 • Published • 133