Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up

Collections

Discover the best community collections!

Collections including paper arxiv:2404.06209

Plot2Code: A Comprehensive Benchmark for Evaluating Multi-modal Large Language Models in Code Generation from Scientific Plots

Paper • 2405.07990 • Published May 13, 2024 • 20
Large Language Models as Planning Domain Generators

Paper • 2405.06650 • Published Apr 2, 2024 • 13
AutoCrawler: A Progressive Understanding Web Agent for Web Crawler Generation

Paper • 2404.12753 • Published Apr 19, 2024 • 43
OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments

Paper • 2404.07972 • Published Apr 11, 2024 • 48

Elephants Never Forget: Memorization and Learning of Tabular Data in Large Language Models

Paper • 2404.06209 • Published Apr 9, 2024 • 5

Audio Reading - 2404.06209 - Elephants Never Forget

Read by Bark: https://drive.google.com/file/d/13IlbhKh71vxLpdYJ6mkIiiJZOUsf7XFv/view?usp=sharing

Elephants Never Forget: Memorization and Learning of Tabular Data in Large Language Models

Paper • 2404.06209 • Published Apr 9, 2024 • 5

Papers - Training - Noisy or Unseen Data Drops Accuracy 6%

Elephants Never Forget: Memorization and Learning of Tabular Data in Large Language Models

Paper • 2404.06209 • Published Apr 9, 2024 • 5

Papers - University of Tubingen

No "Zero-Shot" Without Exponential Data: Pretraining Concept Frequency Determines Multimodal Model Performance

Paper • 2404.04125 • Published Apr 4, 2024 • 29
Elephants Never Forget: Memorization and Learning of Tabular Data in Large Language Models

Paper • 2404.06209 • Published Apr 9, 2024 • 5

Papers - Documents - Tabular

FormNetV2: Multimodal Graph Contrastive Learning for Form Document Information Extraction

Paper • 2305.02549 • Published May 4, 2023 • 6
FormNet: Structural Encoding beyond Sequential Modeling in Form Document Information Extraction

Paper • 2203.08411 • Published Mar 16, 2022 • 1
More efficient manual review of automatically transcribed tabular data

Paper • 2306.16126 • Published Jun 28, 2023 • 1
CascadeTabNet: An approach for end to end table detection and structure recognition from image-based documents

Paper • 2004.12629 • Published Apr 27, 2020 • 2

Papers - Document - OCR

Noise-Aware Training of Layout-Aware Language Models

Paper • 2404.00488 • Published Mar 30, 2024 • 10
FormNet: Structural Encoding beyond Sequential Modeling in Form Document Information Extraction

Paper • 2203.08411 • Published Mar 16, 2022 • 1
FormNetV2: Multimodal Graph Contrastive Learning for Form Document Information Extraction

Paper • 2305.02549 • Published May 4, 2023 • 6
ETC: Encoding Long and Structured Inputs in Transformers

Paper • 2004.08483 • Published Apr 17, 2020 • 1

Papers - Microsoft

Can large language models explore in-context?

Paper • 2403.15371 • Published Mar 22, 2024 • 33
GaussianCube: Structuring Gaussian Splatting using Optimal Transport for 3D Generative Modeling

Paper • 2403.19655 • Published Mar 28, 2024 • 19
WavLLM: Towards Robust and Adaptive Speech Large Language Model

Paper • 2404.00656 • Published Mar 31, 2024 • 11
Enabling Memory Safety of C Programs using LLMs

Paper • 2404.01096 • Published Apr 1, 2024 • 1

Papers - Tabular

Converted the Elephants Never Forget paper to audio with Bark: https://drive.google.com/file/d/13IlbhKh71vxLpdYJ6mkIiiJZOUsf7XFv/view?usp=sharing

End-to-End Object Detection with Transformers

Paper • 2005.12872 • Published May 26, 2020 • 5
Elephants Never Forget: Memorization and Learning of Tabular Data in Large Language Models

Paper • 2404.06209 • Published Apr 9, 2024 • 5
TabReD: A Benchmark of Tabular Machine Learning in-the-Wild

Paper • 2406.19380 • Published Jun 27, 2024 • 49
SpreadsheetLLM: Encoding Spreadsheets for Large Language Models

Paper • 2407.09025 • Published Jul 12, 2024 • 135

Company

TOS Privacy About Jobs

Website

Models Datasets Spaces Pricing Docs