Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up

Collections

Discover the best community collections!

Collections including paper arxiv:2204.08387

LayoutLM and Document Intelligence

LayoutLM: Pre-training of Text and Layout for Document Image Understanding

Paper • 1912.13318 • Published Dec 31, 2019 • 2
microsoft/layoutlm-base-uncased

Updated Apr 16, 2024 • 1.45M • 50
LayoutLMv2: Multi-modal Pre-training for Visually-Rich Document Understanding

Paper • 2012.14740 • Published Dec 29, 2020 • 1
microsoft/layoutlmv2-base-uncased

Updated Sep 16, 2022 • 810k • 64

Papers - Document AI

LayoutLM: Pre-training of Text and Layout for Document Image Understanding

Paper • 1912.13318 • Published Dec 31, 2019 • 2
LayoutLMv2: Multi-modal Pre-training for Visually-Rich Document Understanding

Paper • 2012.14740 • Published Dec 29, 2020 • 1
LayoutLMv3: Pre-training for Document AI with Unified Text and Image Masking

Paper • 2204.08387 • Published Apr 18, 2022 • 2

Papers - Image - OCR - Tesseract for Text Location

CascadeTabNet: An approach for end to end table detection and structure recognition from image-based documents

Paper • 2004.12629 • Published Apr 27, 2020 • 2
LayoutLMv3: Pre-training for Document AI with Unified Text and Image Masking

Paper • 2204.08387 • Published Apr 18, 2022 • 2

Papers - Image - Table Structure Recognition

CascadeTabNet: An approach for end to end table detection and structure recognition from image-based documents

Paper • 2004.12629 • Published Apr 27, 2020 • 2
LayoutLMv3: Pre-training for Document AI with Unified Text and Image Masking

Paper • 2204.08387 • Published Apr 18, 2022 • 2

Papers - Image - OCR

CascadeTabNet: An approach for end to end table detection and structure recognition from image-based documents

Paper • 2004.12629 • Published Apr 27, 2020 • 2
LayoutLMv3: Pre-training for Document AI with Unified Text and Image Masking

Paper • 2204.08387 • Published Apr 18, 2022 • 2
Text Role Classification in Scientific Charts Using Multimodal Transformers

Paper • 2402.14579 • Published Feb 8, 2024 • 1
An inclusive review on deep learning techniques and their scope in handwriting recognition

Paper • 2404.08011 • Published Apr 10, 2024 • 1

Papers - Image - Tabular

CascadeTabNet: An approach for end to end table detection and structure recognition from image-based documents

Paper • 2004.12629 • Published Apr 27, 2020 • 2
LayoutLMv3: Pre-training for Document AI with Unified Text and Image Masking

Paper • 2204.08387 • Published Apr 18, 2022 • 2

Papers - Documents - Tabular

FormNetV2: Multimodal Graph Contrastive Learning for Form Document Information Extraction

Paper • 2305.02549 • Published May 4, 2023 • 6
FormNet: Structural Encoding beyond Sequential Modeling in Form Document Information Extraction

Paper • 2203.08411 • Published Mar 16, 2022 • 1
More efficient manual review of automatically transcribed tabular data

Paper • 2306.16126 • Published Jun 28, 2023 • 1
CascadeTabNet: An approach for end to end table detection and structure recognition from image-based documents

Paper • 2004.12629 • Published Apr 27, 2020 • 2

Papers - Document - OCR

Noise-Aware Training of Layout-Aware Language Models

Paper • 2404.00488 • Published Mar 30, 2024 • 10
FormNet: Structural Encoding beyond Sequential Modeling in Form Document Information Extraction

Paper • 2203.08411 • Published Mar 16, 2022 • 1
FormNetV2: Multimodal Graph Contrastive Learning for Form Document Information Extraction

Paper • 2305.02549 • Published May 4, 2023 • 6
ETC: Encoding Long and Structured Inputs in Transformers

Paper • 2004.08483 • Published Apr 17, 2020 • 1

Papers - Documents - LayoutLM

Noise-Aware Training of Layout-Aware Language Models

Paper • 2404.00488 • Published Mar 30, 2024 • 10
LayoutLMv3: Pre-training for Document AI with Unified Text and Image Masking

Paper • 2204.08387 • Published Apr 18, 2022 • 2
LayoutLMv2: Multi-modal Pre-training for Visually-Rich Document Understanding

Paper • 2012.14740 • Published Dec 29, 2020 • 1
LayoutLM: Pre-training of Text and Layout for Document Image Understanding

Paper • 1912.13318 • Published Dec 31, 2019 • 2

Papers - Microsoft

Can large language models explore in-context?

Paper • 2403.15371 • Published Mar 22, 2024 • 33
GaussianCube: Structuring Gaussian Splatting using Optimal Transport for 3D Generative Modeling

Paper • 2403.19655 • Published Mar 28, 2024 • 19
WavLLM: Towards Robust and Adaptive Speech Large Language Model

Paper • 2404.00656 • Published Mar 31, 2024 • 11
Enabling Memory Safety of C Programs using LLMs

Paper • 2404.01096 • Published Apr 1, 2024 • 1

Previous
1
2
Next

Company

TOS Privacy About Jobs

Website

Models Datasets Spaces Pricing Docs