-
LayoutLM: Pre-training of Text and Layout for Document Image Understanding
Paper • 1912.13318 • Published • 2 -
microsoft/layoutlm-base-uncased
Updated • 1.45M • 50 -
LayoutLMv2: Multi-modal Pre-training for Visually-Rich Document Understanding
Paper • 2012.14740 • Published • 1 -
microsoft/layoutlmv2-base-uncased
Updated • 810k • 64
Collections
Discover the best community collections!
Collections including paper arxiv:2204.08387
-
LayoutLM: Pre-training of Text and Layout for Document Image Understanding
Paper • 1912.13318 • Published • 2 -
LayoutLMv2: Multi-modal Pre-training for Visually-Rich Document Understanding
Paper • 2012.14740 • Published • 1 -
LayoutLMv3: Pre-training for Document AI with Unified Text and Image Masking
Paper • 2204.08387 • Published • 2
-
CascadeTabNet: An approach for end to end table detection and structure recognition from image-based documents
Paper • 2004.12629 • Published • 2 -
LayoutLMv3: Pre-training for Document AI with Unified Text and Image Masking
Paper • 2204.08387 • Published • 2 -
Text Role Classification in Scientific Charts Using Multimodal Transformers
Paper • 2402.14579 • Published • 1 -
An inclusive review on deep learning techniques and their scope in handwriting recognition
Paper • 2404.08011 • Published • 1
-
FormNetV2: Multimodal Graph Contrastive Learning for Form Document Information Extraction
Paper • 2305.02549 • Published • 6 -
FormNet: Structural Encoding beyond Sequential Modeling in Form Document Information Extraction
Paper • 2203.08411 • Published • 1 -
More efficient manual review of automatically transcribed tabular data
Paper • 2306.16126 • Published • 1 -
CascadeTabNet: An approach for end to end table detection and structure recognition from image-based documents
Paper • 2004.12629 • Published • 2
-
Noise-Aware Training of Layout-Aware Language Models
Paper • 2404.00488 • Published • 10 -
FormNet: Structural Encoding beyond Sequential Modeling in Form Document Information Extraction
Paper • 2203.08411 • Published • 1 -
FormNetV2: Multimodal Graph Contrastive Learning for Form Document Information Extraction
Paper • 2305.02549 • Published • 6 -
ETC: Encoding Long and Structured Inputs in Transformers
Paper • 2004.08483 • Published • 1
-
Noise-Aware Training of Layout-Aware Language Models
Paper • 2404.00488 • Published • 10 -
LayoutLMv3: Pre-training for Document AI with Unified Text and Image Masking
Paper • 2204.08387 • Published • 2 -
LayoutLMv2: Multi-modal Pre-training for Visually-Rich Document Understanding
Paper • 2012.14740 • Published • 1 -
LayoutLM: Pre-training of Text and Layout for Document Image Understanding
Paper • 1912.13318 • Published • 2
-
Can large language models explore in-context?
Paper • 2403.15371 • Published • 33 -
GaussianCube: Structuring Gaussian Splatting using Optimal Transport for 3D Generative Modeling
Paper • 2403.19655 • Published • 19 -
WavLLM: Towards Robust and Adaptive Speech Large Language Model
Paper • 2404.00656 • Published • 11 -
Enabling Memory Safety of C Programs using LLMs
Paper • 2404.01096 • Published • 1