MV-JAR: Masked Voxel Jigsaw and Reconstruction for LiDAR-Based Self-Supervised Pre-Training Paper • 2303.13510 • Published Mar 23, 2023 • 1
Robo3D: Towards Robust and Reliable 3D Perception against Corruptions Paper • 2303.17597 • Published Mar 30, 2023 • 1
CLIPSelf: Vision Transformer Distills Itself for Open-Vocabulary Dense Prediction Paper • 2310.01403 • Published Oct 2, 2023 • 1
Evaluating Hallucinations in Chinese Large Language Models Paper • 2310.03368 • Published Oct 5, 2023
MultiModal-GPT: A Vision and Language Model for Dialogue with Humans Paper • 2305.04790 • Published May 8, 2023 • 1
Segment Any Point Cloud Sequences by Distilling Vision Foundation Models Paper • 2306.09347 • Published Jun 15, 2023 • 1
CLIM: Contrastive Language-Image Mosaic for Region Representation Paper • 2312.11376 • Published Dec 18, 2023
T-Eval: Evaluating the Tool Utilization Capability Step by Step Paper • 2312.14033 • Published Dec 21, 2023 • 2
EmbodiedScan: A Holistic Multi-Modal 3D Perception Suite Towards Embodied AI Paper • 2312.16170 • Published Dec 26, 2023 • 1
GPT4RoI: Instruction Tuning Large Language Model on Region-of-Interest Paper • 2307.03601 • Published Jul 7, 2023 • 12
Unified Human-Scene Interaction via Prompted Chain-of-Contacts Paper • 2309.07918 • Published Sep 14, 2023 • 1
Code Needs Comments: Enhancing Code LLMs with Comment Augmentation Paper • 2402.13013 • Published Feb 20, 2024 • 1
AlchemistCoder: Harmonizing and Eliciting Code Capability by Hindsight Tuning on Multi-source Data Paper • 2405.19265 • Published May 29, 2024
ANAH: Analytical Annotation of Hallucinations in Large Language Models Paper • 2405.20315 • Published May 30, 2024
RTMDet: An Empirical Study of Designing Real-Time Object Detectors Paper • 2212.07784 • Published Dec 14, 2022