Yale NLP Lab

university

https://yale-nlp.github.io/

yalenlp

yale-nlp

Activity Feed Request to join this org

AI & ML interests

Natural Language Processing at Yale

Recent Activity

Simeng updated a dataset 12 days ago

yale-nlp/P-FOLIO

Simeng published a dataset 12 days ago

yale-nlp/P-FOLIO

yilunzhao new activity 16 days ago

yale-nlp/MMVU:Add task category

View all activity

yale-nlp's activity

Simeng

updated a dataset 12 days ago

yale-nlp/P-FOLIO

Preview • Updated 12 days ago • 34 • 3

Simeng

published a dataset 12 days ago

yale-nlp/P-FOLIO

Preview • Updated 12 days ago • 34 • 3

yilunzhao

in yale-nlp/MMVU 16 days ago

Add task category

#2 opened 20 days ago by

nielsr

yilunzhao

authored 2 papers 20 days ago

HumanEval Pro and MBPP Pro: Evaluating Large Language Models on Self-invoking Code Generation

Paper • 2412.21199 • Published Dec 30, 2024 • 13

MMVU: Measuring Expert-Level Multi-Discipline Video Understanding

Paper • 2501.12380 • Published 21 days ago • 81

armanc

authored a paper 21 days ago

MMVU: Measuring Expert-Level Multi-Discipline Video Understanding

Paper • 2501.12380 • Published 21 days ago • 81

ChuhanLi

authored a paper 21 days ago

MMVU: Measuring Expert-Level Multi-Discipline Video Understanding

Paper • 2501.12380 • Published 21 days ago • 81

ziyaosg

authored a paper 21 days ago

MMVU: Measuring Expert-Level Multi-Discipline Video Understanding

Paper • 2501.12380 • Published 21 days ago • 81

yilunzhao

updated a dataset 21 days ago

yale-nlp/MMVU

Viewer • Updated 16 days ago • 1k • 6.92k • 54

armanc

authored a paper 29 days ago

ChemAgent: Self-updating Library in Large Language Models Improves Chemical Reasoning

Paper • 2501.06590 • Published Jan 11 • 9

yilunzhao

authored a paper 29 days ago

ChemAgent: Self-updating Library in Large Language Models Improves Chemical Reasoning

Paper • 2501.06590 • Published Jan 11 • 9

ChuhanLi

updated a dataset 30 days ago

yale-nlp/M3SciQA

Viewer • Updated 30 days ago • 1.45k • 168 • 6

pybeebee

updated a collection about 2 months ago

MDCure

Collection

Models and datasets for our work "MDCure: A Scalable Pipeline for Multi-Document Instruction-Following" (https://arxiv.org/abs/2410.23463) • 11 items • Updated Dec 23, 2024 • 5

shrutisingh

updated a dataset about 2 months ago

yale-nlp/SciDQA

Viewer • Updated Dec 17, 2024 • 2.94k • 138 • 1

yilunzhao

authored 6 papers 2 months ago

Benchmarking Generation and Evaluation Capabilities of Large Language Models for Instruction Controllable Summarization

Paper • 2311.09184 • Published Nov 15, 2023 • 1

Investigating Data Contamination in Modern Benchmarks for Large Language Models

Paper • 2311.09783 • Published Nov 16, 2023 • 2

MedAgents: Large Language Models as Collaborators for Zero-shot Medical Reasoning

Paper • 2311.10537 • Published Nov 16, 2023 • 3

AI & ML interests

Recent Activity

Team members 13

yale-nlp's activity

Add task category