Publications

MEDS — An Emerging Data Standard and Ecosystem for Health AI Research

McDermott MBA. et al, (2026), NEJM AI, 3

A benchmark of expert-level academic questions to assess AI capabilities

Phan L. et al, (2026), Nature, 649, 1139 - 1146

MEDS: Building Models and Tools in a Reproducible Health AI Ecosystem

McDermott MBA. et al, (2025), Proceedings of the 31st ACM SIGKDD Conference on Knowledge Discovery and Data Mining V.2, 6243 - 6244

Humanity's Last Exam

Phan L. et al, (2025)

Tree-of-Quote Prompting Improves Factuality and Attribution in Multi-Hop and Medical Reasoning

Xu J. et al, (2025), Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 5605 - 5622

RadEval: A framework for radiology text evaluation

Xu J. et al, (2025), Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing: System Demonstrations, 546 - 557

Automated Structured Radiology Report Generation

Delbrouck J-B. et al, (2025), Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 26813 - 26829

CheXalign: Preference fine-tuning in chest X-ray interpretation models without human feedback

Hein D. et al, (2025), PROCEEDINGS OF THE 63RD ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 1: LONG PAPERS, 27679 - 27702

ACES: AUTOMATIC COHORT EXTRACTION SYSTEM FOR EVENT-STREAM DATASETS

Xu J. et al, (2025), 13th International Conference on Learning Representations Iclr 2025, 66701 - 66716

CheXalign: Preference fine-tuning in chest X-ray interpretation models without human feedback

Hein D. et al, (2024)

Overview of the First Shared Task on Clinical Text Generation: RRG24 and "Discharge Me!"

Xu J. et al, (2024)

ACES: Automatic Cohort Extraction System for Event-Stream Datasets

Xu J. et al, (2024)

GREEN: Generative Radiology Report Evaluation and Error Notation

Ostmeier S. et al, (2024)

GREEN: Generative Radiology Report Evaluation and Error Notation

Ostmeier S. et al, (2024), Findings of the Association for Computational Linguistics: EMNLP 2024, 374 - 390

Overview of the First Shared Task on Clinical Text Generation: RRG24 and “Discharge Me!”

Xu J. et al, (2024), Proceedings of the 23rd Workshop on Biomedical Natural Language Processing, 85 - 98

Cookies on this website

Search results (17)