Search results (16)
« Back to PublicationsA benchmark of expert-level academic questions to assess AI capabilities
Journal article
Phan L. et al, (2026), Nature, 649, 1139 - 1146
Tree-of-Quote Prompting Improves Factuality and Attribution in Multi-Hop and Medical Reasoning
Conference paper
Xu J. et al, (2025), Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 5605 - 5622
Automated Structured Radiology Report Generation
Conference paper
Delbrouck J-B. et al, (2025), Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 26813 - 26829
RadEval: A framework for radiology text evaluation
Conference paper
Xu J. et al, (2025), Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing: System Demonstrations, 546 - 557
CheXalign: Preference fine-tuning in chest X-ray interpretation models without human feedback
Conference paper
Hein D. et al, (2025), PROCEEDINGS OF THE 63RD ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 1: LONG PAPERS, 27679 - 27702
MEDS: Building Models and Tools in a Reproducible Health AI Ecosystem
Conference paper
McDermott MBA. et al, (2025), PROCEEDINGS OF THE 31ST ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING V.2, KDD 2025, 6243 - 6244
ACES: AUTOMATIC COHORT EXTRACTION SYSTEM FOR EVENT-STREAM DATASETS
Conference paper
Xu J. et al, (2025), 13th International Conference on Learning Representations Iclr 2025, 66701 - 66716
CheXalign: Preference fine-tuning in chest X-ray interpretation models without human feedback
Preprint
Hein D. et al, (2024)
Overview of the First Shared Task on Clinical Text Generation: RRG24 and "Discharge Me!"
Preprint
Xu J. et al, (2024)
GREEN: Generative Radiology Report Evaluation and Error Notation
Conference paper
Ostmeier S. et al, (2024), Findings of the Association for Computational Linguistics: EMNLP 2024, 374 - 390
Overview of the First Shared Task on Clinical Text Generation: RRG24 and “Discharge Me!”
Conference paper
Xu J. et al, (2024), Proceedings of the 23rd Workshop on Biomedical Natural Language Processing, 85 - 98
AnnoDash, a clinical terminology annotation dashboard
Journal article
Xu J. et al, (2023), JAMIA Open, 6
