A NOVEL diagnostic approach integrating a host immune biomarker with large language model (LLM) analysis of electronic health records significantly improves the diagnosis of lower respiratory tract infection in critically ill adults, according to new research. The findings highlight a potential strategy to reduce diagnostic uncertainty and inappropriate antibiotic use in intensive care settings. 

Diagnosing lower respiratory tract infection remains a major clinical challenge, particularly in the ICU, where infectious and non-infectious respiratory conditions often present similarly. Traditional microbiological testing frequently fails to identify a causative pathogen, leading clinicians to rely on empiric antibiotics, with well-recognised risks including Clostridioides difficile infection and antimicrobial resistance. 

Lower Respiratory Tract Infection Diagnosis Enhanced by Host Biomarkers 

Host transcriptional biomarkers offer a complementary approach by reflecting the body’s immune response rather than pathogen detection alone. In this study, researchers focused on FABP4, a single-gene pulmonary transcriptomic biomarker suitable for rapid molecular testing platforms already used in clinical practice. FABP4 expression was combined with LLM-based analysis of free-text electronic medical record (EMR) data using GPT-4. 

In a cohort of critically ill adults, the combined diagnostic classifier achieved an area under the receiver operating characteristic curve (AUC) of 0.93 and an overall accuracy of 84%. This performance exceeded that of FABP4 testing alone (AUC 0.84) and LLM-based EMR analysis alone (AUC 0.83). By comparison, the admitting clinical team’s diagnosis had an accuracy of 72%, highlighting the added value of the integrated approach. 

Validation Confirms Diagnostic Accuracy 

Importantly, the model’s performance was replicated in an independent validation cohort, where diagnostic accuracy rose to 96% with an AUC of 0.98. These findings suggest that combining structured molecular data with unstructured clinical narratives can meaningfully enhance diagnostic confidence in complex ICU cases. 

For clinicians, this approach may be particularly valuable when microbiological results are inconclusive or delayed. By improving early identification of true infection, and potentially distinguishing infectious from non-infectious causes of respiratory failure, the model could support more judicious antibiotic prescribing. 

While the study does not establish causality or directly assess patient outcomes, it provides proof of concept that integrating host biomarkers with AI can outperform either modality alone. Further prospective studies will be needed to evaluate real-world implementation, turnaround time, and impact on antimicrobial stewardship. 

Reference 

Phan HV et al. Integrating a host biomarker with a large language model for diagnosis of lower respiratory tract infection. Nat Commun. 2025;16(1):10882.