Announcement

Collapse
No announcement yet.

Sci Rep . Computable phenotypes to identify respiratory viral infections in the All of Us research program

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Sci Rep . Computable phenotypes to identify respiratory viral infections in the All of Us research program

    Sci Rep


    . 2025 May 28;15(1):18680.
    doi: 10.1038/s41598-025-02183-9. Computable phenotypes to identify respiratory viral infections in the All of Us research program

    Bennett J Waxse 1 2 , Fausto Andres Bustos Carrillo 2 , Tam C Tran 1 , Huan Mo 1 , Emily E Ricotta 3 , Joshua C Denny 4 5



    AffiliationsAbstract

    Electronic health records (EHRs) contain rich temporal data about respiratory viral infections, but methods to identify these infections from EHR data vary widely and lack robust validation. We developed computable phenotypes by integrating virus-specific International Classification of Diseases (ICD) billing codes, prescriptions, and laboratory results within 90-day episodes. Analysis of 265,222 participants with EHR data from the All of Us Research Program yielded national cohorts of varied size: large cohorts for SARS-CoV-2 (n = 28,729) and influenza (n = 19,784); medium cohorts for rhinovirus, human coronavirus, and respiratory syncytial virus (n = 1,161-1,620); and smaller cohorts for the other viruses (n = 238-486). Using laboratory results as a reference standard, phenotypes using virus-specific ICD codes and medications had variable sensitivity (8-67%) but high positive predictive value (PPV, 90-97%) for most viruses, while influenza virus and SARS-CoV-2 phenotypes had lower PPV (69-70%) that improved with the inclusion of additional ICD codes. Identified infections exhibited expected seasonal patterns matching CDC data. This integrated approach identified infections more effectively than individual components alone and demonstrated utility for severe infections in hospital settings. This method enables large-scale studies of host genetics, health disparities, and clinical outcomes across episodic diseases, with flexibility to optimize sensitivity or PPV depending on the specific research question.

    Keywords: Computable phenotype; Electronic health records; Influenza virus; Precision medicine; Reproducibility of results; Respiratory tract infections.

Working...
X