AUTHOR=Swerdel Joel N. , Conover Mitchell M. 

TITLE=Comparing broad and narrow phenotype algorithms: differences in performance characteristics and immortal time incurred

JOURNAL=Journal of Pharmacy & Pharmaceutical Sciences

VOLUME=Volume 26 - 2023

YEAR=2024

URL=https://www.frontierspartnerships.org/journals/journal-of-pharmacy-pharmaceutical-sciences/articles/10.3389/jpps.2023.12095

DOI=10.3389/jpps.2023.12095

ISSN=1482-1826

ABSTRACT=Introduction
When developing phenotype algorithms for observational research, there is usually a trade-off between definitions that are sensitive or specific.  The objective of this study was to estimate the performance characteristics of phenotype algorithms designed for increasing specificity and to estimate the immortal time associated with each algorithm.
Materials and Methods
We examined algorithms for 11 chronic health conditions.  The analyses were from data from five databases.  For each health condition, we created five algorithms to examine performance (sensitivity and positive predictive value (PPV)) differences: one broad algorithm using a single code for the health condition and four narrow algorithms where a second diagnosis code was required 1-30 days, 1-90 days, 1-365 days, or 1- all days in a subject’s continuous observation period after the first code. 
We also examined the proportion of immortal time relative to time-at-risk (TAR) for four outcomes.  The TAR’s were: 0-30 days after the first condition occurrence (the index date), 0-90 days post-index, 0-365 days post-index, and 0-1095 days post-index.  Performance of algorithms for chronic health conditions was estimated using PheValuator (V2.1.4) from the OHDSI toolstack.  Immortal time was calculated as the time from the index date until the first of the following: 1) the outcome; 2) the end of the outcome TAR; 3) the occurrence of the second code for the chronic health condition.  
Results
In the first analysis, the narrow phenotype algorithms, i.e., those requiring a second condition code, produced higher estimates for PPV and lower estimates for sensitivity compared to the single code algorithm.  In all conditions, increasing the time to the required second code increased the sensitivity of the algorithm.  
In the second analysis, the amount of immortal time increased as the window used to identify the second diagnosis code increased.  The proportion of TAR that was immortal was highest in the 30-day TAR analyses compared to the 1095-day TAR analyses. 
Conclusion
Attempting to increase the specificity of a health condition algorithm by adding a second code is a potentially valid approach to increase specificity, albeit at the cost of incurring immortal time.