Abstract
Background SLE is difficult to diagnose because of the diverse manifestations occurring over time and across care sites. Electronic health records (EHR) present a rich source of patient information which can be mined for diagnosis and identification to improve quality of care or to enrol patients in studies.
Aim Build a rule-based algorithm for each revised 1982/1997 ACR classification criteria for SLE using EHR data.
Methods We included patients from the Chicago Lupus Database (CLD) fulfilling 4 or more of the ACR classification criteria for SLE who also had records in the Northwestern Medicine Electronic Data Warehouse (NMEDW) EHR. ICD-9 codes and lab test results for each ACR SLE criterion were ascertained. We queried patient diagnoses, lab results and used a simple chart string for lab test results from physician notes.
Results Data from 515/783 patients in CLD and the NMEDW EHR were included. When using ICD 9 codes only 8.8% of patients from CLD/NMEDW were identified. With the addition of lab results to the query concordance increased to 54.7%, and a simple text string query to search physician notes for additional lab results increased identification to 57.5%.
Conclusion Using ICD codes plus laboratory data from NMEDW increased fulfilment of classification criteria but is still not optimal for patient identification. Additional strategies such as using natural language processing (NLP) or examining fulfilment of SLICC classification criteria for SLE which includes more lab results than ACR may yield an improved rule-based algorithm for the identification of SLE patients in EHR data.