Abstract
Background Telemedicine (TM) became central to rheumatology care during the COVID-19 pandemic. Accumulating evidence suggests high acceptance, satisfaction, and feasibility of TM. There is a paucity of data on the use of TM in systemic lupus erythematosus (SLE). Due to the complexity of SLE outcome measures, clinicians and clinical trialists have raised concerns about the accuracy of TM-derived disease activity measures. This study aims to evaluate the level of agreement between physician-assessed virtual and face-to-face SLE outcome measures. Here we describe the study design and data on the first 50 participants evaluated.
Purpose To investigate whether physician assessments of SLE disease activity obtained during TM visits are comparable with those obtained during face-to-face (F2F) visits.
Methods This is an observational, longitudinal study of 200 SLE participants with varying levels of disease activity from 4 academic lupus centers serving diverse populations. The study is supported by the US Department of Defense. Each study participant is evaluated at 2 visits (baseline and a follow-up visit) as dictated by usual care. Virtual physical exam guidelines were established, and rely on physician-directed patient self-examination of major organ systems. At each visit, participants are evaluated by the same physician first via videoconference-based TM immediately followed by a F2F encounter. SLE disease activity measures (BILAG, hybrid SLEDAI, PGA, LFA-REAL™, CLASI, Swollen and Tender Joint Count [TSJC] and CGIC) are completed after the TM encounter and repeated after the F2F encounter. Tandem physician and participant feedback tools for TM and F2F encounters assess satisfaction, comfort, and which portion of the physical exam was difficult to evaluate virtually. In a pre-planned interim analysis of data from the first 50 participants, the degree of agreement between TM and F2F disease activity measures was analyzed using the paired-T-test and intra-class correlations (ICC). Bland-Altman plots of the differences between TM and F2F and scatter plots were also generated.
Results 50 participants were enrolled, 25 completed the follow-up visit. The baseline characteristics are summarized in table 1, 82% women, mean age 38.9 ± 13. The current enrollment spans a wide range of physician determined categories of disease activity (25% inactive, 56% mild/moderate, 18% severe). The study population is racially and ethnically diverse. The mean differences between TM and F2F in various disease activity measures showed that TM tended to slightly underestimate disease activity, but the differences were not statistically significant (table 2).
Estimated ICC were between 0.87 and 0.99, showing a high level of correlation between TM and F2F measures. There were 10 SLEDAI item discrepancies (5 arthritis; 3 rash; 1 alopecia; 1 pleurisy) and 11 BILAG domain discrepancies (3 constitutional; 1 mucocutaneous; 6 musculoskeletal; 1 cardio). The Bland Altman plots of TM-F2F differences and scatter plots also indicate substantial agreement, although a few outliers were observed (figure 1). Differences were largest for swollen joint counts. Physicians reported high levels of satisfaction (highly satisfied or satisfied) for 42 (84%) of the telemedicine visits. In 44 (88%) of the TM visits, physicians felt they were able to satisfactorily address the issues and concerns that prompted the visit. The physicians were unambiguous that they could adequately assess the participant’s disease activity in 30 (60%) of the TM visits, were unsure for 17 (34%), and reported not being able to assess the participant’s disease activity in 3(6%) of visits. Following the F2F encounter, the physicians confirmed that their virtual encounter assessments were accurate for 47 (94%) of visits.
Conclusion These interim data show a high level of agreement between the virtual and F2F disease activity measures. Discrepancies will be further probed to better understand potential areas for improvement. An additional 150 participants will be enrolled in this study to provide the rigorous quantitative and qualitative data on the comparability between virtual and F2F disease activity measures needed to promote confidence in and acceptance of TM in lupus clinical care and research.