Discussion
This retrospective study investigated the diagnostic efficacy of three classification criteria, that is, ACR-1997, SLICC-2012 and EULAR/ACR-2019, for SLE in a Chinese single-centre cohort. The findings in the current study demonstrated great classification abilities of the three criteria; and the SLICC-2012 and EULAR/ACR-2019 had the highest accuracy, followed by ACR-1997. Of note, the SLICC-2012 and EULAR/ACR-2019 could achieve earlier diagnosis, especially the latter one. In addition, by adjusting the classification threshold of EULAR/ACR-2019 from 10 to 12, the specificity and accuracy were improved.
In our study, there were statistically significant differences in gender composition and age distribution between the groups. The reasons for it were manifold and varied, but the one that dominated was gender and age difference of high-risk population in different diseases. Namely, the incidence of SLE in the case group was women of childbearing age.1 19–23 And in contrast, the proportion of patients with rheumatoid arthritis in the control group was large, and the incidence of rheumatoid arthritis was in middle-aged and elderly women24–28; furthermore, the control group included ankylosing spondylitis which was commonly seen in young men.29–31 In order to compare the performance of classification criteria in different gender, age and course of disease, and to eliminate the impact of differences between the case and control group, the groups were further stratified according to gender and age, and then the sensitivity and specificity of classification criteria were compared. We found the changes on sensitivity and specificity of the three sets of classification criteria in different subgroups had the same upward or downward trend. Therefore, the impact on the differences of gender composition and age distribution caused by the selection of the control group was considered little in the overall analysis.
Our study demonstrated that the clinical item of discoid erythema in ACR-1997 and chronic cutaneous lupus in SLICC-2012 was not significantly different between the case group and the control group. The finding rests chiefly on the fact that the number of patients with discoid erythema in the case group and the control group was too small to obtain statistically significant results. In the follow-up study, comparison of this item can be achieved by expanding the sample size.
Previous studies noted that the EULAR/ACR-2019 criteria had great operating characteristics (sensitivity and specificity) in patients diagnosed as SLE.12 15 32 Johnson et al reported that the sensitivity and sensitivity of EULAR/ACR-2019 were 96% and 90%.12 A Malaysian study published recently reported that the sensitivity of EULAR/ACR-2019 classification standard was 91% and the specificity was 94%.15 In the present study, EULAR/ACR-2019 classification also presented the great diagnostic performance for SLE, with sensitivity, specificity, and accuracy of 97.4%, 91.4%, and 94.3%, respectively.
According to the results of our study, the AUCs (95% CI) of ACR-1997, SLICC-2012 and EULAR/ACR-2019 were 0.972 (0.962 to 0.982), 0.986 (0.979 to 0.993) and 0.983 (0.973 to 0.992), respectively. This proved that the diagnostic results of the three sets of classification criteria were highly similar to the results of clinical diagnosis (the gold standard adopted in our study). In the case group of this study, rheumatologists often referred to ACR-1997 in the process of clinical diagnosis. Therefore, the high AUC score of SLICC-2012 and EULAR/ACR-2019 had little to do with the reference use in the diagnosis process.
The qualitative results of ANA in our study came from immunology laboratory of the Second Affiliated Hospital of Suzhou University, where indirect Hep-2 immunofluorescence was used to calibrate ANA titre ≥1:100 as positive results, which was consistent with the corresponding item requirements of ACR-19975 and SLICC-2012,6 but the entry criterion of EULAR/ACR-2019 is ‘ANA at a titre of ≥1:80’.7 Theoretically, the definition of positive ANA following EULAR/ACR-2019 (ANA ≥1:80) may make the sensitivity of EULAR/ACR-2019 higher than the actual value in our study (ANA ≥1:100).
Notably, given that consensus has not yet been reached on standardised definition of ‘early’ SLE, the patients with SLE in the case group were stratified according to the course of disease (≤1 year, >1 year and ≤3 years, >3 years and ≤5 years, >5 years).Our study found that among the patients with a course of disease of no more than 5 years, the sensitivities of SLICC-2012 and EULAR/ACR-2019 were better, reaching 100%. But in the subgroup with longer course of disease, the sensitivities of the two sets of classification criteria decreased, and still higher than 95%. The sensitivities of SLICC-2012 and EULAR/ACR-2019 in the diagnosis of patients with early SLE were higher than that of late SLE. It may be that some patients with late SLE cannot accurately recall the specific situation in the early stage of onset at the last follow-up, resulting in the loss of some data and the inconsistency of classification criteria. These important data could lay the foundation for the performance of clinical trials and observational studies on SLE.
Given that the disease status of SLE may have lasted for a period of time before classification, it is very important to accurately identify patients in the early stage of onset, which is beneficial to early intervention.33–38 As such, the present study also compared the classification time of the three sets of classification criteria and the time of clinical diagnosis. The results showed that EULAR/ACR-2019 and SLICC-2012 could achieve earlier diagnosis than ACR-1997; especially EULAR/ACR-2019 was the best one (median time to classification: 0.13 vs 0.50 vs 1.00 months, respectively, p=0.001).
Previous studies also pointed out that SLICC-2012 and EULAR/ACR-2019 performed well in identifying early SLE.7 15 39 Adamichou et al recruited patients with SLE with a course of <3 years to compare the early diagnostic ability of the classification criteria. In the early stage of the disease, compared with the ACR-1997, the sensitivity of SLICC-2012 and EULAR/ACR-2019 was significantly increased (79.9% vs 91.4% vs 87.3%); and the median classification times were 12.1, 9.1 and 9.1 months, respectively.10 Selvananda and Kan included patients with SLE with a course of no more than 1 year when investigating the early manifestations of the disease. The sensitivity of ACR-1997, SLICC-2012 and EULAR/ACR-2019 was 86%, 98%, and 94%, respectively; and the sensitivity of SLICC-2012 was significantly higher than that of ACR-1997.15
In a multicentre study evaluating the causes of death in 222 patients with SLE, 31% of the deaths were attributed to active organ involvement and 33% to infection, of which the involvement of the kidney and central nervous system was the most common major cause of death.40–43 Previous research data showed that even before the classification criteria were fully met, a certain proportion of patients with SLE may have organ-dominant or organ-localised diseases, especially the nervous system, kidney and haematological.44 45 In view of this, in order to better diagnose renal-dominant SLE, SLICC-2012 first introduced renal involvement as a sufficient condition for independent diagnosis,6 and EULAR/ACR-2019 increased the weight of item with renal involvement.7 Our study showed that in all subgroups, the number of patients whose classification time of SLICC-2012 or EULAR/ACR-2019 was earlier than that of clinical diagnosis was greater than that of ACR-1997. Whether it is skin mucosal lesions, serositis, arthritis, or the involvement of important organs such as haematological and kidney, SLICC-2012 and EULAR/ACR-2019 showed better early diagnosis ability.
In addition, this study improved the accuracy of EULAR/ACR-2019 by adjusting the classification threshold. When the classification threshold was set to 12, the accuracy of the classification criteria was the highest (95.0%), which was the same as that of SLICC-2012 (95.0%). The reason why the EULAR/ACR-2019 classification threshold at the highest accuracy of this classification in this study was inconsistent with the formulated classification threshold may be ethnic differences. The majority of patients in the case group and control group analysed in the formulation of EULAR/ACR-2019 classification criteria were white, with low proportion of Asian population.7 12 Previous studies involving multiethnic differences have shown that the characteristics of SLE in Asian people were different from those in other ethnic groups indeed. A meta-analysis published in 2021 reported that Asian patients with SLE were more likely to develop Malar rash, leucopenia and thrombocytopenia than non-Asian patients. There was no difference in renal and nervous system performance between Asians and non-Asians. Photosensitivity and discoid rash were less observed in Asian patients with SLE. Compared with Indian patients with SLE, Chinese patients with SLE had a lower incidence of oral ulcer.46 What is more, DeQuattro et al pointed that the disease severity index of SLE in Asian patients was significantly higher than that in white patients, but similar to that in black and Hispanic patients.47 By adjusting the classification threshold of the criteria, it may also improve the diagnosis accuracy of patients with SLE in China. We also look forward to the further verification and comparison of the classification criteria from other centres in Asia.
There are some limitations in the present study. This is a single-centre retrospective study with its inherent defect, including selection bias. A large multicentre prospective study is required for further study. Remarkably, studies pertinent to Chinese patients are still scarce to the best of our knowledge. As such, the current study compensated for a lack of data on the clinical utility of three classification criteria, that is, ACR-1997, SLICC-2012 and EULAR/ACR-2019, for SLE in the Chinese cohort.
In conclusion, both the SLICC-2012 and EULAR/ACR-2019 had optimal diagnostic efficacy in identifying Chinese patients with SLE, whether in the overall patients or the subgroup stratified by age, gender and course of disease. Furthermore, SLICC-2012 and EULAR/ACR-2019 showed better early diagnosis ability, especially the latter one. By adjusting the classification threshold of EULAR/ACR-2019 from 10 to 12, the diagnostic accuracy and specificity can be improved.