PT - JOURNAL ARTICLE AU - Melissa J Fazzari AU - Marta M Guerra AU - Jane Salmon AU - Mimi Y Kim TI - Adverse pregnancy outcomes in women with systemic lupus erythematosus: can we improve predictions with machine learning? AID - 10.1136/lupus-2022-000769 DP - 2022 Sep 01 TA - Lupus Science & Medicine PG - e000769 VI - 9 IP - 1 4099 - http://lupus.bmj.com/content/9/1/e000769.short 4100 - http://lupus.bmj.com/content/9/1/e000769.full SO - Lupus Sci Med2022 Sep 01; 9 AB - Objectives Nearly 20% of pregnancies in patients with SLE result in an adverse pregnancy outcome (APO). We previously developed an APO prediction model using logistic regression and data from Predictors of pRegnancy Outcome: bioMarkers In Antiphospholipid Antibody Syndrome and Systemic Lupus Erythematosus (PROMISSE), a large multicentre study of pregnant women with mild/moderate SLE and/or antiphospholipid antibodies. Our goal was to determine whether machine learning (ML) approaches improve APO prediction and identify other risk factors.Methods The PROMISSE data included 41 predictors from 385 subjects; 18.4% had APO (preterm delivery due to placental insufficiency/pre-eclampsia, fetal/neonatal death, fetal growth restriction). Logistic regression with stepwise selection (LR-S), least absolute shrinkage and selection operator (LASSO), random forest (RF), neural network (NN), support vector machines (SVM-RBF), gradient boosting (GB) and SuperLearner (SL) were compared by cross-validated area under the ROC curve (AUC) and calibration.Results Previously identified APO risk factors, antihypertensive medication use, low platelets, SLE disease activity and lupus anticoagulant (LAC), were confirmed as important with each algorithm. LASSO additionally revealed potential interactions between LAC and anticardiolipin IgG, among others. SL performed the best (AUC=0.78), but was statistically indistinguishable from LASSO, SVM-RBF and RF (AUC=0.77 for all). LR-S, NN and GB had worse AUC (0.71–0.74) and calibration scores.Conclusions We predicted APO with reasonable accuracy using variables routinely assessed prior to the 12th week of pregnancy. LASSO and some ML methods performed better than a standard logistic regression approach. Substantial improvement in APO prediction will likely be realised, not with increasingly complex algorithms but by the discovery of new biomarkers and APO risk factors.Data are available on reasonable request. Clinical datasets from PROMISSE will be made available to the community pursuant to written request in the form of a one page protocol as well as an NIH CV. Such inquiries will be reviewed by the Steering Committee. If the majority of the Steering Committee considers the project meritorious, approved investigators will be provided with de-identified data in Excel-compatible and SAS-compatible formats. Investigators with whom data are shared will be encouraged to acknowledge the source of the data.