Skip to main content

Peripheral blood cytokine profiles predict the severity of SARS-CoV-2 infection: an EPIC3 study analysis

Abstract

Background

Predicting which patients will develop severe COVID-19 complications could improve clinical care. Peripheral blood cytokine profiles may predict the severity of SARS-CoV-2 infection, but none have been identified in US Veterans.

Methods

We analyzed peripheral blood cytokine profiles from 202 participants in the EPIC3 study, a prospective observational cohort of US Veterans tested for SARS-CoV-2 across 15 VA medical centers. Illness severity was assessed based on the highest level documented during the first 60 days after recruitment. We correlated cytokine levels with illness severity using LASSO logistic regression, random forest, and XGBoost models on a 70% training set and calculated the AUC on a 30% test set.

Results

LASSO regression identified 6 cytokines as predictors of SARS-CoV-2 severity with 77.3% AUC in the test set. Random forest and XGBoost models achieved an AUC of 80.4% and 80.7% in the test set, respectively. All models assigned a feature importance to each cytokine, with IP-10, MCP-1, and HGF consistently identified as key markers.

Conclusions

Cytokine profiles are predictive of SARS-CoV-2 severity in US Veterans and may guide tailored interventions for improved patient management.

Peer Review reports

Introduction

COVID-19, caused by severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), is an infectious disease that has achieved global reach. The COVID-19 pandemic has had a significant impact on the United States, resulting in over 1.1 million fatalities across the country [1]. Within the Veterans Health Administration (VHA), which is among the largest integrated health care systems in the U.S., over 870,000 infections with SARS-CoV-2 and approximately 24,000 deaths were attributed to COVID-19 [2,3,4].

The clinical manifestations of COVID-19 are diverse. Although most patients experience symptoms that range from mild to moderate, approximately 15% of patients develop severe pneumonia, and of these, around 5% require admission to the intensive care unit (ICU) due to acute respiratory distress syndrome (ARDS), septic shock, or multiple organ dysfunction, often resulting in high fatality rates [5]. Two significant prognostic challenges are the insufficient identification of key cytokines associated with lethal outcomes and the difficulty in predicting which patients are at increased risk of developing severe illness and death [6].

Individuals suffering from severe SARS-CoV-2 infection often develop cytokine storm, which is a prominent feature that is linked with poor clinical outcomes and multiple organ dysfunction syndrome [7]. Recent research has identified cytokines such as MCP-3, IP-10, and IL-6 as reliable indicators for the advancement of COVID-19 [6]. Although current hospital diagnostics and variables like comorbidities and age help assess COVID-19 severity, they have limited ability to capture immune response heterogeneity, which is a key driver of disease progression. Severe SARS-CoV-2 infection marked by hyperinflammation and immune exhaustion can precede clinical deterioration, such as hypoxemia and organ failure. Cytokine profiling enables earlier risk detection and may allow for timely intervention. Moreover, unlike static risk factors, cytokine-based models can offer more personalized risk assessment and help guide targeted treatments. Identifying cytokine biomarkers for severe SARS-CoV-2 infection may enhance clinical decision-making and patient care by enabling early identification and tailored intervention for those at greatest risk [8].

The EPIC3 (Epidemiology, Immunology, and Clinical Characteristics of COVID-19) study, conducted within the Veterans Health Administration (VHA), is dedicated to detailing the epidemiological patterns and the natural progression of SARS-CoV-2 among the Veteran population. It further seeks to evaluate the relationship between host and viral elements and the intensity of the infection, as well as the emergence of immunity over time. In this manuscript, we present findings concerning the highest level of illness severity within the initial 60 days after recruitment. Participating Veterans contributed data by completing questionnaires, either conducted as interviews or filled out by the Veterans themselves, and by providing biospecimens; clinical data assessment was conducted accessing comprehensive VHA Electronic Health Records (EHR) [9]. Utilizing a subset of the broader EPIC3 study participants and leveraging the combination of the questionnaire, EHR, and biospecimen data, we identified a host cytokine profile that independently predicted SARS-CoV-2 infection severities.

Methods

Registration

The EPIC3 study is registered on ClinicalTrials.gov (NCT number: NCT05764083). More details regarding the registration can be found at https://clinicaltrials.gov/study/NCT05764083#more-information.

Study design

EPIC3 is a prospective, observational cohort study. From July 2020 to September 2022, it enrolled Veteran inpatient and outpatient participants across 15 Veterans Affairs medical facilities. The EPIC3 data, including questionnaires and biospecimens, were systematically gathered at baseline (day 0) and then, when possible, on days 3, 7, 14, 21, and 28, followed by the 3rd, 6th, 12th, 18th, and 24th months post-enrollment. This analysis uses risk prediction modeling to identify cytokine biomarkers for illness severity among participants who tested positive for COVID-19.

Study population

The inclusion criteria for the study stipulate that participants must be aged 18 or older, classified as an inpatient or outpatient at one of the participating Veterans Affairs medical centers from June 2020 to September 2022, and have undergone a SARS-CoV-2 RT-PCR test within three weeks prior to recruitment. From the pool of invited Veterans, 60% agreed to enroll, and 21% of these participants had research blood drawn to undergo a comprehensive panel of 45 cytokines assessed using the Luminex platform. Participants or their authorized representatives provided informed consent. The study was reviewed and approved by the VA Central IRB. The criteria for our analysis were not based on a target number of samples, rather the sample size was limited by the number of samples available in the biorepository that fit the pre-defined criteria. Thus, our study sample included those with a positive SARS-CoV-2 RT-PCR test result at enrollment and baseline measurements of 45 cytokines.

Exposures

Participants with positive test outcomes of SARS-CoV-2 RT-PCR tests at recruitment were chosen for this analysis. Their classification as either inpatients or outpatients was established at the baseline. Demographic data, including age, sex, and race/ethnicity, were gleaned from the initial questionnaires or EHR. We calculated participants’ Charlson comorbidity index (CCI), a composite measure of medical comorbidities, using the method detailed by Quan and colleagues, with data from the EHR from the 2 years before enrollment [10].

A comprehensive panel of 45 cytokines was assessed using the Luminex platform (Vendor: Luminex Corporation) at PHRL. These cytokines included BDNF, EGF, Eotaxin, FGF-2, GM-CSF, GRO-α, HGF, IFN-α, IFN-γ, IL-1α, IL-1β, IL-1RA, IL-2, IL-4, IL-5, IL-6, IL-7, IL-8, IL-9, IL-10, IL-12(p70), IL-13, IL-15, IL-17A, IL-18, IL-21, IL-22, IL-23, IL-27, IL-31, IP-10, LIF, MCP-1, MIP-1-α, MIP-1-β, NGF-β, PDGF-BB, PlGF-1, RANTES, SCF, SDF-1-α, TNF-α, TNF-β, VEGF-A, and VEGF-D.

Outcome

One outcome assessed for all study participants was the degree of illness severity. This metric was determined by the highest level of severity a participant experienced within the first 60 days following their entry into the study. Severity was measured using the Veterans Affairs Severity Index for COVID-19 (VASIC), which is an exclusive 4-tier scale (mild, moderate, severe, or death) derived from the World Health Organization’s COVID-19 severity scale. This scale was applied to EHR data, and its accuracy was confirmed through medical record review [11]. Mild severity encompassed participants who were in the hospital for 24 h or less; moderate severity was for those hospitalized for more than 24 h and included cases requiring low-flow oxygen therapy; severe cases were those needing high-flow oxygen, intubation, mechanical ventilation, extracorporeal membrane oxygenation, vasopressors, or initiation of renal dialysis within 60 days post-diagnosis; and any deaths occurring within 60 days, irrespective of cause, were classified under the death category.

In this analysis, the participants were stratified based on their SARS-CoV-2 infection severity into two categories: mild/moderate and severe/death. This approach addresses the limited sample size and simplifies the task of differentiating levels of COVID-19 severity. Moreover, combining'severe'and'death'categories addresses the issue of class imbalance due to the rarity of death events, enhancing the models’ ability to identify patterns predicting severe outcomes.

Statistical analyses

The count and percentage of the participants’ baseline characteristics in each age, CCI, race/ethnicity, cohort (inpatient vs. outpatient), and sex categories were calculated, with each category further stratified by the severity of SARS-CoV-2 infection. The age groups were defined as: < 30, 30–39, 40–49, 50–59, 60–69, 70–79, and > = 80 years. In this study, participants were categorized into four CCI groups: 0, 1–2, 3–4, and 5+. The race and ethnicity of the participants were combined into a single race/ethnicity variable, that included the categories: Hispanic, Non-Hispanic Black, Non-Hispanic White, and Other. The cohort category identified a participant as either inpatient or outpatient, and the sex of participants was classified as female or male.

We also compared the baseline cytokine responses between the mild/moderate and severe/death groups. The results were presented as median and interquartile range (IQR). We calculated the percentage of out-of-detection-limit values for each cytokine. We divided participants into a 70% training set and a 30% test set using randomization stratified by the severity of their SARS-CoV-2 infection. We assessed whether the training and test sets were comparable in terms of the severity of SARS-CoV-2 infection and baseline characteristics. Then we conducted univariate analysis in the training set to examine the ability of each cytokine response to differentiate between mild/moderate vs severe/death cases by using the Receiver Operating Characteristic Curve (ROC) and the area under the ROC curve (AUC). We also calculated the sensitivity, specificity, Youden index, and best cut-off value for each model. After the univariate analysis, we chose the cytokines with AUC’s 95% confidence interval (CI) lower boundary exceeding 0.5 (random guess) to be included in the risk prediction models. Additionally, we evaluated the ability of CCI alone to distinguish between mild/moderate and severe/death cases using ROC and AUC analysis.

We leveraged the Pearson’s correlation coefficient (r) to investigate the pairwise correlations between the cytokine responses selected to be included in the risk prediction model. Correlations were visualized using a heatmap, with stronger correlations (r closer to 1) represented in shades closer to red and weaker correlations (r closer to 0) appearing closer to blue. We hoped to discern the intercorrelation patterns and potential clustering among the cytokine responses.

We aimed to construct prediction models using the selected cytokine responses to predict the risk of having SARS-CoV-2 infection severity of severe/death as opposed to mild/moderate. We did not incorporate other variables such as age, sex, or comorbidities/CCI into the prediction models because our objective was to assess cytokines as independent predictors of SARS-CoV-2 severity. Additionally, we also hoped to identify which variables contribute the most to the prediction of the severity of SARS-CoV-2 infection. However, given the intercorrelated nature of the cytokine responses, building prediction models using all the selected cytokine responses may lead to overfitting problems. Therefore, we decided to employ a penalized regression method, the least absolute shrinkage and selection operator (LASSO) logistic regression, to perform variable selection and regularization to achieve better model performance [12].

To construct the LASSO model, we logarithmically transformed the cytokine concentrations. The optimal penalty parameter was determined via tenfold cross-validation and then used to construct the LASSO model in the training set [13]. The discriminative ability of the LASSO model in the test set was evaluated using the ROC and AUC [14]. We assessed the calibration of the model using the Hosmer–Lemeshow tests. LASSO regression was conducted using the GLMNET package in R. Cytokine responses chosen through LASSO regression were presented with the absolute coefficient value for each variable indicated.

To offer an alternative and complementary analysis to the LASSO regression, we also leveraged a random forest model [15, 16] and the eXtreme Gradient Boosting (XGBoost) [17] using the selected cytokine responses to predict the severity of SARS-CoV-2 infection. The same training and test sets were used. We ranked the importance of the selected cytokine responses based on their ability to discriminate between patients having severe/death outcome as opposed to mild/moderate outcome. In the random forest model, feature importance was determined by the mean decrease in Gini [15], while in the XGBoost model, it was determined by the mean absolute SHapley Additive exPlanations (SHAP) value [18]. Similarly, we used ROC and AUC to evaluate the discriminative ability of the random forest model and XGBoost model in the test set and presented the most significant cytokine predictors and their feature importance in a table. We also assessed the calibration of the models using the Hosmer–Lemeshow tests.

Finally, we visualized the feature importance of each cytokine in the Lasso, random forest, and XGBoost models to assess the consistency of variable importance between the three methods and examine whether there is evidence that these cytokines were genuine predictors or selected by chance.

Results

Descriptive analysis

The baseline characteristics of the study subjects stratified by the severity of SARS-CoV-2 Infection are presented in Table 1. Age distributions were overall comparable between the mild/moderate and severe/death groups. However, the severe/death group had a higher proportion of participants aged 80 and above (16.7%) and fewer below 40 years (2.4%) than the mild/moderate group (aged 80 and above: 5.6%, below 40 years: 8.8%). A greater percentage of participants in the severe/death group had a CCI of 3 or higher (69.1%), indicating higher disease morbidity at the time of SARS-CoV-2 infection, compared to the mild/moderate group (48.7%). A higher proportion of the severe/death group was inpatients (97.6%) compared to the mild/moderate group (82.5%). The distribution of sexes was consistent across both groups.

Table 1 Baseline characteristics of study population stratified by the severity of SARS-CoV-2 infection

The baseline cytokine responses of the study subjects stratified by the severity of SARS-CoV-2 Infection are presented in Table 2. Several cytokines have more than 40% of their values out of detection limits, including FGF-2, IL-1β, IL-4, IL-5, IL-6, IL-9, IL-12p70, IL-21, IL-22, IL-23, IL-27, IL-31, NGF-β, TNF-β, and VEGF-D. Overall, these data suggest that individuals with different illness severity display distinct cytokine expression patterns at the time of presentation.

Table 2 Cytokine levels of study population stratified by the severity of SARS-CoV-2 infection

The training and test sets were comparable in terms of SARS-CoV-2 severity and baseline characteristics, as evidenced by similar distributions across groups (Supplementary Table 1).

Using these data, we identified 10 cytokines that differentiated illness severity in a univariate analysis (Table 3). We selected cytokines with AUC 95% CI lower bound > 0.5, which indicates significant predictive power over chance. IP-10 has the highest AUC (0.707, 95% CI: 0.622–0.791), followed by HGF (0.649, 95% CI: 0.554–0.744) and MCP-1 (0.635, 95% CI: 0.534–0.736). The 10 cytokines in Table 3 were selected to be included in the risk prediction models. These data suggest that certain cytokines have significant predictive power in distinguishing between different severities of SARS-CoV-2 infection. In comparison to these cytokines, using CCI alone achieved an AUC of 0.603 (95% CI: 0.511–0.696), which is less than or at most comparable to the predictive performance of individual cytokines such as IP-10, HGF, and MCP-1.

Table 3 ROC curve analysis by cytokines to predict the severity of SARS-CoV-2 infection

We also compared the correlation between selected cytokines of interest from the univariate analysis (Fig. 1). We identified two clusters of intercorrelated cytokines, as indicated by the two predominant red blocks in the heatmap. In the larger red block, IL-27, IL-1RA, IL-15, MCP-1, VEGF-A, HGF, and IP-10 exhibited strong correlations with each other. Meanwhile, in the smaller red block, IL-2 and GM-CSF were strongly correlated.

Fig. 1
figure 1

Heatmap of correlations between 10 cytokines with individual AUC 95% CI lower bounds exceeding 0.5

LASSO regression

Using LASSO regression, we identified a set of cytokines effective for outcome prediction. Six cytokines were selected by the LASSO model, based on their absolute regression coefficient values, with the strongest predictive power provided by HGF (0.28), followed by MCP-1 (0.19), IP-10 (0.16), IL-2 (0.12), VEGF-A (0.04), and IL-17A (0.04) (Fig. 3a). These cytokines predict outcomes in this initial model with an AUC for the test set of 0.773 (Fig. 2, 95%CI: 0.640–0.967). The Hosmer–Lemeshow test p-value was 0.263 (Table 4).

Fig. 2
figure 2

ROC curves for the LASSO, RF, and XGBoost models

Table 4 ROC curve analysis and Hosmer–Lemeshow p-value by models to predict the severity of SARS-CoV-2 infection

Random forest model and XGBoost model

To further bolster our statistical conclusions, we performed random forest and XGBoost modeling to predict outcomes using the same data. The random forest model identified a set of cytokines with similar predicted test effectiveness as in LASSO regression (Fig. 2, AUC 0.804, 95%CI: 0.601–0.944). The feature importance of each cytokine is presented as the mean decrease in Gini (Fig. 3b). Likewise, XGBoost identified a cytokine set with similar performance (Fig. 2, AUC 0.807, 95%CI: 0.656–0.959). Here, the feature importance of each cytokine is presented as mean absolute SHAP value (Fig. 3c). In the random forest model, IP-10 has the highest feature importance, followed by VEGF-A, MCP-1, HGF, IL-1-RA, IL-15, IL-17A, GM-CSF, IL-27, and IL-2. In the XGBoost model, VEGF-A has the highest feature importance, followed by IP-10, MCP-1, HGF, IL-1-RA, IL-17A, IL-15, IL-27, IL-2, and GM-CSF. The Hosmer–Lemeshow test p-values for the random forest and XGBoost models were 0.155 and 0.218, respectively (Table 4). The AUC, sensitivity, specificity, Youden index, and Hosmer–Lemeshow test p-values for the LASSO, RF, and XGBoost models are detailed in Table 4.

Fig. 3
figure 3

Feature importance of each cytokine in the risk prediction models. A LASSO model. B RF model. C XGBoost model

Discussion

In this study, we identified a set of peripheral blood cytokines that effectively predict the severity of SARS-CoV-2 infection in a Veteran population. The AUC for the LASSO model is greater than 0.7, which is considered acceptable, while the AUC values for the random forest and XGBoost models exceed 0.8, indicating excellent discriminative ability. In addition, the models also demonstrate good calibration, as reflected by their Hosmer–Lemeshow p-values being greater than 0.05. The consistency of the feature importance of each cytokine in the three models is particularly noteworthy. Specifically, cytokines such as Interferon gamma-induced protein 10 (IP-10), Vascular Endothelial Growth Factor A (VEGF-A), Monocyte Chemoattractant Protein 1 (MCP-1), and Hepatocyte Growth Factor (HGF) were repeatedly highlighted as important predictors, suggesting their robust role in the predictive models. This consistency across multiple models suggests that cytokine profiles may predict the severity of SARS-CoV-2 infection after validation in larger and more diverse cohorts.

The peripheral blood cytokines identified to be key risk predictors for the severity of SARS-CoV-2 infection in our study align with observations from several studies that also identified many of the same cytokines. Yang et al. reported that plasma IP-10 and MCP-3 levels are highly associated with illness severity and predict the progression of COVID-19 [6]. Similarly, Chen et al. identified IP-10 and MCP-1 as key biomarkers for COVID-19 severity [19]. Furthermore, Perreau et al. demonstrated that the cytokines HGF and CXCL13 are predictive of both the severity and mortality in COVID-19 patients [20]. Understanding these immune patterns may provide improved clinical decision-making at the individual level and improved resource allocation at a population level. Of the 10 cytokines identified as individual predictors, several (IP-10, IL-2, IL-15, IL-17A) are dependent on effective adaptive T cell responses to reduce inflammation and severity of disease. These echo multiple studies, including predictive models [21], observations that T cell memory responses to SARS-CoV-2 reduce the severity of disease in convalescent individuals [22] and after vaccination across serotypes [23, 24]. These data provide further evidence of the key importance of SARS-CoV-2 vaccination in preventing mortality from SARS-CoV-2 in our US Veteran population.

This study has several strengths. First, we utilized a comprehensive panel of 45 cytokines, allowing for a thorough investigation of potential biomarkers. This broad approach increases the likelihood of identifying key cytokines for predicting the risk of severe SARS-CoV-2 infection. Second, our study employed a robust methodological framework, utilizing LASSO logistic regression, random forest, and XGBoost models to analyze a complex dataset with multiple predictors. This multi-method approach not only provided a comprehensive analysis but also served to cross-validate our findings, enhancing their reliability. The discriminative ability observed in the test set (77.3%, 80.4%, and 80.7%, respectively) is indicative of the effectiveness of our models. In addition, the study draws data from 15 different VA medical centers across the United States, providing a geographically and racially diverse sample. Enrollment took place from June 2020 to September 2022, and so participants were exposed to a wide range of SARS-CoV-2 variants. Moreover, the VASIC is a validated tool specifically tailored to assess COVID-19 severity, which reduces misclassification and enhances the accuracy of severity assessments. Finally, we had comprehensive data from the medical records on participant characteristics and outcomes, and the multi-point follow-up in the EPIC3 study enables a detailed capture of disease progression over time.

While the study offers valuable insights, certain limitations must be acknowledged. Firstly, the focus on the US Veteran population, while providing detailed insights for this demographic, might limit the generalizability of our findings to the wider population. The diverse health profiles and experiences of Veterans may not fully represent the broader spectrum of patients affected by COVID-19, especially female populations. Additionally, our study population also differs in terms of baseline characteristics from the participants of the broader EPIC3 study. Among those who participated in EPIC3 as inpatient or outpatient participants and had a positive RT-PCR test at baseline, only 25.2% were inpatients, whereas in our study population, where baseline multiplex cytokine measurements were available, 85.6% were inpatients. This discrepancy arises because inpatients are more likely to undergo comprehensive biomarker assessments. In the future, expanding cytokine measurements to a larger and more diverse participant pool will be essential for enhancing the generalizability of our findings. Moreover, the simplification of the Veterans Affairs Severity Index for COVID-19 into a binary outcome may have obscured more subtle gradations in illness severity. This could impact the applicability of our findings to clinical scenarios where such nuances are critical. Another limitation is the dependency on the Luminex assay platform for cytokine measurement, which could pose challenges in replicating our results in settings where different technologies or assays are used. Furthermore, this study only leverages baseline cytokine measurements, and we are not capturing the dynamic changes in cytokine levels that occur over time or at the time of symptom onset, which may affect the performance of risk prediction models.

To address these limitations and build on our findings, future research should focus on a more diverse and representative sample of the population, which would enhance the external validity and applicability of the results. Longitudinal studies would provide valuable insights into the temporal dynamics of cytokine profiles and their correlation with disease progression and recovery. Additionally, incorporating other biomarkers and clinical parameters into the analysis could offer a more comprehensive understanding of COVID-19 and its myriad presentations. Such integrative studies could further refine the predictive models and potentially uncover new therapeutic targets or diagnostic markers.

In conclusion, our study demonstrates that peripheral blood cytokine profiles are effective predictors of SARS-CoV-2 infection severity among US Veterans. Using multiple methods, we identified key cytokines correlated with severe outcomes. Future research should focus on validating these results in larger cohorts and exploring the underlying mechanisms of these cytokines in COVID-19 progression, paving the way for targeted treatment approaches.

Data availability

Participating Veterans contributed data by completing questionnaires, either conducted as interviews or filled out by the Veterans themselves, and by providing biospecimens; clinical data assessment was conducted accessing comprehensive VHA Electronic Health Records (EHR).

References

  1. (CDC) CfDCaP. COVID Data Tracker. Accessed Aug 31, 2024. https://covid.cdc.gov/covid-data-tracker/#datatracker-home

  2. Ioannou GN, Liang PS, Locke E, et al. Cirrhosis and Severe Acute Respiratory Syndrome Coronavirus 2 Infection in US Veterans: Risk of Infection, Hospitalization, Ventilation, and Mortality. Hepatology. 2021;74(1):322–35. https://doiorg.publicaciones.saludcastillayleon.es/10.1002/hep.31649.

    Article  CAS  PubMed  Google Scholar 

  3. Fan VS, Dominitz JA, Eastment MC, et al. Risk Factors for Testing Positive for Severe Acute Respiratory Syndrome Coronavirus 2 in a National United States Healthcare System. Clin Infect Dis. 2021;73(9):e3085–94. https://doiorg.publicaciones.saludcastillayleon.es/10.1093/cid/ciaa1624.

    Article  CAS  PubMed  Google Scholar 

  4. Ioannou GN, Locke E, Green P, et al. Risk Factors for Hospitalization, Mechanical Ventilation, or Death Among 10 131 US Veterans With SARS-CoV-2 Infection. JAMA Netw Open. Sep 1 2020;3(9):e2022310. https://doiorg.publicaciones.saludcastillayleon.es/10.1001/jamanetworkopen.2020.22310

  5. Hu B, Guo H, Zhou P, Shi ZL. Characteristics of SARS-CoV-2 and COVID-19. Nat Rev Microbiol. 2021;19(3):141–54. https://doiorg.publicaciones.saludcastillayleon.es/10.1038/s41579-020-00459-7.

    Article  CAS  PubMed  Google Scholar 

  6. Yang Y, Shen C, Li J, et al. Plasma IP-10 and MCP-3 levels are highly associated with disease severity and predict the progression of COVID-19. J Allergy Clin Immunol. 2020;146(1):119-127.e4. https://doiorg.publicaciones.saludcastillayleon.es/10.1016/j.jaci.2020.04.027.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  7. Mehta P, McAuley DF, Brown M, Sanchez E, Tattersall RS, Manson JJ. COVID-19: consider cytokine storm syndromes and immunosuppression. Lancet. 2020;395(10229):1033–4. https://doiorg.publicaciones.saludcastillayleon.es/10.1016/s0140-6736(20)30628-0.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  8. McElvaney OJ, McEvoy NL, McElvaney OF, et al. Characterization of the Inflammatory Response to Severe COVID-19 Illness. Am J Respir Crit Care Med. 2020;202(6):812–21. https://doiorg.publicaciones.saludcastillayleon.es/10.1164/rccm.202005-1583OC.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  9. (CSP) VCSPEC. CSP #2028: EPIC3 Study. https://www.vacsp.research.va.gov/CSPEC/Studies/INVESTD-R/CSP-2028-EPIC3.asp

  10. Quan H, Sundararajan V, Halfon P, et al. Coding algorithms for defining comorbidities in ICD-9-CM and ICD-10 administrative data. Med Care. 2005;43(11):1130–9. https://doiorg.publicaciones.saludcastillayleon.es/10.1097/01.mlr.0000182534.19832.83.

    Article  PubMed  Google Scholar 

  11. Galloway A, Park Y, Tanukonda V, et al. Impact of Coronavirus Disease 2019 (COVID-19) Severity on Long-term Events in United States Veterans Using the Veterans Affairs Severity Index for COVID-19 (VASIC). J Infect Dis. 2022;226(12):2113–7. https://doiorg.publicaciones.saludcastillayleon.es/10.1093/infdis/jiac182.

    Article  CAS  PubMed  Google Scholar 

  12. Friedman J, Hastie T, Tibshirani R. Regularization Paths for Generalized Linear Models via Coordinate Descent. J Stat Softw. 2010;33(1):1–22.

    Article  PubMed  PubMed Central  Google Scholar 

  13. Albert A, Lesaffre E. Multiple group logistic discrimination. Statistical Methods of Discrimination and Classification. Elsevier; 1986:209–224.

  14. Saito T, Rehmsmeier M. The precision-recall plot is more informative than the ROC plot when evaluating binary classifiers on imbalanced datasets. PLoS ONE. 2015;10(3):e0118432.

    Article  PubMed  PubMed Central  Google Scholar 

  15. Breiman L. Random forests Machine learning. 2001;45:5–32.

    Article  Google Scholar 

  16. Qi Y. Random forest for bioinformatics. Ensemble machine learning: Methods and applications. 2012:307–323.

  17. Chen T, Guestrin C. XGBoost: A Scalable Tree Boosting System. presented at: Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining; 2016; San Francisco, California, USA. https://doiorg.publicaciones.saludcastillayleon.es/10.1145/2939672.2939785

  18. Vimbi V, Shaffi N, Mahmud M. Interpreting artificial intelligence models: a systematic review on the application of LIME and SHAP in Alzheimer’s disease detection. Brain Informatics. 2024/04/05 2024;11(1):10. https://doiorg.publicaciones.saludcastillayleon.es/10.1186/s40708-024-00222-1

  19. Chen Y, Wang J, Liu C, et al. IP-10 and MCP-1 as biomarkers associated with disease severity of COVID-19. Molecular Medicine. 2020/10/29 2020;26(1):97. https://doiorg.publicaciones.saludcastillayleon.es/10.1186/s10020-020-00230-x

  20. Perreau M, Suffiotti M, Marques-Vidal P, et al. The cytokines HGF and CXCL13 predict the severity and the mortality in COVID-19 patients. Nature Communications. 2021/08/09 2021;12(1):4888. https://doiorg.publicaciones.saludcastillayleon.es/10.1038/s41467-021-25191-5

  21. Zhang Z, Ai G, Chen L, et al. Associations of immunological features with covid-19 severity: A systematic review and meta-analysis. BMC Infectious Diseases. 2021;21(1). https://doiorg.publicaciones.saludcastillayleon.es/10.1186/s12879-021-06457-1

  22. Kared H, Redd AD, Bloch EM, et al. SARS-COV-2–specific CD8+ T cell responses in convalescent COVID-19 individuals. Journal of Clinical Investigation. 2021;131(5). https://doiorg.publicaciones.saludcastillayleon.es/10.1172/jci145476

  23. Brasu N, Elia I, Russo V, et al. Memory CD8+ T cell diversity and B cell responses correlate with protection against SARS-COV-2 following mrna vaccination. Nat Immunol. 2022;23(10):1445–56. https://doiorg.publicaciones.saludcastillayleon.es/10.1038/s41590-022-01313-z.

    Article  CAS  PubMed  Google Scholar 

  24. Shi J, Zheng J, Zhang X, et al. A T cell–based SARS-COV-2 spike protein vaccine provides protection without antibodies. JCI Insight. 2024;9(5). https://doiorg.publicaciones.saludcastillayleon.es/10.1172/jci.insight.155789

Download references

Acknowledgements

The authors of this manuscript would like to thank and acknowledge the following individuals on the CSP #2028/EPIC3 Study for their contribution towards this manuscript:

Study Chairs Jennifer S. Lee, MD, PhD, CSP #2028 Co-Chair Jennifer M. Ross, MD, MPH, CSP #2028 Co-Chair Javeed A. Shah, MD, CSP #2028 Co-Chair

Study Co-Is Mihaela Aslan, PhD, CSP #2028 Co-Investigator Kelly Cho, PhD, MPH, CSP #2028 Co-Investigator J. Michael Gaziano, MD, MPH, CSP #2028 Co-Investigator Mark Holodniy, MD, CSP #2028 Co-Investigator Christine M. Hunt, MD, MPH, CSP #2028 Co-Investigator Anna M. Korpak, PhD, CSP #2028 Co-Investigator Dawn T. Provenzale, MD, MS, CSP #2028 Co-Investigator (former) Christina Williams, PhD, MPH, CSP #2028 Co-Investigator

Baltimore Scientific Mary-Claire Roghmann, MD, MS, Local Site Investigator Karen (KC) Coffey, MD, MPH, Co-Local Site Investigator Leslie (Les) Katzel, MD, PhD, Co-Local Site Investigator Operations Michelle Newman, BSN, Research Coordinator Gwen L. Robinson, MPH, Research Coordinator

Boston Scientific Eric Garshick, MD, MOH, Local Site Investigator Emily Wan, MD, MPH, Co-Local Site Investigator Operations Emma Busenkell, BS, Research Coordinator (former) Selena Chom, MPH, Research Coordinator (former) Christina Collins, MPH, Research Coordinator (former) Colleen Hynes, RN, Research Nurse (former) Demerise Johnston, MPH, Research Coordinator Erin McHugh, BS, Research Assistant (former) Peter Rivoira, BA, NODES Operation Manager Olivia Sterns, BS, Research Assistant (former) John (Jack) Sweeney, BS, Research Assistant (former) Caroline Truland, RN, BSN, BSBA, NODES Research Nurse Makaila Wall, BS, NODES Associate Director of Operations Cathy Zhang, BS, Research Assistant (former)

Cleveland Scientific Federico Perez, MD, MS, Local Site Investigator Robin L.P. Jump, MD, PhD, Co-Local Site Investigator

Robert Bonomo, MD, Co-Investigator David Canaday, MD, Co-Investigator Margaret Tiktin, RN, NP, DNP, Co-Investigator Operations Sara Abdelrahim, MBBS, Research Coordinator (former) Taissa A. Bej, MS, Research Coordinator Janet Briggs, RN, BSN, MSN, Research Coordinator (former) Elizabeth Delancey-Niksa, RN, BSN, Research Nurse (former) Oteshia Hicks, BA, Research Coordinator Corinne Kowal, BS, Research Coordinator Alexandria (Alex) Nguyen, MS, Research Coordinator Lisa Padro, BSN, PMH-BC, Research Coordinator

Dallas Scientific Roger Bedimo, MD, MS, Local Site Investigator Rohit Manaktala, MD, Co-Local Site Investigator Operations Erik Guajardo, BA, CCRP, NODES Quality Assurance Manager Antoinette Hamilton, BS, Research Coordinator (former) Lisa Jones, MS, NODES Quality Assurance Manager (former) Marcia Keller-Ray, Research Coordinator Angela Dela Llana, BSN, RN, Research Coordinator (former) Jacob Mathew, Research Coordinator (former) Jennifer (Jen) McClure, BSN, RN, NODES Associate Director of Operations Erick Meermans, BS, Research Coordinator (former) Erin Messick, MS, Research Coordinator Dindi Moore-Matthews, MS, Research Coordinator (former) Van Nguyen, BS, Research Coordinator (former) Abeer Zein, BS, Research Coordinator

Denver Scientific Lindsay Nicholson, MD, Local Site Investigator Mary Bessesen, MD, Co-Local Site Investigator Operations Rosa Cunningham, LPN, BS, MHA, Research Coordinator Teresa Derian, RN, Research Coordinator (former) Theresa Dunn, MS, Research Coordinator (former) Camila Hanson, BS, Research Coordinator (former) Kelsey Moore, RN, Research Coordinator (former) Kimberly Owens, MPH, CCRC, NODES Associate Director of Operations Cameron Rogowski, BS, Research Coordinator Janel Vigil, RN, BSN, Research Coordinator (former) Anna Wyrwa, RN, BSN, Research Coordinator

Durham Scientific Micah McClain, MD, PhD, Local Site Investigator Ephraim Tsalik, MD, PhD, Local Site Investigator (former) Christopher Woods, MD, MPH, Co-Local Site Investigator James Everhart, DO, Co-Investigator (former) Christopher Hostler, MD, MPH, Co-Investigator Maria Joyce, MD, PhD, Co-Investigator Operations Jack Anderson, BS, Research Assistant (former) Marline (Marlena) Brown, BS, Research Technician Lynette Gehlhausen, RN, BSN, Research Nurse (former) Amanda Hittinger, BSN, RN, Research Nurse (former) Sara Hoffman, RN, BSN, Research Nurse (former) Tyffany (Evans) Locklear, BS, BA, Research Coordinator (former) Maria Miggs, BS, Research Coordinator Deborah Murray, BS, Research Coordinator (former) Bradly (Brad) Nicholson, PhD, Lab Manager Ashlyn Press, MPH, Program Manager (former) Jaspreet Reen, MPH, Program Manager (former) Delisa Robinson, BS, Research Coordinator (former)

Gainesville Scientific Gary Wang, MD, PhD, Local Site Investigator Amy Vittor, MD, PhD, Co-Local Site Investigator Asmita Gupte, MD, Co-Investigator Alaina Ritter, MD, Co-Investigator Operations Leslie Brown, BA, Research Coordinator (former) Tempa Curry, RN, Research Coordinator Laura Dixon, BSN, Research Assistant (former) Jennifer Gollwitzer, MSN, Research Coordinator Rebecca Kokot, Research Assistant (former) Debra Robertson, RN, Research Coordinator (former) Taylor Simon, BS, Research Assistant (former) Juliana Venetucci, MS, Research Assistant (former) Elizabeth Vo, Research Assistant (former)

Little Rock Scientific John Theus, MD, Local Site Investigator Ryan Dare, MD, Co-Investigator Operations Jesse Byrd, BA, Research Coordinator (former) Adam Lallier, CRC, Research Coordinator (former) Kristin Miller, BSN, Research Coordinator (former) Betty Ussery, CCRC, Research Coordinator

Milwaukee Scientific Sheran Mahatme, DO, MPH, Local Site Investigator Nathan Gundacker, MD, Co-Local Site Investigator Javeria Haque, MD, Co-Local Site Investigator Operations Kasey Kallio, MSN, RN, Research Coordinator Julie Rieder, CMA (AAMA), CCRC, NODES Associate Director of Operations Colleen Veenendaal, RN, Research Coordinator Aprille Walker, BA, Research Coordinator

Palo Alto Scientific Harman Paintal, MBBS, Local Site Investigator Elizabeth (Lisa) Le, MD, Co-Local Site Investigator Matthew (Matt) Stevenson, MD, Co-Local Site Investigator Operations Sadaf Ahmed, MPH, Research Coordinator (former) Karen Bratcher, MSN, RN, NODES Associate Director of Operations (former) Ashley Langston, MS, MA, CRC, Research Coordinator (former) Olga Livingston, Research Coordinator Edgardo A. Gamarra Monteverde, MBA, MPH, NODES Associate Director of Operations Elena Nikolaev, NODES Quality Assurance Manager (former) James Quinn, Research Coordinator (former) Ann Roseman, BA, Research Coordinator

Philadelphia Scientific Stuart Isaacs, MD, Local Site Investigator Joshua (Josh) Baker, MD, MSCE, Co-Local Site Investigator Kyong-Mi Chang, MD, Co-Local Site Investigator Jeffrey Doyon, MD, PhD, Co-Investigator (former) Katherine Gardner, MD, Co-Investigator (former) Mary Hofmann, MD, RN, Co-Investigator

Darshana Jhala, MD, Co-Investigator David Stern, MD, Co-Investigator (former) Laura Su, MD, PhD, Co-Investigator Operations David Azizi, BA, Research Coordinator Juliana Bonilla, BA, Research Coordinator (former) Caleigh Doherty, BS, Research Coordinator (former) Rachel Gillcrist, BA, Research Coordinator (former) Criswell Lavery, MA, Research Coordinator (former) Will Leach, MA, Research Coordinator (former) Lynne Mancini, RN, MSN, BSN, Research Coordinator (former) Lizbeth Novelo, BA, Research Coordinator (former) Mariana Olave, BA, Research Coordinator Mary Valiga, RN, Research Coordinator (former) Sarah Wetzel, MPH, BS, Research Coordinator Muhammad Zahid, MD, Research Coordinator (former)

Portland Scientific Christopher (Chris) Pfeiffer, MD, MHS, Local Site Investigator Marissa Maier, MD, Co-Investigator Angela (Holly) Villamagna, MD, Sub-Investigator (former) Operations Antwan Baker, MS, Research Coordinator (former) Alexandra (Pitts) Bennett, BS, Research Coordinator (former) Hannah Flegal, BA, Research Assistant (former) Jennifer Green, BA, Research Coordinator (former) Tawni Kenworthy-Heinige, BS, NODES Associate Director of Operations (former) Erik Mauk, BS, Research Assistant Laura Onstad, RN, BS, Research Coordinator (former) Kevin Osborn, BS, BA, Research Coordinator Ginger Sullivan, AS, CMA, Research Assistant Michael Tanaka, BA, Research Coordinator Deanna Ternes, BS, Research Coordinator (former) Senta Wiederholt, BA, Research Assistant (former) Lorrinda Zahl, AA, CPT, Research Assistant

Salt Lake City Scientific Patrick (Pat) Powers, MD, Local Site Investigator Julia Lewis, DO, Co-Local Site Investigator Emily Beck, MD, Co-Investigator (former) Sean Callahan, MD, Co-Investigator Laura Certain, MD, PhD, Co-Investigator (former) Barbara Jones, MD, Co-Investigator Mustafa Mir Kasimov, MD, Co-Investigator (former) Lynn Keenan, MD, Co-Investigator Robert Paine III, MD, Co-Investigator Gregory Radin, MD, Co-Investigator Karl Sanders, MD, Co-Investigator Operations Jean Brooks, MSN, RN, CCRC, ACRP-PM, NODES Nurse Manager (former) Brenda Hernandez, MBA, BA, Research Coordinator Craig High, MS, Research Coordinator (former) Vinay Kumaran, MBBS, MPH, CCRC, Research Coordinator (former) Adam Nehls, BS, Research Coordinator (former) Christina Nessler, MS, CCRC, NODES Operations Manager Haleisha Power, BS, Research Coordinator (former) Jason Ray, BBA, Research Assistant Valentino Rodriguez, BS, Research Coordinator (former) Kaylene Russell, MPH, Research Assistant (former) Kandi Velarde, MPH, CCRC, NODES Associate Director of Operations

San Antonio Scientific Patrick Danaher, MD, Local Site Investigator Antonio Anzueto, MD, Co-Local Site Investigator Operations Joanne Holloway, RN, CCRC, Research Coordinator Michele Paprocki, RN, Research Coordinator (former)

Seattle (site) Scientific Kristina Crothers, MD, Local Site Investigator McKenna Eastment, MD, MPH, Co-Local Site Investigator Javeed Shah, MD, Co-Local Site Investigator Arti Tayade, MD, MBBS, Co-Investigator Luis Tulloch-Palomino, MD, Co-Investigator Operations SueAnn Brickle, Research Coordinator Joseph (Joe) Gylys-Colwell, BS, Research Coordinator (former) Neelab (Amina) Kamiab, BS, BA, Research Assistant John Kundzins, BS, Research Coordinator Troy Layouni, MPH, Research Coordinator Jacob Martin, BA, Research Coordinator (former) Hasanah McCauley, BS, Research Coordinator (former) Cassandra (Cassie) Stubbe, MSc, NODES Quality Assurance Manager Rachel Tesoro, BS, Research Assistant Pandora Lucrezia (Luke) Wander, MD, MS, FACP, Staff Physician Kristin Wojtowicz, BS, Research Coordinator

West Haven Scientific Shaili Gupta, MBBS, Local Site Investigator Richard Sutton, MD, PhD, Co-Local Site Investigator Operations David Ardito, Research Coordinator Jessica O’Donovan, BA, Research Coordinator Patricia Pelham, RN, Research Nurse Danielle Plank, Research Coordinator Alicia Roy, BA, Research Coordinator Gary Stack, MD, Lab Manager Christine Summers, MA, Research Coordinator

Seattle Coordinating Center Scientific Nicholas L. Smith, PhD, Coordinating Center Director Jonathan Sugimoto, PhD, Project Director (former) Anna M. Korpak, PhD, Lead Biostatistician Aaron Baraff, PhD, Biostatistician Operations Jonathan Adams, PhD, National Study Coordinator (former) Morgan Bergerud, BS, Research Assistant (former) Christopher (Chris) Bromberg, MA, Research Coordinator Alexandra Fox, MSIS, Data Analyst Helen Haile, BS, Research Specialist (former) Tess Harpur, MPH, Research Coordinator (former) Liuye Huang, MHS, Research Specialist Heidi Hummel, PhD, Project Manager (former) Samin Kamal, BA, Research Specialist Gabrielle LaBazzo, MPH, Research AssistantSpecialist (former) Xumin Li, MS, Research Specialist Cindy Liu, BA, Program Manager Jordanna Midthun, MPH, Research Coordinator Kathryn Moore, PhD, Data Manager (former) Daniel (Dan) Morelli, BA, Program Manager Kytlan Morgan, BA, Research Assistant Geun-woo Oh, BA, Research Assistant Vivek Pakanati, MPH, Research Coordinator Rachel Sanders, BS, BA, Research Specialist (former) Katie Schroeder, BS, Research Specialist Nicholas (Nick) Simeti, MPH, Research Coordinator Chad Sisemore, MS, Data Analyst (former) Jennifer (Jen) Sporleder, BS, Associate Center Director, Research Operations Jonathan Sugimoto Adrienne Tanus, MPH, Project Manager, Research Operations Sarah Thiel, Research Assistant Tija Tippett, BS, Research Assistant (former) Tracy Wang, MAS, Data Analyst Gabriela Webb, BS, Research Assistant (former) Katrina Wicks, MPH, Data Manager (former) Deanna Wilson, MPS, National Study Coordinator (former) Sarah Yarborough, MPH, Research Assistant Specialist (former)

Executive Committee Michael Boeckh, MD, PhD, CSP #2028 Executive Committee Member Kyong-Mi Chang, MD, CSP #2028 Executive Committee Member Elizabeth (Lisa) Le, MD, CSP #2028 Executive Committee Member Yoselin Ordonez Suarez, PharmD, CSP #2028 Executive Committee Member Julie Parsonnet, MD, CSP #2028 Executive Committee Member Jonathan Sugimoto, PhD, CSP #2028 Executive Committee Member Christopher (Chris) W. Woods, MD, MPH, CSP #2028 Executive Committee Member

VA/US Government Disclaimer

All statements and opinions presented in this manuscript are solely of the authors and do not necessarily reflect the position or policy of the United States department of Veterans Affairs (VA), the VA Cooperative Studies Program (CSP), or United States Government.

Funding

This study was funded by the Cooperative Studies Program (CSP) of the United States Department of Veterans Affairs (VA) Office of Research and Development.

Author information

Authors and Affiliations

Authors

Consortia

Contributions

X.L., V.P., and J.S. wrote the main manuscript text. X.L. prepared all figures and tables for the manuscript. The remaining authors (CL, TW, DM, AK, AB, AV, KMC, EL, NLS, JSL, JMR, EPIC3 Investigators) were responsible for reviewing the manuscript prior to submission.

Corresponding author

Correspondence to Javeed A. Shah.

Ethics declarations

Ethics approval and consent to participate

All study participants provided their individual written, informed consent to participate in the EPIC3 study. Our study is approved by the VA Central Institutional Review Board (reference 20–14).

Consent for publication

The authors and participants provided their consent to have this manuscript and the data it contains published.

Competing interests

The authors declare no competing interests.

Additional information

Publisher's Note

A list of authors and their affiliations appears at the end of the paper.

Supplementary Information

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Li, X., Pakanati, V., Liu, C. et al. Peripheral blood cytokine profiles predict the severity of SARS-CoV-2 infection: an EPIC3 study analysis. BMC Infect Dis 25, 677 (2025). https://doiorg.publicaciones.saludcastillayleon.es/10.1186/s12879-025-10914-6

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doiorg.publicaciones.saludcastillayleon.es/10.1186/s12879-025-10914-6

Keywords