Published on in Vol 10 (2024)

Preprints (earlier versions) of this paper are available at https://preprints.jmir.org/preprint/55697, first published .
Associations Between Acute COVID-19 Symptom Profiles and Long COVID Prevalence: Population-Based Cross-Sectional Study

Associations Between Acute COVID-19 Symptom Profiles and Long COVID Prevalence: Population-Based Cross-Sectional Study

Associations Between Acute COVID-19 Symptom Profiles and Long COVID Prevalence: Population-Based Cross-Sectional Study

1Department of Epidemiology, University of Michigan School of Public Health, , Ann Arbor, MI, , United States

2Advocate Aurora Research Institute, Advocate Health, , 3075 Highland Parkway, Downers Grove, IL, , United States

3Department of Biostatistics, University of Michigan School of Public Health, , Ann Arbor, MI, , United States

4Survey Research Center, Institute for Social Research, , Ann Arbor, MI, , United States

5Centers for Disease Control and Prevention Foundation, , Atlanta, GA, , United States

6Michigan Department of Health and Human Services, , Lansing, MI, , United States

Corresponding Author:

Jana L Hirschtick, MPH, PhD


Background: Growing evidence suggests that severe acute COVID-19 illness increases the risk of long COVID (also known as post–COVID-19 condition). However, few studies have examined associations between acute symptoms and long COVID onset.

Objective: This study aimed to examine associations between acute COVID-19 symptom profiles and long COVID prevalence using a population-based sample.

Methods: We used a dual mode (phone and web-based) population-based probability survey of adults with polymerase chain reaction–confirmed SARS-CoV-2 between June 2020 and May 2022 in the Michigan Disease Surveillance System to examine (1) how acute COVID-19 symptoms cluster together using latent class analysis, (2) sociodemographic and clinical predictors of symptom clusters using multinomial logistic regression accounting for classification uncertainties, and (3) associations between symptom clusters and long COVID prevalence using modified Poisson regression.

Results: In our sample (n=4169), 15.9% (n=693) had long COVID, defined as new or worsening symptoms at least 90 days post SARS-CoV-2 infection. We identified 6 acute COVID-19 symptom clusters resulting from the latent class analysis, with flu-like symptoms (24.7%) and fever (23.6%) being the most prevalent in our sample, followed by nasal congestion (16.4%), multi-symptomatic (14.5%), predominance of fatigue (10.8%), and predominance of shortness of breath (10%) clusters. Long COVID prevalence was highest in the multi-symptomatic (39.7%) and predominance of shortness of breath (22.4%) clusters, followed by the flu-like symptom (15.8%), predominance of fatigue (14.5%), fever (6.4%), and nasal congestion (5.6%) clusters. After adjustment, females (vs males) had greater odds of membership in the multi-symptomatic, flu-like symptom, and predominance of fatigue clusters, while adults who were Hispanic or another race or ethnicity (vs non-Hispanic White) had greater odds of membership in the multi-symptomatic cluster. Compared with the nasal congestion cluster, the multi-symptomatic cluster had the highest prevalence of long COVID (adjusted prevalence ratio [aPR] 6.1, 95% CI 4.3‐8.7), followed by the predominance of shortness of breath (aPR 3.7, 95% CI 2.5‐5.5), flu-like symptom (aPR 2.8, 95% CI 1.9‐4.0), and predominance of fatigue (aPR 2.2, 95% CI 1.5‐3.3) clusters.

Conclusions: Researchers and clinicians should consider acute COVID-19 symptom profiles when evaluating subsequent risk of long COVID, including potential mechanistic pathways in a research context, and proactively screen high-risk patients during the provision of clinical care.

JMIR Public Health Surveill 2024;10:e55697

doi:10.2196/55697

Keywords



Although SARS-CoV-2 will continue to circulate for the foreseeable future, we are now entering a new phase focused on the resulting health and social impacts of the COVID-19 pandemic. COVID-19 illness was devasting for many, resulting in over 1.1 million deaths in the United States alone [1]. Additionally, many COVID-19 survivors have not returned to their usual state of health for months or years after their COVID-19 illness. This prolonged period of recovery, commonly known as long COVID (also as post–COVID-19 condition), may affect 5% of the US adult population [2] with important implications for the health care system, workforce, and society more broadly.

There continues to be considerable uncertainty regarding long COVID, including its definition, risk factors, and mechanistic pathways. Long COVID is broadly defined as persistent new or worsening symptoms lasting for a minimum of 4 weeks (per the Centers for Disease Control and Prevention [3]) or 3 months (per the World Health Organization [4]) after a SARS-CoV-2 infection. Although there is not yet a comprehensive understanding of risk factors, a growing body of evidence reports higher long COVID prevalence among females [5-10] and individuals who had severe acute COVID-19 illness [5,6,11,12].

Acute illness course may be indicative of mechanistic pathways that operate in the development of long COVID. Relatedly, specific acute symptoms may be an important and understudied predictor of long COVID. While several studies have explored associations between acute symptoms [13-15] and long COVID prevalence, there are over 200 documented symptoms of long COVID [16] across multiple organ systems [17,18] complicating the investigatory process. Examining the clustering of acute symptoms, however, may provide more tangible information on the profiles of acute illness associated with long COVID.

We are not aware of any studies examining associations between acute symptoms and long COVID using a population-based sample. Therefore, our objectives are to examine (1) how acute COVID-19 symptoms cluster together; (2) sociodemographic and clinical predictors of symptom clusters; and (3) associations between symptom clusters and long COVID prevalence using a population-based probability survey of adults with polymerase chain reaction (PCR)–confirmed SARS-CoV-2. Findings can inform our understanding of how acute symptom profiles may predispose individuals to long COVID.


Study Population

Our data came from the Michigan COVID-19 Recovery Surveillance Study (MI CReSS), a population-based study of adults in Michigan with a PCR-confirmed SARS-CoV-2 infection in the Michigan Disease Surveillance System. We drew 14 sequential stratified samples of 1000 adults with COVID-19 onset between June 1, 2020, and May 31, 2022, from 13 geographic regions (6 public health preparedness regions [19], 6 counties in southeast Michigan, and Detroit). A base sample of 50 was drawn from each region, with the rest of the sample drawn proportionally to case numbers in each region during each time frame.

Eligible adults were alive at the time of sampling, not institutionalized, and had a valid phone number and zip code or county information (n=1,668,938). Of the 14,584 adults selected for the sample, 4628 provided consent and completed our survey over the phone with an interviewer in English, Spanish, or Arabic or online in English. All interviewers completed training on interviewing, the responsible conduct of research, and human subject protections prior to conducting interviews.

Of all eligible individuals in our sample, 32.2% completed the survey (American Association for Public Opinion Research response rate #6) [20]. Respondents completed the survey a median of 137 days (IQR 105‐176 days) after their COVID-19 onset date. The final sample of respondents was weighted to match the age and sex distribution within each region and the total sampling frame, simultaneously accounting for unequal probabilities of selection and nonresponse.

We excluded survey responses collected by proxy for reasons other than language translation or hearing assistance (n=19) as well as 21 phone interviews for which interviewers lacked confidence about data validity. Additionally, although there are some reports of long COVID among individuals with asymptomatic acute infection [13], we restricted our sample to individuals with acute symptoms since only 2 asymptomatic individuals out of 189 met our criteria for long COVID.

Acute Symptom Clusters

The MI CReSS survey collected information on 27 COVID-19 symptoms. For each symptom, respondents were asked “During your illness, did you ever experience [symptom]?” If yes, respondents were asked “Did you experience this symptom during the first two weeks of your illness?” We considered an acute symptom present if the respondent experienced it in the first 2 weeks of their illness [21].

Several symptoms were collapsed into single indicators, including fever (feeling feverish or having a fever of 100.4 °F [38 °C] or above); chills (chills or repeated shaking with chills); nasal congestion (nasal congestion or runny nose); gastrointestinal symptoms (nausea or vomiting, abdominal pain, or diarrhea); and weakness (muscle weakness or general weakness). We excluded a single symptom, trouble sleeping, due to its lack of specificity. The resulting 20 symptoms were as follows: fatigue; fever; chills; muscle aches; weakness; joint pain; shortness of breath; cough; nasal congestion; sore throat; gastrointestinal; loss of appetite; headache; lightheaded or dizzy; brain fog, memory loss, or disorientation; loss of sense of smell or taste; chest pain or tightness; heart rate or heart rhythm issues; hair loss; and rash or skin discoloration. For collapsed symptoms, we coded an acute symptom present if a respondent reported at least 1 contributing symptom in the first 2 weeks of their illness. We coded an acute symptom as absent if a respondent never experienced it, did not experience it in the first 2 weeks, or did not know whether they experienced it during their illness.

Long COVID Outcome

We defined long COVID as new or worsening symptoms lasting at least 90 days post COVID-19 onset to align with the 3-month threshold recommended by the World Health Organization [4]. We factored the time between COVID onset and the survey date into our definition since it varied across participants. Some of our respondents had less than 90 days between their COVID-19 onset and survey date. We used 2 methods (described below) to address this matter. All respondents were asked if they had recovered to their usual state of health at the time of the survey. We considered long COVID present if respondents had not recovered at least 90 days after COVID-19 onset. For respondents who had not recovered and were surveyed less than 90 days after their COVID onset, we set the long COVID variable to missing (n=56), since it is unclear what proportion of these individuals with persistent symptoms would recover by 90 days. The group identified as having long COVID remained constant in our analyses, as all of them, by definition, had 90+ days elapse between COVID-19 onset and the survey date. Alternatively, we used 2 approaches to define the group without long COVID. For the main analysis, we considered long COVID absent if respondents reported a recovery period of 90 days or less, including respondents who reported recovery who were surveyed less than 90 days after their COVID onset. We ran a sensitivity analysis excluding all individuals with less than 90 days between their COVID onset and survey date, regardless of their self-reported recovery status (n=610), to test the impact of these coding decisions.

Covariates

We included sociodemographic and clinical factors as potential confounders, including sex at birth (male and female); age group (18‐34, 35‐44, 45‐54, 55‐64, and 65+ years); race and ethnicity (Hispanic, Non-Hispanic White, Non-Hispanic Black, another race or ethnicity); annual household income (<US $35,000, US $35,000‐74,999, and US $75,000+); and BMI (underweight/healthy weight [BMI<25], overweight [BMI 25 to <30], moderately obese [BMI 30 to <35], and severely obese [BMI 35+]). We also included a binary indicator for the presence of any of the following pre-existing diagnosed conditions: chronic obstructive pulmonary disease, asthma, diabetes, heart disease, hypertension, liver disease, kidney disease, cerebrovascular disease, cancer, immunosuppressive conditions, autoimmune conditions, or a physical disability. We coded each pre-existing condition as absent if the respondent did not know if they had ever been diagnosed with the condition. We also coded the combined pre-existing condition indicator as “none” if a respondent had missing data for 1 condition but reported no previous diagnoses for the remaining 11 conditions. We included an additional indicator for a pre-existing diagnosed psychological condition separately because prior research has found an association between a pre-existing psychological condition and long COVID [7,12,22].

Missing Data

Data missingness was generally minimal. However, income was missing for 10.9% of respondents. To account for this, we imputed income using the hot deck imputation method under the missing at random assumption [23]. Additionally, to account for missingness in race and ethnicity (4.3%) and BMI (2%), we included an “unknown” category for each variable. The remaining missingness (4% for long COVID outcome; ≤1% for additional covariates) was handled using listwise deletion.

Statistical Analysis

We used latent class analysis (LCA) to identify acute symptom clusters among symptomatic participants, accounting for the complex sampling design. Using the 3-step method [24,25], we (1) estimated an unconditional LCA model for acute symptom clusters, (2) assigned respondents to classes using modal posterior probabilities and calculated classification uncertainties (ie, measurement errors), and (3) explored predictors of latent class membership while accounting for these measurement errors using multinomial logistic regression. To determine the adequate number of classes, we applied unconditional LCA iteratively, beginning with a 2-class model, and sequentially increased the number of latent classes while examining the fit statistics. We evaluated goodness of fit indices by considering the Bayesian information criterion, Vuong-Lo-Mendell-Rubin likelihood ratio test, and adjusted Vuong-Lo-Mendell-Rubin likelihood ratio test. All LCA analyses were conducted using MPLUS (version 8.6) [26] and used the full information maximum likelihood method to account for missing data in latent class indicator variables.

Finally, we examined associations between the acute symptom clusters and long COVID prevalence using modified Poisson regression, adjusting for the sociodemographic and clinical factors enumerated under covariates. Poisson regression analyses accounted for the complex survey design and were performed using Stata SE (version 17.0; StataCorp) [27].

Ethical Considerations

The MI CReSS study was deemed public health surveillance by the University of Michigan institutional review board. Respondents were mailed informed consent documents and subsequently provided consent either verbally over the phone or via the web-based survey tool.

This study, a secondary analysis of deidentified MI CReSS data, was deemed exempt by the University of Michigan institutional review board, which did not require additional consent from study subjects.


Study Population

The analytic sample (n=4169) was predominantly female (n=2489, 54.6%) and non-Hispanic White (n=3059, 70.7%), with one-third (n=1384, 36.6%) between the ages of 18‐34 years (Table 1). Over 70% (n=2921) were overweight or obese and almost half (n=2091, 48.3%) had a pre-existing physical condition. Nearly 1 in 6 (n=693, 15.9%) had long COVID, defined as new or worsening symptoms lasting at least 90 days post SARS-CoV-2 infection.

Table 1. Description of the analytic sample (n=4169) from the Michigan COVID-19 Recovery Surveillance Study (June 1, 2020, to May 31, 2022).
CharacteristicsWeighted percentage (%)N
Sex
Male45.41680
Female54.62489
Age group (years)
18‐3436.61384
35‐4418.5741
45‐5416.5699
55‐6416.0711
65+12.4634
Race and ethnicity
Hispanic6.6243
Non-Hispanic Black7.7292
Another race or ethnicity11.2425
Non-Hispanic White70.73059
Unknown3.7150
Annual household income (US $)
<35,00028.71203
35,000-74,99930.41272
75,00040.91694
Current smoking prior to illness9.3402
BMI
Underweight/normal weight (BMI<25)27.81194
Overweight (BMI 25 to <30)31.41293
Moderately obese (BMI 30 to <35)20.2833
Severely obese (BMI 35+)19.2795
Unknown1.454
Any pre-existing physical condition (excluding psychological)48.32091
A pre-existing psychological condition13.6576
Long COVID15.9693
Phase
June 1, 2020, to September 30, 202013.2535
October 1, 2020, to February 28, 202122.7924
March 1, 2021, to May 31, 202113.4534
June 1, 2021, to September 30, 202114.0561
October 1, 2021, to February 28, 202219.6795
March 1, 2022, to May 31, 202217.1820
Survey mode
Phone32.51336
Web-based67.52833

Acute Symptom Clusters

We identified 6 mutually exclusive acute COVID-19 symptom clusters based on the Vuong-Lo-Mendell-Rubin likelihood ratio and adjusted Vuong-Lo-Mendell-Rubin likelihood ratio tests and the meaningfulness of the latent classes (Figure 1). While models with 7 classes yielded the lowest Bayesian information criterion values, the goodness-of-fit tests showed no significant difference in fit between models with 6 and 7 classes. We thus selected 6 classes based on interpretability.

Given the high probability of fatigue in 5 of the 6 classes, we did not consider it a distinguishing symptom in classes with a high probability of other symptoms. Cluster 1, representing 14.5% of the sample, was multi-symptomatic, with a high probability of flu-like (eg, fever, chills, muscle aches), pulmonary (eg, shortness of breath), gastrointestinal (eg, nausea, diarrhea), and neurological symptoms (eg, headache, dizziness). Cluster 2 (10% of the sample) had a predominance of shortness of breath. Cluster 3 (24.7% of the sample) was characterized by flu-like symptoms. Cluster 4 (10.8% of the sample) had a predominance of fatigue, without a high probability of additional symptoms. Cluster 5 (23.6% of the sample) was characterized by fever, with a lower probability of other flu-like symptoms than clusters 1 or 2. Cluster 6 (16.4% of the sample) had a low probability of all symptoms except nasal congestion.

The unadjusted (Table S1 in Multimedia Appendix 1) and adjusted results (Table 2) from the multinomial logistic regression models further characterize the acute symptom clusters. Compared with the nasal congestion cluster, females had greater odds of membership in the multi-symptomatic (adjusted odds ratio [aOR] 2.7, 95% CI 2.0‐3.8), flu-like (aOR 2.1, 95% CI 1.6‐2.8), and predominance of fatigue (aOR 1.8, 95% CI 1.2‐2.6) symptom clusters than males. Adults aged 65 years and older had lower odds of membership in the multi-symptomatic, predominance of shortness of breath, flu-like, and fever symptom clusters than adults aged 18‐34 years. However, adults aged 35‐54 years had greater odds of predominance of fatigue than adults aged 18‐34 years. Compared with Non-Hispanic White adults, adults identifying as Hispanic (aOR 2.0, 95% CI 1.1‐3.5) or another race or ethnicity (aOR 2.0, 95% CI 1.2‐3.2) had greater odds of membership in the multi-symptomatic cluster versus the nasal congestion cluster.

Figure 1. Acute COVID-19 symptom clusters in the Michigan COVID-19 Recovery Surveillance Study (June 1, 2020, to May 31, 2022). Values for symptoms represent the probability of individuals in each cluster reporting each symptom. Values with a probability of 0.5 or greater have been highlighted in blue to aid visual interpretation of findings, with darker shades of blue reflecting higher probability values.
Table 2. Adjusted multinomial logistic regression examining odds of belonging to specific acute COVID-19 symptom clusters (referent category: nasal congestion; n=4169) in the Michigan COVID-19 Recovery Surveillance Study (June 1, 2020, to May 31, 2022). All models mutually adjusted for variables listed in the table. Italicized values denote statistical significance at P<.05.
Multi-symptomaticPredominance of shortness of breathFlu-likePredominance of fatigueFever
95% CIP value95% CIP value95% CIP value95% CIP value95% CIP value
aORaLBbUBcaORLBUBaORLBUBaORLBUBaORLBUB
Sex
Male1.0d1.01.01.01.0
Female2.72.03.8<.0011.10.71.6.642.11.62.8<.0011.81.22.6.0020.80.61.1.23
Age group (years)
18‐341.01.01.0 —1.01.0
35‐441.10.71.7.630.90.51.5.601.10.71.6.692.51.44.4.0011.00.61.5.95
45‐541.20.81.9.430.90.51.6.741.10.71.7.641.91.03.6.0491.40.92.2.19
55‐640.60.41.0.070.70.41.3.270.70.51.1.091.71.03.1.071.00.61.6.99
65+0.30.20.5<.0010.30.20.6.0010.30.20.5<.0011.60.92.8.120.50.30.8.003
Race and ethnicity
Hispanic2.01.13.5.0190.70.31.6.350.70.41.3.211.00.52.0.940.90.51.7.68
Non-Hispanic Black0.90.51.6.760.90.51.8.860.80.51.3.380.50.21.1.070.60.31.2.14
Another race or ethnicity2.01.23.2.0071.30.72.5.351.10.71.8.690.80.41.7.591.20.72.0.52
Non-Hispanic White1.01.01.01.01.0
Unknown1.70.74.0.201.30.53.4.621.00.52.1.981.10.43.0.840.90.42.1.82
Annual income (US $)
<35,0002.20.74.0<.0012.51.54.1<.0011.30.91.9.111.61.02.6.0461.30.91.9.23
35,000-74,9991.51.02.2.031.50.92.5.081.10.81.5.621.30.82.0.251.00.71.4.83
75,0001.01.01.01.01.0
Smoking prior to illness1.71.02.8.040.90.41.9.821.40.82.2.231.30.72.5.431.00.61.8.97
BMI
Underweight/normal weight1.01.01.01.01.0
Overweight1.20.81.9.301.50.92.7.121.61.12.3.0091.20.71.9.461.20.81.8.28
Moderately obese1.61.02.5.052.01.13.6.021.51.02.3.041.40.82.3.211.00.71.6.96
Severely obese1.61.02.5.051.60.92.9.101.20.81.9.331.10.61.8.861.00.61.6.93
Unknown1.10.34.6.872.00.48.9.381.20.34.8.811.40.36.5.711.60.45.9.51
Pre-existing physical conditione2.31.63.2<.0012.11.43.2<.0011.61.22.2.0031.61.12.4.021.10.81.5.77
Pre-existing psychological condition3.32.15.2<.0011.60.93.0.121.91.22.9.0071.40.72.6.310.90.51.6.69
Survey phase
June 1, 2020, to September 30, 20201.01.01.01.01.0
October 1, 2020, to February 28, 20211.10.71.9.680.50.31.0.041.50.92.3.120.90.51.6.640.90.51.5.63
March 1, 2021, to May 31, 20211.70.93.0.090.90.51.7.661.50.92.7.130.80.41.5.441.00.61.9.93
June 1, 2021, to September 30, 20211.20.62.2.610.70.41.4.361.50.92.5.161.00.52.0.941.40.82.5.21
October 1, 2021, to February 28, 20221.50.92.5.150.60.31.2.141.40.92.3.180.80.41.5.521.20.72.1.50
March 1, 2022, to May 31, 20220.80.41.5.480.50.31.0.061.30.82.2.270.80.41.4.391.61.02.7.07
Survey mode
Phone1.30.91.8.141.00.71.6.950.90.71.3.661.10.81.7.560.70.51.0.07
Web-based1.01.01.01.01.0

aaOR: adjusted odds ratio.

bLB: lower bound of 95% CI.

cUB: upper bound of 95% CI.

dNot applicable (referent value).

eExcluding psychological conditions.

Annual household income and health status indicators, including BMI and pre-existing conditions, were strong predictors of symptom cluster membership. Compared with the nasal congestion cluster, adults with a household income less than US $35,000 (vs US $75,000+) had between 1.6‐2.5 times greater odds of membership in the multi-symptomatic, predominance of shortness of breath, and predominance of fatigue symptom clusters in adjusted models. In terms of health status, adults who were obese had greater odds of membership in the predominance of shortness of breath and flu-like symptom clusters. Additionally, having a pre-existing physical condition (vs none) was associated with greater odds of membership in the multi-symptomatic, predominance of shortness of breath, flu-like, and predominance of fatigue symptom clusters compared with the nasal congestion cluster. Lastly, adults with a pre-existing psychological condition (vs none) had greater odds of membership in the multi-symptomatic (aOR 3.3, 95% CI 2.1‐5.2) and flu-like symptom (aOR 1.9, 95% 1.2‐2.9) clusters.

Long COVID Outcome

Long COVID prevalence varied across the symptom clusters, from 39.7% in the multi-symptomatic cluster to 5.6% in the nasal congestion cluster (Figure 1). In adjusted models (Table 3), the multi-symptomatic cluster had the highest prevalence of long COVID compared with the nasal congestion cluster (adjusted prevalence ratio [aPR] 6.1, 95% CI 4.3‐8.7), followed by predominance of shortness of breath (aPR 3.7, 95% CI 2.5‐5.5), flu-like symptoms (aPR 2.8, 95% CI 1.9‐4.0), and predominance of fatigue (aPR 2.2, 95% CI 1.5‐3.3). The prevalence of long COVID was not statistically different between the clusters characterized by fever and nasal congestion (aPR 1.3, 95% CI 0.8‐1.9). Sociodemographic and health-related predictors of long COVID included female sex, older age, lower income, obesity, and pre-existing conditions. Responding via telephone (vs online) was also statistically associated with higher long COVID prevalence, likely due to associations between survey mode and respondent demographics (data not shown) and potential differences in symptoms reported based on mode.

Finally, results from the sensitivity analysis excluding all individuals with less than 90 days between their COVID onset and survey date closely aligned with the primary analysis, including the 6 acute symptom clusters identified (Table S2 in Multimedia Appendix 1) and their relationship with long COVID prevalence (Table S3 in Multimedia Appendix 1).

Table 3. Modified Poisson regression examining prevalence of long COVID (n=4169) in the Michigan COVID-19 Recovery Surveillance Study (June 1, 2020, to May 31, 2022). Italicized values denote statistical significance at P<.05.
UnadjustedAdjusteda
95% CIP value95% CIP value
PRbLBcUBdaPReLBUB
Acute COVID-19 symptom clusters
Multi-symptomatic7.15.010.2<.0016.14.38.7<.001
Predominance of shortness of breath4.02.76.0<.0013.72.55.5<.001
Flu-like2.82.04.1<.0012.81.94.0<.001
Predominance of fatigue2.61.73.9<.0012.21.53.3<.001
Fever1.20.81.8.511.30.81.9.27
Nasal congestion1.0f1.0
Sex
Male1.01.0
Female1.71.42.0<.0011.41.21.7<.001
Age group (years)
18‐341.01.0
35‐441.81.42.3<.0011.71.42.2<.001
45‐541.81.42.3<.0011.71.42.2<.001
55‐641.91.52.4<.0012.11.72.6<.001
65+2.21.72.8<.0012.51.93.1<.001
Race and ethnicity
Hispanic1.51.11.9.0031.10.91.4.44
Non-Hispanic Black1.51.21.9.0021.10.91.5.28
Another race or ethnicity0.90.71.2.360.80.61.1.15
Non-Hispanic White1.01.0
Unknown1.00.61.5.851.00.61.5.88
Annual income (US $)
<35,0001.71.42.1<.0011.31.01.5.015
35,000-74,9991.41.21.7.0011.21.01.5.047
75,0001.01.0
Smoking prior to illness1.31.01.6.051.10.81.3.65
BMI
Underweight/normal weight1.01.0
Overweight1.31.01.6.031.10.91.4.33
Moderately obese1.61.32.0<.0011.20.91.5.16
Severely obese2.01.62.5<.0011.31.11.7.01
Unknown1.20.62.5.601.00.52.1.94
Pre-existing physical conditiong versus none1.81.52.1<.0011.21.01.4.05
Pre-existing psychological condition versus none1.51.21.8<.0011.10.91.3.29
Survey phase
June 1, 2020, to September 30, 20201.01.0
October 1, 2020, to February 28, 20211.00.81.2.780.90.71.1.37
March 1, 2021, to May 31, 20210.90.71.1.260.80.61.0.025
June 1, 2021, to September 30, 20210.70.50.9.0050.70.50.9.002
October 1, 2021, to February 28, 20220.80.61.0.040.70.50.8<.001
March 1, 2022, to May 31, 20220.30.20.4<.0010.40.30.5<.001
Survey mode
Phone1.61.41.9<.0011.31.11.5.004
Web-based1.01.0—

aAll models mutually adjusted for variables listed in table.

bPR: prevalence ratio.

cLB: lower bound of 95% CI.

dUB: upper bound of 95% CI.

eaPR: adjusted prevalence ratio.

fNot applicable (referent value).

gExcluding psychological conditions.


Using a population-based sample of adults with PCR-confirmed SARS-CoV-2 in Michigan, we examined associations between acute COVID-19 symptom clusters and long COVID. Generally, we found that long COVID prevalence increased as the number of acute symptoms, or probability of severe symptoms, increased. Of the 6 acute symptom clusters we identified, long COVID prevalence was lowest in clusters with a low number of symptoms (nasal congestion, fever), and highest in clusters with more symptoms (multi-symptomatic) or a high probability of severe symptoms (shortness of breath). These findings are consistent with a growing body of research linking severe acute illness to long COVID [5,6,11,12].

Long COVID prevalence was greater among respondents with multi-symptomatic acute illness with a high probability of fever, chills, and muscle aches. This is consistent with a study of patients at a hospital in Toronto, Canada, in which patients with acute constitutional (eg, fever, chills) or rheumatologic symptoms (eg, muscle aches) had greater odds of persistent symptoms 90 days after illness onset than patients without these acute symptoms [15]. Other studies have used a different approach, examining the progression of specific symptoms during and after COVID-19 illness [16,28], rather than associations between acute symptoms and long COVID (defined as the presence of any symptom after a specified time period). For example, a study using an international convenience sample of individuals with persistent symptoms following COVID-19 illness reported 3 clusters of COVID-19 symptoms, including symptoms most prevalent during the acute phase (eg, diarrhea and fever), symptoms consistently present over time (eg, muscle aches, changes to sense of taste and smell, and fatigue), and symptoms increasingly prevalent over time (eg, brain fog and postexertional malaise) [16].

In terms of specific acute COVID-19 symptoms, our findings may provide clues to potential mechanisms leading to long COVID. The multi-symptomatic cluster had the highest probability of acute gastrointestinal symptoms. Although few studies have examined associations between acute symptoms and long COVID, one small study in India found an association between diarrhea during acute illness and long COVID [14]. There is also preliminary evidence that the SARS-CoV-2 virus persists in the gastrointestinal tract for months in a subset of infected individuals [29], potentially prolonging gastrointestinal symptoms or leading to symptoms outside the gastrointestinal tract [30].

The multi-symptomatic acute symptom cluster also had a high probability of neurological symptoms and, along with the shortness of breath cluster, respiratory symptoms. In a study of military health system beneficiaries, individuals with neurological and respiratory/systemic symptoms were more likely to report persistent symptoms 6 months following illness onset than participants with nasal symptoms [31]. Neurological and respiratory symptoms are common among individuals with long COVID [32,33], suggesting the persistence of these symptoms over time. However, neurological symptoms associated with acute COVID-19 illness and long COVID may differ. While loss of taste and smell may persist from acute to chronic illness, other neurological long COVID symptoms, such as absent-mindedness, may have a delayed onset [16,34,35], suggesting a separate mechanistic pathway.

We also identified sociodemographic disparities in acute COVID-19 symptom class membership. Since long COVID may be underdiagnosed among people who are from racial or ethnic minority groups or have low income [36], it is important to identify high-risk groups when attempting to quantify disparities in long COVID. Respondents who were female, younger, members of racial or ethnic minority groups, low income, and diagnosed with pre-existing physical or psychological conditions had greater odds of membership in the multi-symptomatic cluster, which had the highest long COVID prevalence (39.7%). Similarly, respondents who were younger, low income, and diagnosed with a pre-existing physical condition had greater odds of membership in the predominance of shortness of breath cluster, which had a long COVID prevalence of 22.4%. On the other hand, respondents who were female and middle-aged had greater odds of membership in the predominance of fatigue cluster, which had a long COVID prevalence of 14.5%.

There are also alternative explanations for these associations. First, residual confounding due to sociodemographic and health status differences may account for some of the association between the acute symptom clusters and long COVID. Second, we did not adjust for hospital admission since it likely mediates the relationship between acute symptoms and long COVID. However, post-hospital or post-ICU syndrome, whereby hospitalized individuals have prolonged recovery irrespective of admitting illness [37,38], may partially explain associations between the more severe acute symptom clusters and long COVID.

Additionally, these findings should be interpreted within the context of several limitations. First, our sample is based on adults with PCR-confirmed SARS-CoV-2 in Michigan and therefore only representative of individuals who obtained a SARS-CoV-2 PCR test in Michigan. Given the limited availability of PCR testing early on, and the wide availability of at-home testing in later stages of the pandemic, our sample is not generalizable to all individuals with SARS-CoV-2 in Michigan during our study period. Second, our response rate (32.2%), although fairly high for a population-based survey, may be subject to nonresponse bias. However, population-based samples are more generalizable than clinical or convenience samples and we have previously shown that the impact of nonresponse on our study findings is likely minimal [39]. Third, our study relies on self-reported data, which is subject to recall and reporting biases. Fourth, we did not adjust for acute illness severity, including hospital admission, as we viewed these factors as mediators between acute symptoms and long COVID. Fifth, we did not examine the persistence of specific symptoms over time, an important avenue for future research.

A growing body of research links acute COVID-19 severity to long COVID. In our population-based sample of adults with PCR-confirmed SARS-CoV-2, we found that acute COVID-19 symptoms clustered into 6 classes ranging in symptom type and severity and that long COVID prevalence increased as the number of acute symptoms, or probability of severe symptoms, increased. Researchers and clinicians should consider acute COVID-19 symptom profiles when evaluating subsequent risk of long COVID, including potential mechanistic pathways in a research context, and proactively screen high-risk patients during the provision of clinical care.

Acknowledgments

We thank the Michigan COVID-19 Recovery Surveillance Study participants and interviewers for making this study possible. This work was supported by The Michigan Department of Health and Human Services, Michigan Public Health Institute, University of Michigan (UM) Institute for Data Science, UM Rogel Cancer Center, UM Epidemiology Department, and Centers for Disease Control and Prevention (CDC) of the US Department of Health and Human Services (HHS) (grant number 6NU50CK000510-02-04). The contents are those of the authors and do not necessarily represent the official views of, nor an endorsement by, CDC/HHS, or the US Government.

Data Availability

The data sets analyzed during this study are not publicly available due to the requirements of our data use agreement with the Michigan Department of Health and Human Services. However, a deidentified version of the data sets will soon be available via the Inter-university Consortium for Political and Social Research at the University of Michigan.

Conflicts of Interest

None declared.

Multimedia Appendix 1

Supplementary tables containing the unadjusted results from the multinomial logistic regression models and the results from the sensitivity analysis excluding all individuals with less than 90 days between their COVID onset and survey date.

DOCX File, 44 KB

  1. COVID data tracker weekly review. Centers for Disease Control and Prevention. URL: https://www.cdc.gov/coronavirus/2019-ncov/covid-data/covidview/index.html [Accessed 2023-07-10]
  2. Long COVID - household pulse survey. Centers for Disease Control and Prevention. URL: https://www.cdc.gov/nchs/covid19/pulse/long-covid.htm [Accessed 2023-11-17]
  3. Long COVID basics. Centers for Disease Control and Prevention. URL: https://www.cdc.gov/coronavirus/2019-ncov/long-term-effects/index.html [Accessed 2024-01-24]
  4. Soriano JB, Murthy S, Marshall JC, Relan P, Diaz JV, WHO Clinical Case Definition Working Group on Post-COVID-19 Condition. A clinical case definition of post-COVID-19 condition by a Delphi consensus. Lancet Infect Dis. Apr 2022;22(4):e102-e107. [CrossRef] [Medline]
  5. Global Burden of Disease Long COVID Collaborators, Wulf Hanson S, Abbafati C, et al. Estimated global proportions of individuals with persistent fatigue, cognitive, and respiratory symptom clusters following symptomatic COVID-19 in 2020 and 2021. JAMA. Oct 25, 2022;328(16):1604-1615. [CrossRef] [Medline]
  6. Sudre CH, Murray B, Varsavsky T, et al. Attributes and predictors of long COVID. Nat Med. Apr 2021;27(4):626-631. [CrossRef] [Medline]
  7. Sneller MC, Liang CJ, Marques AR, et al. A longitudinal study of COVID-19 sequelae and immunity: baseline findings. Ann Intern Med. Jul 2022;175(7):969-979. [CrossRef] [Medline]
  8. Lindahl A, Aro M, Reijula J, et al. Women report more symptoms and impaired quality of life: a survey of Finnish COVID-19 survivors. Infect Dis (Auckl). Jan 2, 2022;54(1):53-62. [CrossRef]
  9. Seeßle J, Waterboer T, Hippchen T, et al. Persistent symptoms in adult patients 1 year after coronavirus disease 2019 (COVID-19): a prospective cohort study. Clin Infect Dis. Apr 9, 2022;74(7):1191-1198. [CrossRef]
  10. Blomberg B, Mohn KGI, Brokstad KA, et al. Long COVID in a prospective cohort of home-isolated patients. Nat Med. Sep 2021;27(9):1607-1613. [CrossRef] [Medline]
  11. Petersen MS, Kristiansen MF, Hanusson KD, et al. Long COVID in the Faroe Islands: a longitudinal study among nonhospitalized patients. Clin Infect Dis. Dec 6, 2021;73(11):e4058-e4063. [CrossRef]
  12. Hirschtick JL, Titus AR, Slocum E, et al. Population-based estimates of post-acute sequelae of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) infection (PASC) prevalence and characteristics. Clin Infect Dis. Dec 6, 2021;73(11):2055-2064. [CrossRef]
  13. Huang Y, Pinto MD, Borelli JL, et al. COVID symptoms, symptom clusters, and predictors for becoming a long-hauler looking for clarity in the haze of the pandemic. Clin Nurs Res. Nov 2022;31(8):1390-1398. [CrossRef] [Medline]
  14. Chopra N, Chowdhury M, Singh AK, et al. Clinical predictors of long COVID-19 and phenotypes of mild COVID-19 at a tertiary care centre in India. Drug Discov Ther. 2021;15(3):156-161. [CrossRef] [Medline]
  15. Estrada-Codecido J, Chan AK, Andany N, et al. Prevalence and predictors of persistent post-COVID-19 symptoms. J Assoc Med Microbiol Infect Dis Can. Sep 2022;7(3):208-219. [CrossRef] [Medline]
  16. Davis HE, Assaf GS, McCorkell L, et al. Characterizing long COVID in an international cohort: 7 months of symptoms and their impact. EClinicalMedicine. Aug 2021;38:101019. [CrossRef] [Medline]
  17. Montani D, Savale L, Noel N, et al. Post-acute COVID-19 syndrome. Eur Respir Rev. Mar 31, 2022;31(163):210185. [CrossRef] [Medline]
  18. Davis HE, McCorkell L, Vogel JM, Topol EJ. Long COVID: major findings, mechanisms and recommendations. Nat Rev Microbiol. Mar 2023;21(3):133-146. [CrossRef] [Medline]
  19. Contact information. Michigan Department of Health and Human Services. URL: https://www.michigan.gov/mdhhs/safety-injury-prev/publicsafety/ophp/contact [Accessed 2023-07-10]
  20. Standard definitions. American Association for Public Opinion Research. 2016. URL: https://aapor.org/standards-and-ethics/standard-definitions/ [Accessed 2024-09-17]
  21. Coronavirus disease (COVID-19). World Health Organization. URL: https://www.who.int/news-room/fact-sheets/detail/coronavirus-disease-(covid-19) [Accessed 2024-01-30]
  22. Wang S, Quan L, Chavarro JE, et al. Associations of depression, anxiety, worry, perceived stress, and loneliness prior to infection with risk of post-COVID-19 conditions. JAMA Psychiatry. Nov 1, 2022;79(11):1081-1091. [CrossRef] [Medline]
  23. Cox BG. The weighted sequential hot deck imputation procedure. Res Methods. 1980:721-726.
  24. Vermunt JK. Latent class modeling with covariates: two improved three-step approaches. Polit Anal. 2010;18(4):450-469. [CrossRef]
  25. Asparouhov T, Muthén B. Auxiliary variables in mixture modeling: three-step approaches using Mplus. Struct Equ Modeling: Multi J. Jul 3, 2014;21(3):329-341. [CrossRef]
  26. Muthén L, Muthén B. Mplus User’s Guide Eighth Edition. Muthén & Muthén; 2017.
  27. StataCorp. Stata Statistical Software: Release 17. StataCorp LLC; 2021.
  28. Mariani C, Borgonovo F, Capetti AF, et al. Persistence of long-COVID symptoms in a heterogenous prospective cohort. J Infect. May 2022;84(5):722-746. [CrossRef] [Medline]
  29. Natarajan A, Zlitni S, Brooks EF, et al. Gastrointestinal symptoms and fecal shedding of SARS-CoV-2 RNA suggest prolonged gastrointestinal infection. Med. Jun 10, 2022;3(6):371-387. [CrossRef] [Medline]
  30. Wong AC, Devason AS, Umana IC, et al. Serotonin reduction in post-acute sequelae of viral infection. Cell. Oct 26, 2023;186(22):4851-4867. [CrossRef] [Medline]
  31. Epsi NJ, Powers JH, Lindholm DA, et al. A machine learning approach identifies distinct early-symptom cluster phenotypes which correlate with hospitalization, failure to return to activities, and prolonged COVID-19 symptoms. PLoS One. 2023;18(2):e0281272. [CrossRef] [Medline]
  32. Thaweethai T, Jolley SE, Karlson EW, et al. Development of a definition of postacute sequelae of SARS-CoV-2 infection. JAMA. Jun 13, 2023;329(22):1934-1946. [CrossRef] [Medline]
  33. Aiyegbusi OL, Hughes SE, Turner G, et al. Symptoms, complications and management of long COVID: a review. J R Soc Med. Sep 2021;114(9):428-442. [CrossRef] [Medline]
  34. Apple AC, Oddi A, Peluso MJ, et al. Risk factors and abnormal cerebrospinal fluid associate with cognitive symptoms after mild COVID-19. Ann Clin Transl Neurol. Feb 2022;9(2):221-226. [CrossRef] [Medline]
  35. Jason LA, Islam M, Conroy K, et al. COVID-19 symptoms over time: comparing long-haulers to ME/CFS. Fatigue Biomed Health Behav. 2021;9(2):59-68. [CrossRef] [Medline]
  36. Pfaff ER, Madlock-Brown C, Baratta JM, et al. Coding long COVID: characterizing a new disease through an ICD-10 lens. BMC Med. Feb 16, 2023;21(1):58. [CrossRef] [Medline]
  37. Krumholz HM. Post-hospital syndrome--an acquired, transient condition of generalized risk. N Engl J Med. Jan 10, 2013;368(2):100-102. [CrossRef] [Medline]
  38. Marra A, Pandharipande PP, Girard TD, et al. Co-occurrence of post-intensive care syndrome problems among 406 survivors of critical illness. Crit Care Med. Sep 2018;46(9):1393-1401. [CrossRef] [Medline]
  39. Hirschtick JL, Xie Y, Slocum E, et al. A statewide population-based approach to examining long COVID symptom prevalence and predictors in michigan. Prev Med. Dec 2023;177:107752. [CrossRef] [Medline]


aOR: adjusted odds ratio
aPR: adjusted prevalence ratio
LCA: latent class analysis
MI CReSS: Michigan COVID-19 Recovery Surveillance Study
PCR: polymerase chain reaction


Edited by Amaryllis Mavragani; submitted 20.12.23; peer-reviewed by Fang Zhu, Pouyan Ebrahimi; final revised version received 09.05.24; accepted 16.05.24; published 01.10.24.

Copyright

© Jana L Hirschtick, Elizabeth Slocum, Yanmei Xie, Laura E Power, Michael R Elliott, Robert C Orellana, Nancy L Fleischer. Originally published in JMIR Public Health and Surveillance (https://publichealth.jmir.org), 1.10.2024.

This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in JMIR Public Health and Surveillance, is properly cited. The complete bibliographic information, a link to the original publication on https://publichealth.jmir.org, as well as this copyright and license information must be included.