Published on in Vol 10 (2024)

Preprints (earlier versions) of this paper are available at https://preprints.jmir.org/preprint/48825, first published .
Using Social Vulnerability Indices to Predict Priority Areas for Prevention of Sudden Unexpected Infant Death in Cook County, IL: Cross-Sectional Study

Using Social Vulnerability Indices to Predict Priority Areas for Prevention of Sudden Unexpected Infant Death in Cook County, IL: Cross-Sectional Study

Using Social Vulnerability Indices to Predict Priority Areas for Prevention of Sudden Unexpected Infant Death in Cook County, IL: Cross-Sectional Study

1Center for Health Equity & Innovation, Cook County Health, , 1950 W Polk St, Suite 5807, Chicago, IL, , United States

2Program in Public Health, Feinberg School of Medicine, Northwestern University, , Chicago, IL, , United States

Corresponding Author:

Daniel P Riggins, MD, MPH


Background: The incidence of sudden unexpected infant death (SUID) in the United States has persisted at roughly the same level since the mid-2000s, despite intensive prevention efforts around safe sleep. Disparities in outcomes across racial and socioeconomic lines also persist. These disparities are reflected in the spatial distribution of cases across neighborhoods. Strategies for prevention should be targeted precisely in space and time to further reduce SUID and correct disparities.

Objective: We sought to aid neighborhood-level prevention efforts by characterizing communities where SUID occurred in Cook County, IL, from 2015 to 2019 and predicting where it would occur in 2021‐2025 using a semiautomated, reproducible workflow based on open-source software and data.

Methods: This cross-sectional retrospective study queried geocoded medical examiner data from 2015‐2019 to identify SUID cases in Cook County, IL, and aggregated them to “communities” as the unit of analysis. We compared demographic factors in communities affected by SUID versus those unaffected using Wilcoxon rank sum statistical testing. We used social vulnerability indicators from 2014 to train a negative binomial prediction model for SUID case counts in each given community for 2015‐2019. We applied indicators from 2020 to the trained model to make predictions for 2021‐2025.

Results: Validation of our query of medical examiner data produced 325 finalized cases with a sensitivity of 95% (95% CI 93%‐97%) and a specificity of 98% (95% CI 94%‐100%). Case counts at the community level ranged from a minimum of 0 to a maximum of 17. A map of SUID case counts showed clusters of communities in the south and west regions of the county. All communities with the highest case counts were located within Chicago city limits. Communities affected by SUID exhibited lower median proportions of non-Hispanic White residents at 17% versus 60% (P<.001) and higher median proportions of non-Hispanic Black residents at 32% versus 3% (P<.001). Our predictive model showed moderate accuracy when assessed on the training data (Nagelkerke R2=70.2% and RMSE=17.49). It predicted Austin (17 cases), Englewood (14 cases), Auburn Gresham (12 cases), Chicago Lawn (12 cases), and South Shore (11 cases) would have the largest case counts between 2021 and 2025.

Conclusions: Sharp racial and socioeconomic disparities in SUID incidence persisted within Cook County from 2015 to 2019. Our predictive model and maps identify precise regions within the county for local health departments to target for intervention. Other jurisdictions can adapt our coding workflows and data sources to predict which of their own communities will be most affected by SUID.

JMIR Public Health Surveill 2024;10:e48825

doi:10.2196/48825

Keywords



Background

Sudden unexpected infant death (SUID) exerts a severe burden on affected families and society at large. The National Institute of Child Health and Human Development (NICHD) defines SUID as “the death of an infant younger than 1 year of age that occurs suddenly and unexpectedly.” Sudden infant death syndrome (SIDS) is a specific type of SUID where a cause cannot be identified after a full medical investigation has been completed [1]. Other types of SUID where a cause can be identified include but are not limited to “suffocation, mechanical asphyxia, entrapment, infection, ingestions, metabolic diseases, or trauma” [2]. SIDS as a single subtype was ranked as the third leading cause of infant death in the United States in 2020 [3]. In terms of years of life lost (YLL), SIDS accounted for a higher burden of disease than congenital heart anomalies for infants younger than 1 year of age in the United States in 2019 (3600 YLL per 100,000 infants vs 3134) [4].

SUID trends in the US have changed dynamically over time. The respective rates of SUID and SIDS fell from 155 and 130 (cases per 100,000 live births) in 1990 to 90 and 52 in 2010 [5]. Epidemiologists attribute these drops in incidence to a series of programs started in the 1990s to target safe sleep practices. The programs included the release of specific guidelines by the American Academy of Pediatrics (AAP) and an educational campaign called “Safe to Sleep” [5]. The Safe to Sleep program continues to this day and centers on the idea of ABCs: a baby should sleep Alone on their Back in a separate Crib [6]. This is due to evidence that babies are at increased risk of (A) rebreathing their expired gases when lying in a prone or side-lying position and (B) falling victim to accidental suffocation when sharing a sleep surface with caregivers and bulky bed materials [2].

Despite the success of Safe to Sleep and associated programming, trends from the last decade suggest that novel approaches are needed to make further progress in the prevention of SUID. In 2020, the respective rates for SUID and SIDS remained relatively stagnant at 93 and 38 [5]. Such trends hold true in the local context of Illinois, the home state in which Cook County is located. The rate of SUID remained roughly even in Illinois at ~100 (cases per 100,000 live births) between 2000 and 2018. Furthermore, racial disparities were stark. In the Black infant population, rates were actually observed to increase from a nadir of ~200 in 2009 to ~300 in 2018 [7]. Reducing SIDS in a just, effective manner will require addressing these disparities.

There is active discussion in the scientific literature on how best to adjust prevention strategies. Some researchers argue there remains untapped potential in addressing external factors that are named by AAP guidelines but get less attention than the ABCs of safe sleep. For example, use of a pacifier, breastfeeding, and smoking and alcohol avoidance are all recommended protective practices against SUID that get relatively less attention in educational materials [8]. Pretorius and Rew [9] argue that targeting such factors would be particularly effective if implemented outside of the hospital in the clinic or community, where caregivers have greater time and mental capacity to internalize risk reduction strategies.

An example of such an approach in Chicago, IL, was demonstrated by Rasinski et al [10], when they targeted educational campaigns to Black communities estimated to be most at risk for SIDS. They found mixed results from their quasi-experimental design, where some risk behaviors like usage of an improper sleep surface were decreased but other behaviors like placement of extraneous materials in the sleep environment actually increased.

In response to such mixed results, Matoba and Collins [11] argue that to improve high infant mortality rates—and their racially disparate trends—public health practitioners must broaden their focus beyond individual behaviors to neighborhood factors like poverty, air quality, and crime. Indeed, within Chicago, Guest et al [12] demonstrated that geographic variation in infant mortality was significantly associated at the neighborhood level with racial segregation and unemployment.

To counteract these negative neighborhood influences and reduce the community incidence of SUID in a just manner, public health practitioners must precisely target their interventions in space and time. Without spatiotemporal precision, practitioners risk exacerbating inequity by bolstering communities already well resourced while neglecting the communities with the highest need [13].

Objectives

In this study of Cook County, IL, we sought to enable a neighborhood-focused prevention approach by creating a semiautomated method to precisely describe where SUID occurred in the recent past (2015‐2019) and to predict where SUID would occur in the near future (2021‐2025) while pointing to social vulnerability indicators as explanatory variables. Previous analyses have not provided sufficient geospatial detail to name specific places within counties to target for highest priority [14-17], so we also sought to afford greater detail by using “communities” as our primary unit of analysis.

While these analytic efforts were specific to Cook County, we aimed for our methods to be as adaptable as possible to other jurisdictions by using open-source software and data whenever possible [18,19]. We intended for these tools to enable any interested jurisdiction to get assistance to communities most afflicted by SUID in a timely, targeted manner.


Study Design

We conducted a cross-sectional retrospective study on case counts of SUID in “communities” of Cook County, IL.

We used the concept of communities in order to increase the geographic catchment area for case counts (incidence in smaller census tracts was too low to make robust case count predictions) as well as to lend semantic meaning to the geographic unit of analysis, that is, being able to name each unit rather than using a numeric identifier.

Since census tracts and existing community boundaries do not exactly align, we designed a method for resolving differences. First, for each census tract, we calculated its centroid (the geographic center of an irregular shape). Second, we sent each census tract’s centroid to the OpenCage service [20,21], which uses open data to assign labels to geographic points (eg, the corresponding country, state, city, suburb, neighborhood, etc, for a given point). Third, we used a decision rule to resolve the collection of multiple labels for a given census tract into a single “community.” A community roughly corresponded to one of the 77 formally defined Chicago Community Areas (if within city limits) [22] or one of Suburban Cook County’s independent municipalities [23]. Finally, we performed a spatial join of all census tracts with the same community label (ie, combine all census tracts together by their shared borders into a common external border).

Although aggregating to “communities” lent the benefits listed above, it came at the cost of less precision for corresponding social vulnerability covariates, which were aggregated into community-level values by taking simple sums of their estimates at the census tract level without accounting for variations in their margins of error.

Primary Outcome

The primary outcome of interest was the SUID case count in each community. We translated the NICHD’s definition of SUID [1] into a structured query language (SQL) query of the Cook County Medical Examiner Office’s Archive (Figure 1). Each case identified by the query was validated by team members from the SUID Case Registry for Cook County, who also shared cases that were missed by the query process. Each validated case was geocoded confidentially with latitude and longitude by using ArcGIS Pro geocoding tools behind Cook County’s firewall. Geocoded cases occurring in each community were aggregated into case counts.

Figure 1. SQL query of the medical examiner’s archive for identifying cases of sudden unexpected infant death (SUID) in communities of Cook County, IL, from 2015 to 2019. The “WHERE CASENUMBER IN” clause is censored here for confidentiality, but it contained a list of additional case numbers recommended for inclusion by team members from the SUID Case Registry for Cook County.

Temporal Setting

The year of origin for each data variable was context-specific. SUID case counts were observed from 2015 to 2019. A descriptive comparison of census tracts with and without SUID present was performed on variables from contemporaneous years. We retrospectively trained a model predicting SUID case counts from 2015 to 2019 using predictor covariates from 2014. Using the trained model, we made prospective predictions for SUID case counts from 2021 to 2025 with covariates from 2020.

Coding Pipeline

We performed all steps in the data pipeline downstream from the SQL query in the R computing environment [24] using the “tidyverse” suite of packages [25] (The R Foundation). We used additional data cleaning convenience functions from the “janitor” and “RSocrata” packages [26,27]. The data pipeline was orchestrated using the “targets” package [28]. We generated data tables for publication using the “gt” package and its companion {gtsummary} [29,30]. We performed geospatial manipulation and mapping with the “sf,” “leaflet,” “tmap,” “opencage,” and “tigris” packages [31-34] along with their underlying computational infrastructure [35-41].

Statistical Analysis

Descriptive Comparison

We compared communities with and without observed cases of SUID using median values (and IQRs) of demographic variables. Median values were reported because most of the variables did not approximate normal distributions. Comparisons between communities with and without observed SUID were statistically conducted with the Wilcoxon rank sum test.

Predictive Modeling

We sourced all predictive covariates from US Census’ “American Community Survey” (ACS) and from the Agency for Toxic Substances and Disease Registry’s associated “Social Vulnerability Index” (SVI), which has 4 thematic domains: Socioeconomic (eg, estimate of people living below the poverty line), Household Composition and Disability (eg, estimate of single-parent households with children under 18), Minority Status and Language (eg, estimate of all non-White, non-Hispanic people), and Housing Type and Transportation (eg, estimate of households with more occupants than rooms) [42,43].

We modeled SUID case counts in each census tract using maximum likelihood estimation via the “MASS” R package [44]. We used the negative binomial family of generalized linear models instead of the Poisson family because overdispersion was detected [45]. A random forest model was considered as an alternative to negative binomial, but stakeholders expressed preference for a model where results could be explained by explicit predictor variables and their covariates.

The selection of model forms and predictors is described in detail in Multimedia Appendix 1. Briefly, we started with the total population as an offset variable for use in the model. Next, to consider other predictor candidates, we calculated the Pearson correlation coefficients between the SUID case count and each underlying variable in the SVI [43]. One to two promising variables from each SVI thematic domain were assessed in the model using an additive step-wise strategy. The performance parameters assessed were Akaike information criteria and Bayesian information criteria, root mean squared error, and Nagelkerke R2 using the “EasyStats” suite of R packages [46]. All predictor variables were log-transformed in the final model in order to reduce the influence of extreme values.

Ethical Considerations

To validate our query of the Cook County medical examiner’s database, infant mortality in the medical examiner data was shared with the county’s infant mortality review panel, who already had access to these data. As a population health study using publicly available data sources on decedents, this study was exempt from institutional review board review and informed consent processes. The analytic dataset was deidentified, and for publication, data were aggregated to protect affected families.

Data Availability

An export of the analytic dataset is available in Multimedia Appendix 2 or in an open web-based SQL database on DoltHub [19].


Case Identification

Our SQL query identified 333 prospective cases of SUID from the medical examiner archives. Reviewers from the SUID Case Registry of Cook County recommended adding 16 cases and removing 7 cases. Using all cases of death in children under 1 year of age as the denominator, our electronic query achieved a sensitivity of 326/342 (95%; 95% CI 93%‐97%) and a specificity of 102/109 (98%; 95% CI 94% to 100%). One case could not be used in subsequent spatial analyses because there was no associated address, which led to a final case count of 325.

Geospatial Aggregation

We spatially aggregated cases into counts within 199 communities; 49% of communities (97/199) observed at least one case of SUID from 2015 to 2019. Table 1 shows the full distribution of case counts.

Table 1. Cross-sectional distribution of sudden unexpected infant death (SUID) case counts in communities of Cook County, IL, from 2015 to 2019.
SUID case countFrequency, n (%)
0102 (51.3)
136 (18.1)
224 (12.1)
38 (4.0)
47 (3.5)
53 (1.5)
64 (2.0)
75 (2.5)
81 (0.5)
92 (1.0)
103 (1.5)
111 (0.5)
120 (0)
130 (0)
141 (0.5)
150 (0)
160 (0)
172 (1.0)

Mapping SUID Case Counts

We generated an interactive map of SUID case counts (Figure 2), which can be viewed online [47]. The map showed subjective clusters of cases on the West and South Sides of Chicago as well as the South Suburbs of Cook County. The top 10 communities with the largest case counts were all located within Chicago city limits: Austin (17 cases), Englewood (17 cases), West Englewood (14 cases), West Pullman (11 cases), Humboldt Park (10 cases), New City (10 cases), Roseland (10 cases), North Lawndale (9 cases), South Shore (9 cases), and the Near North Side (8 cases). The community outside of Chicago city limits with the largest case count was Chicago Heights (7 cases).

Figure 2. Snapshot of an interactive map of cross-sectional case counts of sudden unexpected infant death (SUID) in communities of Cook County, IL, from 2015 to 2019.

Comparing Communities Affected Versus Unaffected by SUID

Table 2 compares the demographics of communities observed with at least one case of SUID versus those without. There were statistically significant differences in age and sex composition, although the absolute differences were slight. In terms of race, communities with SUID present exhibited a lower composition of Asian residents at 2% versus 5% (P<.001) and non-Hispanic White residents at 17% versus 60% (P<.001; Figure 3). Conversely, communities with SUID present exhibited a higher composition of non-Hispanic Black residents at 32% versus 3% (P<.001; Figure 4). There was no statistically significant difference in Hispanic composition. Variables algorithmically selected for use in the predictive model all showed statistically significant differences as well.

Table 2. A cross-sectional comparison of demographics in communities of Cook County, IL, affected versus unaffected by sudden unexpected infant death (SUID) from 2015 to 2019.
VariableSUID present (n=97), median (IQR)No SUID (n=102), median (IQR)P valuea
Age
Median age (years)37 (34-40)40 (36-43)<.001
Sex
Sex ratio (males per 100 females)91 (84-97)96 (91-101).001
Race or ethnicity
American Indian and Alaska Native, any (%)1 (0-1)1 (0-1).05
Asian, any (%)2 (1-6)5 (2-11)<.001
Non-Hispanic Black, alone (%)32 (6-79)3 (2-10)<.001
Non-Hispanic White, alone (%)17 (4-49)60 (31-76)<.001
Hispanic, any (%)14 (4-28)14 (7-33).20
Modeling variables
Total population23,810 (13,811-47,181)14,426 (8446-26,577)<.001
Total people living below poverty4,354 (1873-8421)1252 (578-2692)<.001
Total crowded households284 (116-639)116 (38-330)<.001

aWilcoxon rank sum test.

Figure 3. A cross-sectional comparison of White racial composition in communities of Cook County, IL, affected versus unaffected by sudden unexpected infant death (SUID) from 2015 to 2019.
Figure 4. A cross-sectional comparison of Black racial composition in communities of Cook County, IL, affected versus unaffected by sudden unexpected infant death (SUID) from 2015 to 2019.

Modeling SUID Case Counts

We fit a negative binomial regression model to predict SUID case counts for each community of Cook County based on total population, total people living below the poverty line, and total households with more occupants than rooms (“crowded” households; Table 3). The model was retrospectively trained on SUID case counts for 2015‐2019 with predictor variables from the 2014 SVI and ACS. Figure 5 depicts goodness of fit for the model on the training data, showing that it captured mid-range counts well, but underpredicted the number of communities with zero cases, overpredicted those with one case, and underpredicted those with counts over 6. Evaluation of the model’s predictive performance showed a Nagelkerke R2 value of 70.2% and a root mean squared error of 17.49. See Multimedia Appendix 1 for further details on evaluation of performance.

The median difference between predicted and observed case counts in the training data was 0.24 (IQR −0.57 to 0.60). All 5 communities with the highest observed case counts (Austin, Englewood, Humboldt Park, West Englewood, and West Pullman) were also among the communities with the most significant individual underpredictions (ranging from 4.05 to 7.61 less than that observed). On the other end of the spectrum, the 5 communities with the most significant individual overpredictions were Chicago Lawn, Douglas, Grand Boulevard, Kenwood, and Rogers Park (ranging from 2.94 to 4.77 more than that observed; data available in Multimedia Appendix 2).

Table 3. Exponentiated parameters for a negative binomial regression model predicting case counts of sudden unexpected infant death in communities of Cook County, IL.a
ParameterRRbSE95% CIz-scoreP value
Intercept0.001340.001651.12e−04 to 0.01−4.30<.001
log(Total crowded households)0.687920.079170.55 to 0.87−2.59.001
log(Total people living below poverty)6.065471.017104.42 to 8.448.58<.001
log(Total population)0.551810.093390.40 to 0.77−2.80<.001

aTraining data used outcome case counts from 2015 to 2019 and predictor variables from 2014.

bRR: risk ratio. Interpreted as the magnitude by which you would multiply the risk with a unit increase of 1 for each covariate.

Figure 5. Rootogram depicting goodness of fit for a negative binomial regression model predicting case counts of sudden unexpected infant death (SUID) in communities of Cook County, IL. Training data used outcome case counts from 2015 to 2019 and predictor variables from 2014. Observed case counts are the bars “hanging” from predicted case counts “pinned” at the dots. For aggregate underpredictions, the bar hangs below zero; for aggregate overpredictions, the bar hangs above zero.

Prospective Model Predictions

We applied predictor variables from the 2020 SVI and ACS to our trained model to predict case counts for 2021‐2025. Most of the top 10 communities predicted to have the highest case counts for 2021‐2025 were the same as those observed in the top 10 for 2015‐2019 except for Chicago Lawn, Auburn Gresham, and South Lawndale with predicted changes of +8, +5, and +3, respectively (Table 4). Figure 6 depicts these communities in spatial relation to each other.

Table 4. Top 10 predicted case counts for a negative binomial regression model predicting case counts of sudden unexpected infant death in communities of Cook County, IL.a
2021‐20252015‐2019
CommunityPredicted countPredicted changebPredicted countObserved countResidualc
Austin17+01117−6
Englewood14−3917−8
Auburn Greshamd12+577+0
Chicago Lawnd12+874+3
South Shore11+289−1
Humboldt Park10+0610−4
North Lawndale10+189−1
South Lawndaled10+377+0
New City9−1510−5
West Englewood9−5714−7

aTraining data used outcome case counts from 2015 to 2019 and predictor variables from 2014. Forward predictions were made for the period from 2021 to 2025 using predictor variables from 2020.

b(Prediction from 2021 to 2025) − (Observation from 2015 to 2019).

c(Prediction from 2015 to 2019) − (Observation from 2015 to 2019).

dNew to the top 10 (not in the top 10 for observed case counts from 2015 to 2019).

Figure 6. A map of top 10 predicted case counts for a negative binomial regression model predicting case counts of sudden unexpected infant death (SUID) in communities of Cook County, IL, from 2021 to 2025. Labels: community name, predicted case count, predicted change from 2015‐2019.

Principal Results

Our process for querying data from the Cook County Medical Examiner was able to identify cases of SUID from 2015 to 2019 with a high degree of fidelity to those identified by the county’s official registry. We aggregated these cases to the “community” level and found that those containing a case of SUID during the study period also had higher proportions of Black residents and lower proportions of Asian and White residents. We built a negative binomial regression model to predict SUID case counts and achieved moderate accuracy, especially within the mid-range of case counts. As an aide for deciding where to target preventive services, we mapped the communities most affected by SUID. Our results suggest that intervention programs should focus efforts on the north and west sides of Chicago and in the south suburbs of Cook County. The top 10 communities predicted to have the highest case counts were also in these general areas. Chicago Lawn was predicted to emerge with the largest predicted change of +8 cases.

Implications

We suggest our analytic outputs should be used in different ways based on their temporal context. Within the model training period of 2014‐2019, residuals between predictions and observations represent promising routes of inquiry for expanding understanding of community-level ecological factors correlating with SUID incidence. For example, public health practitioners might consider intensive qualitative inquiry in the communities most underpredicted (like Austin and Englewood) to identify new risk factors and in communities most overpredicted (like Douglas and Grand Boulevard) to identify new protective factors unaccounted for by the model. After the model training period in 2020‐2025, our prospective predictions represent opportunities to anticipate shifting dynamics in the communities most affected by SUID.

Regardless of the temporal context, all results from this study can be used to guide a targeted, local approach to SUID prevention. Turman and Swigonski [48] proposed a framework for such an approach. Both high-level goals of their framework hinged on identifying specific zip codes with the highest infant mortality rates in Central Indiana. The first goal focused on developing infrastructure in those zip codes to support healthy pregnancies and infants. For example, they increased the capacity of local early childhood education programs, which both provided childcare for mothers seeking employment and served as vehicles for education on safe sleep practices. The second goal focused on training women from those zip codes as grass roots maternal child health leaders. This framework is just as applicable to Cook County as it is to Central Indiana. Many of the key institutional partners like a local Fetal Infant Mortality Review Board are already in place and could use the maps and predictions from this study to identify targeted communities most at risk.

For a variety of reasons, more work must be done before designing interventions to apply within the framework above. First, because our model is designed for prediction rather than causal inference, differences in model covariates should not be interpreted as directly leading to differences in SUID incidence. Hence, we would caution against trying to act on these covariates in isolation. Second, the SVI on its own does not capture many population-level factors that would be directly relevant to modifiable, individual-level factors specific to SUID (eg, safe sleep practice, smoking, alcohol use, illicit drug use, and engagement in prenatal care) [49]. Additional work must be done to acquire data on these factors in our jurisdiction. Finally, one Cochrane synthesis of evidence suggests that a key source of data for health interventions is generated by direct conversation with local stakeholders in communities identified as most affected [50], which has yet to be performed. If local coalitions express preference for a menu of potential options, they might consider deploying targeted digital assessments of in-home sleep environments [51], smoking cessation campaigns, or educational programming in community-based organizations.

Comparison With Prior Work

Our findings are consistent with the broader body of research on sociodemographics related to infant mortality. Several other studies have also identified a correlation between infant mortality and community-level factors like alcohol or drug use, education, employment, immigration, insurance, involvement in child protective services, poverty, racism, and racial segregation [12,15,16,52-55]. Some studies have posited direct causal relationships between these factors and infant mortality, but the evidence is still equivocal. Both Hearst et al [55] and Johnson et al [54] used propensity score matching as a means of isolating such causal effects, the former for residential segregation on Black infant mortality and the latter for neighborhood poverty on American Indian infant mortality. Both were unable to detect an influence but emphasized that their inability to detect effects may have been due to limitations in the size of sampling pools to achieve adequate counterfactual comparisons. Our study does not attempt to identify causal effects and instead focuses on helping local health departments target precise regions of their jurisdictions for intervention.

Our study advances geospatial research on SUID in 3 major ways. First, we added finer detail to the available Chicago-based maps by using census tracts and “communities” as areal units, with the added benefit that these can be linked to census-derived demographics. Second, we added interactive capability to our maps for greater utility to practitioners wanting to explore the data first-hand. Third, to our knowledge, this is the first study to have made prospective predictions on the spatial incidence of SUID, helping to anticipate changing dynamics in regions of interest. In comparison, Briker et al [14] performed a different but complementary analysis of Cook County data, visualizing SUID incidence in 2015‐2016 by kernel density estimation (KDE). Relative to our method of displaying counts per “community” areal unit, KDE has the advantage of smoothing out random variation observed in the data to make clusters more apparent. With this method, the authors also found strong clusters of SUID on the West and South Sides of Chicago. The comparative disadvantages of KDE are that the visualized kernels do not have one-to-one matches with real-world administrative boundaries, and the units of intensity are less interpretable than counts. Also, because the map in Briker et al was not interactive, it was more difficult to ascertain specific high-risk communities. In another study, Drake et al [15] used KDE to visualize SUID incidence in Harris County, TX, and this approach had the same advantages and disadvantages as described above. Fee and Tarrell [16] used analogous techniques to those used in our study by visualizing incidence in administrative areal units of Douglas County, NE. These authors were also able to correlate incidence with other variables, although their variables were more relevant to individual-level risk (eg prenatal care and tobacco use) rather than the population-level approach used in our study. In an older seminal study, Grimson et al [17] focused on demonstrating statistical methods for identifying geospatial clusters. Their work is relevant to our study because the authors used SUID incidence as their example use-case, but our study was not focused on identifying statistically significant clusters and was more finely detailed on a smaller geographic scale.

Limitations

Our study has limitations. First, a Durbin-Watson test of our model for autocorrelated residuals rejected the null hypothesis, suggesting that observations were not independent from each other. This was at least partially due to the interconnected nature of census tracts in physical space. Indeed, a Moran I test also rejected the null hypothesis, suggesting the presence of spatial autocorrelation. Second, our model did not account for measurement error in our covariates, which are estimated by the ACS using subsamples of the population. Failure to account for the margins of error and for spatial autocorrelation risked introducing systematic bias into our model predictions [56]. This was likely compounded by aggregating estimates for census tracts into the “community” areal unit. Trade-offs for this approach are described in the Methods section.

One means of addressing these first two limitations would be to implement a hierarchical Bayesian model of spatial measurement error [57]. We attempted to do so using the “geostan” R package [58], but preliminary attempts yielded warnings from the software that sampling chains did not converge--significantly raising the risk of inaccurate parameter estimates. An additional limitation, which may have contributed to nonconvergence, was that we did not have access to estimates of the true denominator for the incidence of SUID (the count of live births in each areal unit during the study period). The model implemented in geostan requires an offset denominator variable, so we used population counts of children under 5 as a proxy, but this may have contributed to additional imprecision that hampered the sampling algorithm’s ability to converge.

A final limitation was that not enough time had passed to empirically assess our model’s prospective predictions. Decision makers at local health departments might be more confident in the model’s performance if they could see how it fared at predicting outside of the training dataset. We suggest mitigating this limitation by comparing prospective predictions with retrospective observations, both of which together make a more compelling argument for risk than either piece of information alone.

Conclusions

Like many public health jurisdictions throughout the United States, Cook County, IL, is mired in persistent rates of SUID tied to racial and socioeconomic disparities. Our semiautomated process for compiling, analyzing, and predicting cases may allow intervention programs to more quickly and effectively address such disparities with efforts targeted at communities most in need of prevention services.

Acknowledgments

This work was supported by a grant to Cook County Health from the US Department of Justice's Office for Victims of Crime: Reducing Child Fatalities and Recurring Child Injuries Caused by Crime Victimization (award #2019-V3-GX-K004). The authors thank Kyran Quinlan and his team at the SUID Case Registry for Cook County for reviewing cases identified by our querying method. The authors also thank Joseph Feinglass, Phil Fontanarosa, and Sijia Wei for early feedback on this manuscript.

Authors' Contributions

DPR contributed to conceptualization, primary analysis, original draft preparation, and review/editing. HZ contributed to data curation, analysis, and review/editing. WET contributed to conceptualization, supervision, and review/editing.

Conflicts of Interest

None declared.

Multimedia Appendix 1

Model selection and evaluation.

PDF File, 365 KB

Multimedia Appendix 2

Data export.

CSV File, 2477 KB

  1. Common SIDS and SUID terms and definitions. Safe to Sleep. URL: https://safetosleep.nichd.nih.gov/safesleepbasics/SIDS/Common [Accessed 2022-10-25]
  2. Moon RY, Carlin RF, Hand I, The Task Force on Sudden Infant Death Syndrome, The Committee on Fetus and Newborn. Evidence base for 2022 updated recommendations for a safe infant sleeping environment to reduce the risk of sleep-related infant deaths. Pediatrics. 2022;150(1):e2022057991. [CrossRef]
  3. Murphy SL, Kochanek KD, Xu J, Arias E. Mortality in the United States, 2020. National Center for Health Statistics. Dec 2021. URL: https://www.cdc.gov/nchs/products/databriefs/db427.htm [Accessed 2024-07-26]
  4. VizHub GBD Compare. Institute for Health Metrics and Evaluation/University of Washington. URL: http://vizhub.healthdata.org/gbd-compare [Accessed 2021-11-19]
  5. Data and statistics. Centers for Disease Control and Prevention. 2023. URL: https://www.cdc.gov/sids/data.htm [Accessed 2023-09-07]
  6. Leong T, Billaud M, Agarwal M, et al. As easy as ABC: evaluation of safe sleep initiative on safe sleep compliance in a freestanding pediatric hospital. Inj Epidemiol. May 2019;6(Suppl 1):26. [CrossRef] [Medline]
  7. Illinois infant mortality report. Illinois Department of Public Health, Office of Women’s Health and Family Services; Dec 2020. URL: https:/​/dph.​illinois.gov/​content/​dam/​soi/​en/​web/​idph/​files/​publications/​illinois-infant-mortality-data-report-2020-december-0.​pdf [Accessed 2024-07-26]
  8. Schiraldi V. Can we eliminate the youth prison? (and what should we replace it with?). The Square One Project. 2020. URL: https:/​/squareonejustice.​org/​wp-content/​uploads/​2020/​06/​CJLJ8234-Square-One-Youth-Prisons-Paper-200616-WEB.​pdf [Accessed 2020-10-01]
  9. Pretorius K, Rew L. Outpatient- or community-based interventions to prevent SIDS and sleep-related deaths. J Spec Pediatr Nurs. Apr 2020;25(2):e12279. [CrossRef] [Medline]
  10. Rasinski KA, Kuby A, Bzdusek SA, Silvestri JM, Weese-Mayer DE. Effect of a sudden infant death syndrome risk reduction education program on risk factor compliance and information sources in primarily black urban communities. Pediatrics. Apr 2003;111(4 Pt 1):e347-e354. [CrossRef] [Medline]
  11. Matoba N, Collins JW. Racial disparity in infant mortality. Semin Perinatol. Oct 2017;41(6):354-359. [CrossRef] [Medline]
  12. Guest AM, Almgren G, Hussey JM. The ecology of race and socioeconomic distress: infant and working-age mortality in Chicago. Demography. Feb 1998;35(1):23-34. [Medline]
  13. Horton R. Offline: in defence of precision public health. Lancet. Oct 2018;392(10157):1504. [CrossRef]
  14. Briker A, McLone S, Mason M, Matoba N, Sheehan K. Modifiable sleep-related risk factors in infant deaths in Cook County, Illinois. Inj Epidemiol. May 2019;6(Suppl 1):24. [CrossRef] [Medline]
  15. Drake SA, Wolf DA, Yang Y, et al. A descriptive and geospatial analysis of environmental factors attributing to sudden unexpected infant death. Am J Forensic Med Pathol. Jun 2019;40(2):108-116. [CrossRef] [Medline]
  16. Fee CE, Tarrell A. Geographical analysis of sudden infant death syndrome (SIDS) and associated risk factors in Douglas County, Nebraska. EMET Projects. 2017. URL: https://digitalcommons.unmc.edu/cgi/viewcontent.cgi?article=1001&context=emet_posters [Accessed 2022-08-22]
  17. Grimson RC, Wang KC, Johnson PWC. Searching for hierarchical clusters of disease: spatial patterns of sudden infant death syndrome. Soc Sci Med D: Med Geogr. May 1981;15(2):287-293. [CrossRef]
  18. Riggins DP. Cook_county_sids_mortality. GitHub. URL: https://github.com/andtheWings/cook_county_sids_mortality [Accessed 2024-07-26]
  19. Riggins DP. Suid_in_cook_county_illinois. GitHub. URL: https://www.dolthub.com/repositories/danriggins/suid_in_cook_county_illinois/doc/main [Accessed 2024-07-26]
  20. Possenriede D, Sadler J, Salmon M. Opencage. URL: https://docs.ropensci.org/opencage/index.html [Accessed 2024-07-26]
  21. OpenCage geocoding API. OpenCage. URL: https://opencagedata.com/ [Accessed 2024-07-26]
  22. Boundaries - community areas (current). Chicago Data Portal. URL: https:/​/data.​cityofchicago.org/​Facilities-Geographic-Boundaries/​Boundaries-Community-Areas-current-/​cauq-8yn6 [Accessed 2023-05-23]
  23. Cook County municipalities. Cook County Government. 2024. URL: https://datacatalog.cookcountyil.gov/Economic-Development/Cook-County-Municipalities/65nw-e4gp [Accessed 2024-07-26]
  24. R programming language. The R Foundation. URL: https://www.r-project.org/ [Accessed 2024-07-26]
  25. Tidyverse. URL: https://www.tidyverse.org/ [Accessed 2024-07-26]
  26. Firke S. Janitor. URL: https://sfirke.github.io/janitor/ [Accessed 2024-07-26]
  27. Devlin H, Schenk T, Leynes G, et al. RSocrata. URL: https://cran.r-project.org/web/packages/RSocrata/index.html [Accessed 2024-07-26]
  28. Landau WM. The targets R package: a dynamic make-like function-oriented pipeline toolkit for reproducibility and high-performance computing. rOpenSci. URL: https://docs.ropensci.org/targets/ [Accessed 2024-08-07]
  29. Iannone R, Cheng J, Schloerke B, et al. gt: easily create presentation-ready display tables. RStudio. URL: https://gt.rstudio.com/ [Accessed 2024-08-07]
  30. Sjoberg DD, Larmarange J, Curry M, et al. gtsummary. RStudio. URL: https://www.danieldsjoberg.com/gtsummary/ [Accessed 2024-08-07]
  31. Pebesma E. Simple features for R. GitHub. URL: https://r-spatial.github.io/sf/index.html [Accessed 2024-08-07]
  32. Cheng J, Schloerke B, Karambelkar B, Xie Y. An R interface to leaflet maps. GitHub. URL: https://rstudio.github.io/leaflet/ [Accessed 2024-08-07]
  33. Tennekes M. tmap: thematic maps in R. GitHub. URL: https://r-tmap.github.io/tmap/ [Accessed 2024-08-07]
  34. Walker K. tigris: load census TIGER/line shapefiles. GitHub. URL: https://cran.rstudio.com/web/packages/tigris/ [Accessed 2024-08-07]
  35. CARTO basemap styles. GitHub. URL: https://github.com/CartoDB/basemap-styles [Accessed 2024-08-07]
  36. Warmerdam F, Rouault EG. GDAL. URL: https://gdal.org/ [Accessed 2024-08-07]
  37. GEOS. URL: https://libgeos.org/ [Accessed 2024-08-07]
  38. Agafonkin V. Leaflet. URL: https://leafletjs.com/ [Accessed 2024-08-07]
  39. OpenStreetMap. URL: https://www.openstreetmap.org/ [Accessed 2024-08-07]
  40. PROJ. URL: https://proj.org/index.html [Accessed 2024-08-07]
  41. TIGER data products guide. United States Census Bureau. URL: https://www.census.gov/programs-surveys/geography/guidance/tiger-data-products-guide.html [Accessed 2024-08-07]
  42. American Community Survey data products. United States Census Bureau. URL: https://www.census.gov/programs-surveys/acs/ [Accessed 2024-08-07]
  43. Social Vulnerability Index data products. Agency for Toxic Substances and Disease Registry. URL: https://www.atsdr.cdc.gov/placeandhealth/svi/index.html [Accessed 2024-08-07]
  44. Ripley B, Venables B, Bates DM, et al. MASS package. The R Project for Statistical Computing. URL: https://cran.r-project.org/web/packages/MASS/index.html [Accessed 2024-08-07]
  45. Gelman A, Hill J. Data Analysis Using Regression and Multilevel/Hierarchical Models. Cambridge University Press; 2006.
  46. Lüdecke D, Makowski D, Ben-Shachar S, et al. easystats: an R framework for easy statistical modeling, visualization, and reporting. GitHub. URL: https://easystats.github.io/easystats/ [Accessed 2024-08-07]
  47. Riggins DP. SUID case counts, 2015-2019, communities of Cook County, IL. Danielriggins.com. URL: http://danielriggins.com/widgets/suid_counts_cook_county_communities.html [Accessed 2024-08-07]
  48. Turman JE, Swigonski NL. Changing systems that influence birth outcomes in marginalized zip codes. Pediatrics. Jul 2021;148(1):e2020049651. [CrossRef] [Medline]
  49. Perrone S, Lembo C, Moretti S, et al. Sudden infant death syndrome: beyond risk factors. Life (Basel). Feb 26, 2021;11(3):184. [CrossRef] [Medline]
  50. Anderson LM, Adeney KL, Shinn C, Safranek S, Buckner-Brown J, Krause LK. Community coalition-driven interventions to reduce health disparities among racial and ethnic minority populations. Cochrane Database Syst Rev. Jun 15, 2015;2015(6):CD009905. [CrossRef] [Medline]
  51. Nabaweesi R, Whiteside-Mansell L, Mullins SH, Rettiganti MR, Aitken ME. Field assessment of a safe sleep instrument using smartphone technology. J Clin Transl Sci. Dec 19, 2019;4(5):451-456. [CrossRef] [Medline]
  52. Chambers BD, Arabia SE, Arega HA, et al. Exposures to structural racism and racial discrimination among pregnant and early post-partum black women living in Oakland, California. Stress Health. Apr 2020;36(2):213-219. [CrossRef] [Medline]
  53. Bandoli G, Baer RJ, Owen M, et al. Maternal, infant, and environmental risk factors for sudden unexpected infant deaths: results from a large, administrative cohort. J Matern Fetal Neonatal Med. Dec 2022;35(25):8998-9005. [CrossRef] [Medline]
  54. Johnson PJ, Oakes JM, Anderton DL. Neighborhood poverty and American Indian infant death: are the effects identifiable? Ann Epidemiol. Jul 2008;18(7):552-559. [CrossRef] [Medline]
  55. Hearst MO, Oakes JM, Johnson PJ. The effect of racial residential segregation on black infant mortality. Am J Epidemiol. Dec 1, 2008;168(11):1247-1254. [CrossRef] [Medline]
  56. Bazuin JT, Fraser JC. How the ACS gets it wrong: the story of the American community survey and a small, inner city neighborhood. Appl Geogr. Dec 2013;45:292-302. [CrossRef]
  57. Bernardinelli L, Pascutto C, Best NG, Gilks WR. Disease mapping with errors in covariates. Stat Med. Apr 15, 1997;16(7):741-752. [CrossRef] [Medline]
  58. Donegan C. geostan: bayesian spatial analysis. GitHub. URL: https://connordonegan.github.io/geostan/ [Accessed 2024-08-07]


AAP: American Academy of Pediatrics
ABCs: An infant should sleep Alone on their Back in a Crib to promote safe sleep
ACS: American Community Survey
KDE: kernel density estimation
NICHD: National Institute of Child Health and Human Development
SIDS: sudden infant death syndrome
SQL: structured query language
SUID: sudden unexpected infant death
SVI: Social Vulnerability Index


Edited by Amaryllis Mavragani, Travis Sanchez; submitted 23.05.23; peer-reviewed by Carolyn Ahlers-Schmidt, Colin Rogerson, Godwin Osei-Poku; final revised version received 01.02.24; accepted 05.03.24; published 20.08.24.

Copyright

© Daniel P Riggins, Huiyuan Zhang, William E Trick. Originally published in JMIR Public Health and Surveillance (https://publichealth.jmir.org), 20.8.2024.

This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in JMIR Public Health and Surveillance, is properly cited. The complete bibliographic information, a link to the original publication on https://publichealth.jmir.org, as well as this copyright and license information must be included.