Digital SARS-CoV-2 Detection Among Hospital Employees: Participatory Surveillance Study

doi:10.2196/33576

Original Paper

¹Department of Economics, University of Zurich, Zurich, Switzerland

²Clinic for Infectious Diseases and Hospital Epidemiology, Cantonal Hospital St. Gallen, St Gallen, Switzerland

³Medical Research Center, Cantonal Hospital St. Gallen, St Gallen, Switzerland

⁴Federal Office of Public Health, Bern, Switzerland

⁵Department of Infectious Diseases and Hospital Epidemiology, Children’s Hospital of Eastern Switzerland, St Gallen, Switzerland

*these authors contributed equally

Corresponding Author:

Onicio Leal-Neto, MSc, PhD

Department of Economics

University of Zurich

Schönberggasse 1

Zurich, 8001

Switzerland

Phone: 41 783242116

Email: onicio@gmail.com

Background: The implementation of novel techniques as a complement to traditional disease surveillance systems represents an additional opportunity for rapid analysis.

Objective: The objective of this work is to describe a web-based participatory surveillance strategy among health care workers (HCWs) in two Swiss hospitals during the first wave of COVID-19.

Methods: A prospective cohort of HCWs was recruited in March 2020 at the Cantonal Hospital of St. Gallen and the Eastern Switzerland Children’s Hospital. For data analysis, we used a combination of the following techniques: locally estimated scatterplot smoothing (LOESS) regression, Spearman correlation, anomaly detection, and random forest.

Results: From March 23 to August 23, 2020, a total of 127,684 SMS text messages were sent, generating 90,414 valid reports among 1004 participants, achieving a weekly average of 4.5 (SD 1.9) reports per user. The symptom showing the strongest correlation with a positive polymerase chain reaction test result was loss of taste. Symptoms like red eyes or a runny nose were negatively associated with a positive test. The area under the receiver operating characteristic curve showed favorable performance of the classification tree, with an accuracy of 88% for the training data and 89% for the test data. Nevertheless, while the prediction matrix showed good specificity (80.0%), sensitivity was low (10.6%).

Conclusions: Loss of taste was the symptom that was most aligned with COVID-19 activity at the population level. At the individual level—using machine learning–based random forest classification—reporting loss of taste and limb/muscle pain as well as the absence of runny nose and red eyes were the best predictors of COVID-19.

JMIR Public Health Surveill 2021;7(11):e33576

doi:10.2196/33576

Keywords

digital epidemiology (31); SARS-CoV-2 (444); COVID-19 (3091); health care workers (66)

The COVID-19 pandemic is one of the greatest health challenges that societies around the globe have ever experienced. A range of instruments and ways to measure factors related to COVID-19 and the pandemic have been described [Wirth FN, Johns M, Meurers T, Prasser F. Citizen-Centered Mobile Health Apps Collecting Individual-Level Spatial Data for Infectious Disease Management: Scoping Review. JMIR mHealth uHealth 2020 Nov 10;8(11):e22594 [FREE Full text] [CrossRef] [Medline]1-Parker MJ, Fraser C, Abeler-Dörner L, Bonsall D. Ethics of instantaneous contact tracing using mobile phone apps in the control of the COVID-19 pandemic. J Med Ethics 2020 Jul 04;46(7):427-431 [FREE Full text] [CrossRef] [Medline]7]. COVID-19 presents a challenge for public health in general, while health care workers (HCWs) are at particular risk of acquiring COVID-19 [Adams JG, Walls RM. Supporting the Health Care Workforce During the COVID-19 Global Epidemic. JAMA 2020 Apr 21;323(15):1439-1440. [CrossRef] [Medline]8]. Several studies using online forms have found they can be useful for tracking disease activity in different locations, including workplaces [Sim JXY, Conceicao EP, Wee LE, Aung MK, Wei Seow SY, Yang Teo RC, et al. Utilizing the electronic health records to create a syndromic staff surveillance system during the COVID-19 outbreak. Am J Infect Control 2021 Jun;49(6):685-689 [FREE Full text] [CrossRef] [Medline]9,Kohler PP, Kahlert CR, Sumer J, Flury D, Güsewell S, Leal-Neto OB, et al. Prevalence of SARS-CoV-2 antibodies among Swiss hospital workers: Results of a prospective cohort study. Infect Control Hosp Epidemiol 2021 May;42(5):604-608 [FREE Full text] [CrossRef] [Medline]10]. However, these technological platforms require timely, persistent, and ongoing engagement to generate valid and representative surveillance data [Wirth FN, Johns M, Meurers T, Prasser F. Citizen-Centered Mobile Health Apps Collecting Individual-Level Spatial Data for Infectious Disease Management: Scoping Review. JMIR mHealth uHealth 2020 Nov 10;8(11):e22594 [FREE Full text] [CrossRef] [Medline]1]. In the context of collaboration and the collection of collective health information, digital epidemiology and participatory surveillance techniques have been demonstrated to be tools with great potential for helping to detect health threats [Bach M, Jordan S, Hartung S, Santos-Hövener C, Wright MT. Participatory epidemiology: the contribution of participatory research to epidemiology. Emerg Themes Epidemiol 2017 Feb 10;14(1):2-15 [FREE Full text] [CrossRef] [Medline]11-Wójcik OP, Brownstein JS, Chunara R, Johansson MA. Public health for the people: participatory infectious disease surveillance in the digital age. Emerg Themes Epidemiol 2014 Jun 20;11(1):7-7 [FREE Full text] [CrossRef] [Medline]16]. Many strategies that involve daily reporting of symptoms through the voluntary participation of individuals have reported successful results [Leal Neto O, Dimech GS, Libel M, de Souza WV, Cesse E, Smolinski M, et al. Saúde na Copa: The World's First Application of Participatory Surveillance for a Mass Gathering at FIFA World Cup 2014, Brazil. JMIR Public Health Surveill 2017 May 04;3(2):e26 [FREE Full text] [CrossRef] [Medline]17,Leal Neto O, Cruz O, Albuquerque J, Nacarato de Sousa M, Smolinski M, Pessoa Cesse EÂ, et al. Participatory Surveillance Based on Crowdsourcing During the Rio 2016 Olympic Games Using the Guardians of Health Platform: Descriptive Study. JMIR Public Health Surveill 2020 Apr 07;6(2):e16119 [FREE Full text] [CrossRef] [Medline]18]. Participatory surveillance by patients has been shown to have a complementary role in detecting syndromic clusters for several epidemiological challenges, such as COVID-19, seasonal influenza, or high-risk mass gatherings [Leal Neto O, Dimech GS, Libel M, de Souza WV, Cesse E, Smolinski M, et al. Saúde na Copa: The World's First Application of Participatory Surveillance for a Mass Gathering at FIFA World Cup 2014, Brazil. JMIR Public Health Surveill 2017 May 04;3(2):e26 [FREE Full text] [CrossRef] [Medline]17-Leal-Neto O, Santos F, Lee J, Albuquerque J, Souza W. Prioritizing COVID-19 tests based on participatory surveillance and spatial scanning. Int J Med Inform 2020 Nov;143:104263 [FREE Full text] [CrossRef] [Medline]22]. The implementation of novel techniques represents an additional opportunity for the rapid analysis of big data based on machine learning, thereby acting as a complement to traditional disease surveillance systems.

The objective of this work is to describe a web-based participatory surveillance strategy among HCWs in two Swiss hospitals during the first wave of the COVID-19 pandemic.

Study Design

A prospective cohort of HCWs was recruited in March 2020 at the Cantonal Hospital of St. Gallen and the Eastern Switzerland Children’s Hospital, Switzerland. Individuals aged 16 years and older were eligible. HCWs were enrolled in the study after accepting the electronic informed consent form. The anonymization of participants was carried out by using a management ID system with three levels; we anonymized the participants (user ID), surveys (survey ID), and their samples (order ID). No compensation was provided and participation was voluntary. A copy of the informed consent with all details about privacy and confidentiality is provided in

Multimedia Appendix 1

Informed consent (in German).

PDF File (Adobe PDF File), 206 KB Multimedia Appendix 1. The study was approved by the local ethics committee (Ethikkommission Ostschweiz; #2020-00502). All participants received a link via email to fill in a baseline questionnaire collecting data on pre-existing conditions at the start of the study. To improve the data quality and reduce reporting bias, mobile number validation was required; participants could only move forward if they input a token sent to their mobile phone. After completing the baseline form, participants became eligible to receive the daily SMS text message with an individualized link redirecting them to a secure web platform where they could fill in their symptom diary. To encourage participant engagement through the entire period, an SMS text message reminder was sent to those that did not fill in the symptom diary the day before. In the symptom diary, participants were asked about the type and severity of COVID-19 symptoms according to . Those that met SARS-CoV-2 testing criteria (ie, fever/feverishness, cough, shortness of breath, sore throat, or anosmia/ageusia) according to the Swiss Federal Office of Public Health (FOPH) were asked to schedule an appointment for a nasopharyngeal swab [].

For validation purposes, the positivity rate of the online survey was compared to the positivity rate of HCWs undergoing SARS-CoV-2 polymerase chain reaction (PCR) testing at the study institutions (independent of study participation). We tested both isolated symptoms and various combinations, including the FOPH testing criteria.

Table 1. List of symptoms and consequences.

Survey question topic	Type
Sore throat	Symptom
Cough	Symptom
Shortness of breath	Symptom
Runny nose	Symptom
Headache	Symptom
Diarrhea	Symptom
Anorexia/nausea	Symptom
Fever	Symptom
Chills	Symptom
Limb/muscle pain	Symptom
Loss of taste	Symptom
Itchy red eyes	Symptom
Feeling weak	Symptom
Fever-related muscle pain	Symptom
Took medicines	Consequence
Sought health care	Consequence
Missed work	Consequence
Hospitalized	Consequence

Data Analysis

For the analysis of time trends of symptoms, we used a locally weighted running line smoother (locally estimated scatterplot smoothing [LOESS]) [Garimella R. A Simple Introduction to Moving Least Squares and Local Regression Estimation. Los Alamos National Lab. 2017. URL: https://www.osti.gov/biblio/1367799-simple-introduction-moving-least-squares-local-regression-estimation [accessed 2021-11-17] 24], which is a nonparametric smoother with Gaussian noise added in the sine wave. This algorithm estimates the latent function in a pointwise fashion. This method is a supervised machine learning approach and was carried out to generate a moving average for scatterplot smoothing among the data points. Its function can be expressed as the following:

ω (χ)=(1–|d|³)³

where d is the distance of the data point from the point on the fitter curve, scaled to lie in the range from 0-1. We then used a moving average with 7 days as the window size, aligned on the right.

The Spearman rank correlation coefficient was used to verify the statistical dependence between symptoms and test positivity, using a monotonic function described by the following formula [Cleff T. Exploratory Data Analysis in Business and Economics: An Introduction Using SPSS, Stata, and Excel. Cham: Springer International Publishing; 2014.25]:

It is critical to identify significant temporal deviations throughout the period including the impact of seasonality in such high frequency data inputs. Therefore, we applied the Seasonal-Hybrid Extreme Studentized Deviate (S-H-ESD) algorithm [Hochenbaum J, Vallis OS, Kejariwal A. Automatic anomaly detection in the cloud via statistical learning. arXiv Preprint posted online on April 24, 2017 [FREE Full text]26], which uses a modified Seasonal-Trend decomposition procedure based on LOESS [Cleveland RB, Cleveland WS, McRae JE, Terpenning I. STL: A Seasonal-Trend Decomposition Procedure Based on Loess. Journal of Official Statistics 1990;6(1):3-73 [FREE Full text]27]. This technique allows for the identification of change points over time, recognizing when the signal frequency (FOPH classification) was positive (increasing) or negative (decreasing). The missing value was handled by spline interpolation, the maximal anomaly ratio was 0.1, and a piecewise median time window of 2 weeks was chosen.

Finally, to classify participants according to the probability of having symptoms compatible with COVID-19, we used the random forest algorithm. This is an ensemble learning method based on decision trees, which increases the accuracy of classification for both training and test data [Ho TK. Random decision forests. In: Proceedings of 3rd International Conference on Document Analysis and Recognition. 1995 Presented at: 3rd International Conference on Document Analysis and Recognition; August 14-16, 1995; Montreal, QC p. 278-282. [CrossRef]28]. Specifically, this algorithm is a predictor consisting of an assembly of randomized base regression trees {r_n(x,Θ_m,D_n),m ≥ 1}, where Θ₁,Θ₂,... are independent and identically distributed (IID) outputs of a randomizing variable Θ. These random trees are pooled to form the following aggregated regression estimate [Biau G. Analysis of a random forests model. The Journal of Machine Learning Research 2012;13(1):1063-1095 [FREE Full text]29]:

where denotes expectation with respect to the random parameter, conditionally on X and the data set D_n.

To explain how the random forest technique was used in this study, a summary of its parameters along with a prediction matrix for the model was generated. In addition, a receiver operating characteristic (ROC) curve was created to evaluate the binary classification of the model. We split the data, using 70% of entries for model training and 30% of entries for the test set. To determine which variables were more or less important for predicting the outcome, we used a boxplot chart.

Algorithms and techniques were programmed and deployed in R language, using the Exploratory [Nishida K. Exploratory. 2020. URL: https://exploratory.io [accessed 2021-04-14] 30] framework. The data collection system was developed using JotForm [Jotform. 2020. URL: https://jotform.com [accessed 2021-04-10] 31] as well as a proprietary solution and was hosted at Amazon Web Services, using EC2 and S3 instances. The SMS text messaging system used Twilio’s [Twilio - Communication APIs for SMS, Voice, Video and Authentication. URL: https://www.twilio.com/ [accessed 2021-01-01] 32] application programming interface to send out the messages.

From March 23, 2020, to August 23, 2020, a total of 127,684 SMS text messages were sent, generating 90,414 valid reports among 1004 participants, achieving a weekly average of 4.5 (SD 1.9) reports per user. Female gender (n=755, 75.2%) was more prevalent than male (n=249, 24.8%) among participants, reflecting the general HCW population in these hospitals. The median age was 39 years, with a mean of 40.2 (SD 11.3) years. Figure 1 shows the temporal distribution of symptoms of respiratory infection over the study period, using LOESS regression. In total, 1.49% (n=15) of participants reported a positive PCR result during the study period. The first peak of the bimodal curves clearly parallels the reference curve of individuals in the hospital who tested positive, representing the first COVID-19 wave in the region. The second peak appears between July 2020 and August 2020, with a much lower signal in the reference curve of individuals who tested positive.

Regarding anomaly detection over time, Figure 2 shows whether a signal of symptoms was expected (based in the past trends) or if it represented a positive or negative anomaly, meaning a significant increase or decrease in the frequency of recorded symptoms. Table 2 indicates the change points that were statistically significant, including the difference observed when compared with the expected amount. The positive anomalies happened in three different periods; two of them occurred during the highest activity of the first wave and the third occurred between July and August, representing a possible second wave. However, as mentioned above, no second (or third) wave was seen in the reference curve.

Figure 1. Temporal distribution and LOESS regression of symptoms related to acute respiratory infection in health care workers at two hospitals in Switzerland. FOPH: cases documented by the Federal Office of Public Health; LOESS: locally estimated scatterplot smoothing.

Figure 2. Temporal distribution of the FOPH proportion of positives, indicating which types of anomalies occurred in health care workers in two hospitals in Switzerland. FOPH: Federal Office of Public Health.

Table 2. Significant (P<.05) timepoints for anomaly detection in health care workers, Switzerland.

Date	Federal Office of Public Health proportion of positives	Expected	Difference from expected	Anomaly type
05/04/2020	.324675325	1	–.675324675	Negative
08/04/2020	1.83982684	1	.83982684	Positive
20/04/2020	1.677489177	1	.677489177	Positive
30/04/2020	.91991342	0	.91991342	Negative
12/05/2020	.162337662	0	.162337662	Negative
09/07/2020	.703463203	1	–.296536797	Negative
15/07/2020	.91991342	0	.91991342	Positive
02/08/2020	.216450216	0	.216450216	Negative

A correlation matrix between symptoms and a positive PCR test result for SARS-CoV-2 is shown in Figure 3, while in Figure 4, the significance matrix shows the positive and negative correlations, as well as the nonsignificant ones. The symptom with the strongest correlation with a positive PCR result was loss of taste. Conversely, symptoms such as red eyes or runny nose were negatively associated with a positive test (Table 3).

Figure 3. Correlation matrix using the Spearman method for symptoms and positive results in health care workers in two hospitals in Switzerland during the study period. FOPH: Federal Office of Public Health.

Figure 4. Significance matrix showcasing the positive and negative correlations between variables in health care workers in two hospitals in Switzerland during the study period. A larger dot represents a higher correlation.

Table 3. Correlation between symptoms and positive cases in health care workers in Switzerland for the period of the study.

Symptoms	Correlation	Pairs	P value
Loss of taste	0.5274	Positive	<.001
Federal Office of Public Health definition	0.2189	Positive	<.001
Anorexia/nausea	0.1698	Positive	<.001
Limb/muscle pain	0.1103	Positive	<.001
Cough	0.1032	Positive	<.001
Chills	0.0731	Positive	.002
Headache	0.0279	Positive	.37
Red itchy eyes	–0.1560	Negative	.01
Runny nose	–0.1508	Negative	.001
Fever	–0.1025	Negative	.10
Diarrhea	–0.0770	Negative	.001

Finally, Table 4 shows the summary results from a random forest algorithm that was used to classify participants into SARS-CoV-2 positive and negative cases based on their indicated symptoms. The area under the ROC curve shows reasonable performance of the classification tree, with an accuracy of 88% for the training data and 89% for the test data (Figure 5). Nevertheless, while the prediction matrix showed good specificity (80.0%), sensitivity was low (10.6%; Table 5). Figure 6 shows the importance of symptoms and their capacity to predict the expected outcome based on the random forest algorithm, considering a P value of <.05. Loss of taste and limb/muscle pain were the most important variables for prediction of a positive result, while runny nose and red eyes were negatively correlated with the same outcome. Fever was a very weak predictor of a positive result.

Table 4. Summary of the parameters of the random forest model.

Data set	Area under the curve	F₁ score	Accuracy rate	Misclassification rate	Precision	Recall
Training	.90375	.68027	.8839	.11604	.87719	.5555
Test	.87576	.66331	.89438	.10561	.91304	.5206

Figure 5. Receiver operating characteristic curve for the random forest model.

Table 5. Prediction matrix for the random forest model.

Data set and type (actual)		Data type (predicted)
		TRUE, %		FALSE, %
Test
	TRUE		10.4		9.57
	FALSE		.99		79.04
Training
	TRUE		12.35		9.88
	FALSE		1.73		76.05

Figure 6. Boxplot of the importance of symptoms and their capacity to predict the expected outcome based on the random forest algorithm (P<.05). Loss of taste, limb/muscle pain, FOPH (Federal Office of Public Health), sore throat, cough, and shortness of breath were positively associated with the outcome. Runny nose and red itchy eyes were negatively associated with the outcome. Fever was neither positively nor negatively associated with the outcome.

This study demonstrates the use of digital surveillance to monitor COVID-19 activity among HCWs. Loss of taste was the symptom that was most aligned with COVID-19 activity at the population level. At the individual level, using machine learning–based random forest classification, reporting loss of taste and limb/muscle pain as well as absence of runny nose and red eyes were the best predictors of COVID-19. The main strengths of the study are its high response rate and the comparison to a reference curve, which was based on documented PCR results in the same population.

Syndromic surveillance through participatory surveillance has been shown to be a feasible strategy to monitor COVID-19 activity [Lapointe-Shaw L, Rader B, Astley CM, Hawkins JB, Bhatia D, Schatten WJ, et al. Web and phone-based COVID-19 syndromic surveillance in Canada: A cross-sectional study. PLoS One 2020 Oct 2;15(10):e0239886 [FREE Full text] [CrossRef] [Medline]33], and is considered an important measure to inform the public health response to this pandemic [Budd J, Miller BS, Manning EM, Lampos V, Zhuang M, Edelstein M, et al. Digital technologies in the public-health response to COVID-19. Nat Med 2020 Aug;26(8):1183-1192 [FREE Full text] [CrossRef] [Medline]34]. Considering that engagement is a key element of a successful platform, our study—with an average response of 4.5 answers per week—has an excellent basis to produce valid and representative results. This high rate of engagement and participation is extraordinary when compared to other platforms [Smolinski MS, Crawley AW, Olsen JM, Jayaraman T, Libel M. Participatory Disease Surveillance: Engaging Communities Directly in Reporting, Monitoring, and Responding to Health Threats. JMIR Public Health Surveill 2017 Oct 11;3(4):e62 [FREE Full text] [CrossRef] [Medline]13,Leal Neto O, Dimech GS, Libel M, de Souza WV, Cesse E, Smolinski M, et al. Saúde na Copa: The World's First Application of Participatory Surveillance for a Mass Gathering at FIFA World Cup 2014, Brazil. JMIR Public Health Surveill 2017 May 04;3(2):e26 [FREE Full text] [CrossRef] [Medline]17,Leal Neto O, Cruz O, Albuquerque J, Nacarato de Sousa M, Smolinski M, Pessoa Cesse EÂ, et al. Participatory Surveillance Based on Crowdsourcing During the Rio 2016 Olympic Games Using the Guardians of Health Platform: Descriptive Study. JMIR Public Health Surveill 2020 Apr 07;6(2):e16119 [FREE Full text] [CrossRef] [Medline]18,Lapointe-Shaw L, Rader B, Astley CM, Hawkins JB, Bhatia D, Schatten WJ, et al. Web and phone-based COVID-19 syndromic surveillance in Canada: A cross-sectional study. PLoS One 2020 Oct 2;15(10):e0239886 [FREE Full text] [CrossRef] [Medline]33], especially over a period of 5 months [Brownstein JS, Chu S, Marathe A, Marathe MV, Nguyen AT, Paolotti D, et al. Combining Participatory Influenza Surveillance with Modeling and Forecasting: Three Alternative Approaches. JMIR Public Health Surveill 2017 Nov 01;3(4):e83 [FREE Full text] [CrossRef] [Medline]35]. The easy-to-use survey, the defined population of HCWs from two different hospitals, and the regular interaction with study participants are potential reasons for this high response rate. It remains to be seen if these engagement indexes can be maintained when the study is scaled up to larger communities.

The temporal distribution of symptoms followed the trends represented in the first wave of COVID-19 in Switzerland [COVID-19 Situation Updates. European Centre for Disease Prevention and Control. 2020. URL: https://www.ecdc.europa.eu/en/COVID-19-pandemic [accessed 2020-04-24] 36,Micallef S, Piscopo TV, Casha R, Borg D, Vella C, Zammit M, et al. The first wave of COVID-19 in Malta; a national cross-sectional study. PLoS One 2020;15(10):e0239389 [FREE Full text] [CrossRef] [Medline]37]. However, the signals detected in July were not due to COVID-19, as shown by the reference curve. Interestingly, several HCWs tested positive for rhinovirus during this time period, suggesting that this was the reason for this wave. Of note, loss of taste, the most specific symptom of COVID-19, did not increase during this second wave.

Several other studies have shown that loss of taste is a good proxy for COVID-19 [Spinato G, Fabbris C, Polesel J, Cazzador D, Borsetto D, Hopkins C, et al. Alterations in Smell or Taste in Mildly Symptomatic Outpatients With SARS-CoV-2 Infection. JAMA 2020 May 26;323(20):2089-2090 [FREE Full text] [CrossRef] [Medline]38-Menni C, Valdes AM, Freidin MB, Sudre CH, Nguyen LH, Drew DA, et al. Real-time tracking of self-reported symptoms to predict potential COVID-19. Nat Med 2020 Jul 11;26(7):1037-1040 [FREE Full text] [CrossRef] [Medline]41]. Although the specificity of this symptom is excellent, only about 20% of patients report loss of taste [Bénézit F, Le Turnier P, Declerck C, Paillé C, Revest M, Dubée V, RAN COVID Study Group. Utility of hyposmia and hypogeusia for the diagnosis of COVID-19. Lancet Infect Dis 2020 Sep;20(9):1014-1015 [FREE Full text] [CrossRef] [Medline]42]. We conclude that the detection of loss of taste is very helpful to interpret findings at the population level, but less so at the individual patient level because of its low prevalence. The second most important positively associated symptom in our analysis was limb/muscle pain, which has also been noted by others [Nepal G, Rehrig JH, Shrestha GS, Shing YK, Yadav JK, Ojha R, et al. Neurological manifestations of COVID-19: a systematic review. Crit Care 2020 Jul 13;24(1):421-411 [FREE Full text] [CrossRef] [Medline]43]. Remarkably, runny nose and red eyes were very important negative predictors of COVID-19; this finding is particularly useful for when surveillance is performed during allergy season. However, both the sensitivity and specificity of a symptom depend on the background activity of other infections and allergies and might therefore be subject to change. The validity of a symptom may also change due to genetic adaptations in the dominant SARS-CoV-2 strain. During the study period, none of the new variants of SARS-CoV-2 (eg, B.1.1.7/Alpha) were circulating in Switzerland. Therefore, the symptoms described here cannot necessarily be extrapolated to a different circulating SARS-CoV-2 variant. However, syndromic surveillance through participatory surveillance may allow for the detection or validation of a different clinical presentation emerging from a new circulating strain. Indeed, a recent study describes small differences in COVID-19 symptoms in the general population in the United Kingdom depending on the variant [Vihta K, Pouwels K, Peto T, Pritchard E, Eyre DW, House T, COVID-19 Infection Survey team. Symptoms and SARS-CoV-2 positivity in the general population in the UK. Clin Infect Dis 2021 Nov 08:ciab945. [CrossRef] [Medline]44].

Our study has a number of limitations. First, it was performed outside influenza season. Because influenza more often presents with constitutional symptoms than other respiratory viruses, distinguishing influenza from COVID-19 by analysis of symptoms is difficult. Second, we relied on participants self-reporting their symptoms, a method that is prone to bias. Third, generalizability of our data is limited because only one-fifth of the HCWs from our hospitals participated in the study; in addition, the spatial component could not be explored due to these same reasons. At the same time, this would be a very important parameter for evaluating whether SARS-CoV-2 is being regionally distributed, which would be useful to form a complete picture for disease surveillance purposes. The application of classification techniques based on machine learning, such as random forest classification, has its own limitations, as a large number of trees can make the algorithm too slow and ineffective for real-time predictions. In general, these algorithms are fast to train, but quite slow to create predictions once they are trained. A more accurate prediction requires more trees, which results in a slower model.

Nevertheless, we deem the presented surveillance tool highly useful in monitoring and predicting COVID-19 activity among our HCWs. Currently, we have expanded our HCW cohort to include over 5000 participants from over 20 institutions [Kahlert CR, Persi R, Güsewell S, Egger T, Leal-Neto OB, Sumer J, et al. Non-occupational and occupational factors associated with specific SARS-CoV-2 antibodies among hospital workers - A multicentre cross-sectional study. Clin Microbiol Infect 2021 Sep;27(9):1336-1344 [FREE Full text] [CrossRef] [Medline]45]. The analysis of data from different institutions will allow us to detect the clustering of cases in certain institutions, which might trigger targeted intervention measures in affected health care institutions. Additionally, these data allow for the detection of symptomatic HCWs who were either not tested or had a false-negative PCR result, and also for the discrimination of symptoms caused by SARS-CoV-2 from symptoms caused by other viruses, such as influenza. Further questions, which we aim to answer with the surveillance data generated in this larger cohort, include how long HCWs with documented SARS-CoV-2 infection (or vaccination) are protected against reinfection or how the emergence of viral variants might change the symptomatology of COVID-19.

Acknowledgments

This work was supported by the Swiss National Sciences Foundation (grants 31CA30_196544 and PZ00P3_179919 to PK), the Federal Office of Public Health (grant 20.008218/421-28/1), and the research fund of the Cantonal Hospital of St. Gallen. OLN acknowledges support from Rodrigo Paiva in the development of the technological platform.

Authors' Contributions

OLN and PK conceived of the presented idea and wrote the manuscript with support from TE, CK, MS, DF, WA, and PV. OLN carried out the analysis. All authors revised the final version of the manuscript. PK supervised the project.

Conflicts of Interest

None declared.

‎

Multimedia Appendix 1

Informed consent (in German).

PDF File (Adobe PDF File), 206 KB

Wirth FN, Johns M, Meurers T, Prasser F. Citizen-Centered Mobile Health Apps Collecting Individual-Level Spatial Data for Infectious Disease Management: Scoping Review. JMIR mHealth uHealth 2020 Nov 10;8(11):e22594 [FREE Full text] [CrossRef] [Medline]
Flaxman S, Mishra S, Gandy A, Unwin HJT, Mellan TA, Coupland H, Imperial College COVID-19 Response Team, et al. Estimating the effects of non-pharmaceutical interventions on COVID-19 in Europe. Nature 2020 Aug;584(7820):257-261 [FREE Full text] [CrossRef] [Medline]
Howell O'Neill P, Ryan-Mosley T, Johnson B. A flood of coronavirus apps are tracking us. Now it's time to keep track of them. MIT Technology Review. 2020. URL: https://www.technologyreview.com/2020/05/07/1000961/launching-mittr-covid-tracing-tracker/ [accessed 2021-01-15]
Altmann S, Milsom L, Zillessen H, Blasone R, Gerdon F, Bach R, et al. Acceptability of app-based contact tracing for COVID-19: Cross-country survey evidence. JMIR mHealth uHealth 2020 Jul 24:1-6 [FREE Full text] [CrossRef] [Medline]
Ye Q, Zhou J, Wu H. Using Information Technology to Manage the COVID-19 Pandemic: Development of a Technical Framework Based on Practical Experience in China. JMIR Med Inform 2020 Jun 08;8(6):e19515 [FREE Full text] [CrossRef] [Medline]
Abeler J, Bäcker M, Buermeyer U, Zillessen H. COVID-19 Contact Tracing and Data Protection Can Go Together. JMIR mHealth uHealth 2020 Apr 20;8(4):e19359 [FREE Full text] [CrossRef] [Medline]
Parker MJ, Fraser C, Abeler-Dörner L, Bonsall D. Ethics of instantaneous contact tracing using mobile phone apps in the control of the COVID-19 pandemic. J Med Ethics 2020 Jul 04;46(7):427-431 [FREE Full text] [CrossRef] [Medline]
Adams JG, Walls RM. Supporting the Health Care Workforce During the COVID-19 Global Epidemic. JAMA 2020 Apr 21;323(15):1439-1440. [CrossRef] [Medline]
Sim JXY, Conceicao EP, Wee LE, Aung MK, Wei Seow SY, Yang Teo RC, et al. Utilizing the electronic health records to create a syndromic staff surveillance system during the COVID-19 outbreak. Am J Infect Control 2021 Jun;49(6):685-689 [FREE Full text] [CrossRef] [Medline]
Kohler PP, Kahlert CR, Sumer J, Flury D, Güsewell S, Leal-Neto OB, et al. Prevalence of SARS-CoV-2 antibodies among Swiss hospital workers: Results of a prospective cohort study. Infect Control Hosp Epidemiol 2021 May;42(5):604-608 [FREE Full text] [CrossRef] [Medline]
Bach M, Jordan S, Hartung S, Santos-Hövener C, Wright MT. Participatory epidemiology: the contribution of participatory research to epidemiology. Emerg Themes Epidemiol 2017 Feb 10;14(1):2-15 [FREE Full text] [CrossRef] [Medline]
Koppeschaar CE, Colizza V, Guerrisi C, Turbelin C, Duggan J, Edmunds WJ, et al. Influenzanet: Citizens Among 10 Countries Collaborating to Monitor Influenza in Europe. JMIR Public Health Surveill 2017 Sep 19;3(3):e66 [FREE Full text] [CrossRef] [Medline]
Smolinski MS, Crawley AW, Olsen JM, Jayaraman T, Libel M. Participatory Disease Surveillance: Engaging Communities Directly in Reporting, Monitoring, and Responding to Health Threats. JMIR Public Health Surveill 2017 Oct 11;3(4):e62 [FREE Full text] [CrossRef] [Medline]
Leal-Neto OB, Dimech GS, Libel M, Oliveira W, Ferreira JP. Digital disease detection and participatory surveillance: overview and perspectives for Brazil. Rev Saude Publica 2016;50:17 [FREE Full text] [CrossRef] [Medline]
Salathé M. Digital epidemiology: what is it, and where is it going? Life Sci Soc Policy 2018 Jan 04;14(1):1-5 [FREE Full text] [CrossRef] [Medline]
Wójcik OP, Brownstein JS, Chunara R, Johansson MA. Public health for the people: participatory infectious disease surveillance in the digital age. Emerg Themes Epidemiol 2014 Jun 20;11(1):7-7 [FREE Full text] [CrossRef] [Medline]
Leal Neto O, Dimech GS, Libel M, de Souza WV, Cesse E, Smolinski M, et al. Saúde na Copa: The World's First Application of Participatory Surveillance for a Mass Gathering at FIFA World Cup 2014, Brazil. JMIR Public Health Surveill 2017 May 04;3(2):e26 [FREE Full text] [CrossRef] [Medline]
Leal Neto O, Cruz O, Albuquerque J, Nacarato de Sousa M, Smolinski M, Pessoa Cesse EÂ, et al. Participatory Surveillance Based on Crowdsourcing During the Rio 2016 Olympic Games Using the Guardians of Health Platform: Descriptive Study. JMIR Public Health Surveill 2020 Apr 07;6(2):e16119 [FREE Full text] [CrossRef] [Medline]
Drew D, Nguyen LH, Steves CJ, Menni C, Freydin M, Varsavsky T, COPE Consortium. Rapid implementation of mobile technology for real-time epidemiology of COVID-19. Science 2020 Jun 19;368(6497):1362-1367 [FREE Full text] [CrossRef] [Medline]
Garg S, Bhatnagar N, Gangadharan N. A Case for Participatory Disease Surveillance of the COVID-19 Pandemic in India. JMIR Public Health Surveill 2020 Apr 16;6(2):e18795 [FREE Full text] [CrossRef] [Medline]
Luo H, Lie Y, Prinzen FW. Surveillance of COVID-19 in the General Population Using an Online Questionnaire: Report From 18,161 Respondents in China. JMIR Public Health Surveill 2020 Apr 27;6(2):e18576 [FREE Full text] [CrossRef] [Medline]
Leal-Neto O, Santos F, Lee J, Albuquerque J, Souza W. Prioritizing COVID-19 tests based on participatory surveillance and spatial scanning. Int J Med Inform 2020 Nov;143:104263 [FREE Full text] [CrossRef] [Medline]
Infektionskrankheiten melden. URL: https://www.bag.admin.ch/bag/de/home/krankheiten/infektionskrankheiten-bekaempfen/meldesysteme-infektionskrankheiten/meldepflichtige-ik/meldeformulare.html [accessed 2021-11-16]
Garimella R. A Simple Introduction to Moving Least Squares and Local Regression Estimation. Los Alamos National Lab. 2017. URL: https://www.osti.gov/biblio/1367799-simple-introduction-moving-least-squares-local-regression-estimation [accessed 2021-11-17]
Cleff T. Exploratory Data Analysis in Business and Economics: An Introduction Using SPSS, Stata, and Excel. Cham: Springer International Publishing; 2014.
Hochenbaum J, Vallis OS, Kejariwal A. Automatic anomaly detection in the cloud via statistical learning. arXiv Preprint posted online on April 24, 2017 [FREE Full text]
Cleveland RB, Cleveland WS, McRae JE, Terpenning I. STL: A Seasonal-Trend Decomposition Procedure Based on Loess. Journal of Official Statistics 1990;6(1):3-73 [FREE Full text]
Ho TK. Random decision forests. In: Proceedings of 3rd International Conference on Document Analysis and Recognition. 1995 Presented at: 3rd International Conference on Document Analysis and Recognition; August 14-16, 1995; Montreal, QC p. 278-282. [CrossRef]
Biau G. Analysis of a random forests model. The Journal of Machine Learning Research 2012;13(1):1063-1095 [FREE Full text]
Nishida K. Exploratory. 2020. URL: https://exploratory.io [accessed 2021-04-14]
Jotform. 2020. URL: https://jotform.com [accessed 2021-04-10]
Twilio - Communication APIs for SMS, Voice, Video and Authentication. URL: https://www.twilio.com/ [accessed 2021-01-01]
Lapointe-Shaw L, Rader B, Astley CM, Hawkins JB, Bhatia D, Schatten WJ, et al. Web and phone-based COVID-19 syndromic surveillance in Canada: A cross-sectional study. PLoS One 2020 Oct 2;15(10):e0239886 [FREE Full text] [CrossRef] [Medline]
Budd J, Miller BS, Manning EM, Lampos V, Zhuang M, Edelstein M, et al. Digital technologies in the public-health response to COVID-19. Nat Med 2020 Aug;26(8):1183-1192 [FREE Full text] [CrossRef] [Medline]
Brownstein JS, Chu S, Marathe A, Marathe MV, Nguyen AT, Paolotti D, et al. Combining Participatory Influenza Surveillance with Modeling and Forecasting: Three Alternative Approaches. JMIR Public Health Surveill 2017 Nov 01;3(4):e83 [FREE Full text] [CrossRef] [Medline]
COVID-19 Situation Updates. European Centre for Disease Prevention and Control. 2020. URL: https://www.ecdc.europa.eu/en/COVID-19-pandemic [accessed 2020-04-24]
Micallef S, Piscopo TV, Casha R, Borg D, Vella C, Zammit M, et al. The first wave of COVID-19 in Malta; a national cross-sectional study. PLoS One 2020;15(10):e0239389 [FREE Full text] [CrossRef] [Medline]
Spinato G, Fabbris C, Polesel J, Cazzador D, Borsetto D, Hopkins C, et al. Alterations in Smell or Taste in Mildly Symptomatic Outpatients With SARS-CoV-2 Infection. JAMA 2020 May 26;323(20):2089-2090 [FREE Full text] [CrossRef] [Medline]
Sudre C, Lee K, Lochlainn M, Varsavsky T, Murray B, Graham MS, et al. Symptom clusters in COVID-19: A potential clinical prediction tool from the COVID Symptom Study app. Sci Adv 2021 Mar;7(12):eabd4177 [FREE Full text] [CrossRef] [Medline]
Eliezer M, Hautefort C, Hamel A, Verillaud B, Herman P, Houdart E, et al. Sudden and Complete Olfactory Loss of Function as a Possible Symptom of COVID-19. JAMA Otolaryngol Head Neck Surg 2020 Jul 01;146(7):674-675. [CrossRef] [Medline]
Menni C, Valdes AM, Freidin MB, Sudre CH, Nguyen LH, Drew DA, et al. Real-time tracking of self-reported symptoms to predict potential COVID-19. Nat Med 2020 Jul 11;26(7):1037-1040 [FREE Full text] [CrossRef] [Medline]
Bénézit F, Le Turnier P, Declerck C, Paillé C, Revest M, Dubée V, RAN COVID Study Group. Utility of hyposmia and hypogeusia for the diagnosis of COVID-19. Lancet Infect Dis 2020 Sep;20(9):1014-1015 [FREE Full text] [CrossRef] [Medline]
Nepal G, Rehrig JH, Shrestha GS, Shing YK, Yadav JK, Ojha R, et al. Neurological manifestations of COVID-19: a systematic review. Crit Care 2020 Jul 13;24(1):421-411 [FREE Full text] [CrossRef] [Medline]
Vihta K, Pouwels K, Peto T, Pritchard E, Eyre DW, House T, COVID-19 Infection Survey team. Symptoms and SARS-CoV-2 positivity in the general population in the UK. Clin Infect Dis 2021 Nov 08:ciab945. [CrossRef] [Medline]
Kahlert CR, Persi R, Güsewell S, Egger T, Leal-Neto OB, Sumer J, et al. Non-occupational and occupational factors associated with specific SARS-CoV-2 antibodies among hospital workers - A multicentre cross-sectional study. Clin Microbiol Infect 2021 Sep;27(9):1336-1344 [FREE Full text] [CrossRef] [Medline]

‎

FOPH: Federal Office of Public Health

HCW: health care worker

LOESS: locally estimated scatterplot smoothing

PCR: polymerase chain reaction

ROC: receiver operating characteristic

S-H-ESD: Seasonal-Hybrid Extreme Studentized Deviate

Edited by T Sanchez; submitted 14.09.21; peer-reviewed by X Dong, A Ardekani; comments to author 01.10.21; revised version received 05.10.21; accepted 05.10.21; published 22.11.21

©Onicio Leal-Neto, Thomas Egger, Matthias Schlegel, Domenica Flury, Johannes Sumer, Werner Albrich, Baharak Babouee Flury, Stefan Kuster, Pietro Vernazza, Christian Kahlert, Philipp Kohler. Originally published in JMIR Public Health and Surveillance (https://publichealth.jmir.org), 22.11.2021.

This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in JMIR Public Health and Surveillance, is properly cited. The complete bibliographic information, a link to the original publication on https://publichealth.jmir.org, as well as this copyright and license information must be included.

This paper is in the following e-collection/theme issue:

Digital SARS-CoV-2 Detection Among Hospital Employees: Participatory Surveillance Study