Background: Since the outbreak of the novel coronavirus disease (COVID-19) in December 2019, the coronavirus has spread all over the world at an unprecedented rate. The transmissibility of the coronavirus from asymptomatic patients to healthy individuals has received enormous attention. An important study using COVID-19 data from the city of Ningbo, China, was carried out to estimate and compare the transmission rates of the coronavirus by the symptomatic and asymptomatic patients. However, in the original analysis, the usual chi-square tests were unduly used for some contingency tables with small cell counts including zero, which may violate the assumptions for the chi-square test.
Objective: We reanalyze the data from the city of Ningbo with more appropriate statistical methods to draw more reliable and sound conclusions on the transmission rates of the coronavirus by the symptomatic and asymptomatic patients.
Methods: We excluded the cases associated with the super-spreader and adopted a more appropriate statistical method, including the permutation test and the Fisher exact test, to reanalyze the COVID-19 data from the city of Ningbo.
Results: After excluding the cases related to the super-spreader, the Fisher exact test yields a P value of .84, which indicates stronger evidence of no difference in the transmission rates compared with the original analysis. The odds ratio of the coronavirus transmission rates between the symptomatic and asymptomatic patients is 1.2 with a 95% confidence interval 0.5-2.8.
Conclusions: Through a more in-depth and comprehensive statistical analysis of the Ningbo data, we concluded that there is no difference in the transmission rates of coronavirus between symptomatic and asymptomatic patients.
Since the outbreak of the novel coronavirus disease (COVID-19) in December 2019, the coronavirus has spread all over the world at an unprecedented rate. By May 21, 2020, more than 200 countries and territories have been affected by COVID-19, with a total of more than 5 million confirmed cases and over 330,000 deaths . In addition, both the numbers of cases and deaths continue to climb up quickly. On March 11, 2020, COVID-19 was declared an international public health emergency by the World Health Organization [ ]. Many countries have taken the most restrictive travel bans and quarantine policies in an attempt to stop the coronavirus from infecting their healthy populations. The worldwide economy has also been greatly set back.
During the disease incubation period, a percentage of coronavirus carriers may have no symptoms or minimal symptoms and thus often go undetected. These covert coronavirus carriers may not even be aware of the infection themselves but would be confirmed as positive cases if tested using the reverse transcriptase polymerase chain reaction (RT-PCR). If the percentage of asymptomatic carriers is large and if their transmissibility of coronavirus is as high as the symptomatic cases, this would pose a great threat to the public health worldwide. Therefore, it is critical to determine the percentage and the transmissibility of asymptomatic coronavirus carriers in the population.
There has been some work in the literature on the estimation of the asymptomatic proportion of COVID-19 cases. Based on the infected cases on the Diamond Princess cruise ship, the asymptomatic ratio was estimated to be 0.179 with a 95% Bayesian credible interval of 0.155-0.202 . Another study [ ] indicated that the asymptomatic ratio could be as high as 0.416 by using the information on Japanese nationals who were evacuated from Wuhan, China on charter flights. An analysis on the COVID-19 infected cases from Tibetan Autonomous Prefecture [ ] found that the proportion of asymptomatic carriers was 0.217. In a study with 36 children with COVID-19 in Zhejiang, China [ ], it was found that there were 10 asymptomatic cases out of 36 infections (27.7%). Another investigation in a skilled nursing facility in King County, Washington identified that, out of 48 residents that tested positive for COVID-19, 27 (56%) were asymptomatic at the time of testing [ ]. The aforementioned studies indicate that the proportion of asymptomatic carriers in the total infected cases is considerably high, but the sample sizes of these studies are rather small.
There has been evidence for transmission of coronavirus from asymptomatic carriers. It was reported that the viral load detected in the asymptomatic patients was similar to that in the symptomatic patients, which suggests the potential transmissibility of asymptomatic carriers . A familial cluster of 5 patients in Anyang, China demonstrated transmission of the coronavirus from an asymptomatic carrier with normal chest computed tomography but tested positive after all 5 contacted family members had shown symptoms and confirmed positive RT-PCR test results [ ]. A similar case of the familial cluster of 5 members associated with COVID-19 in Luzhou, China also suggested that coronavirus can be transmitted by asymptomatic carriers [ ]. Another example of coronavirus infection by an asymptomatic patient was a German case through the usual contact in business meetings [ ]. Moreover, the mathematical model developed to estimate the basic reproductive number of COVID-19 and quantify the contribution of different transmission routes also indicated the transmissibility of the asymptomatic individuals [ ]. In a study on a cluster of 22 close contacts of a male 22 years of age with COVID-19 [ ], the asymptomatic patient showed the rapid human-to-human transmissibility. Via a detailed literature review conducted at the Centers for Disease Control and Prevention [ ], it was demonstrated from the epidemiologic, virologic, and modeling studies that COVID-19 is transmittable by persons with presymptomatic or asymptomatic infection.
Chen et al  carried out an important study using the COVID-19 data from Ningbo, China to estimate the transmission rates of the coronavirus by the symptomatic and asymptomatic cases. The estimated transmission rates for the symptomatic and asymptomatic patients were 0.063 and 0.041, respectively, and the chi-square test yielded a P value of .29, which indicates that there is no statistically significant difference between the two transmission rates. They further investigated the transmission rates for different relationships and different types of contact with the infected patients including both symptomatic and asymptomatic cases. The conclusions were that there are statistically significant differences in the transmission rates across different relationships and different types of contact. As expected, the closer the contact is with the infected patients, the higher the chance of infection.
The following is the permutation test algorithm:
However, in their original statistical analysis , the chi-square tests were unduly used because the counts in some cells of the contingency tables were rather small and sometimes even zero, which violates the assumptions of a chi-square test and thus casts doubt on the validity of the hypothesis test. Moreover, when comparing the transmission rates of symptomatic and asymptomatic cases, Chen et al [ ] included the cases associated with a super-spreader who mainly transmitted the disease in an air-conditioned bus and a Buddhism activity gathering. However, this may reduce the generalization of the findings, as the super-spreader should be regarded as an outlier and removed from the primary analysis.
We adopted a permutation test to determine the difference in the average numbers of contacts by the symptomatic and asymptomatic cases. The permutation test algorithm gives the details of the permutation test, andprovides the diagram for the resampling step in the permutation test. Note that the permutation test requires no assumptions on the data, which simply permutes the data to simulate the null distribution.
Fisher Exact Test
To allow for small cell counts including zero in the contingency table, the Fisher exact tests  were used to investigate the difference in the transmission rates between the symptomatic and asymptomatic patients wherever small cell counts were present (eg, less than 5 as a rule of thumb).
Without making any assumptions on the data, the Fisher exact test simply adopts the hypergeometric distribution to calculate the exact probability of the observed data in the table. For example, as shown in, suppose that there are a infected cases and b uninfected individuals in the close contacts of the symptomatic cases, while there are c infected cases and d uninfected individuals in the close contacts of the asymptomatic cases. The probability of observing such data is given by:
The P value of the Fisher exact test is calculated by summing up all the probabilities of obtaining data as or more extreme than the observed under the null hypothesis (ie, there is no difference between the two groups).
|Number of close contacts of symptomatic cases||a||b|
|Number of close contacts of asymptomatic cases||c||d|
Odds Ratio and Related Confidence Intervals
To gain more insight into the Ningbo data, we further calculated the odds ratio between the symptomatic and asymptomatic groups as well as the corresponding confidence interval. For a 2×2 contingency table with cell counts (a, b, c, d) as shown in, the odds ratio is ad/bc, and the corresponding 95% confidence interval is given by:
If the estimated transmission rate with sample size n is denoted by , the 95% confidence interval of the transmission rate is:
Both confidence intervals for the odds ratio and the transmission rate are based on normal approximation, and in the Ningbo data, the sample sizes for computation of these confidence intervals are reasonably large.
Results of the Permutation Test
From January 21 to March 6, 2020, there were 157 symptomatic cases and 30 asymptomatic cases in the Ningbo COVID-19 data . These infected cases resulted in 2147 close contacts with them, of which 2001 exposures were caused by the symptomatic cases and 146 by the asymptomatic cases. The average number of close contacts by a symptomatic case was 13 and for an asymptomatic case was 5, and the difference is statistically significant with P<.001 from the permutation test. presents the histograms of the average numbers of contacts by the symptomatic and asymptomatic cases as well as the differences after the permutation (ie, under the null distribution) in the average numbers of contacts by the symptomatic and asymptomatic cases in the permutation test. The larger number of close contacts by the symptomatic cases may be due to the medical attention they received after they had the confirmation of a COVID-19 positive test.
Results of the Fisher Exact Test
Under the Fisher exact test, we consider two scenarios: (1) to combine the numbers of symptomatic and asymptomatic cases as the total number of infected patients, leading to a 2×2 table; or (2) to separate them, leading to a 2×3 table, as shown in the primary analysis of close contacts section of.
From the results summarized in, we concluded that there was no significant difference in the transmission rates between the symptomatic and asymptomatic cases, either including or excluding the cases associated with the super-spreader. However, the tests excluding the cases associated with the super-spreader yielded larger P values: P=.84 when combining the numbers of symptomatic and asymptomatic cases, and P=.11 when separating them. As a result, there is no statistical evidence in the data to rule out the transmissibility of asymptotic carriers in comparison with symptomatic cases.
|Analysis||Close contacts, n||Infected||Uninfected, n||P value|
|Symptomatic cases, n||Asymptomatic cases, n||Combineda||Separateb|
|Primary analysis of close contacts by symptomatic and asymptomatic cases||.84 (.37)d||.11 (.08)|
|Symptomatic cases||1904 (97)c||79 (28)||15 (4)||1810 (65)|
|Total||2050 (97)||82 (28)||18 (4)||1950 (65)|
|Subgroup analysis by different relationships with infected cases||<.001||<.001|
|Subgroup analysis by different types of contact with infected cases||<.001||<.001|
aCombined means P values were obtained by pooling the numbers of symptomatic and asymptomatic cases together.
bSeparate means P values were obtained by separating the numbers of symptomatic and asymptomatic cases.
cThe numbers in the parentheses are associated with the super-spreader.
dP values in the parentheses were obtained when including the cases associated with the super-spreader.
Estimation of the Odds Ratio
The estimated odds ratio, transmission rates, and their difference between symptomatic and asymptomatic cases as well as the corresponding 95% confidence intervals are all presented in. The odds of transmitting the coronavirus to a healthy individual by a symptomatic patient is 1.2 times more than that by an asymptomatic patient, which was not statistically significant as the 95% confidence interval covers one. Furthermore, as the 95% confidence intervals for the difference of transmission rates cover zero, we concluded that there is no difference in the transmissibility of the coronavirus through close contacts between symptomatic and asymptomatic cases, which is consistent with the findings using the Fisher exact tests.
The transmission rates under different relationships with the infected cases are significantly different with both P values<.001 whether combining the symptomatic and asymptomatic cases or not. With regard to different types of contact, the transmission rates are also significantly different with P values<.001. As expected, the more close contacts with the infected cases, the higher the likelihood of contracting the coronavirus.
|Variable||Odds ratio (95% CI)||Transmission rate of symptomatic cases, (95% CI)||Transmission rate of asymptomatic cases (95% CI)||Difference of transmission rates (95% CI)|
|With super-spreader cases||1.568 (0.679-3.620)||0.063 (0.053-0.075)||0.041 (0.017-0.091)||0.022 (–0.016 to 0.059)|
|Without super-spreader cases||1.212 (0.522-2.815)||0.049 (0.040-0.060)||0.041 (0.017-0.091)||0.008 (–0.029 to 0.046)|
In summary, we provided a more in-depth analysis of the Ningbo COVID-19 data to examine the difference in the transmissibility of the coronavirus for symptomatic and asymptomatic patients. The conclusion remains the same, that there is no statistically significant difference in the transmissibility of the coronavirus between symptomatic and asymptomatic patients, but our evidence for no difference appears to be stronger with larger P values than the original analysis .
As the proportion of asymptomatic carriers in the total infected cases is considerably high [- ], such findings are crucial to the public health and can help to guide the relevant government agencies on policy making about the asymptomatic cases.
However, our analysis only focuses on the data from the city of Ningbo, China, and the sample size is small. Therefore, the generalization of our findings to a larger and more diverse population is limited. More work is warranted to study the transmissibility of coronavirus by the asymptomatic coronavirus carriers.
Conflicts of Interest
- Dong E, Du H, Gardner L. An interactive web-based dashboard to track COVID-19 in real time. Lancet Infect Dis 2020 May;20(5):533-534. [CrossRef]
- Cucinotta D, Vanelli M. WHO Declares COVID-19 a Pandemic. Acta Biomed 2020 Mar 19;91(1):157-160. [CrossRef] [Medline]
- Mizumoto K, Kagaya K, Zarebski A, Chowell G. Estimating the asymptomatic proportion of coronavirus disease 2019 (COVID-19) cases on board the Diamond Princess cruise ship, Yokohama, Japan, 2020. Eurosurveillance 2020 Mar 12;25(10). [CrossRef]
- Nishiura H, Kobayashi T, Miyama T, Suzuki A, Jung S, Hayashi K, et al. Estimation of the asymptomatic ratio of novel coronavirus infections (COVID-19). Int J Infect Dis 2020 Feb 17. [CrossRef]
- Song H, Xiao J, Qiu J. A considerable proportion of individuals with asymptomatic SARS-CoV-2 infection in Tibetan population. medRxiv 2020 Mar 20. [CrossRef]
- Qiu H, Wu J, Hong L, Luo Y, Song Q, Chen D. Clinical and epidemiological features of 36 children with coronavirus disease 2019 (COVID-19) in Zhejiang, China: an observational cohort study. Lancet Infect Dis 2020 Mar. [CrossRef]
- Arons MM, Hatfield KM, Reddy SC, Kimball A, James A, Jacobs JR, et al. Presymptomatic SARS-CoV-2 infections and transmission in a skilled nursing facility. N Engl J Med 2020 Apr 24 [FREE Full text] [CrossRef] [Medline]
- Zou L, Ruan F, Huang M, Liang L, Huang H, Hong Z, et al. SARS-CoV-2 viral load in upper respiratory specimens of infected patients. N Engl J Med 2020 Mar 19;382(12):1177-1179. [CrossRef]
- Bai Y, Yao L, Wei T, Tian F, Jin D, Chen L, et al. Presumed asymptomatic carrier transmission of COVID-19. JAMA 2020 Feb 21 [FREE Full text] [CrossRef] [Medline]
- Ye F, Xu S, Rong Z, Xu R, Liu X, Deng P, et al. Delivery of infection from asymptomatic carriers of COVID-19 in a familial cluster. Int J Infect Dis 2020 May;94:133-138. [CrossRef] [Medline]
- Rothe C, Schunk M, Sothmann P, Bretzel G, Froeschl G, Wallrauch C, et al. Transmission of 2019-nCoV infection from an asymptomatic contact in Germany. N Engl J Med 2020 Mar 05;382(10):970-971. [CrossRef]
- Ferretti L, Wymant C, Kendall M, Zhao L, Nurtay A, Abeler-Dörner L, et al. Quantifying SARS-CoV-2 transmission suggests epidemic control with digital contact tracing. Science 2020 May 08;368(6491) [FREE Full text] [CrossRef] [Medline]
- Huang L, Zhang X, Zhang X, Wei Z, Zhang L, Xu J, et al. Rapid asymptomatic transmission of COVID-19 during the incubation period demonstrating strong infectivity in a cluster of youngsters aged 16-23 years outside Wuhan and characteristics of young patients with COVID-19: A prospective contact-tracing study. J Infect 2020 Jun;80(6):e1-e13 [FREE Full text] [CrossRef] [Medline]
- Furukawa NW, Brooks JT, Sobel J. Evidence supporting transmission of severe acute respiratory syndrome coronavirus 2 while presymptomatic or asymptomatic. Emerg Infect Dis 2020 May 04;26(7). [CrossRef] [Medline]
- Chen Y, Wang A, Yi B, Keqin D, Haibo W, Jianmei W, et al. The epidemiological characteristics of infection in close contacts of COVID-19 in Ningbo city. Chin J Epidemiol 2020;41(5):668-672. [CrossRef]
- Lydersen S, Pradhan V, Senchaudhuri P, Laake P. Choice of test for association in small sample unordered r x c tables. Stat Med 2007 Oct 15;26(23):4328-4343. [CrossRef] [Medline]
|COVID-19: coronavirus disease|
|RT-PCR: reverse transcriptase polymerase chain reaction|
Edited by T Rashid Soron, G Eysenbach, T Sanchez; submitted 19.04.20; peer-reviewed by Y Zheng, E Da Silva, H Feng; comments to author 08.05.20; revised version received 11.05.20; accepted 21.05.20; published 28.05.20Copyright
©Guosheng Yin, Huaqing Jin. Originally published in JMIR Public Health and Surveillance (http://publichealth.jmir.org), 28.05.2020.
This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in JMIR Public Health and Surveillance, is properly cited. The complete bibliographic information, a link to the original publication on http://publichealth.jmir.org, as well as this copyright and license information must be included.