Claim your CPD points
This author believes that by using a simple, reductive approach to estimating community infection, it can be demonstrated that there are substantially more COVID-19 carriers in the community, than confirmed, active cases.
The author notes that other papers have been written internationally making similar assertions around the under-reporting of cases. [i] , [ii]
The key risk to highlight is that the public may misread the current situation by underestimating their exposure to the virus. The goal of sharing this piece of work is to shift the media dialogue towards a focus on community level infection and transmission. A better understanding of risk is important both now, and in the future.
While it relies on crude assumptions, the simple model [iii] output suggests at 9 April there may have been around 20,000 carriers of COVID-19 in Australia, rather than the 3,000 or so active confirmed cases as reported [iv] .
This means the community rate of infection could be about 6-7x that the media is reporting. While, this is a simplistic exercise, the key point is that the reported data is an order of magnitude less than the actual data. With access to more granular data, particularly around the demographics of those tested and the dynamics of the viral transmission, these estimates could be improved.
The public is comfortable with political commentary around public opinion, using opinion polls which have a sample size of approximately 1,000 and which are designed to reflect the electorate.
For COVID-19 we have a sample of 330,000 people tested, which is over 1% of the population. It is reasonable for the public to expect to be provided with experts' estimates of community infection rather than focussing purely on confirmed cases.
With appropriate data on the sample versus the population, and with more random sampling on the broader population, it would be possible to start to refine our understanding of, and hence, the public's understanding of the community infection rate.
As restrictions are relaxed there will need to be an increase in testing and tracking in the community, preferably including random sampling. From the resulting data, it will enhance the management and control of the virus, and public awareness of true risk, if estimates of community level infection can be made, and reported in the media.
The model workings, and all the assumptions are contained below.
As at 3.00pm on 9 April 2020, the following data has been sourced from Australian Government Department of Health:
Additionally, from the Report of the WHO-China Joint Mission on COVID-19 the following can be sourced
This latter set, may not be directly applicable to the Australian situation, and hence could be improved to get a clearer picture than that which follows.
From the data-set I can note the following, Australia-wide:
Taking the following three facts together, allows me to make an 'educated guess' at the carrier mortality rate:
While a purer approach to modelling would generate a range for an estimate, crudely, based on the above three facts I assume a carrier mortality rate of 0.5% for the purposes of illustration.
The second crude assumption, based on the data from the WHO study in China, which Australian health officials may be able to enhance for the local experience, is that simplistically it takes three weeks from infection to death. As this model is for illustrative purposes, it allows it to remain simple and reductive.
If we have a constant carrier mortality rate and a constant time from infection to death, for the purposes of modelling, we can infer new carriers at a point in time, from deaths at a later point in time. This is the critical insight that the model exploits.
Death data for the last three weeks in Australia shows :
Given the small numbers of deaths in Australia, this data is best smoothed. In some nations there are hundreds or more deaths per day which is a more robust dataset.
Applying the carrier mortality rate estimate of 0.5% to the death data suggests:
Summing these up, suggests that there were of the order of 10,000 carriers in the country on 19 March. This is vastly in excess of the 700 or so reported active cases at that time. This the key message to take from this illustration.
From 19 March to 9 April, assumptions must be made about carrier transmissions. The daily deaths data (smoothed) illustrates what this was historically. Smoothing this suggests around 15% daily transmission in the three weeks to 19 March. Note that increasingly, the public was becoming aware of COVID-19, and restrictions were being introduced predominantly for travellers, and gatherings.
This means over the period, each carrier has been infecting 0.15 new people. If a carrier was infectious for three weeks (crudely, for simple illustration, 20 days), a carrier would pass the virus to three people over that time. This is not inconsistent with various studies on R0 ('Basic Reproduction Number'). To slow the spread of the disease this number needs to be as low as one. This is where social distancing remains key on an ongoing basis.
To estimate transmissions since March 19 there are two possible inputs:
The following assumptions are then made for illustrative purposes. Again, a model would look to derive ranges for the variables - the Australian case data can be smoothed, and it seems reasonable versus the overseas experiences. Worldometers.info stores the historic data, and hence for efficiency, it was sourced from there. It would be improved with better, or more granular data on who was being tested.
Assume therefore that:
In order to allow from the lag from infection to testing, the case data used is lagged five days. For example, to understand transmission on 20 March, we look at New Cases v Active Cases on 25 March.
| Date | Active Cases (5 days later) | New Cases (5 days later) | Transmission | Average | Assumed |
| 20-Mar | 2,191 | 359 | 16% | 12% | 12% |
| 21-Mar | 2,547 | 374 | 15% | ||
| 22-Mar | 2,867 | 328 | 11% | ||
| 23-Mar | 3,195 | 257 | 8% | ||
| 24-Mar | 3,451 | 528 | 15% | 8% | 8% |
| 25-Mar | 3,902 | 297 | 8% | ||
| 26-Mar | 4,197 | 303 | 7% | ||
| 27-Mar | 4,398 | 285 | 6% | ||
| 28-Mar | 4,680 | 266 | 6% | ||
| 29-Mar | 4,704 | 140 | 3% | 3% | 3% |
| 30-Mar | 4,841 | 96 | 2% | ||
| 31-Mar | 4,935 | 200 | 4% | ||
| 1-Apr | 4,633 | 145 | 3% | ||
| 2-Apr | 3,418 | 93 | 3% | ||
| 3-Apr | 3,392 | 64 | 2% | ||
| 4-Apr | 3,189 | 100 | 3% | ||
| 5-Apr | - | - | - | ||
| 6-Apr | - | - | - | ||
| 7-Apr | - | - | - | ||
| 8-Apr | - | - | - | ||
| 9-Apr | - | - | - |
Applying the assumptions on community transmission to the population estimate of 10,000 carriers on 19 March, and with the reductive assumption that all carriers are contagious for three weeks from infection to either death or recovery then the path of the carriers may have been as follows.
| Date | Carriers | | | |
| 19-Mar | 10,000 | Transmission | New Carriers | Death/Recovery |
| 20-Mar | 11,000 | 12% | 1,200 | 200 |
| 21-Mar | 12,120 | 1,320 | 200 | |
| 22-Mar | 13,374 | 1,454 | 200 | |
| 23-Mar | 14,779 | 1,605 | 200 | |
| 24-Mar | 15,762 | 8% | 1,182 | 200 |
| 25-Mar | 16,823 | 1,261 | 200 | |
| 26-Mar | 17,968 | 1,346 | 200 | |
| 27-Mar | 19,006 | 1,437 | 400 | |
| 28-Mar | 20,126 | 1,520 | 400 | |
| 29-Mar | 20,330 | 3% | 604 | 400 |
| 30-Mar | 20,540 | 610 | 400 | |
| 31-Mar | 20,756 | 616 | 400 | |
| 1-Apr | 20,979 | 623 | 400 | |
| 2-Apr | 21,208 | 629 | 400 | |
| 3-Apr | 21,045 | 636 | 800 | |
| 4-Apr | 20,876 | 631 | 800 | |
| 5-Apr | 20,702 | 626 | 800 | |
| 6-Apr | 20,523 | 621 | 800 | |
| 7-Apr | 20,339 | 616 | 800 | |
| 8-Apr | 20,149 | 610 | 800 | |
| 9-Apr | 19,954 | 604 | 800 |
[i] Substantial undocumented infection facilitates the rapid dissemination of novel coronavirus (SARS-CoV2) - R. Li et al., Science 10.1126/science.abb3221 (2020)
[ii] Correcting under-reported COVID-19 case numbers: estimating the true scale of the pandemic. View ORCID Profile Alexander Lachmann, Kathleen M Jagodnik, Federico Manuel Giorgi, Forest Ray https://doi.org/10.1101/2020.03.14.20036178
[iii] Contemporary models would consider multiple scenarios, and put ranges around particular variables. This example is simplified and is for illustrative purposes only. It uses a mathematical, rather than medical perspective and hence there are limitations in its use. Where possible, publically available data applicable to the Australian situation is used, unlike the more theoretical models released by government last week.
[iv] Note that population of 20,000 carriers on the assumptions made, could be expected to have 100 deaths in the following 3 weeks. This is an average of around 5 per day.