Some professions love their amateurs. Astronomers, for instance, are dependent on the efforts of amateurs to pick up on new heavenly bodies, for example, and to process images.
Not so much bio-statisticians, though. Here’s a gracious, but I think quite irritated, Jacques Raubenheimer of the University of Sydney, writing at The Conversation about analysis of Covid-19 in real-time:
5. Yes, the data are messy, incomplete and may change
Some social media users get angry when the statistics are adjusted, fuelling conspiracy theories.
But few realise how mammoth, chaotic and complex the task is of tracking statistics on a disease like this.
Countries and even states may count cases and deaths differently. It also takes time to gather the data, meaning retrospective adjustments are made.
We’ll only know the true figures for this pandemic in retrospect. Equally so, early models were not necessarily wrong because the modellers were deceitful, but because they had insufficient data to work from.
Welcome to the world of data management, data cleaning and data modelling, which many armchair statisticians don’t always appreciate. Until now.
It’s short and to the point. I appreciated point #1 a lot, as I’d deduced it but wondered if I had it right:
1. It’s the infection rate that’s scary, not the death rate
Which is not to say the death rate’s not scary, but it’s not a measurement of potential disaster. There are a lot of diseases that have high fatality rates, but because the infection rate is low, even without social mitigation measures, they don’t mean that much. If you catch it, we’re sorry you’re dying, but the rest of society is not at risk.
But epidemiologists worry about those diseases picking up a mutation or three that increases the infection rate.
Add to that the lack of medical resources when it comes to epidemics, and that’s why the medics have permanent scrunches in their brows – putting those two together means the death rate goes up. And, while I knew this in the back of my mind, I appreciate the reminder:
Flu’s R₀ is about 1.3. Although COVID-19 estimates vary, its R₀ sits around a median of 2.8. Because of the way infections grow exponentially (see below), the jump from 1.3 to 2.8 means COVID-19 is vastly more infectious than flu.
It’s a bit like that old brain-teaser about algae doubling every day, or the one about doubling the number of grains of rice on each successive square of a chessboard – it does up much faster than human intuition would guess.
I liked this article – well-organized, succinct, and politely pissed off at the people who only think they know what they’re doing.