How effective would a data-driven vaccination program be over a simple rules based approach

Data Science Asked by andrew pate on January 13, 2021

The rollout of vaccination programs tends to be based on a given set of rules (advice) devised by government UK example. Such a rule-set are generally limited in complexity in order to be consistently applied by humans administering the roll-out.

There is a strong association between age and risk nature but it appears below the age of 60 that other factors such as sex/deprivation/ethenicity predominate. There is also mounting evidence towards mental disorders holding a significant risk factor risk factors and mental disorders

Is there any openly available papers or modelling showing a data-driven prioritization of vacination (for example using machine-intelligence), especially in the scenario of limited vaccine availability, would be more effective than simply using age alone as the predicate.

2 Answers

It seems age tends to outweigh the effects from other influences. For example a an effect that increases IMF by 2x equates to a 6 year age difference, and an effect (such as a severe mental disorder) which increases IMF by 7x equates to a 17 year age difference. Benoit's answer illustrates the nature of this well.

I've tried creating a crude simulation ( to experiment with the effects of different prioritizations

Here are some findings. They are not peer reviewed - They should not be considered as health guidance, without further validation and research:

  1. Generally a large proportion of the total number that get infected (around 40%) occurs in the early stages, when the proportion of vaccinated people is still low (around 20-30%). So prioritisation has little influence on overall death toll.

  2. Very high risk groups (such as severe mental disorders or morbidly obese) represent only a small proportion of the full cohort, so do not contribute to overall death toll significantly.

  3. In addition to vaccination, partial lockdown (r around 1.3) combined with partial herd immunity (say 25%) can also cause cases to beginning to fall.

  4. Reported number of cases per day does not account for asymtomatic cases, the true number of cases infected each day will be higher - This needs considering in models.

  5. Non-linear effects such as lockdown rules changing when thresholds are reached tend to introduce considerable variability in the model.

  6. Its likely the timelyness and speed of the vaccine rollout (as well as lockdown thresholds) have much more of an impact on overall death toll than vaccine prioritisation.

Answered by andrew pate on January 13, 2021

The reason for the rules based approach is the simpson paradox. enter image description here

In short, the way a test group is sampled can easily overturn the conclusion of a study. It is very important to include sufficient patients from a particular risk group in the study. This is why rules are set. This approach however is the basis of causality theory. A data driven-approach would be as followed:

  1. experts flag expected causal patterns, i.e. draw causal graph of the study. [This is the mathematical formulation of risk factors]
  2. Causal graph generates ideal distribution of study and tools to draw conclusion

Answered by Benoit Descamps on January 13, 2021

Add your own answers!

Related Questions

How to handle unseen labels in test data?

1  Asked on August 22, 2020 by meysam


In a list, find the numbers where they are bigger than the next numbers

1  Asked on August 21, 2020 by ellen-sheldon


Neural networks: how to think of it?

0  Asked on August 18, 2020 by luca-di-mauro


MLPRegressor Output Range

2  Asked on August 18, 2020 by kam


Ask a Question

Get help from others!

© 2022 All rights reserved. Sites we Love: PCI Database, MenuIva, UKBizDB, Menu Kuliner, Sharing RPP, SolveDir