# Is there any anova-like approach for calculating contingency tables across multiple levels within a factor

I want to compare success rates across a large number of different levels within a third factor to detect if there are statistically significant differences for at least one of the groups. I’m not specifically interested in the post-hoc question of which groups are different from each other, if any.

For example, I have 3 groups, and total failures and total attempts for each of those groups, lke

group A 15 200
group B 6  100
group C 4  50
group D 45 90


In this particular example, the P value would be (presumably) less than 0.05 because group D 45% compared to between 15 and 20% for the other groups.

The equivalent for continuous variables, like the measured heights of 11 year old children, would be to use a ANOVA.

Does anyone know of a method for doing this?

One possibility would be to create all pairwise combinations, but the number of levels within that variable are too large for it to be practical or useful, let alone having enough data to account for the multiple comparisons for each level.

Another possibility I’ve thought of: take one group, say group A, and compare it against the sum of all other groups and do a standard goodness of fit test, then you only have to do N-1 comparisons for N_group groups AND it tells you if a specific group is an outlier relative to the rest. e.g. group 1 = 15, 200, group 2 = 55, 240? The question then would be how to calculate the correct degrees of freedom for comparison. The normal df = k-c, where k= filled cells and c= estimated parameters, so in this case c = N_group?

Thanks,

## Related Questions

### How does Generalized Policy Iteration stabilize to the optimal policy and value function?

1  Asked on December 6, 2021

### Fire an alert when number of sign up in an app drops. How to find the best condition to maximize accuracy?

2  Asked on December 6, 2021 by omm-kreate

### Is it always possible a closed form solution for a norm minimization problem? Which one is the best approach closed form solution or gradient based?

0  Asked on December 6, 2021 by lakshman-mahto

### Does gradient descent work for tabular Q learning?

1  Asked on December 6, 2021

### prove change in total probability of success in binomial distribution

1  Asked on December 6, 2021 by rambalachandran

### Why we cannot take baseline as predictor for change in this case

0  Asked on December 6, 2021

### Calculate group with highest defective rate

0  Asked on December 6, 2021 by user6883405

### Time series model for multiple different series observations

1  Asked on December 6, 2021

### Whitening a dataset with fewer observations than variables

1  Asked on December 5, 2021 by laos

### Composite Scores and Standardized Composite Scores t test

1  Asked on December 5, 2021 by user41710

### The distribution of the product of a multivariate normal and a lognormal distribution

1  Asked on December 5, 2021 by aae

### How to understand mapping function of kernel?

1  Asked on December 5, 2021

### Attention Mechanisms and Alignment Models in Machine Translation

1  Asked on December 5, 2021

### Difference between Repeated measures ANOVA, ANCOVA and Linear mixed effects model

1  Asked on December 5, 2021

### Time Series Multivariate Forecasting

1  Asked on December 5, 2021

### reporting results of a multivariate logistic regression using the glm function in R

1  Asked on December 5, 2021 by b-kenobi

### Checking the constant variance assumption for residuals vs fitted plots: What about for the same fitted values?

1  Asked on December 5, 2021

### What model is a suitable model for zero-constrained variables?

0  Asked on December 5, 2021

### Why the regression coefficient for normalized continuous variable is unexpected when there is dummy variable in the model?

1  Asked on December 5, 2021 by emberbillow

### Is boosting and bagging only relevant in the context of decision trees?

2  Asked on December 5, 2021