# Is there any anova-like approach for calculating contingency tables across multiple levels within a factor

Cross Validated Asked by Brad Davis on January 1, 2022

I want to compare success rates across a large number of different levels within a third factor to detect if there are statistically significant differences for at least one of the groups. I’m not specifically interested in the post-hoc question of which groups are different from each other, if any.

For example, I have 3 groups, and total failures and total attempts for each of those groups, lke

group A 15 200
group B 6  100
group C 4  50
group D 45 90


In this particular example, the P value would be (presumably) less than 0.05 because group D 45% compared to between 15 and 20% for the other groups.

The equivalent for continuous variables, like the measured heights of 11 year old children, would be to use a ANOVA.

Does anyone know of a method for doing this?

One possibility would be to create all pairwise combinations, but the number of levels within that variable are too large for it to be practical or useful, let alone having enough data to account for the multiple comparisons for each level.

Another possibility I’ve thought of: take one group, say group A, and compare it against the sum of all other groups and do a standard goodness of fit test, then you only have to do N-1 comparisons for N_group groups AND it tells you if a specific group is an outlier relative to the rest. e.g. group 1 = 15, 200, group 2 = 55, 240? The question then would be how to calculate the correct degrees of freedom for comparison. The normal df = k-c, where k= filled cells and c= estimated parameters, so in this case c = N_group?

Thanks,

## Related Questions

### How to model count data with decay

0  Asked on October 15, 2020 by learning-stats-by-example

### Seeking Explanation of how this ANCOVA result in R shows the group means are not equal

2  Asked on October 13, 2020 by kyle

### Why are my coefficients too large when control variables are not added?

1  Asked on October 10, 2020 by kyrhee

### Statistical line comparison

1  Asked on October 6, 2020 by mobeus-zoom

### When running a reliability analysis (using Cronbach’s alpha) in SPSS, what does “scale” refer to? Do I have to compute a scale explicitly beforehand?

0  Asked on October 4, 2020 by nada-al-iskandaraniyyah

### In a 2 class problem how do I compute the normalization constant for finding the posterior distributions?

0  Asked on October 2, 2020 by anonymous

### Using percentage change with different sample sizes, which is more significant?

0  Asked on September 30, 2020 by katie-fenton

### Python Random Forest Prediction Probabilities Reliability, Overfitting?

0  Asked on September 24, 2020 by rkhan8

### Euclidean distance score and similarity

4  Asked on September 24, 2020 by navige

### How could I understand the Self-critical sequence training (SCST) model?

1  Asked on September 22, 2020

### Looking for feedback on my approach to split data into validation and test set?

0  Asked on September 22, 2020 by s_am

### How to set up first differences model?

1  Asked on September 21, 2020 by fabio

### Precision and recall for clustering?

3  Asked on September 20, 2020 by learner

### Mixed Effects Model

1  Asked on September 20, 2020 by seydou-goro

### Which Regression scoring to use when have small dataset?

0  Asked on September 19, 2020 by user293111

### Is my logistic regression model correct?

2  Asked on September 15, 2020 by mustapha-hakkou-asz

### Gibbs entropy and Shannon entropy

0  Asked on September 14, 2020 by alhayer

### Why small values produce undulating densities when ploting logarithm of a loguniform prior (in R)?

1  Asked on September 13, 2020 by prolix

### Seasonality in data

1  Asked on September 12, 2020 by madhur-mehta

### Can we maximize the log of the odds instead of log of the probability?

1  Asked on September 9, 2020

### Ask a Question

Get help from others!