Cross Validated Asked by CBGodbole on January 1, 2022

I am performing multiple linear regression to predict a score (dependent variable) from multiple categorical variables. My dependent variable has skewed distribution with a large number of zero values but no negative values.

Can I use Box-Cox transformation in this scenario?

I tried to run it in R, but got the error message –

"Error in boxcox.default(linreg1) : response variable must be positive"

Box-Cox transformation works fine with zeros. Hope you are using boxcox.fit() in package named geoR.

However, you can solve your problem of skewness with other transformations like:

- Square root transformation. However, often the square root is not a strong enough transformation to deal with the high levels of skewness.
- Use log(x+1) transformation which is a widely accepted way of feature transformation.

Also, I don't understand why you are doing transformation of the dependent variable. I agree with @dave for the assumption of normality in regression.

Answered by Vivek on January 1, 2022

2 Asked on December 21, 2020 by uzbekistan

dynamic regression fixed effects model generalized moments panel data random effects model

2 Asked on December 20, 2020 by david-ernst

anova experiment design multiple comparisons post hoc statistical power

0 Asked on December 19, 2020 by user276835

conditional probability machine learning mathematical statistics nested data regression

0 Asked on December 18, 2020 by oumayma-bounouh

forecast combination forecasting normal distribution pdf weights

1 Asked on December 18, 2020 by farzad

0 Asked on December 18, 2020 by germania

0 Asked on December 17, 2020 by pavel

1 Asked on December 17, 2020

0 Asked on December 16, 2020 by whuber

1 Asked on December 16, 2020 by jlearner

1 Asked on December 16, 2020 by ber08

forecasting garch hidden markov model variance volatility forecasting

0 Asked on December 16, 2020 by noumenal

1 Asked on December 15, 2020 by emmettcc

1 Asked on December 15, 2020 by suzee

generalized linear model machine learning multiple regression neural networks regression

0 Asked on December 15, 2020

1 Asked on December 14, 2020 by bandit_king28

9 Asked on December 13, 2020 by jessica

1 Asked on December 13, 2020 by kh_one

1 Asked on December 13, 2020 by l-sicilis

Get help from others!

Recent Answers

- Peter Machado on Why fry rice before boiling?
- Joshua Engel on Why fry rice before boiling?
- haakon.io on Why fry rice before boiling?
- Jon Church on Why fry rice before boiling?
- Lex on Does Google Analytics track 404 page responses as valid page views?

Recent Questions

- How Do I Get The Ifruit App Off Of Gta 5 / Grand Theft Auto 5
- Iv’e designed a space elevator using a series of lasers. do you know anybody i could submit the designs too that could manufacture the concept and put it to use
- Need help finding a book. Female OP protagonist, magic
- Why is the WWF pending games (“Your turn”) area replaced w/ a column of “Bonus & Reward”gift boxes?
- Does Google Analytics track 404 page responses as valid page views?

© 2023 AnswerBun.com. All rights reserved. Sites we Love: PCI Database, UKBizDB, Menu Kuliner, Sharing RPP