Cross Validated Asked by CBGodbole on January 1, 2022

I am performing multiple linear regression to predict a score (dependent variable) from multiple categorical variables. My dependent variable has skewed distribution with a large number of zero values but no negative values.

Can I use Box-Cox transformation in this scenario?

I tried to run it in R, but got the error message –

"Error in boxcox.default(linreg1) : response variable must be positive"

Box-Cox transformation works fine with zeros. Hope you are using boxcox.fit() in package named geoR.

However, you can solve your problem of skewness with other transformations like:

- Square root transformation. However, often the square root is not a strong enough transformation to deal with the high levels of skewness.
- Use log(x+1) transformation which is a widely accepted way of feature transformation.

Also, I don't understand why you are doing transformation of the dependent variable. I agree with @dave for the assumption of normality in regression.

Answered by Vivek on January 1, 2022

2 Asked on November 24, 2021 by dmittal

2 Asked on November 24, 2021 by appleseed

confidence interval missing data multiple imputation multiple regression regression

0 Asked on November 24, 2021 by mrmartin

1 Asked on November 24, 2021

mixed model multilevel analysis multiple regression regression coefficients repeated measures

0 Asked on November 24, 2021

3 Asked on November 24, 2021

generalized linear model regression regression coefficients standard error statistical significance

1 Asked on November 24, 2021 by jack-arthur

1 Asked on November 24, 2021

0 Asked on November 24, 2021 by doug-sponsler

0 Asked on November 24, 2021

0 Asked on November 24, 2021

computer vision neural networks optical character recognition pattern recognition

0 Asked on November 24, 2021 by zhutchens1

1 Asked on November 24, 2021

conditional expectation conditional probability econometrics expected value regression

0 Asked on November 24, 2021 by chiara-toschi

1 Asked on November 24, 2021 by tzirtzi

1 Asked on November 21, 2021 by user3358740

1 Asked on November 21, 2021 by scott-thibault

1 Asked on November 21, 2021 by arctan27

1 Asked on November 21, 2021

multilevel analysis multiple regression poisson regression regression repeated measures

Get help from others!

Recent Answers

- Peter Machado on Why fry rice before boiling?
- Lex on Does Google Analytics track 404 page responses as valid page views?
- Jon Church on Why fry rice before boiling?
- haakon.io on Why fry rice before boiling?
- Joshua Engel on Why fry rice before boiling?

Recent Questions

- How Do I Get The Ifruit App Off Of Gta 5 / Grand Theft Auto 5
- Iv’e designed a space elevator using a series of lasers. do you know anybody i could submit the designs too that could manufacture the concept and put it to use
- Need help finding a book. Female OP protagonist, magic
- Why is the WWF pending games (“Your turn”) area replaced w/ a column of “Bonus & Reward”gift boxes?
- Does Google Analytics track 404 page responses as valid page views?

© 2023 AnswerBun.com. All rights reserved. Sites we Love: PCI Database, MenuIva, UKBizDB, Menu Kuliner, Sharing RPP, SolveDir