# Error propagation in combined linear models

Cross Validated Asked by Chochot on January 5, 2022

I have a set of observed values ($y_{1Obs}$) and 3 predictive variables (n = 27). I use multiple linear regression to create a linear model:

$Z_1=alpha_0 + alpha_1 W_1 + alpha_2 X_1 + alpha_3 Y_1$

Where $Z_1$ is my response variable, $alpha_0$ is the intercept, $alpha_1 , alpha_2 , alpha_3$ are regression coefficients and $W_1, X_1, Y_1$ are predictive variables.

I also have a second set of observed values ($y_{2Obs}$) and 3 different predictive variables (n = 27). I again use multiple linear regression to create a second linear model of the same form:

$Z_2=beta_0 + beta_1 W_2 + beta_2 X_2 + beta_3 Y_2$

Where $Z_2$ is my response variable, $beta_0$ is the intercept, $beta_1 , beta_2 , beta_3$ are regression coefficients and $W_2, X_2, Y_2$ are predictive variables.

With 27 predicted values from each model I then calculate the predicted change between the two:

$Delta Z_{Pred}=Z_2 – Z_1$

While my observed change comes from the two sets of observed values used to create the two linear models:

$Delta Z_{Obs}=y_{2Obs} – y_{1Obs}$

My question is: what formula do I use to calculate the RMSE of my predictions ($Delta Z_{Pred}$) that will propagate the errors from predictions of $Z_1$ and $Z_2$?

I’ve tried using the following formula though I’m not convinced this is correct:

$RMSE_{Delta Z_{Pred}} = sqrt{{RMSE_1}^2 + {RMSE_2}^2}$

Where $RMSE_1$ and $RMSE_2$ are the RMSEs from the first two models shown above.

• The formula $RMSE_{Delta Z_{Pred}} = sqrt{{RMSE_1}^2 + {RMSE_2}^2}$ may work for independent sets of responses and predictors, but is likely to be false if the two sets are correlated. For example, if your two sets of predictors and responses are the same $RMSE_{Delta Z_{Pred}}=0$, while $sqrt{{RMSE_1}^2 + {RMSE_2}^2} > 0$.
• If you are interested in $Delta Z_{Pred}$, I suggest regressing it on the six predictors - although this is not what you are asking, it might server your goal.

Answered by Pere on January 5, 2022

## Related Questions

### Difference of two non-central chi squared random variables

1  Asked on February 14, 2021 by pol

### Why do the mean and proportion measurements take the spotlight in estimation?

1  Asked on February 13, 2021

### Feature engineering: Measuring proportional changes across two item given two conditions

0  Asked on February 13, 2021 by comte

### Validation of my implementation for trace norm and L1 regularization

0  Asked on February 13, 2021 by rando

### Multivariate Normal Quadratic MGF: Eigendecomposition to Matrix form

0  Asked on February 12, 2021 by itsallpurple

### Sample Clustering by Accounting for Gene Fold Change and p-value

0  Asked on February 12, 2021 by user294496

### How do we construct features to use as input to machine learning algorithms for the purpose of movie recommendations(using collaborative filtering)?

1  Asked on February 11, 2021 by user3676846

### subspace clustering with density threshold

0  Asked on February 11, 2021 by pianobegginer

### In real-life applications, which continuous distributions have NON-CONVERGENT expectations that require Lebesgue integration?

0  Asked on February 11, 2021 by iterator516

### How many combinations and how to find a (standard?) distribution that matches?

2  Asked on February 10, 2021 by d-b

### Does PyMC3’s Timeseries API allow for time varying parameters of any model that would require fixed values under a frequentist approach?

0  Asked on February 10, 2021 by lisa-ann

### How do I modify data to test retest measure as more normal?

1  Asked on February 10, 2021 by user136083

### Anomaly detection in Text Classification

2  Asked on February 9, 2021 by naveen-y

### Which output activation is recommended when predicting a variable with a lower but not an upper bound?

2  Asked on February 9, 2021 by user26067

### Extracting a parameter from a probability problem

1  Asked on February 8, 2021 by mishe-mitasek

### Missing target variable values for a labeled model with just a time feature (timeseries)?

0  Asked on February 8, 2021 by amit-s

### What happens to the Gaussian as $sigma to infty$ at $x to infty$?

1  Asked on February 6, 2021 by rkabra

### Deriving comparable probabilites from continuous and discrete data

0  Asked on February 6, 2021 by pst0102