# Trying to wrap my arms around copulas

Cross Validated Asked by eSurfsnake on January 7, 2022

This topic is dense with notation that makes things a bit confusing. But is this the correct interpretation?

Suppose we have two jointly distributed random variables – $X$ and $Y$ – of arbitrary (but let’s assume known) CDFs. The problem is the joint probability for any pair of values ($x$,$y$) is not simply $F_X(x)F_Y(y)$ because they are not independent. That actual joint distribution seems to be often called $H(x,y)$.

Now, it seems to me that the copula, in the end, is simply a function such that $C(F_X(x),F_Y(y))$ actually maps to the value for $H(x,y)$.

It accomplishes this by sort of running the marginals “backwards”, and also baking in the jointness into another joint distribution specified on $[0,1]^{2}$ where the marginals are uniform.

So, in the end, all it is is a mapping that has the right “correction” for jointness, where all you need give it (once you have it calculated) is the naïve values for $F_X(x)$ and $F_Y(y)$, and it delivers up the actual $H(x,y)$? In other words, Sklar’s theorem guarantees there is a one-to-one mapping between $F_X(x)$ and $F_Y(y)$ to $H(x,y)$, and the copula captures all that information?

You probably haven't got an answer because everything you said is correct! As you said, Sklar's Theorem implies that everything about the "jointness" of the distribution is encoded by the copula, so once you know the $$F_X(x)$$ and $$F_Y(y)$$ and have a copula, then you know the joint cumulative distribution function.

$$H(x,y) = P(X le x quad & quad Y le y) = C(F_X(x), F_Y(y))$$

Or, if things are differentiable and you prefer to express things in terms of densities, you can write the joint density as

$$h(x, y) = color{red}{frac{partial^2 C}{partial x partial y}(F_X(x), F_Y(y))}f_X(x)f_Y(y)$$

where you can thing of the red term as the "correction" for non-independence.

Answered by Flounderer on January 7, 2022

(I originally posted this as a comment. While I think this is more of a comment than an answer, this interesting question has gone two years without any answers, and a short answer is better than no answer.)

Think of the density on the unit square as describing relationships between quantiles of the marginal distributions. If you look at a Gaussian copula for a high, positive correlation, when $$X$$ has a low quantile (say $$0.1$$), it’s likely that $$Y$$ will, too, and it is unlikely that $$Y$$ will have a high quantile. Ditto for when $$X$$ has a high quantile (say $$0.9$$).

Answered by Dave on January 7, 2022

## Related Questions

### R: When do we use mean or median for the y axis in ggplot2 when doing analysis on property prices?

0  Asked on January 28, 2021 by chua-s-yang

### COCO evaluation – Negative values on AP and AR

0  Asked on January 28, 2021 by visionenthusiast

### How to make the regressor of LASSO consistent?

0  Asked on January 28, 2021 by zqq

### Suggestions for identifying the most “important” image labels

1  Asked on January 28, 2021 by nlapidot

### Any ideas on how to segment a 2D vector field?

0  Asked on January 28, 2021 by tricostume

### Binomial logistic regression for multiclass problems

1  Asked on January 27, 2021 by mathews24

### How is confidence defined in Expected Calibration Error?

0  Asked on January 26, 2021 by thecity2

### Why does the McNemar’s test use $chi^{2}$ and not the normal distribution?

2  Asked on January 26, 2021

### What algorithm can you use if you want clusters but only are interested in one group?

0  Asked on January 26, 2021 by bonesones

### Can I use an unknown number of variables to model my time-series?

0  Asked on January 26, 2021 by kplauritzen

### Variance of a stationary AR(2) model

2  Asked on January 26, 2021 by user369210

### Avoiding adjustments for time-varying controls in difference-in-differences (DID)?

0  Asked on January 26, 2021

### Removing the effect from structural breaks

1  Asked on January 25, 2021 by kiril-e-proykov

### Recommender System – Predict ratings with Random Forest Regressor or Classifier?

0  Asked on January 24, 2021 by oja-niva

### Nonparametric assessment of multiple predictors

0  Asked on January 24, 2021 by mephisto73

### Calculating measurement variance to achieve desired accuracy in estimation

0  Asked on January 23, 2021 by valjean

### Can large # of epochs or smaller batchsize compensate for smaller data size in training lstms

1  Asked on January 23, 2021 by tjt

### Probability that number of heads exceeds sum of die rolls

5  Asked on January 23, 2021 by user239903

### Combining Sub-Samples for Factor Analysis?

0  Asked on January 22, 2021

### Need to create a model to identify patterns in user details

0  Asked on January 21, 2021 by pooza

### Ask a Question

Get help from others!