3 개의 랜덤 변수의 상관 관계에 대한 경계

28

세 가지 랜덤 변수가 있습니다 . 세 변수 사이의 세 가지 상관 관계는 동일합니다. 그건, $x,y,z$

ρ = cor (x, y) = cor (x, z) = cor (y, z)

$\rho=\textrm{cor}(x,y)=\textrm{cor}(x,z)=\textrm{cor}(y,z)$

줄 수있는 가장 빡빡한 것은 무엇입니까 ? $\rho$

correlation correlation-matrix

— 사용자 1352399
소스

1

아마도 "pho"는 rho (

ρ

$\rho$ ) 를 의미 합니다. 그러나 귀하의 질문은 명확하지 않습니다. "당신이 줄 수있는 가장 엄격한 한계는 무엇입니까?"는 무엇을 의미합니까?

— gung-Monica Monica 복원

변수의 이름은 단지 더미입니다. 가장 밀접한 관계로, 상관 관계에 대해 [-1, 1]과 같은 것을 의미하지만 이것은 가능한 가장 엄격한 한계는 아닙니다.

— user1352399

rho = cor (x, y) = cor (x, z) = cor (y, z)이고 rho의 한계는 무엇입니까?

— user31264

네, rho = cor (x, y) = cor (x, z) = cor (y, z)이고 rho의 한계는 무엇입니까? Dilip, rho가 음이 아니어야한다는 것을 말해 줄 수 있습니까?

— user1352399

1

이것을 인용 할 교과서는 Seber & Lee "선형 회귀 분석"(적어도 첫 번째 판에서는 ...)

— kjetil b halvorsen

29

공통 상관 관계 $\rho$ 는 값 가질 수 $+1$ 있지만 아닙니다 $-1$ . 경우 $\rho_{X,Y}= \rho_{X,Z}=-1$ , 다음 $\rho_{Y,Z}$ 동일하지 않은 수 $-1$ 하지만 사실상 $+1$ . 세 확률 변수의 일반적인 상관 관계의 가장 작은 값은 $-\frac{1}{2}$ . 보다 일반적으로, 최소한의 공통 상관 $n$ 랜덤 변수이다 $-\frac{1}{n-1}$ 벡터로 간주 할 때, 그들은 (차원 단면의 정점에있는 $n-1$ )에서 $n$ 차원 공간.

$n$ 단위 분산 랜덤 변수 의 합의 분산을 고려하십시오 $X_i$ . 우리는 그 여기서 인평균값의

\begin{aligned} var (\sum_{i = 1}^{n} X_{i}) & = \sum_{i = 1}^{n} var (X_{i}) + \sum_{i = 1}^{n} \sum_{j \neq i}^{n} cov (X_{i}, X_{j}) \\ = n + \sum_{i = 1}^{n} \sum_{j \neq i}^{n} ρ_{X_{i}, X_{j}} \\ (1) & = n + n (n - 1) \bar{ρ} \end{aligned}

$\begin{align*} \operatorname{var}\left(\sum_{i=1}^n X_i\right) &= \sum_{i=1}^n \operatorname{var}(X_i) + \sum_{i=1}^n\sum_{j\neq i}^n \operatorname{cov}(X_i,X_j)\\ &= n + \sum_{i=1}^n\sum_{j\neq i}^n \rho_{X_i,X_j}\\ &= n + n(n-1)\bar{\rho} \tag{1} \end{align*}$

\bar{ρ}

$\bar{\rho}$

상관 계수. 그러나 이후

, 우리는 쉽게에서 얻을

그

(\binom{n}{2})

$\binom{n}{2}$

var (\sum_{i} X_{i}) \geq 0

$\operatorname{var}\left(\sum_i X_i\right) \geq 0$

(1)

$(1)$

\bar{ρ} \geq - \frac{1}{n - 1} .

$\bar{\rho} \geq -\frac{1}{n-1}.$

그래서, 상관 계수의 평균 값은 적어도 . 경우모든상관 계수는이같은값, 그들의 평균은 동일하고 우리가 그래서 $-\frac{1}{n-1}$ $\rho$ $\rho$ 그것은 일반적인 상관 치되는 확률 변수 가질 수 있습니다동일을

ρ \geq - \frac{1}{n - 1} .

$\rho \geq -\frac{1}{n-1}.$

ρ

$\rho$

? 예.

가상관되지 않은 단위 분산 랜덤 변수이고

설정 한다고 가정합니다.

- \frac{1}{n - 1}

$-\frac{1}{n-1}$

X_{i}

$X_i$

. 그런 다음

이고

Y_{i} = X_{i} - \frac{1}{n} \sum_{j = 1}^{n} X_{j} = X_{i} - \bar{X}

$Y_i = X_i - \frac{1}{n}\sum_{j=1}^n X_j = X_i -\bar{X}$

E [Y_{i}] = 0

$E[Y_i]=0$

및

var (Y_{i}) = {(\frac{n - 1}{n})}^{2} + (n - 1) {(\frac{1}{n})}^{2} = \frac{n - 1}{n}

$\displaystyle \operatorname{var}(Y_i) = \left(\frac{n-1}{n}\right)^2 + (n-1)\left(\frac{1}{n}\right)^2 = \frac{n-1}{n}$

은

cov (Y_{i}, Y_{j}) = - 2 (\frac{n - 1}{n}) (\frac{1}{n}) + (n - 2) {(\frac{1}{n})}^{2} = - \frac{1}{n}

$\operatorname{cov}(Y_i,Y_j) = -2\left(\frac{n-1}{n}\right)\left(\frac{1}{n}\right) + (n-2)\left(\frac{1}{n}\right)^2 = -\frac{1}{n}$

따라서,

최소 공통 상관 값을 달성하는 임의의 변수는

ρ_{Y_{i}, Y_{j}} = \frac{cov (Y_{i}, Y_{j})}{\sqrt{var (Y_{i}) var (Y_{j})}} = \frac{- 1 / n}{(n - 1) / n} = - \frac{1}{n - 1} .

$\rho_{Y_i,Y_j} = \frac{\operatorname{cov}(Y_i,Y_j)}{\sqrt{\operatorname{var}(Y_i)\operatorname{var}(Y_j)}} =\frac{-1/n}{(n-1)/n} = -\frac{1}{n-1}.$

Y_{i}

$Y_i$

. 우연히도,

이므로 벡터로 간주되는 랜덤 변수는

차원 공간의

차원 초평면에있습니다.

- \frac{1}{n - 1}

$-\frac{1}{n-1}$

\sum_{i} Y_{i} = 0

$\sum_i Y_i = 0$

(n - 1)

$(n-1)$

n

$n$

— 디립 사르 베이트
소스

25

소감 가능한 바인딩입니다 . $-1/2 \le \rho \le 1$ 그러한 모든 가치는 실제로 나타날 수 있습니다.

결과에 대해 특별히 깊거나 신비로운 것이 없다는 것을 보여주기 위해이 대답은 먼저 완전한 기본 솔루션을 제시하며, 예상되는 제곱의 값이되는 편차가 음이 아니어야한다는 명백한 사실 만 요구합니다. 그 다음에는 일반적인 해결책 (약간 더 복잡한 대수적 사실을 사용함)이 이어집니다.

기본 솔루션

의 선형 조합의 분산은 음이 아니어야합니다. $x,y,z$ 수 (가) 이러한 변수의 편차를 보자 및 각각. 모두 0이 아닙니다 (그렇지 않으면 일부 상관 관계가 정의되지 않음). 우리가 계산할 수있는 분산의 기본 속성을 사용하여 $\sigma^2, \tau^2,$ $\upsilon^2$

0 \leq Var (α x / σ + β y / τ + γ z / υ) = α^{2} + β^{2} + γ^{2} + 2 ρ (α β + β γ + γ α)

$0 \le \text{Var}(\alpha x/\sigma + \beta y/\tau + \gamma z/\upsilon) = \alpha^2 +\beta^2+\gamma^2 + 2\rho(\alpha\beta+\beta\gamma+\gamma\alpha)$

모든 실수 . $(\alpha, \beta, \gamma)$

이라고 가정하면 약간의 대수적 조작은 다음과 같습니다. $\alpha+\beta+\gamma\ne 0$

\frac{- ρ}{1 - ρ} \leq \frac{1}{3} {(\frac{\sqrt{(α^{2} + β^{2} + γ^{2}) / 3}}{(α + β + γ) / 3})}^{2} .

$\frac{-\rho}{1-\rho} \le \frac{1}{3} \left(\frac{\sqrt{(\alpha^2+\beta^2+\gamma^2)/3}}{(\alpha+\beta+\gamma)/3}\right)^2.$

$(\alpha, \beta, \gamma)$ $(1/3, 1/3, 1/3)$ $1$ $1$ $\alpha=\beta=\gamma\ne 0$

ρ \geq - 1 / 2.

$\rho \ge -1/2.$

의 명시 적 예 $n=3$ below (involving trivariate Normal variables $(x,y,z)$ ) shows that all such values, $-1/2 \le \rho \le 1$ , actually do arise as correlations. This example uses only the definition of multivariate Normals, but otherwise invokes no results of Calculus or Linear Algebra.

General solution

Overview

Any correlation matrix is the covariance matrix of the standardized random variables, whence--like all correlation matrices--it must be positive semi-definite. Equivalently, its eigenvalues are non-negative. This imposes a simple condition on $\rho$ : it must not be any less than $-1/2$ (and of course cannot exceed $1$ ). Conversely, any such $\rho$ actually corresponds to the correlation matrix of some trivariate distribution, proving these bounds are the tightest possible.

Derivation of the conditions on $\rho$

Consider the $n$ by $n$ correlation matrix with all off-diagonal values equal to $\rho.$ (The question concerns the case $n=3,$ but this generalization is no more difficult to analyze.) Let's call it $\mathbb{C}(\rho, n).$ By definition, $\lambda$ is an eigenvalue of provided there exists a nonzero vector $\mathbf{x}_\lambda$ such that

C (ρ, n) x_{λ} = λ x_{λ} .

$\mathbb{C}(\rho,n) \mathbf{x}_\lambda = \lambda \mathbf{x}_\lambda.$

These eigenvalues are easy to find in the present case, because

Letting $\mathbf{1} = (1, 1, \ldots, 1)'$ , compute that

$C (ρ, n) 1 = (1 + (n - 1) ρ) 1 .$ $\mathbb{C}(\rho,n)\mathbf{1} = (1+(n-1)\rho)\mathbf{1}.$
Letting $\mathbf{y}_j = (-1, 0, \ldots, 0, 1, 0, \ldots, 0)$ with a $1$ only in the $j^\text{th}$ place (for $j = 2, 3, \ldots, n$ ), compute that

$C (ρ, n) y_{j} = (1 - ρ) y_{j} .$ $\mathbb{C}(\rho,n)\mathbf{y}_j = (1-\rho)\mathbf{y}_j.$

Because the $n$ eigenvectors found so far span the full $n$ dimensional space (proof: an easy row reduction shows the absolute value of their determinant equals $n$ , which is nonzero), they constitute a basis of all the eigenvectors. We have therefore found all the eigenvalues and determined they are either $1+(n-1)\rho$ or $1-\rho$ (the latter with multiplicity $n-1$ ). In addition to the well-known inequality $-1 \le \rho \le 1$ satisfied by all correlations, non-negativity of the first eigenvalue further implies

ρ \geq - \frac{1}{n - 1}

$\rho \ge -\frac{1}{n-1}$

while the non-negativity of the second eigenvalue imposes no new conditions.

Proof of sufficiency of the conditions

The implications work in both directions: provided $-1/(n-1)\le \rho \le 1,$ the matrix $\mathbb{C}(\rho, n)$ is nonnegative-definite and therefore is a valid correlation matrix. It is, for instance, the correlation matrix for a multinormal distribution. Specifically, write

Σ (ρ, n) = (1 + (n - 1) ρ) I_{n} - \frac{ρ}{(1 - ρ) (1 + (n - 1) ρ)} 1 1^{'}

$\Sigma(\rho, n) = (1 + (n-1)\rho)\mathbb{I}_n - \frac{\rho}{(1-\rho)(1+(n-1)\rho)}\mathbf{1}\mathbf{1}'$

for the inverse of $\mathbb{C}(\rho, n)$ when $-1/(n-1) \lt \rho \lt 1.$ For example, when $n=3$

Σ (ρ, 3) = \frac{1}{(1 - ρ) (1 + 2 ρ)} (\begin{array}{ccc} ρ + 1 & - ρ & - ρ \\ - ρ & ρ + 1 & - ρ \\ - ρ & - ρ & ρ + 1 \end{array}) .

$\color{gray}{\Sigma(\rho, 3) = \frac{1}{(1-\rho)(1+2\rho)} \left( \begin{array}{ccc} \rho +1 & -\rho & -\rho \\ -\rho & \rho +1 & -\rho \\ -\rho & -\rho & \rho +1 \\ \end{array} \right)}.$

Let the vector of random variables $(X_1, X_2, \ldots, X_n)$ have distribution function

f_{ρ, n} (x) = \frac{\exp (- \frac{1}{2} x Σ (ρ, n) x^{'})}{(2 π)^{n / 2} {((1 - ρ)^{n - 1} (1 + (n - 1) ρ))}^{1 / 2}}

$f_{\rho, n}(\mathbf{x}) = \frac{\exp\left(-\frac{1}{2}\mathbf{x}\Sigma(\rho, n)\mathbf{x}'\right)}{(2\pi)^{n/2}\left((1-\rho)^{n-1}(1+(n-1)\rho)\right)^{1/2}}$

where $\mathbf{x} = (x_1, x_2, \ldots, x_n)$ . For example, when $n=3$ this equals

\frac{1}{\sqrt{(2 π)^{3} (1 - ρ)^{2} (1 + 2 ρ)}} \exp (- \frac{(1 + ρ) (x^{2} + y^{2} + z^{2}) - 2 ρ (x y + y z + z x)}{2 (1 - ρ) (1 + 2 ρ)}) .

$\color{gray}{\frac{1}{\sqrt{(2\pi)^{3}(1-\rho)^2(1+2\rho)}} \exp\left(-\frac{(1+\rho)(x^2+y^2+z^2) - 2\rho(xy+yz+zx)}{2(1-\rho)(1+2\rho)}\right)}.$

The correlation matrix for these $n$ random variables is $\mathbb{C}(\rho, n).$

Contours of the density functions $f_{\rho,3}.$ From left to right, $\rho=-4/10, 0, 4/10, 8/10$ . Note how the density shifts from being concentrated near the plane $x+y+z=0$ to being concentrated near the line $x=y=z$ .

The special cases $\rho = -1/(n-1)$ and $\rho = 1$ can also be realized by degenerate distributions; I won't go into the details except to point out that in the former case the distribution can be considered supported on the hyperplane $\mathbf{x}.\mathbf{1}=0$ , where it is a sum of identically distributed mean- $0$ Normal distribution, while in the latter case (perfect positive correlation) it is supported on the line generated by $\mathbf{1}'$ , where it has a mean- $0$ Normal distribution.

More about non-degeneracy

A review of this analysis makes it clear that the correlation matrix $\mathbb{C}(-1/(n-1), n)$ has a rank of $n-1$ and $\mathbb{C}(1, n)$ has a rank of $1$ (because only one eigenvector has a nonzero eigenvalue). For $n\ge 2$ , this makes the correlation matrix degenerate in either case. Otherwise, the existence of its inverse $\Sigma(\rho, n)$ proves it is nondegenerate.

— whuber
소스

20

Your correlation matrix is

(\begin{matrix} 1 & ρ & ρ \\ ρ & 1 & ρ \\ ρ & ρ & 1 \end{matrix})

$\begin{pmatrix} 1&\rho&\rho\\ \rho&1&\rho\\ \rho&\rho&1 \end{pmatrix}$

The matrix is positive semidefinite if the leading principal minors are all non-negative. The principal minors are the determinants of the "north-west" blocks of the matrix, i.e. 1, the determinant of

(\begin{matrix} 1 & ρ \\ ρ & 1 \end{matrix})

$\begin{pmatrix} 1&\rho\\ \rho&1\end{pmatrix}$

and the determinant of the correlation matrix itself.

1 is obviously positive, the second principal minor is $1-\rho^2$ , which is nonnegative for any admissible correlation $\rho\in[-1,1]$ . The determinant of the entire correlation matrix is

2 ρ^{3} - 3 ρ^{2} + 1.

$2\rho^3-3\rho^2+1.$

The plot shows the determinant of the function over the range of admissible correlations $[-1,1]$ . enter image description here

You see the function is nonnegative over the range given by @stochazesthai (which you could also check by finding the roots of the determinantal equation).

— Christoph Hanck
소스

Aren't we assuming in your answer that

V a r () = 1

$Var( )=1$ ? Why can we?

— An old man in the sea.

1

@Anold You seem to be reading "covariance" where "correlation" is written.

— whuber

6

There exist random variables $X$ , $Y$ and $Z$ with pairwise correlations $\rho_{XY} = \rho_{YZ} = \rho_{XZ} = \rho$ if and only if the correlation matrix is positive semidefinite. This happens only for $\rho \in [-\frac{1}{2},1]$ .

— stochazesthai
소스

2

can you explain this in very simple terms.

— Elizabeth Susan Joseph

1

I don't think there exists an explanation that does not require the knowledge of matrix algebra. I suggest you to look at the Wikipedia page (en.wikipedia.org/wiki/…).

— stochazesthai

4

I found an explanation that requires only basic (high school level) algebra and have included that in my answer.