간단한 선형 모형을 고려하십시오.

y y = X' β β + ϵ

$\pmb{y}=X'\pmb{\beta}+\epsilon$

여기서 및 , 및 는 열을 포함합니다. 상수. $\epsilon_i\sim\mathrm{i.i.d.}\;\mathcal{N}(0,\sigma^2)$ $X\in\mathbb{R}^{n\times p}$ $p\geq2$ $X$

내 질문은 $\mathrm{E}(X'X)$ , $\beta$ 및 가 주어지면 * $\sigma$ 에 사소한 상한에 대한 공식이 있습니까? (모델이 OLS에 의해 추정되었다고 가정). $\mathrm{E}(R^2)$

* 내가이 글을 쓰는, 가정이 점점 $E(R^2)$ 자체가 불가능했을 것입니다.

편집 1

Stéphane Laurent (아래 참조)에서 파생 된 솔루션을 사용하여 에 대한 사소한 상한을 얻을 수 있습니다 $E(R^2)$ . 일부 수치 시뮬레이션 (아래)은이 한계가 실제로 매우 엄격하다는 것을 보여줍니다.

: 스테판 랑은 다음 유래 비 중심적 파라미터 비 중앙 베타 분포 로를 $R^2\sim\mathrm{B}(p-1,n-p,\lambda)$ $\mathrm{B}(p-1,n-p,\lambda)$ $\lambda$

λ = | | X ' β - E ( X ) ' β 1 n | | 2 σ 2

$\lambda=\frac{||X'\beta-\mathrm{E}(X)'\beta1_n||^2}{\sigma^2}$

그래서

E (R 2) = E (χ 2 p - 1 ( λ ) χ 2 p - 1 ( λ ) + χ 2 n - p) \geq E ( χ 2 p - 1 ( λ ) ) E ( χ 2 p - 1 ( λ ) ) + E ( χ 2 n - p )

$\mathrm{E}(R^2)=\mathrm{E}\left(\frac{\chi^2_{p-1}(\lambda)}{\chi^2_{p-1}(\lambda)+\chi^2_{n-p}}\right)\geq\frac{\mathrm{E}\left(\chi^2_{p-1}(\lambda)\right)}{\mathrm{E}\left(\chi^2_{p-1}(\lambda)\right)+\mathrm{E}\left(\chi^2_{n-p}\right)}$

여기서 는 매개 변수 와 자유도를 가진 비 중심 입니다 . 따라서 대한 사소한 상한 은 $\chi^2_{k}(\lambda)$ $\chi^2$ $\lambda$ $k$ $\mathrm{E}(R^2)$

λ + p - 1 λ + n - 1

$\frac{\lambda+p-1}{\lambda+n-1}$

그것은이다 매우 (I 가능한 것 기대했던 것보다 훨씬 엄격한) 꽉 :

예를 들어 다음을 사용합니다.

rho<-0.75
p<-10
n<-25*p
Su<-matrix(rho,p-1,p-1)
diag(Su)<-1
su<-1
set.seed(123)
bet<-runif(p)

1000 개 이상의 시뮬레이션 에서 의 평균은 입니다 . 위의 이론 상한은 다음과 같습니다 . 바운드는 많은 값에서 똑같이 정확 해 보입니다 . 정말 놀랍습니다! $R^2$ 0.9608190.9609081 $R^2$

EDIT2 :

더욱 연구 한 결과,이 표시 의 상한의 근사 품질 것을 보다로서 얻을 증가 (과 다른 모든 동등 로 증가 ). $E(R^2)$ $\lambda+p$ $\lambda$ $n$

linear-model expected-value

— 사용자 603
소스

는

과

에만 의존하는 매개 변수를 가진 베타 분포를 가지고있습니다. 아니 ? R2 $R^2$

n $n$

p $p$

— Stéphane Laurent

죄송합니다. 이전의 주장은 "널 모델"(인터셉트 만)의 가설 하에서 만 사실입니다. 그렇지 않으면

의 분포는 비 중심 베타 분포와 유사해야하며 알 수없는 매개 변수를 포함하는 비 중심 매개 변수가 있어야합니다. R2 $R^2$

— Stéphane Laurent

@ StéphaneLaurent : 감사합니다. 알려지지 않은 매개 변수와 베타의 매개 변수 사이의 관계에 대해 더 알고 싶으십니까? 나는 붙어있어 어떤 포인터라도 환영받을 것이다 ...

— user603

당신은 절대적으로 대처해야합니까

? 아마도위한 간단한 정확한 수식가

. E[R2] $E[R^2]$

E[R2/(1−R2)] $E[R^2/(1-R^2)]$

— Stéphane Laurent

내 대답의 표기법으로 일부 스칼라

경우

이며 비 중앙

분포 의 첫 번째 순간 은 간단합니다. R2/(1−R2)=kF $R^2/(1-R^2) = k F$

k $k$

F $F$

— Stéphane Laurent

Any linear model can be written $\boxed{Y=\mu+\sigma G}$ where $G$ has the standard normal distribution on $\mathbb{R}^n$ and $\mu$ is assumed to belong to a linear subspace $W$ of $\mathbb{R}^n$ . In your case $W=\text{Im}(X)$ .

Let $[1] \subset W$ be the one-dimensional linear subspace generated by the vector $(1,1,\ldots,1)$ . Taking $U=[1]$ below, the $R^2$ is highly related to the classical Fisher statistic

F = ∥ P Z Y ∥ 2 / ( m - ℓ ) ∥ P ⊥ W Y ∥ 2 / ( n - m ),

$F = \frac{{\Vert P_Z Y\Vert}^2/(m-\ell)}{{\Vert P_W^\perp Y\Vert}^2/(n-m)},$ for the hypothesis test of

H0:{μ∈U} $H_0\colon\{\mu \in U\}$ where

U⊂W $U\subset W$ is a linear subspace, and denoting by

Z=U⊥∩W $Z=U^\perp \cap W$ the orthogonal complement of

U $U$ in

W $W$ , and denoting

m=dim(W) $m=\dim(W)$ and

ℓ=dim(U) $\ell=\dim(U)$ (then

m=p $m=p$ and

ℓ=1 $\ell=1$ in your situation).

Indeed,

∥ P Z Y ∥ 2 ∥ P ⊥ W Y ∥ 2 = R 2 1 - R 2

$\dfrac{{\Vert P_Z Y\Vert}^2}{{\Vert P_W^\perp Y\Vert}^2} = \frac{R^2}{1-R^2}$ because the definition of

R2 $R^2$ is

R 2 = ∥ P Z Y ∥ 2 ∥ P ⊥ U Y ∥ 2 = 1 - ∥ P ⊥ W Y ∥ 2 ∥ P ⊥ U Y ∥ 2 .

$R^2 = \frac{{\Vert P_Z Y\Vert}^2}{{\Vert P_U^\perp Y\Vert}^2}=1 - \frac{{\Vert P^\perp_W Y\Vert}^2}{{\Vert P_U^\perp Y\Vert}^2}.$

Obviously $\boxed{P_Z Y = P_Z \mu + \sigma P_Z G}$ and $\boxed{P_W^\perp Y = \sigma P_W^\perp G}$ .

When $H_0\colon\{\mu \in U\}$ is true then $P_Z \mu = 0$ and therefore

F = ∥ P Z G ∥ 2 / ( m - ℓ ) ∥ P ⊥ W G ∥ 2 / ( n - m ) \sim F m - ℓ, n - m

$F = \frac{{\Vert P_Z G\Vert}^2/(m-\ell)}{{\Vert P_W^\perp G\Vert}^2/(n-m)} \sim F_{m-\ell,n-m}$ has the Fisher

Fm−ℓ,n−m $F_{m-\ell,n-m}$ distribution. Consequently, from the classical relation between the Fisher distribution and the Beta distribution,

R2∼B(m−ℓ,n−m) $R^2 \sim {\cal B}(m-\ell, n-m)$ .

In the general situation we have to deal with $P_Z Y = P_Z \mu + \sigma P_Z G$ when $P_Z\mu \neq 0$ . In this general case one has ${\Vert P_Z Y\Vert}^2 \sim \sigma^2\chi^2_{m-\ell}(\lambda)$ , the noncentral $\chi^2$ distribution with $m-\ell$ degrees of freedom and noncentrality parameter $\boxed{\lambda=\frac{{\Vert P_Z \mu\Vert}^2}{\sigma^2}}$ , and then $\boxed{F \sim F_{m-\ell,n-m}(\lambda)}$ (noncentral Fisher distribution). This is the classical result used to compute power of $F$ -tests.

The classical relation between the Fisher distribution and the Beta distribution hold in the noncentral situation too. Finally $R^2$ has the noncentral beta distribution with "shape parameters" $m-\ell$ and $n-m$ and noncentrality parameter $\lambda$ . I think the moments are available in the literature but they possibly are highly complicated.

Finally let us write down $P_Z\mu$ . Note that $P_Z = P_W - P_U$ . One has $P_U \mu = \bar\mu 1$ when $U=[1]$ , and $P_W \mu = \mu$ . Hence $P_Z \mu =\mu - \bar\mu 1$ where here $\mu=X\beta$ for the unknown parameters vector $\beta$ .

— Stéphane Laurent
소스

PZx $P_Z x$ is the orthogoanl projection of

x $x$ on the linear subspace

Z $Z$ . And

P⊥ $P^\perp$ denotes projection on the orthogonal.

— Stéphane Laurent

Beware of

Px≠∥Px∥2 $Px \neq \Vert P x \Vert^2$ . I'm going to edit my post to write the formulas.

— Stéphane Laurent

Done - do you see any simplification ?

— Stéphane Laurent

$\bar \mu = \frac{1}{n} \sum \mu_i$

— Stéphane Laurent

Type I, obviously: type II are distributed on

$(0, \infty)$ . Actually

$R^2/(1-R^2)$ has the type II distribution. I have done the last corrections for today.

— Stéphane Laurent

R- 제곱의 조건부 기대

편집 1

EDIT2 :