감마 랜덤 변수의 로그 왜곡


16

고려 감마 확률 변수 . 평균, 분산 및 왜곡에 대한 깔끔한 수식이 있습니다.XΓ(α,θ)

E[X]=αθVar[X]=αθ2=1/αE[X]2Skewness[X]=2/α

이제 로그 변환 된 랜덤 변수 . Wikipedia는 평균과 분산에 대한 공식을 제공합니다.Y=log(X)

E[Y]=ψ(α)+log(θ)Var[Y]=ψ1(α)

감마 함수의 로그의 1 차 및 2 차 미분으로 정의되는 디 감마 및 트리 감마 함수를 통해.

왜도의 공식은 무엇입니까?

테트라 감마 기능이 나타 납니까?

(이것에 대해 궁금한 점은 로그 정규 분포와 감마 분포 사이의 선택 입니다. 감마 대 로그 정규 분포를 참조하십시오 . 무엇보다도 왜곡 특성이 다릅니다. 특히, 로그 정규 로그의 왜곡은 거의 0과 같습니다. 감마 로그의 왜도는 음수이지만 얼마나 음입니까? ..)


1
합니까 도움을? 아니면 이것 ?
S. Kolassa-복원 모니카

로그 감마 분포가 무엇인지 잘 모르겠습니다. lognormal이 normal과 관련되어 있기 때문에 감마와 관련이 있다면 다른 것에 대해 묻고 있습니다 ( "lognormal"은 혼란 스럽기 때문에 log (normal)가 아닌 exp (normal)의 분포이기 때문에).
amoeba는 Reinstate Monica가

1
@Glen_b : 솔직히 말하면, 정규의 지수를 "로그 정규"라고 부르는 것이 훨씬 더 일관성이없고 혼란 스럽다고 말하고 싶습니다. 불행히도, 더 확립되었습니다.
S. Kolassa-복원 모니카

2
@Stephan은 log-logistic, log-Cauchy, log-Laplace 등을 참조하십시오. 반대보다 더 명확하게 확립 된 규칙
Glen_b -Reinstate Monica

1
네; 이런 이유로이 분포와 관련하여 "log-gamma"라고 말하지 않도록주의했습니다. (나는 과거에 로그 정규에 일관된 방식으로 그것을 사용했다)
Glen_b-복지국 Monica

답변:


12

순간 발생 함수 미디엄()와이=ln엑스 그것이 간단한 대수 형식을 갖기 때문에,이 경우에 유용하다. mgf의 정의에 따라

미디엄()=이자형[이자형ln엑스]=이자형[엑스]=1Γ(α)θα0엑스α+1이자형엑스/θ엑스=θΓ(α)0와이α+1이자형와이와이=θΓ(α+)Γ(α).

당신이 준 기대와 분산을 확인합시다. 도함수를 취하면, 우리는 M'(t)=Γ(α+t)

미디엄'()=Γ'(α+)Γ(α)θ+Γ(α+)Γ(α)θln(θ)
따라서,E[Y]=ψ(0)(α)+ln(θ),
미디엄()=Γ(α+)Γ(α)θ+2Γ'(α+)Γ(α)θln(θ)+Γ(α+)Γ(α)θln2(θ).
그런 다음Var(Y)=E[Y2]E[Y]2=Γ(α)
E[Y]=ψ(0)(α)+ln(θ),E[Y2]=Γ(α)Γ(α)+2ψ(0)(α)ln(θ)+ln2(θ).
Var(Y)=E[Y2]E[Y]2=Γ(α)Γ(α)(Γ(α)Γ(α))2=ψ(1)(α).

비대칭도를 찾기 위해 (줘서 고마워요 @probabilityislogic)를 cumulant 발생 기능을 유의하는 것은 따라서 제 1 누적량은 간단히 K ' ( 0 ) = ψ ( 0 ) ( α ) + ln ( θ ) 이다. 기억해

K(t)=lnM(t)=tlnθ+lnΓ(α+t)lnΓ(α).
K(0)=ψ(0)(α)+ln(θ)ψ(n)(x)=dn+1lnΓ(x)/dxn+1, so the subsequent cumulants are K(n)(0)=ψ(n1)(α), n2. The skewness is therefore
E[(YE[Y])3]Var(Y)3/2=ψ(2)(α)[ψ(1)(α)]3/2.

As a side note, this particular distribution appeared to have been thoroughly studied by A. C. Olshen in his Transformations of the Pearson Type III Distribution, Johnson et al.'s Continuous Univariate Distributions also has a small piece about it. Check those out.


3
You should differentiate K(t)=log[M(t)]=tlog[θ]+log[Γ(α+t)]log[Γ(α)] instead of M(t) as this is the cumulant generating function - more directly related to central moments - skew=K(3)(0)=ψ(2)(α) where ψ(n)(z) is the polygamma function
probabilityislogic

1
@probabilityislogic: very good call, changed my answer
Francis

@probabilityislogic This is a great addition, thanks a lot. I just want to note, lest some readers be confused, that skewness is not directly given by the third cumulant: it's the third standardized moment, not the third central moment. Francis has it correct in his answer, but the last formula in your comment is not quite right.
amoeba says Reinstate Monica

13

I. Direct computation

Gradshteyn & Ryzhik [1] (sect 4.358, 7th ed) list explicit closed forms for

0xν1eμx(lnx)pdx
for p=2,3,4 while the p=1 case is done in 4.352 (assuming you regard expressions in Γ,ψ and ζ functions as closed form) -- from which it is definitely doable up to kurtosis; they give the integral for all p as a derivative of a gamma function so presumably it's feasible to go higher. So skewness is certainly doable but not especially "neat".

Details of the derivation of the formulas in 4.358 are in [2]. I'll quote the formulas given there since they're slightly more succinctly stated and put 4.352.1 in the same form.

Let δ=ψ(a)lnμ. Then:

0xa1eμxlnxdx=Γ(a)μa{δ}0xa1eμxln2xdx=Γ(a)μa{δ2+ζ(2,a)}0xa1eμxln3xdx=Γ(a)μa{δ3+3ζ(2,a)δ2ζ(3,a)}0xa1eμxln4xdx=Γ(a)μa{δ4+6ζ(2,a)δ28ζ(3,a)δ+3ζ2(2,a)+6ζ(4,a))}

where ζ(z,q)=n=01(n+q)z is the Hurwitz zeta function (the Riemann zeta function is the special case q=1).

Now on to the moments of the log of a gamma random variable.

Noting firstly that on the log scale the scale or rate parameter of the gamma density is merely a shift-parameter, so it has no impact on the central moments; we may take whichever one we're using to be 1.

If XGamma(α,1) then

E(logpX)=1Γ(α)0logpxxα1exdx.

We can set μ=1 in the above integral formulas, which gives us raw moments; we have E(Y), E(Y2), E(Y3), E(Y4).

Since we have eliminated μ from the above, without fear of confusion we're now free to re-use μk to represent the k-th central moment in the usual fashion. We may then obtain the central moments from the raw moments via the usual formulas.

Then we can obtain the skewness and kurtosis as μ3μ23/2 and μ4μ22.


A note on terminology

It looks like Wolfram's reference pages write the moments of this distribution (they call it ExpGamma distribution) in terms of the polygamma function.

By contrast, Chan (see below) calls this the log-gamma distribution.


II. Chan's formulas via MGF

Chan (1993) [3] gives the mgf as the very neat Γ(α+t)/Γ(α).

(A very nice derivation for this is given in Francis' answer, using the simple fact that the mgf of log(X) is just E(Xt).)

Consequently the moments have fairly simple forms. Chan gives:

E(Y)=ψ(α)

and the central moments as

E(YμY)2=ψ(α)E(YμY)3=ψ(α)E(YμY)4=ψ(α)

and so the skewness is ψ(α)/(ψ(α)3/2) and kurtosis is ψ(α)/(ψ(α)2). Presumably the earlier formulas I have above should simplify to these.

Conveniently, R offers digamma (ψ) and trigamma (ψ) functions as well as the more general polygamma function where you select the order of the derivative. (A number of other programs offer similarly convenient functions.)

Consequently we can compute the skewness and kurtosis quite directly in R:

skew.eg <- function(a) psigamma(a,2)/psigamma(a,1)^(3/2)
kurt.eg <- function(a) psigamma(a,3)/psigamma(a,1)^2

Trying a few values of a (α in the above), we reproduce the first few rows of the table at the end of Sec 2.2 in Chan [3], except that the kurtosis values in that table are supposed to be excess kurtosis, but I just calculated kurtosis by the formulas given above by Chan; these should differ by 3.

(E.g. for the log of an exponential, the table says the excess kurtosis is 2.4, but the formula for β2 is ψ(1)/ψ(1)2 ... and that is 2.4.)

Simulation confirms that as we increase sample size, the kurtosis of a log of an exponential is converging to around 5.4 not 2.4. It appears that the thesis possibly has an error.

Consequently, Chan's formulas for central moments appear to actually be the formulas for the cumulants (see the derivation in Francis' answer). This would then mean that the skewness formula was correct as is; because the second and third cumulants are equal to the second and third central moments.

Nevertheless these are particularly convenient formulas as long as we keep in mind that kurt.eg is giving excess kurtosis.

References

[1] Gradshteyn, I.S. & Ryzhik I.M. (2007), Table of Integrals, Series, and Products, 7th ed.
Academic Press, Inc.

[2] Victor H. Moll (2007)
The integrals in Gradshteyn and Ryzhik, Part 4: The gamma function
SCIENTIA Series A: Mathematical Sciences, Vol. 15, 37–46
Universidad Técnica Federico Santa María, Valparaíso, Chile
http://129.81.170.14/~vhm/FORM-PROOFS_html/final4.pdf

[3] Chan, P.S. (1993),
A statistical study of log-gamma distribution,
McMaster University (Ph.D. thesis)
https://macsphere.mcmaster.ca/bitstream/11375/6816/1/fulltext.pdf


1
Cool. Thanks a lot! According to the encyclopedia entry that Stephan linked to above, the final answer for skewness is ψ(α)/ψ(α)3/2 (which almost qualifies as "neat"!). So it seems that all the scary zetas will have to cancel out.
amoeba says Reinstate Monica

1
Sorry only just now saw your comment (I've been editing for about an hour or so); that's correct, though if the Encyclopedia gives kurtosis the way Chan gives it in his thesis, it seems that it's wrong (as given above), but readily corrected. The neat formulas appear to be for cumulants rather than standardized central moments.
Glen_b -Reinstate Monica

Yes, the Encyclopedia does give the same formula for kurtosis.
amoeba says Reinstate Monica

Hmm, I mean to refer to the things normally denoted γ1 and γ2. I will fix.
Glen_b -Reinstate Monica

2
I should probably add the note that the Hurwitz zeta function can be expressed in terms of the polygamma function, and vice versa:
ψ(n)(z)=(1)n+1Γ(n+1)ζ(n+1,z)
So, the answer to the @amoeba's question of "will the tetragamma function appear?" is YES.
J. M. is not a statistician
당사 사이트를 사용함과 동시에 당사의 쿠키 정책개인정보 보호정책을 읽고 이해하였음을 인정하는 것으로 간주합니다.
Licensed under cc by-sa 3.0 with attribution required.