Example Problems on Sufficient Statistics
ST702 Homework 1 on Sufficient Statistics
5.23
Let Ui, i=1,2,… be independent uniform(0,1) random variables, and let X have the distribution
P(X=x)=cx!, x=1,2,3,…where c=1/(e−1). Find the distribution of
Z=min{U1,…,UX}(Hint: Note that the distribution of Z|X=x is that of the first-order statistic from the sample of size x.)
P(Z>z)=∞∑x=1P(Z>z|x)P(X=x)Notice that P(Z>z|x) is the CDF of the minimum order statistic.
P(Z>z)=∞∑x=1P(Z>z|x)P(X=x)=∞∑x=1[1−PU(z)]xP(X=x)=∞∑x=1[1−z−01−0]xcx!=c∞∑x=1(1−z)xx!=c(e1−z−1)Taylor series=e1−z−1e−1for z∈[0,1]6.1
Let X be one observation from a n(0,σ2) population. Is |X| a sufficient statistic?
Notice that X2=|X|2.
f(x|σ2)=1√2πσ2e−(x−0)2/(2σ2)=1√2πσ2e−|x|2/(2σ2)⏟g(T(x)|σ2)⋅1⏟h(x)Thus, |X| is a sufficient statistic by the Factorization Theorem.
6.3
Let X1,…,Xn be a random sample from the pdf
f(x|μ,σ)=1σe−(x−μ)/σ,μ<x<∞, 0<σ<∞Since we have a random sample, our joint pdf is the product of the marginals.
f(x|μ,σ)=n∏i=1fX=n∏i=11σe−(xi−μ)/σI(μ<xi<∞)=1σe−∑(xi−μ)/σI(μ<x(1)<∞)Notice that the indicator function becomes only a function of the miminum order statistic. Thus, our sufficient statistic is (∑Xi,X(1)).
6.5
Let X1,…,Xn be independent random variables with pdfs
f(xi|θ)={12iθ−i(θ−1)<xi<i(θ+1)0otherwise,where θ>0. Find a two-dimensional sufficient statistic for θ.
Since we have an independent sample, our joint pdf is the product of the marginals.
fx=n∏i=1f(xi|θ)=n∏i=112iθI(−i(θ−1)<xi<i(θ+1))=(12θ)2n∏i=11iI(−i(θ−1)<xi)I(xi<i(θ+1))=(12θ)2n∏i=11iI(−(θ−1)<xi/i)I(xi/i<(θ+1))Notice that the first indicator is only 1 if −(θ−1)<min(xi/i) and the second indicator is only 1 if max(xi/i)<θ+1. So, our sufficient statistic is (min(xi/i), max(xi/i)).
6.13
Suppose X1 and X2 are iid observations from the pdf f(x|α)=αxα−1e−xα, x>0, α>0. Show that log(X1)/log(X2) is an ancillary statistic.
Take U=log(X1) and V=log(X2). Then,
FU(u)=P(U≤u)=P(log(X1)≤u)=P(X1≤eu)fU(u)=fx(eU)eU=α(eu)α−1e−(eu)αeu=αeuαe−euα=αeuα−euα−∞<u<∞Notice that V follows the same form. In fact, these form a scale family with rate parameter 1α. Take U=A1 and V=A2. In chapter 3, we learned that we can write Ai=1αBi where B1 and B2 are iid from the same scale family with the scale parameter set to 1. So, Bi looks like
fBi(bi)=1⋅ebi⋅1−ebi⋅1=ebi−ebi.Notice that this does not depend on α. Thus,
log(X1)log(X2)=UV=A1A2=1/α⋅B11/α⋅B2.This does not depend on α, so log(X1)log(X2) is an ancillary statistic for α.