Journal of Statistical Theory and Applications

Volume 18, Issue 3, September 2019, Pages 303 - 308

Characterization of Exponential Distribution Through Normalized Spacing of Generalized Order Statistics

Authors
M. J. S. Khan*, S. Iqrar, M. Faizan
Department of Statistics and Operations Research, Aligarh Muslim University, Aligarh-202 002, India
*Corresponding author. Email: jahangirskhan@gmail.com
Corresponding Author
M. J. S. Khan
Received 19 October 2017, Accepted 24 December 2018, Available Online 2 September 2019.
DOI
10.2991/jsta.d.190818.004How to use a DOI?
Keywords
Continuous distribution; Generalized order statistics; Meijer's G-function; Characterization
Abstract

In this paper, exponential distribution is characterized by normalized spacing of generalized order statistics (gos) using Meijer's G-function. While the necessary part of the theorem was given by U. Kamps, E. Cramer, Statistics. 35 (2001), 269–280, we have given an easy proof of sufficient part in this paper. This paper contains the result of characterization of exponential distribution through normalized spacing of order statistics, sequential order statistics, progressive type II censoring and record values. Also, by simulation study, we have shown that the confidence interval based on upper records is shorter in length than asymptotic confidence interval constructed by maximum likelihood estimate.

Copyright
© 2019 The Authors. Published by Atlantis Press SARL.
Open Access
This is an open access article distributed under the CC BY-NC 4.0 license (http://creativecommons.org/licenses/by-nc/4.0/).

1. INTRODUCTION

Ordered random variables like order statistics, record values, sequential order statistics and progressive type II censored order statistics frequently arise in real life situations. For application of ordered random variables, one may refer to [1], to [2] for order statistics, to [3] and [4] for record values and to [5] for progressive type II censored order statistics and reference cited therein. The concept of generalized order statistics (gos) was first introduced by [6]. Goss includes various important models related to ordered random variables like order statistics, record values, sequential order statistics, order statistics with non-integral sample size and progressive type II censored order statistics etc.

Let X1,X2,,Xn be a sequence of independent and identically distributed (iid) random variables (rv) with the distribution function (df) F(x) and the probability density function (pdf) f(x). Let nN, k>0, m~=(m1,m2,,mn1)Rn1 and Mr=j=rn1mj, such that γr=k+nr+Mr>0 for all r{1,2,,n1}, then X(1,n,m~,k),X(2,n,m~,k),,X(n,n,m~,k) are said to be the gos if their joint pdf is given by

kj=1n1γji=1n1[1F(xi)]mif(xi)[1F(xn)]k1f(xn)
on the cone F1(0+)<x1x2xn<F1(1) of Rn. By adjusting the parameters of gos, one may get almost all important models related to ordered random variables. For example, at m=0,k=1i.e.γi=ni+1, we get ordinary order statistics. At m=1,kNi.e.γi=k, we shall get kth record values. Similarly sequential order statistics (γi=(ni+1)βi;β1,β2,,βn>0), order statistics with non-integral sample size (γi=βi+1;β>0), Pfeifer record values (γi=βi;β1,β2,,βn>0) and progressive type II censored order statistics (N,nN,N=n+j=1nRj,RjN0 and γj=Nν=1j1Rνj+1,1jn) can be obtained as specific cases of gos.

Let PF stands for the probability measure on R determined by F(x). Define α=inf{xR:F(x)>0} and β=sup{xR:F(x)<1}, then the pdf of X(r,n,m~,k) with respect to a measure PF can be expressed as (see [7]):

fr(x)=cr1Gr[F¯(x)|γ1,,γr]I(α,β)(x),
where cr1=j=1rγj,F¯(x)=1F(x) and IA denotes the indicator function of the set A. Further Gr(x)=Gr,rr,0(x|γ1,,γr)=Gr,rr,0xγ11,,γr1γ1,,γr is the particular Meijer's G-function defined by
Gr,rr,0xγ11,,γr1γ1,,γr=12πiLxzj=1r(γi1z)dz,| x |<1
and L is an appropriately chosen contour of integration. For the definition of G-function and its numerous properties and applications, one may refer to [8](Chapter 2). The joint PFPF density of X(r,n,m~,k) and X(s,n,m~,k), 1r<sn is given by [7]
fr,s(x,y)=cs1GsrF¯(y)F¯(x)|γr+1,,γsGr[F¯(x)|γ1,,γr]F¯(x)IA1(x,y),
where the set A1 is defined as A1={(x,y)R2:α<x<y<β}.

Suppose X1,X2,,Xn are iid random variables from exponential distribution with scale parameter θ, i.e., F(x)=1exθ,x>0,θ>0 and X1:nX2:nXn:n are the corresponding orders statistics. The normalized spacing of order statistics (when the random variables are iid and from exponential distribution) was introduced by [9] as: D1:n=nX1:n and Dr:n=(nr+1)(Xr:nXr1:n),2rn. [9] proved that Dr:n(1rn) are iid from exponential distribution with scale parameter θ. Utilizing the above transformation, it can be shown that Xs:nXr:n and Xsr:nr are identically distributed across all 1r<sn. Based on order statistics, the characterization of exponential distribution through normalized spacing was discussed by [1013] and [14]. The related results of characterization of exponential distribution for record values through spacing was established by [11,13,15,16] and [14]. [17] has shown that using linear function of spacings instead of linear function of order statistics is more suitable for theoretical and applied purposes. Normalized spacing for gos was given by [6] (p. 81). Suppose X1,X2,,Xn be the iid random variables from exponential distribution and X(1,n,m~,k),X(2,n,m~,k),X(n,n,m~,k) be the corresponding goss. [6] has shown that D(1,n,m~,k)=γ1X(1,n,m~,k) and D(r,n,m~,k)=γr[X(r,n,m~,k)X(r1,n,m~,k)],2rn are iid from exponential distribution. Further, [18] and [7] characterized the exponential distribution using normalized spacing of goss based on increasing failure rate and decreasing failure rate distributions. [19] characterized Weibull and Uniform distributions using record values.

In this paper, we have extended the result of [16] and [14] and characterized the exponential distribution using normalized spacing of goss. While the necessary part of the main theorem was given by [20], in this paper, we have given a simple and easy proof for the sufficient part of the theorem. The paper is divided into four sections. In Section 2, based on gos, we have characterized the exponential distribution. Section 3 contains a simulation study where we have shown that confidence interval based on upper records is shorter in length than asymptotic confidence interval constructed by maximum likelihood estimate. Concluding remark focusing on the use of the result given in Section 2, is contained in Section 4.

2 CHARACTERIZATION OF EXPONENTIAL DISTRIBUTION

In this section, exponential distribution has been characterized through the normalized spacing based on goss.

The following two Lemmas are given, which have been used in the proof of the theorem. Proof of the Lemmas are straightforward and hence omitted.

Lemma 2.1.

Let X(r,n,m~,k) be the rth gos with pdf defined in (2), then

1cr1=αβGr[F¯(x)|γ1,,γr]f(x)dx.

Lemma 2.2.

Let X(sr,nr,μ~,k) andX(sr1,nr,μ~,k) be the two adjacent gos and μ~=(mr+1,,mn1)ϵRnr1, then

F¯sr,nr,μ~,k(u)F¯sr1,nr,μ~,k(u)=cs2cr1Gsr[F¯(u)|γr+1,,γs]F¯(u).

Theorem 2.1.

Let X1,X2,,Xn be n non-negative iid random variables with absolutely continuous df (with respect to Lebesgue measure) F(x) andpdf f(x), where F(x) is strictly increasing over the support (0,), then for 1r<s1<n,

X(l,n,m~,k)X(r,n,m~,k)=dX(lr,n,μ~,k),l=s1,s
if and only if
f(x)=1exθ,θ>0.

Proof.

The necessary part was established by [20] as follows: X(l,n,m~,k)X(r,n,m~,k)=j=r+1l(zjγj),l=s1,s, where zjexp(θ).

This implies that X(l,n,m~,k)X(r,n,m~,k)=dX(lr,nr,μ~,k),l=s1,s, where the symbol =d represents equal in distribution.

To prove the sufficient part, we have for any positive and finite u

P[X(s,n,m~,k)X(r,n,m~,k)u]=cs10x+uGsr[F¯(y)F¯(x)|γr+1,,γs]f(y)F¯(x)Gr[F¯(x)|γ1,,γr]f(x)dydx.

Let

A(x,y)=GsrF¯(y)F¯(x)γr+1,,γsf(y)F¯(x).

Integrating A(x,y) w.r.t, y over (x+u,), we have

x+uA(x,y)dy=x+uGsrF¯(y)F¯(x)γr+1,,γsf(y)F¯(x)dy=GsrF¯(y)F¯(x)γr+1,,γsF¯(y)F¯(x)(x+u)+x+uyGsrF¯(y)F¯(x)γr+1,,γsF¯(y)F¯(x)dy.

Now, we know from [8] (p. 130), that

xaGr(x|γ1,,γr)=Gr(x|γ1+a,,γr+a),aϵR.

Using (11) in (10), we have

A(x,y)=GsrF¯(y)F¯(x)γr+1,,γs(x+u)+x+uyGsrF¯(y)F¯(x)γr+1,,γsF¯(y)F¯(x)dy,
where γr+i=γr+i+1,i=1,2,,sr.

Let γ1:r...γr:r denote the ordered parameters of γ1,...,γr,r2, then

limx1Gr(x|γ1,,γr)=limx0+Gr(x|γ1,,γr)=0,ifγ1:r>1.
[21] and
ddxGr(x|γ1,,γr)=1x[(γr1)Gr(x|γ1,,γr)Gr1(x|γ1,,γr1)].
[8] (p. 94)

Using (13) and (14) in (12), we get

A(x,y)=GsrF¯(x+u)F¯(x)γr+1,,γs+x+uGsr1F¯(y)F¯(x)γr+1,,γsf(y)F¯(x)dy(γs1)x+uGsrF¯(y)F¯(x)γr+1,,γsf(y)F¯(x)dy.

This implies that

P[X(s,n,m~,k)X(r,n,m~,k)u]  =cs10GsrF¯(x+u)F¯(x)γr+1,,γsGr[F¯(x)|γ1,,γr]f(x)dx   +cs10x+uGsr1F¯(y)F¯(x)|γr+1,,γs1f(y)F¯(x)Gr[F¯(x)|γ1,,γr]f(x)dydx   (γs1)cs10x+uGsrF¯(y)F¯(x)|γr+1,,γsf(y)F¯(x)Gr[F¯(x)|γ1,,γr]f(x)dydx
=cs10GsrF¯(x+u)F¯(x)γr+1,,γsGr[F¯(x)|γ1,,γr]f(x)dx+γsP[X(s1,n,m~,k)X(r,n,m~,k)u](γs1)P[X(s,n,m~,k)X(r,n,m~,k)u]
that is
P[X(s,n,m~,k)X(r,n,m~,k)u]P[X(s1,n,m~,k)X(r,n,m~,k)u]  =cs20GsrF¯(x+u)F¯(x)|γr+1,,γsGr[F¯(x)|γ1,,γr]f(x)dx.

Now in view of Lemma 2.1 together with Lemma 2.2, we have

P[Xsr,nr,μ~,k]P[X(sr1,nr,μ~,k)u]  =cs2Gsr[F¯(u)|γr+1,,γs]0Gr[F¯(x)|γ1,,γr]f(x)dx.
(16) and (17) will be equal if and only if
cs20GsrF¯(x+u)F¯(x)|γr+1,,γsGsr[F¯(u)|γr+1,,γs]   ×Gr[F¯(x)|γ1,,γr]f(x)dx=0.

Since both Gr[F¯(x)|γ1,,γr] and f(x) are positive, using the generalization of Muntz-Swartz theorem (see [22]), the above integral will be zero only if

GsrF¯(x+u)F¯(x)|γr+1,,γsGsr[F¯(u)|γr+1,,γs]=0.

For s>r+1, the function γr+1:s>1, GsrF¯(x+u)F¯(x)|γr+1,,γs is unimodal, strictly increasing up to mode and then strictly decreasing. Further, the value of mode only depends upon the parameter γr+1,,γs, hence for the functions

GsrF¯(x+u)F¯(x)|γr+1,,γs
and
GsrF¯(u)|γr+1,,γs
mode occurs at the same point (see [21]; Theorem 2.1). The value of GsrF¯(x+u)F¯(x)|γr+1,,γs and GsrF¯(u)|γr+1,,γs are equal only at the mode. At mode, the random variable X satisfy F¯(x+u)F¯(x)=F¯(u), which imply that X satisfies the memoryless property. Only continuous distribution over the support (0,), which satisfies the memoryless property is exponential distribution and hence the theorem.

3 SIMULATION STUDY

Here we have followed the procedure adopted by [19] and carried out the simulation to construct the confidence interval for scale parameter θ of exponential distribution.

Let X1,...,Xn is a random sample from pdf f(x)=1θexθ,x>0,θ>0. Based on maximum likelihood estimator of exponential distribution, 100(1α)%,0<α<1, asymptotic confidence interval (CIAsymptotic) for θ is given by

CIAsymptotic=X¯1+Zα2n,X¯1Zα2n
where X¯ is sample mean of random variables and Zα2 is a positive constant satisfying the relation Φ(Zα2)Φ(Zα2)=1α and Φ(.) is the cumulative distribution function of the standard normal distribution. Using Theorem 2.1, for record statistics, we have XU(n)XU(m)d__XU(nm). As Y=XU(nm)G(nm,θ), based on Y, a 100(1α)% confidence interval for θ is given by
CI=2Yχ2n,1α22,2Yχ2n,α22.

To construct these confidence intervals we simulate 10,000 samples of size n from (8) for n=4,5,...,100,m=1 and θ=0.2,0.5,1,2. For each sample, we calculate the limits of confidence interval given in (20) and (21). The plot of coverage lengths versus n=4,5,...,100, is shown in Figs. 14. From the figures, it can be seen that the confidence interval obtained from (21) is smaller than the confidence interval obtained from (20) for n60. Thus, we get an improved estimator based on XU(nm).

Figure 1

Plot of confidence interval for θ = 0.2.

Figure 2

Plot of confidence interval for θ = 0.5.

Figure 3

Plot of confidence interval for θ = 1.

Figure 4

Plot of confidence interval for θ = 2.

4 CONCLUDING REMARK

In real life, a statistician is often interested in guessing the probability distribution from which the true data is obtained. Characterization problem is a theoretical approach to obtain the distribution function if certain conditions are fulfilled. A probability distribution can be characterized in many ways and the method under study here is one of them. Further this result may be used to ease statistical computation. For example, if we are computing moments of difference between two gos (order statistics, records, sequential order statistics, progressive type II censored order statistics), when the random variables X1,X2,,Xn follow exponential distribution, then we may utilize the result of Theorem 2.1. This result may also be utilized for calculating the sample quasi range, when the random variables X1,...,Xn are iid having pdf f(x)=1θexθ,x>0,θ>0. Various statistical properties of quasi range can be established using Theorem 2.1. Sample quasi range is used in estimating the population standard deviation if the sample size n is small (see [23]). [24] used the sample quasi range in setting confidence intervals for the population standard deviation. Further, [25] and [26] have used the sample quasi range in estimating the scale parameter for a location scale family of distribution, if the parent distribution is symmetric about the location parameter. Sample quasi range and range are also extensively used in quality control.

REFERENCES

1.N. Balakrishnan and A.C. Cohen, Order Statsitics and Inference, Academic Press, Boston, 1991.
4.M. Ahsanullah, Record Values - Theory and Applications, University Press of America, Lanham, 2004.
8.A. M. Mathai, A Handbook of Generalized Special Functions for Statistical and Physical Sciences, Oxford Science Publications, New York, 1993.
11.M. Iwińska, Fasc. Math., Vol. 16, 1986, pp. 101-107.
14.M.J.S. Khan, M. Faizan, and S. Iqrar, Align. J. Stat., Vol. 36, 2016, pp. 57-62.
25.P.Y. Thomas, J. Indian Soc. Agric. Stat., Vol. 42, 1990, pp. 250-256.
Journal
Journal of Statistical Theory and Applications
Volume-Issue
18 - 3
Pages
303 - 308
Publication Date
2019/09/02
ISSN (Online)
2214-1766
ISSN (Print)
1538-7887
DOI
10.2991/jsta.d.190818.004How to use a DOI?
Copyright
© 2019 The Authors. Published by Atlantis Press SARL.
Open Access
This is an open access article distributed under the CC BY-NC 4.0 license (http://creativecommons.org/licenses/by-nc/4.0/).

Cite this article

TY  - JOUR
AU  - M. J. S. Khan
AU  - S. Iqrar
AU  - M. Faizan
PY  - 2019
DA  - 2019/09/02
TI  - Characterization of Exponential Distribution Through Normalized Spacing of Generalized Order Statistics
JO  - Journal of Statistical Theory and Applications
SP  - 303
EP  - 308
VL  - 18
IS  - 3
SN  - 2214-1766
UR  - https://doi.org/10.2991/jsta.d.190818.004
DO  - 10.2991/jsta.d.190818.004
ID  - Khan2019
ER  -