1Unité de Formation et de Recherche des Sciences Appliquées à la Technologie, Laboratoire d'Etudes et de Recherches en Statistiques et Développement, Gaston Berger University, Saint Louis, Sénégal
2LERSTAD, Gaston Berger University, Saint-Louis, Senegal, Evanston Drive, NW, Calgary, Canada, T3P 0J9, Associate Researcher, LSTA, Pierre et Marie University, Paris, France, Associated Professor, African University of Sciences and Technology, Abuja, Nigeria
3Unité de Formation et de Recherche des Sciences Appliquées à la Technologie, Laboratoire d'Etudes et de Recherches en Statistiques et Développement, Gaston Berger University, Saint Louis, Sénégal
In the two previous papers of this series, the main results on the asymptotic behaviors of empirical divergence measures based on wavelets theory have been established and particularized for important families of divergence measures like Rényi and Tsallis families and for the Kullback-Leibler measures. While the proofs of the results in the second paper may be skipped, the proofs of those in paper 1 are to be thoroughly proved since they serve as a foundation to the whole structure of results. We prove them in this last paper of the series. We will also address the applicability of the results to usual distribution functions.
1. INTRODUCTION AND RECALL OF THE RESULTS TO BE PROVED
For a general introduction, we refer the reader to the ten (10) first pages in [1] in which the notation and the assumption are exposed.
Let us recall here the main results we exposed in. The first is related to the empirical process based on wavelets.
Theorem 1.1.
Given theXnn≥1, defined in Condition(8)such thatf∈B∞,∞tℝand letfndefined as Formula(13)andGn,Xwdefined as in Formula(17). Then, underAssumptions [1–3], all in [1] and for any bounded functionh, defined onD, belonging toB∞,∞tℝ, we have
σh,n−1Gn,Xwh⇝N0,1asn→∞,
where we have
σh,n2=EXKjnhX2−EXKjnhX2→VarhXasn→∞.
Proof.
Suppose that Assumptions 1 and 3, in [1], are satisfied and h∈B∞,∞tℝ.
We have
∫Dfnx−fxhxdx=ℙn,XKjnh−EXh=ℙn,X−EXKjnh+EXKjnhX−hX.
It comes that
Gn,Xwh=nℙn,X−EXKjnh+nR1,n,
where R1,n=EXKjnhX−hX.
To complete the proof, we have to show that (1)nℙn,X−EXKjnh converges in distribution to a centered normal distribution and (2)nR1,n converges to zero in probability, as n→∞. By the way, we will assume that, in the sequel, all the limits as meant as n→∞, unless the contrary is specified.
For the first point, we show that
nℙn,X−EXKjnhX⇝N0,VarhX as n→∞,
by applying the central theorem for independent randoms variables.
Let us denote Zi,n=Zi,nh=KjnhXi, μjn=EZi,n, and σi,n2=σi,nh2=VarZi,n<∞. Let
Tn=∑i=1nZi,n−μjn,
sn2=VarTn=∑i=1nσi,n2.
Tn/sn has mean 0 and variance 1; our goal is to give conditions under which
Tnsn⇝N0,1, as n→+∞.
Such conditions are given in the Lindeberg-Feller-Levy conditions (See [4]), Point B, pp. 292).
The two others main results are related to the asymptotics of class of the ϕ-divergence measures. They concern the almost sure efficiency of them.
Theorem 1.2.
UnderAssumptions [1–3], C-A, C-h, C1-ϕ, C2-ϕ, and (BD) all in [1], we have
limsupn→+∞|Jfn,g−Jf,g|an≤A1,a.s
limsupn→+∞|Jf,gn−Jf,g|bn≤A2,a.s
limsupn,m→+∞,+∞Jfn,gm−Jf,gcn,m≤A1+A2a.s
where an, bn and cn are as in Formulas (16) in [1].
Proof.
In the proofs, we will systematically use the mean values theorem. In the multivariate handling, we prefer to use the Taylor-Lagrange-Cauchy as stated in [5], page 230. The assumptions have already been set up to meet these two rules. To keep the notation simple, we introduce the two following notations:
So by applying the mean value theorem to the function u1x↦ϕu1x,gx, we have
ϕfnx,gx=ϕfx,gx+Δnfxϕ11fx+θ1xΔnfx,gx
where θ1x is some number lying between 0 and 1. In the sequel, any θi satisfies |θi|<1. By applying again the mean values theorem to the function u2x↦ϕ11u2x,gx, we have
It remains to prove that nmmVh1X+nVh2YRn,m=oℙ1. But we have by the continuity assumptions on ϕ and on its partial derivatives and by the uniform convergence of Δnfx and Δngx to zero, that
As previously, we have nan2=oℙ1, mbm2=oℙ1 and nambm=oℙ1.
From there, the conclusion is immediate.
We finish the series by this section on the applicability of our results for usual pdf's.
2. APPLICABILITY OF THE RESULTS FOR USUAL PROBABILITY LAWS
Here, we address the applicability of our results on usual distribution functions. We have seen that we need to avoid infinite and null values. For example, integrals in the Rényi's and the Tsallis family, we may encounter such problems as signaled in the first pages of paper [1]. To avoid them, we already suggested to used a modification of the considered divergence measure in the following way:
First of all, it does not make sense to compare two distributions of different supports. Comparing a pdf with support ℝ, like the Gaussian one, with another with support 0,1, like the standard uniform one, is meaningless. So, we suppose that the pdf's we are comparing have the same support D.
Next, for each ε>0, we find a domain Dε included in the common support D of f and g such that
∫Dεfxdx≤1−ε and ∫Dεgxdx≤1−ε.
And there exist two finite numbers κ1>0 and κ2>0, such that we have
κ1≤f1Dε,g1Dε≤κ2.
Besides, we choose the Dϵ's increasing to D as ϵ decreases to zero. We define the modified divergence measure
Dεf,g=Dfε,gε,
where
fε=D1−1f1Dε,gε=D2−1g1Dε,
with D1=∫Dfxdx and D2=∫Dgxdx.
Based on the remarks that the Dϵ's increases to D as ϵ decreases to zero and that the equality between f and g implies that of fε and gε, we recommend to replace the exact test of f=g by the approximated test fε=gε, for ε as small as possible.
So each application should begin by a quick look at the domain D of the two pdfs and the founding of the appropriate sub-domain Dε on which are applied the tests.
Assumption (20) also ensures that the pdf's fε and gε lie in B∞,∞t for almost all the usual laws. Actually, according to [3], page 104, we have that f∈lB∞,∞t, for some t>0, if and only if
where t stands for the integer part of the real number t, that is the greatest integer less or equal to t and fp denotes the p-th derivative function of f.
Whenever the functions fε and gε have t+1-th derivatives bounded and not vanishing on Dε, they will belong to B∞,∞t. Assumption (20) has been set on purpose for this. Once this is obtained, all the functions that are required to lie on B∞,∞t for the validity of the results, effectively are in that space. All examples we will use in this sections satisfy these conditions, including the following random variables to cite a few: Gaussian, Gamma, Hyperbolic, and so on.
3. CONCLUSION
In this last paper of this series, the main results have been proved. Wavelet theory has proved to be a good framework for processing estimates of divergence measures. We believe that having exactly the values of the scaling function will give better results in our work.
ACKNOWLEDGMENTS
The three (1 &2 &3) authors acknowledges support from the World Bank Excellence Center (CEA-MITIC) that is continuously funding his research activities from starting 2014.
TY - JOUR
AU - Amadou Diadié Bâ
AU - Gane Samb Lo
AU - Diam Bâ
PY - 2019
DA - 2019/05/23
TI - Divergence Measures Estimation and Its Asymptotic Normality Theory Using Wavelets Empirical Processes III
JO - Journal of Statistical Theory and Applications
SP - 113
EP - 122
VL - 18
IS - 2
SN - 2214-1766
UR - https://doi.org/10.2991/jsta.d.190514.002
DO - 10.2991/jsta.d.190514.002
ID - Bâ2019
ER -