Journal of Statistical Theory and Applications

Volume 19, Issue 1, March 2020, Pages 102 - 108

A New Flexible Discrete Distribution with Applications to Count Data

Authors
Anwar Hassan1, *, Gulzar Ahmad Shalbaf1, Sheikh Bilal2, Adil Rashid2
1P.G. Department of Statistics, University of Kashmir, Srinagar
2Goverenment Degree College Ganderbal
*Corresponding author. Email: anwar.hassan2007@gmail.com
Corresponding Author
Anwar Hassan
Received 4 February 2018, Accepted 17 March 2019, Available Online 2 March 2020.
DOI
10.2991/jsta.d.200224.006How to use a DOI?
Keywords
Poisson distribution; Ailamujia distribution; Count data
Abstract

In this paper we have explored a new discrete probability mass function that has been generated through compounding mechanism. This newly proposed probability mass function is essentially a mixture of Poisson and Ailamujia distribution. Furthermore the parameter estimation has also been discussed by using Maximum likelihood estimation (MLE) technique. Moreover, we have also studied some important properties of the proposed model that include factorial moments, raw moments, mean, variance, and coefficient of variation. In the end, the application and potentiality of the proposed model have been tested statistically and it has been shown that the proposed model can be employed to model a real life data set to get an adequate fit that has also corroborated through graphically.

Copyright
© 2020 The Authors. Published by Atlantis Press SARL.
Open Access
This is an open access article distributed under the CC BY-NC 4.0 license (http://creativecommons.org/licenses/by-nc/4.0/).

1. INTRODUCTION

In probability distribution theory there are vast numbers of discrete distributions that have specified applications but at times the observable data have distinct features that are not exhibited by these classical discrete distributions. So to overcome these limitations researchers often develop new probability distributions so that these new distributions can be employed in those situations where classical distributions are not providing any adequate fit. There are so many techniques by which we can obtain new distributions such as transmutation, compounding, etc. but compounding of distributions has received maximum attention from the last few decades due to its simplified approach. Some new compound distributions were obtained in the recent past by the numerous researchers for instance, Adil et al. [14] and Adil and Jan [58] obtained a new lifetime distributions to address issues related to count data and lifetime of series system and parallel systems.

2. MATERIALS AND METHODS

A random variable X has a Poisson distribution if for λ0, its probability mass function is

p1x=eλλxx!,  where   x=0,1,2,(1)

The factorial moment of the Poisson distribution and hence mean and variance are given by

μ[r]=λr    for   r=1,2,3,(2)
EX=λ     and     VX=λ

Also, the probability density function associated with Ailamujia random variate is

f1X=x,α=4xα2e2αx,  x0,α>0.(3)

The raw moments of Ailamujia distribution (AD) are given by

E(Xr)=0xrf1(X=x,α)dx
μr=(r+1)!2αr(4)

AD is formulated by Lv et al. [9] for several engineering application. This distribution was also studied by Pan et al. [10] for interval estimation and hypothesis testing based on sample of small size. The Bayesian estimation of AD was obtained by Long [11] under type ii censoring using the three different priors based on missing data. The minimax estimation of the parameter of Ailamujia model was evaluated by Li [12].

Usually the parameter λ in Poisson distribution is fixed constant but here we will consider a problem in which the parameter λ is itself a random variable following AD with probability density function (3).

3. DEFINITION OF PROPOSED MODEL

If X|λ follows P(λ) where λ is itself a random variable following AD then determining the distribution that results from marginalizing over λ will be known as a compound of Poisson distribution with that of AD which is called Poisson Ailamujia distribution and is denoted by PADλ,α. It may be noted here that proposed model will be a discrete since the parent distribution is discrete.

3.1. Theorem

The probability mass function of a compound of P(λ) with ADα is given by

fPAD(X=x,α)=4α2(1+x)(1+2α)x+2,x=0,1,2,,α0.

Proof:

Using the definition (3), the probability mass function of a compound of Pλ with ADα can be obtained as

fPAD(X=λ,α)=0p1(x|λ)f1(x,λ)dλfPAD(X=λ,α)=0eλλxx!4λα2e2αλdλfPAD(X=λ,α)=4α2x!0e(1+2α)λλ(x+2)1dλfPAD(X=λ,α)=4α2(1+x)(1+2α)x+2,x=0,1,2,,α0.fPAD(X=x,α)=4α2(1+x)(1+2α)x+2,x=0,1,2,,α0.(5)

3.2. Cumulative Distribution Function of the Proposed Model

FX(x)=i=xi  xPX=xi=x=0x4α2(x+1)(1+2α)x+2=4α2(1+2α)2x=0x(x+1)(1+2α)xFX(x)=1(4α+2αx+1)(1+2α)x+2

4. FACTORIAL MOMENTS OF PROPOSED MODEL

We shall obtain factorial moments of the proposed model which is very useful to study some of the most important properties such as mean, variance, standard deviation, etc.

4.1. Theorem

The factorial moments of order “r” of the proposed model is given by

μ[r]X=(r+1)!(2α)r,  X=0,1,2,,α0.

Proof:

The factorial moment of order r of Poisson distribution is

μ[r]=λr    for   r=1,2,.

Since λ itself is a random variable following AD, therefore one can obtain factorial moment of the proposed model by using the definition

μ[r]x=EλmrX|λ

Where mrX|λ are factorial moments of order “r” of a Poisson distribution.

We know that

mrX|λ=λr

Therefore

μ[r](x)=E[λr]=0λr4λα2e2αλdλ=4α20λ(r+2)1e2αλdλ
μ[r](X)=(r+1)!(2α)r,  X=0,1,2,,α0.

For r = 1, we get the mean of PADα

μ[1](x)=4α2(1+1)!(2α)1+2
μ[1](x)=4α2(2)!(2α)3
mean=μ[1](x)=1α
μ[2](x)=4α2(2+1)!(2α)2+2
μ[2](x)=32α2
μ[2](x)=μ2μ1

μ2=3+2α2α2 And hence Variance =2α+12α2 SD =2α+1α2=1α(2α+1)/2

Moreover the CV=2α+12×100

5. PARAMETER ESTIMATION

The estimation of a parameter of PAD(α) model will be discussed here in this section through maximum likelihood estimation.

5.1. Maximum Likelihood Estimation

The estimation of a parameter of PAD(α) model via maximum likelihood estimation method requires the log likelihood function of PAD(α) as

Let X1,X2,,Xn are i.i.d PAD(α). Then

L=i=1nPX=x,α
L=i=1n4α2(xi+1)(1+2α)xi+2
L=4α2ni=1n(xi+1)(1+2α)xi+2n
logL=nlog4+2nlogα+i=1nlog(xi+1)i=1nxi+2nlog(1+2α).
logLα=2nα2i=1nxi+2n(1+2α)

For estimating the unknown parameter we set the first order partial derivative equal to zero

logLα=2nα2i=1nxi+2n(1+2α)=0

Hence we get that α=1x¯

6. APPLICATION IN BIOLOGICAL SCIENCE

Genetics is the biological science which deals with the mechanisms responsible for similarities and differences among closely related species. The term “genetic” is derived from the Greek word “genesis” meaning grow into or to become. So, genetic is the study of heredity and hereditary variations. It is the study of transmission of body features that is similarities and difference, from parents to offspring's and the laws related to this transmission, any difference between individual organisms or groups of organisms of any species, caused either by genetic difference or by the effect of environmental factors, is called variation. Variation can be shown in physical appearance, metabolism, behavior, learning and mental ability, and other obvious characters. In this section the potentiality of proposed model will be justified by fitting it to the reported genetics count data sets of Catcheside et al. (Tables 1 and 2).

Fitted Distribution
Number of Aberrations Observed Frequency Poisson PAD
0 268 231.3 246.1
1 87 126.1 106.1
2 26 34.7 34.3
3 9 6.30.80.10.10.1} 9.8
4 4 2.60.60.10.04}
5 2
6 1
7+ 3
Total 400 400 400
Chi-square estimate 40.8 10.05
Parameter estimation θ^=0.55 α^=1.82
p-Value <.0001 .018

PAD, Poisson Ailamujia Distribution.

Table 1

Distribution of number of Chromatid aberrations (0.2 g chinon 1, 24 hours).

Fitted Distribution
Class/Exposure (μg/kg) Observed Frequency Poisson PAD
0 200 172.5 184.6
1 57 95.4 79.5
2 30 26.4 25.7
3 7 7.31.9.50.1} 4.90.70.10.0}
4 4
5 0
6 2
Total 300 300 300
Parameter estimation θ^=0.55 α^=1.82
Chi-square estimate 29.6 9.4
p-Value <.0001 .09

PAD, Poisson Ailamujia Distribution.

Table 2

Mammalian cytogenetic dosimetry lesions in rabbit lymphoblast induced by Streptonigrin (NSC-45383) Exposure −70 μg/kg.

7. CONCLUSIONS

In this paper we have proposed a new model by compounding Poison distribution with Ailamujia Distribution (AD) and it has been shown that proposed model can be nested to different compound distributions. Furthermore, we have derived several properties of proposed model such as factorial moments, mean, variance. In addition to this parameter estimation of the proposed model has been discussed by means of method of moments and MLE. Finally, the application of the proposed model have been explored in genetics, based on the goodness of fit test and it has been shown that our model offers a better fit as compared to classical Poisson distribution. We hope that the proposed model will serve as an alternative to various models available in literature.

REFERENCES

5.R. Adil and T.R. Jan, J. Mod. Appl. Stat. Methods, Vol. 9, 2014, pp. 17-24.
7.R. Adil and T.R. Jan, Int. J. Math. Stat., Vol. 17, 2016, pp. 23-38.
8.R. Adil and T.R. Jan, J. Reliab. Stat. Stud., Vol. 6, 2013, pp. 11-19.
9.H.Q. Lv, L.H. Gao, and C.L. Chen, J. Acad. Armed Force Eng., Vol. 16, 2002, pp. 48-52.
10.G.T. Pan, B.H. Wang, C.L. Chen, Y.B. Hang, and M.T. Dang, Appl. Stat. Manag., Vol. 28, 2009, pp. 468-472.
11.B. Long, Math. Pract. Theor., 2015, pp. 186-192.
Journal
Journal of Statistical Theory and Applications
Volume-Issue
19 - 1
Pages
102 - 108
Publication Date
2020/03/02
ISSN (Online)
2214-1766
ISSN (Print)
1538-7887
DOI
10.2991/jsta.d.200224.006How to use a DOI?
Copyright
© 2020 The Authors. Published by Atlantis Press SARL.
Open Access
This is an open access article distributed under the CC BY-NC 4.0 license (http://creativecommons.org/licenses/by-nc/4.0/).

Cite this article

TY  - JOUR
AU  - Anwar Hassan
AU  - Gulzar Ahmad Shalbaf
AU  - Sheikh Bilal
AU  - Adil Rashid
PY  - 2020
DA  - 2020/03/02
TI  - A New Flexible Discrete Distribution with Applications to Count Data
JO  - Journal of Statistical Theory and Applications
SP  - 102
EP  - 108
VL  - 19
IS  - 1
SN  - 2214-1766
UR  - https://doi.org/10.2991/jsta.d.200224.006
DO  - 10.2991/jsta.d.200224.006
ID  - Hassan2020
ER  -