A Quasi Poisson-Rama Distribution with Properties and Applications

Rama Shanker, Riki Tabassum and Jyotirmoyee Baishya*

Department of Statistics, Assam University, Silchar, Assam, India
E-mail: shankerrama2009@gmail.com; rikitabassum18@gmail.com; jyotirmoyeebaishya790@gmail.com
*Corresponding Author

Received 03 November 2025; Accepted 16 April 2026

Abstract

A quasi Poisson-Rama distribution has been introduced which is the Poisson compound of quasi Rama distribution. The moments based descriptive properties have been discussed. The proposed distribution is unimodal, has increasing hazard rate and over-dispersed. The estimation of parameters has been studied using the method of moments and the method of maximum likelihood. A simulation study has been done to test the performance of maximum likelihood estimators of parameters. Finally, the goodness of fit of the proposed distribution has been discussed with two discrete datasets, the first from biological sciences and the second from thunderstorms and compared with the goodness of fit of Poisson-Lindley distribution, Poisson-Akash distribution, Poisson-Sujatha distribution, Poisson-Rama distribution, quasi Poisson-Lindley distribution, quasi Poisson-Akash distribution and quasi Poisson-Sujatha distribution.

Keywords: Quasi Rama distribution, compounding, mathematical and statistical properties, estimation of parameters, simulation, applications.

Abbreviations

PD Poisson Distribution
NBD Negative Binomial Distribution
PLD Poisson-Lindley Distribution
PShD Poisson-Shanker Distribution
PAD Poisson-Akash Distribution
PSD Poisson-Sujatha Distribution
RD Rama Distribution
PDF Probability Density Function
QRD Quasi Rama Distribution
PRD Poisson Rama Distribution
PMF Probability Mass Function
QPRD Quasi Poisson-Rama Distribution
QPLD Quasi Poisson-Lindley Distribution
QPAD Quasi Poisson-Akash Distribution
QPSD Quasi Poisson-Sujatha Distribution
IHR Increasing Hazard Rate
CV Coefficient of Variation
CS Coefficient of Skewness
CK Coefficient of Kurtosis
ID Index of Dispersion
CDF Cumulative Distribution Function
MLE Maximum Likelihood Estimator
MSE Mean Square Error

1 Introduction

The PD is one of the primary discrete distributions for modeling equi-dispersed count data, where the mean equals the variance. In general, count data can be found in many areas of knowledge, such as biology, insurance, health, agriculture, engineering, etc. In real-world scenarios, it has been noted that the majority of stochastic datasets are either under-dispersed or over-dispersed. Numerous statistical methods, including weighted distributions and mixtures of distributions are suggested to address the over-dispersed count data. In recent decades, some researchers have attempted to create a one-parameter over-dispersed discrete distribution by compounding the PD with one parameter continuous lifetime distributions that are positively skewed. The chief characteristics of the Poisson mixture of positively skewed lifetime distribution is that the resultant distribution follows some characteristics of its mixing distribution.

During recent decades several over-dispersed count distributions have been introduced in statistics literature including NBD which is the PD compound of gamma distribution, PLD [1] which is the Poisson compound of Lindley distribution [2], PShD [3] which is the PD compound of Shanker distribution [4], PAD [5] which is the PD compound of Akash distribution [6] and PSD [7] which is the PD compound of Sujatha distribution [8]. A discrete quasi Akash distribution using discretization technique has been proposed by [9].

RD [10] is defined by its PDF

f(x;θ)=θ4θ3+6(1+x3)eθx;x>0,θ>0.

Various statistical and reliability properties along with applications of RD have been discussed in [10]. A lot of works have been done in a short period of time on RD and its modifications. Two-parameter RD has been introduced where its properties and applications have also been discussed in [11]. The power version of RD and the weighted version of RD have been proposed by [12] and [13], respectively. Further extensions of RD include exponentiated RD by [14], extended RD by [15], new generalization of RD by [16], new variant of RD to model blood cancer data by [17], QRD by [18], wrapped Rama distribution by [19], some among others. A simple generalization of QRD has been proposed by [20].

Assuming the parameter λ of the PD following RD, the PD compound of RD, named Poisson-Rama distribution (PRD) has been derived and studied by [21]. The PRD is defined by its pmf.

P(X=x)=θ4θ3+6x3+6x2+11x+(θ3+3θ2+3θ+7)(θ+1)x+4;
x=0,1,2,,θ>0

The descriptive properties based on moments, pmf and other statistical and reliability properties along with applications of PRD have been discussed in [21].

The QRD by [18] is defined by its PDF

f(x;θ,α)=θ3αθ2+6(α+θx3)eθx;x>0,θ>0,α>0

At α=θ, it reduces to RD. Descriptive statistical properties and applications of QRD are available in [18]. The weighted quasi Rama distribution (WQRD) and the power quasi Rama distribution (PQRD) have been proposed by [22] and [23], respectively.

The main motivation to propose QPRD which is the Poisson mixture of QRD is that it is highly flexible and applicable to over dispersed real-world count data. The interesting feature of QPRD is that it has the potential to handle a wide range of over-dispersed count data patterns and thus its versatility makes it useful for over-dispersed complex datasets commonly arising in the field of biomedical sciences and environmental studies. Although several Poisson mixture distributions are available in statistics literature but the main advantage of proposing QPRD for overdispersed data is that it allows the modeling of more extreme skewness than the standard one parameter PLD, PAD, PSD and PRD and two-parameter QPLD, QPAD and QPSD. The key difference between PRD and QPRD are that (i) PRD is of one parameter one parameter and QPRD is of two parameter and hence QPRD provides better fit in analysing real-world count datasets, where the over-dispersion is high often outperforming PRD in biological, medical and social sciences modeling. (ii) QPRD is a generalization of PRD and hence serves as a more flexible model for situations where PRD cannot sufficiently explain the variance in the dataset.

Various mathematical and statistical properties and estimation of parameters of QPRD have been discussed. Simulation study has been presented. The goodness of fit of QPRD has been compared with other one parameter and two-parameter distributions including PLD, PAD, PSD, PRD, QPLD [24], QPAD [25] and QPSD [26].

2 A Quasi Poisson-Rama Probability Model

Definition 1: The random variable X is said to have QPRD if it follows the stochastic representation

XλPD(λ)and(λθ,α)QRD(θ,α)forλ>0,θ>0,α>0.

The unconditional distribution of the stochastic representation is denoted as QPRD(θ,α).

Theorem 1: If XQPRD(θ,α), then the PMF of X is given by

P(x;θ,α)=θ3αθ2+6θ(x3+6x2+11x+6)+α(θ+1)3(θ+1)x+4;
x=0,1,2,,(θ,α)>0

Proof: If XλPD(λ) and λ(θ,α)QRD(θ,α), then the PMF of X can be derived as

P(x;θ,α)=P(X=x)=0P(X=xλ)f(λθ,α)dλ

where f(λθ,α) is the QRD with parameter θ and α.

We have

P(x;θ,α) =0eλλxx!θ3αθ2+6(α+θλ3)eθλdλ
=θ3(αθ2+6)x![α0e(θ+1)λλxdλ+θ0e(θ+1)λλx+3dλ] (1)
=θ3(αθ2+6)x![αΓ(x+1)(θ+1)x+1+θΓ(x+4)(θ+1)x+4]
=θ3αθ2+6θ(x3+6x2+11x+6)+α(θ+1)3(θ+1)x+4;
x=0,1,2,,(θ,α)>0 (2)

Since this is the PD compound with QRD, we would call this probability model as QPRD. Also, at α=θ,QPRD reduces to PRD. The PMF plot of QPRD for different values of parameters θ and α are presented in Figure 1.

images

Figure 1 PMF of QPRD.

Theorem 2: The QPRD has IHR and is unimodal.

Proof: Since

Q(x;θ,α) =P(x+1;θ,α)P(x;θ,α)
=1θ+1[1+θ(3x2+15x+18)θ(x3+6x2+11x+6)+α(θ+1)3]

is decreasing function of x for a given θ and α,P(x;θ,α) is log-concave. Using the result about log-concave that log-concave functions are unimodal and has IHR from [27], we conclude that QPRD has an IHR and is unimodal.

Theorem 3: The P(x;θ,α) of QPRD is decreasing function of x for given θ and α.

Proof: We have

P(x+1;α,θ)P(x;α,θ)=1θ+1[1+θ(3x2+15x+18)θ(x3+6x2+11x+6)+α(θ+1)3]
x=0,1,2,3,

So for sufficiently large value of x,P(x+1;α,θ)P(x;α,θ)<1. In fact from and after that stage for every x, we have P(x+1;α,θ)<P(x;α,θ). Therefore, P(x;α,θ) is decreasing function of x from and after a certain stage.

Theorem 4: The pmfP(x;θ,α) of QPRD is log-concave and unimodel.

Proof: From [28, 29] and [30], a distribution having PMF P(x;θ) is log-concave if

[P(x+1;θ)]2>P(x;θ)P(x+2;θ)for xN (3)

Now for θ>0,α>0, it can be easily shown that the PMF given in (2)

[P(x+1;θ)]2P(x;θ)P(x+2;θ)>0for all xN.

Therefore, Equation (3) is satisfied for the pmf of QPRD given in (2). Again, using the results of [31] which states that log-concave pmf are strongly unimodal, the QPRD is also unimodal.

Theorem 5: The QPRD is a two-component mixture of NBD and can be expressed as

P(x;θ,α)=pP1(x;θ)+(1p)P2(x;θ),

where Pi(x;θ) is the PMF of NBD with parameters the number of successes i and p=αθ2αθ2+6 with

P1(x;θ)=θ(θ+1)x+1asNBD(1,θθ+1)

and

P2(x;θ)=(x+1)(x+2)(x+3)θ46(θ+1)x+4

as the NBD (4,θθ+1) respectively.

Proof: We have

P(x;θ,α) =0eλλxx!θ3αθ2+6(α+θλ3)eθλdλ
=0eλλxx!{αθ2αθ2+6(θeθλ)
+6αθ2+6(θ4Γ(4)eθλλ3)}dλ
=αθ2αθ2+6[θx!Γ(x+1)(θ+1)x+1]
+6αθ2+6[θ4x!Γ(4)Γ(x+4)(θ+1)x+4]
=αθ2αθ2+6[NBD(1,θθ+1)]+6αθ2+6[NBD(4,θθ+1)]
=pP1(x;θ)+(1p)P2(x;θ)

3 Moments Based Descriptive Statistics

Theorem 6: The rth factorial moment about origin μ(r) of QPRD is given by

μ(r)=r!{αθ2+(r+1)(r+2)(r+3)}θr(αθ2+6);r=1,2,3,

Proof: Using (2), μ(r) can be obtained as

μ(r) =E[E(X(r)λ)]
=0[x=0x(r)eλλxx!]θ3αθ2+6(α+θλ3)eθλdλ
=0[λrx=reλλxr(xr)!]θ3αθ2+6(α+θλ3)eθλdλ
=r!{αθ2+(r+1)(r+2)(r+3)}θr(αθ2+6);r=1,2,3,

The first four factorial moments of QPRD is thus given by

μ(1) =αθ2+24θ(αθ2+6),μ(2)=2(αθ2+60)θ2(αθ2+6)
μ(3) =6(αθ2+120)θ3(αθ2+6),μ(4)=24(αθ2+210)θ4(αθ2+6)

The first four moments about origin of the QPRD are given by

μ1=αθ2+24θ(αθ2+6)
μ2=αθ3+2αθ2+24θ+120θ2(αθ2+6)
μ3=αθ4+6αθ3+6αθ2+24θ2+360θ+720θ3(αθ2+6)
μ4=αθ5+14αθ4+36αθ3+24αθ2+24θ3+840θ2+4320θ+5040θ4(αθ2+6)

The moments about mean of QPRD can thus be obtained as

μ2=α2θ5+α2θ4+30αθ3+84αθ2+144θ+144θ2(αθ2+6)2
μ3={α3θ8+3α3θ7+2α3θ6+36α2θ6+270α2θ5+396α2θ4+ 324αθ4+1944αθ3+648αθ2+864θ2+2592θ+1728}θ3(αθ2+6)3
μ4={α4θ11+10α4θ10+18α4θ9+15α4θ8+42α3θ9+852α3θ8+ 3132α3θ7+3384α3θ6+540α2θ7+1180α2θ6+ 34992α2θ5+41472α2θ4+2808αθ5+59184αθ4+ 132192αθ3+425088αθ2+5184θ3+98496θ2+186624θ+ 2083968}θ4(αθ2+6)4

The descriptive constants based on moments namely CV, CS, CK and the ID of QPRD are given by

CV=μ2μ1=α2θ5+α2θ4+30αθ3+84αθ2+144θ+144αθ2+24
CS=μ3μ23/2={α3θ8+3α3θ7+2α3θ6+36α2θ6+270α2θ5+ 396α2θ4+324αθ4+1944αθ3+648αθ2+864θ2+ 2592θ+1728}(α2θ5+α2θ4+30αθ3+84αθ2+144θ+144)3/2
CK=μ4μ22={α4θ11+10α4θ10+18α4θ9+15α4θ8+42α3θ9+ 852α3θ8+3132α3θ7+3384α3θ6+540α2θ7+ 1180α2θ6+34992α2θ5+41472α2θ4+2808αθ5+ 59184αθ4+132192αθ3+425088αθ2+5184θ3+ 98496θ2+186624θ+2083968}(α2θ5+α2θ4+30αθ3+84αθ2+144θ+144)2
ID=μ2μ1=α2θ5+α2θ4+30αθ3+84αθ2+144θ+144θ(αθ2+6)(αθ2+24)

Behaviour of CV, CS, CK and ID of QPRD for various values of parameters are shown in Figure 2.

images

Figure 2 CV, CS, CK and ID of QPRD.

CV increases for increasing values of α and fixed θ while decreases for increasing values of θ and fixed α. CS increases for increasing values of α and θ and the parameter α has more influence on CS than the parameter θ. CK is highest when both the parameter α and θ is smaller which means that the distribution has heavier tails and sharper peaks in that region. As either parameter increases the kurtosis drops sharply that leads to a flatter and more uniform distribution. For higher values of the parameters α and θ, ID increases which means that α and θ increases, the variance increases more than the mean, resulting into over-dispersion.

Theorem 7: The QPRD is over-dispersed, that is, μ2>μ1.

Proof: We have

μ2 =α2θ5+α2θ4+30αθ3+84αθ2+144θ+144θ2(αθ2+6)2
=αθ2+24θ(αθ2+6)[α2θ5+α2θ4+30αθ3+84αθ2+144θ+144θ(αθ2+6)(αθ2+24)]
=μ1[(α2θ5+30αθ3+144θ)+(α2θ4+84αθ2+144)α2θ5+30αθ3+144θ]
=μ1[1+α2θ4+84αθ2+144θ(αθ2+6)(αθ2+24)].

This gives μ2>μ1.

4 Reliability Properties

In this section, reliability properties of QPRD including survival function, hazard function, reverse hazard function and Mill’s ratio have been discussed.

The cdf of QPRD can be derived as

F(x) =F(x;θ,α)=P(Xx)=1P(Xx+1)
=1t=x+10P(X=tλ)f(λ;θ,α)dλ
=1t=x+10eλλtt!θ3αθ2+6(α+θλ3)eθλdλ
=1t=x+1θ3(αθ2+6)[α(θ+1)3+θ(t+1)(t+2)(t+3)(θ+1)t+4]
=1[θ3x3+(9θ3+3θ2)x2+(26θ3+21θ2+6θ)x]+{24θ3+36θ2+14θ+αθ2(θ+1)3}](αθ2+6)(θ+1)x+4.

The survival function of QPRD can thus be given by

S(x)=S(x;θ,α)=[θ3x3+(9θ3+3θ2)x2+(26θ3+21θ2+6θ)x+{24θ3+36θ2+14θ+αθ2(θ+1)3}](αθ2+6)(θ+1)x+4.

The hazard function of QPRD can be expressed as

h(x) =h(x;θ,α)
=P(x;θ,α)S(x;θ,α)
=θ4(x3+6x2+11x+6)+αθ3(θ+1)3[θ3x3+(9θ3+3θ2)x2+(26θ3+21θ2+6θ)x+{24θ3+36θ2+14θ+αθ2(θ+1)3}]

The cdf, survival function and hazard function of QPRD shown in Figure 3. From the Graph of cdf of QPRD, it is clear that QPRD has a valid cdf because for x,F(x)1. Also, for fixed α,limxh(x)=θ and for fixed θ, limxh(x)=α.

images

Figure 3 cdf, survival function and hazard function of QPRD.

The CDF shows how the probability builds up as the count variable increases. From the plots, it is clear that the cdf of QPRD increases smoothly with increasing x and always remains between 0 and 1. This confirms that the proposed model is mathematically valid and suitable for count data. The parameter α affects how fast the cdf increases and θ affects the shape of the distribution.

The survival function decreases smoothly with increasing x. The parameter θ controls how quickly survival declines, while α affects the early behaviour of the curve. This shows that the distribution can model both rapid and slow decay in survival probabilities.

Hazard function represents the happening of an event at a particular value of x given that the event has not occurred before. In the first graph of hazard function θ is fixed and α varies. In the 1st graph we have seen that as the value of alpha increases, initially the risk decreases then it increases steadily and after a certain point, it becomes almost constant. For smaller value of alpha, the risk increases in a very slow manner.

The second graph allows θ to vary and α to be fixed. In the 2nd graph we observed that the hazard function increases for all values of θ. Initially there is a small decrease in risk for the highest value of α. For smaller values of θ, the risk remains low and increases gradually. As the value of θ increases, the hazard level becomes higher, but the overall pattern remains similar, this means that the rate of increase in risk is stable.

Reverse hazard function and Mill’s ratio of QPRD are obtained as

Rh(x;θ,α) =P(x;θ,α)F(x;θ,α)
=θ3{θ(x3+6x2+11x+6)+α(θ+1)3}(αθ2+6)(θ+1)x+4[θ3x3+(9θ3+3θ2)x2+(26θ3+21θ2+6θ)x+{24θ3+36θ2+14θ+αθ2(θ+1)3}]

and

M(x;θ,α) =S(x;θ,α)P(x;θ,α)
=[θ3x3+(9θ3+3θ2)x2+(26θ3+21θ2+6θ)x+{24θ3+36θ2+14θ+αθ2(θ+1)3}]θ3{θ(x3+6x2+11x+6)+α(θ+1)3}

5 Parameter Estimation

5.1 Method of Moment Estimation

Let (x1,x2,,xn) be a random sample of size n from the QPRD. We have

μ2μ1(μ1)2=(2αθ2+120)(αθ2+6)(αθ2+24)2=k

Taking αθ2=c, we have

2(c+60)(c+6)(c+24)2=k
(k2)c2+(48k132)c+576k720=0

Replacing μ1 and μ2 with their corresponding sample moments, an estimate of k can be obtained and substituting the value of k in the above equation, estimate of c can be obtained.

Now, for real root of c, we have

(48k132)24(k2)(576k720)0k2.25

Thus, the method of moments estimate is applicable if k=m2x¯(x¯)22.25, where m2 and x¯ are the corresponding second and first sample moments, respectively.

Now taking αθ2=c in the expression for mean and equating the population mean to the sample mean, we get the moment estimate θ~ of θ as

αθ2+24θ(αθ2+6)=c+24θ(c+6)=x¯θ~=c+24x¯(c+6)

From αθ2=c, we get the moment estimate α~ of α as

α~=c(θ~)2=c(c+6)2(x¯)2(c+24)2

Thus, the method of moment estimates (θ~,α~) of (θ,α) of QPRD are given by

(θ~,α~)=(c+24x¯(c+6),c(c+6)2(x¯)2(c+24)2)

5.2 Method of Maximum Likelihood Estimation

Suppose (x1,x2,,xn) be a random sample from QPRD and let fx be the observed frequency corresponding to X=x(x=1,2,3,k) such that x=1kfx=n, where k is the largest observed value having non-zero frequency. The likelihood function, L of the QPRD is given by

L =(θ3αθ2+6)n
×1(θ+1)x=1k(x+4)fxx=1k[θ(x3+6x2+11x+6)+α(θ+1)3]

The log-likelihood function is given by

logL =n[3logθlog(αθ2+6)]x=1kfx(x+4)log(θ+1)
+x=1kfxlog[θ(x3+6x2+11x+6)+α(θ+1)3]

The maximum likelihood estimates (θ^,α^) of (θ,α) are the solutions of equations

logLθ =3nθ2nαθαθ2+6n(x¯+4)θ+1
+x=1k(x3+6x2+11x+6)+3α(θ+1)2θ(x3+6x2+11x+6)+α(θ+1)3=0
logLα =nθ2αθ2+6
+x=1k(θ+1)3fxθ(x3+6x2+11x+6)+α(θ+1)3=0

where x¯ is the sample mean. Since these two log-likelihood equations are not in closed form, it cannot be solved directly. But these equations can be solved using R-software.

6 A Simulation Study

In this simulation study, we evaluate the effectiveness of MLE for parameters θ and α of QPRD. The study considers multiple sample sizes (n = 50, 100, 200, 300, 400, 500), with true values set at θ=3,α=1 and θ=3.5,α=0.5. The values of θ and α has been chosen on the basis of the estimates of the parameters for the proposed distribution. For simulating the sample, we used inverse transformation sampling. MLE is implemented using the stats4 package in R, applying the L-BFGS-B optimization method. For each sample size, 5,000 replications are used to compute performance metrics such as mean, variance, bias, and MSE. The results highlight improvements in estimation accuracy for increasing sample size and affirm the robustness of the MLE approach under this specific distributional setting.

bias(θ^)=E(θ^)θ=1Ni=1Nθ^iθ
MSE(θ^)=E[(θ^θ)2]=1Ni=1N(θ^iθ)2
bias(α^)=E(α^)α=1Ni=1Nα^iα
MSE(α^)=E[(α^α)2]=1Ni=1N(α^iα)2

Table 1 MSE and Bias for θ=3 and α=1

θ=3 α=1
N Mean Variance MSE Bias Mean Variance MSE Bias
50 3.7184 1.5399 2.0559 0.7183 0.7835 0.3906 0.4374 0.2165
100 3.4526 0.8414 1.0462 0.4526 0.8591 0.3401 0.3610 0.1409
200 3.2886 0.4067 0.4838 0.3886 0.9104 0.2794 0.2874 0.0896
300 3.1959 0.2626 0.3009 0.1959 0.9416 0.2480 0.2515 0.0584
400 3.1454 0.1909 0.2182 0.1454 0.9621 0.2166 0.2025 0.0378
500 3.1274 0.1659 0.1822 0.1273 0.9729 0.2018 0.2181 0.0271

Table 2 MSE and Bias for θ=3.5 and α=0.5

θ=3.5 α=0.5
N Mean Variance MSE Bias Mean Variance MSE Bias
50 4.0436 1.7796 2.0751 0.5436 0.6308 0.3742 0.3912 0.1308
100 3.8162 1.0729 1.1729 0.3162 0.6502 0.3263 0.3489 0.1523
200 3.6804 0.5922 0.6247 0.1804 0.6269 0.2531 0.2692 0.1269
300 3.6116 0.4122 0.4246 0.1116 0.6194 0.2175 0.2318 0.1194
400 3.5700 0.3223 0.3222 0.0700 0.6108 0.1841 0.1963 0.1108
500 3.5574 0.2632 0.2632 0.0574 0.5997 0.1592 0.1692 0.0997

From the Table 1 we observed that for the parameter θ, the estimated mean moves closer to the true value θ=3 as the sample size increases. At smaller sample sizes, the estimator shows more bias, but this bias steadily decreases with increasing sample size. Both the variance and the mean squared error (MSE) also decline sharply as the sample size increases, indicating improved precision and stability of the estimator. A similar trend is observed for the parameter α. Same pattern has been seen in Table 2. For both table we can say that the estimators are consistent.

Table 3 Mammalian cytogenetic dosimetry lesions in rabbit lymphoblast induced by streptogramin

Number of Observations Observed Frequency Expected Frequency
PLD PAD PSD PRD QPLD QPAD QPSD QPRD
0 413 405.7 420.1 406.1 413.9 407.5 409.9 395.1 413.2
1 124 133.6 124.8 133.0 123.5 131.3 128.5 127.4 123.9
2 42 42.6 38.6 42.7 41.2 42.2 41.8 45.9 41.8
3 15 13.3 12.1 13.4 14.5 13.6 13.9 18.4 14.6
4 5 4.1 3.8 4.1 5.2 4.3 4.6 8.0 5.0
5 0 1.2 1.2 1.2 1.8 1.4 1.5 3.5 1.7
6 2 0.7 0.7 1.5 0.9 0.7 0.8 2.7 0.8
Total 601 601 601 601 601 601 601 601 601
θ^(SE) 2.6853 (0.1670) 3.0872 (0.1593) 3.1257 (0.1640) 3.1369 (0.1280) 2.2403 (0.4188) 2.6253 (1.0183) 2.4153 (0.7841) 3.4605 (1.3479)
α(SE) 15.0060 (47.0648) 5.45822 (14.1681) 16.4112 (48.9510) 1.8442 (3.7322)
χ2 4.8603 5.1153 1.1771 0.13929 0.68141 0.27049 5.512 0.0454
d.f. 3 3 3 3 2 2 2 2
p-value 0.5618 0.5291 0.8819 0.9977 0.9536 0.9916 0.2387 0.9997

Table 4 Number of thunderstorm events in the month of June

Number of Thunderstorms Observed Frequency Expected Frequency
PLD PAD PSD PRD QPLD QPAD QPSD QPRD
0 187 185.3 194.8 184.8 191.1 184.3 184.6 184.4 185.3
1 77 83.5 79.0 83.6 76.4 84.4 83.4 84.0 81.9
2 40 36.0 33.1 36.3 34.3 36.3 36.8 36.5 37.3
3 17 15.0 13.8 15.2 15.8 15.0 15.4 15.2 15.8
4 6 6.1 5.6 6.1 7.1 6.0 6.1 6.1 6.2
5 2 2.5 2.2 2.4 3.1 2.4 2.3 2.4 2.3
6 1 1.6 1.5 1.6 2.2 1.6 1.4 1.4 1.2
Total 330 330 330 330 330 330 330 330 330
θ^(SE) 1.8042 (0.1257) 2.2510 (0.1260) 2.2298 (0.1295) 2.4547 (0.1047) 1.8978 (0.4497) 2.5365 (0.5030) 2.2697 (0.4608) 3.2501 (0.5556)
α^(SE) 1.3459 (1.7615) 0.9516 (0.8563) 2.0299 (2.1131) 0.61325 (0.5117)
χ2 1.3739 2.553 1.2573 2.0633 1.4322 1.0321 1.2506 0.6458
d.f. 3 3 3 3 2 2 2 2
p-value 0.8487 0.6352 0.8686 0.7241 0.8386 0.9049 0.8697 0.9578

7 Applications

In this section QPRD has been fitted on two real-life examples of observed count datasets and its goodness of fit has been compared with PLD, PAD, PSD, PRD, QPLD, QPAD and QPSD and presented in the Tables 3 and 4. The first dataset in Table 3 is related to mammalian cytogenetic dosimetry lesions in rabbit lymphoblast [32] and the second dataset is related to number of days that experienced the number of thunderstorms events at Cape Kennedy, Florida for the month of June, 11-year period of record January 1957 to December 1967 [33]. Based on the values of χ2, it is quite obvious that QPRD competes well with considered over-dispersed one parameter and two-parameter distributions and thus provides best fit among all considered distributions. In both datasets, QPRD yields smaller chi-square values and higher p-values among the considered distributions. Here, we would like to emphasize that algorithm for generating neutrosophic gamma distributed data by [34] and length-biased weighted Ishita distribution by [35] are popular for modeling continuous data but the Poisson mixture of these distributions is useful for modeling discrete data.

8 Conclusion

In this study, the PD compound of QRD named the QPRD has been proposed. It is specifically designed to model over-dispersed non-negative count data, particularly when the right tail of the data approaches zero quickly. Coefficients of variation, skewness, kurtosis, and index of dispersion are the measures based on moments have been derived and their behaviour for different parameter values has been studied. The QPRD is shown to be overdispersed, unimodal, has an increasing hazard rate and right-skewed. It is a two-component mixture of NBDs, yet it maintains its unimodality in its PMF which suggest that modes of its constituent sub-populations are located very close to each other. The estimation of parameters of QPRD has been discussed with both the method of moments and the method of maximum likelihood. The effectiveness of maximum probability estimates has been tested using simulation. The goodness of fit of QPRD has been tested with two datasets and compared with the goodness of fit of PLD, PAD, PSD, PRD, QPLD, QPAD and QPSD. It is clear that QPRD is the best distribution among all one parameter and two-parameter over-dispersed discrete distributions and therefore, QPRD would be the best probability model for over-dispersed count data in data science. It is quite clear that QPRD is particularly suited for biological sciences, insurance and other fields characterized by heavily over-dispersed count data.

The QPRD has some limitations. Since the proposed distribution has only two-parameter which may limits its flexibility compared to three-parameter or more complex models when dealing with extremely diverse datasets. It is designed for over-dispersed dataset and is not suitable for under-dispersed dataset. Although the proposed distribution is flexible, its unimodal nature means that it cannot be used to model multi-modal data structures.

As the distribution is new and thus have high potential for applications in different fields of knowledge and we are sure that in future the proposed distribution will draw attention of researchers in biomedical sciences and engineering to model survival time data. The future research possibilities are that size-biased and zero-truncated versions of QPRD can be derived and their applications for the data excluding zero counts can be examined. Some more areas of applications in other fields of knowledge can be explored for big data in the data science. For example, zero-inflated Poisson quasi Rama distribution can be developed to model data with excessive zeros. The QPRD can be extended into a regression framework named Poisson quasi Rama regression to examine the relationship between the dependent variable and covariates. The proposed distribution can be modified to handle under-dispersed data.

Acknowledgements

Authors are grateful to the editor in chief and the two anonymous reviewers for their fruitful comments which improved both the quality and presentation of the paper.

References

[1] Sankaran, M. The discrete Poisson-Lindley distribution, Biometrics, 26(1): 145–149, 1970.

[2] Lindley, D. V. Fiducial distributions and Bayes theorem, Journal of Royal Statistical Society, 20(1): 102–107, 1958.

[3] Shanker, R. The discrete Poisson-Shanker distribution, Jacobs Journal of Biostatics, 1(1): 1–7, 2016.

[4] Shanker, R. Shanker distribution and its applications, International Journal of Statistics and Applications, 5(6): 338–348, 2015.

[5] Shanker, R. The discrete Poisson-Akash distribution, International Journal of Probability and Statistics, 6(1): 1–10, 2017.

[6] Shanker, R. Akash distribution and its applications, International Journal of Probability and Statistics, 4(3): 65–75, 2015.

[7] Shanker, R. The discrete Poisson-Sujatha distribution, International Journal of Probability and Statistics, 5(1): 1–9, 2016.

[8] Shanker, R. Sujatha distribution and its applications, Statistics in Transition-New Series, 17(3): 391–410, 2016.

[9] Abebe, B., and Shanker, R. A discrete quasi Akash distribution with applications, Turkiye Klinikleri Journal of Biostatistics, 10(3): 187–199, 2018.

[10] Shanker, R. Rama distribution and its application, International Journal of Statistics and Applications, 7(1): 26–35, 2017.

[11] Edith, U. U., Ebele, T. U., and Henrietta, A. I. A two-parameter Rama distribution, Earthline Journal of Mathematical Sciences, 2(2): 365–382, 2019.

[12] Abebe, B., Tesfay, M., Eyob, T., and Shanker, R. A two-parameter power Rama distribution with properties and applications, Biometrics & Biostatistics International Journal, 8(1): 6–11, 2019.

[13] Eyob, T., and Shanker, R. A two-parameter weighted Rama distribution with properties and application, Universal Journal of Mathematics and Applications, 2(2): 48–58, 2019.

[14] Onyekwere, C. K., Osuji, G. A., Enogwe, S. U., Okoro, M. C., and Ihedioha, V. N. Exponentiated Rama Distribution: properties and application, Mathematical theory and Modelling, 11(1): 2224–5804, 2021.

[15] Alhyasat, K. M., Ibrahim, K., Al-Omari, A. I., and Bakar, M. A. A. Extended Rama distribution: properties and applications, Computer System Science Engineering, 39(1): 55–67, 2021.

[16] Mohiuddin, M., and Kannan, R. A new generalization of Rama distribution with application to machinery data, International Journal of Emerging Technologies in Engineering Research, 9(9): 1–13, 2021.

[17] Omoruyi, F. A., Omeje, I. L., Anabike, I. C., and Obulezi, O. J. A new variant of Rama distribution with simulation study and application to blood cancer data, European Journal of Theoretical and Applied Sciences, 1(4): 389–409, 2023.

[18] Shanker, R., Das, H., and Shukla, K. K. A quasi Rama distribution, Model Assisted Statistics and Applications, 18(3): 267–277, 2023.

[19] Bell, W., and Nadarajah, S. The wrapped Rama distribution, Scientific Reports, 14(1): 31936, 2024.

[20] Arunachalam, R. A new generalisation of the quasi Rama distribution: Exploring its Statistical Properties and Applications, Mathematics and Computer Science: Research Updates, 4: 64–83, 2025.

[21] Shanker, R., Ray, M., and Shukla, K. K. The Poisson-Rama distribution with properties and applications to model over-dispersed count data, Brazilian Journal of Biometrics, 43(2): 1–15, 2025.

[22] Shanker, R., Ray, M., Prodhani, H. R., and Boruah, J. Weighted quasi Rama distribution with properties and applications in Engineering, International Journal of Statistics and Reliability Engineering, 12(1): 18–34, 2025.

[23] Shanker, R., Prodhani, H. R., Ray, M., and Deb, J. Power quasi Rama distribution with properties and applications in reliability engineering, Journal of Probability and Statistical Science, 24(1): 50–73, 2026.

[24] Shanker, R., and Mishra, A. A quasi Poisson-Lindley distribution, Journal of Indian Statistical Association, 54(1&2): 113–125, 2016.

[25] Shanker, R., Shukla, K. K., and Shanker, R. A quasi Poisson-Akash distribution and its applications to ecology, International Journal of Statistics and Applied Mathematics, 3(3): 111–119, 2018.

[26] Shanker, R., and Shukla, K. K. A quasi Poisson-Sujatha distribution with applications, Journal of Applied Probability and Statistics, 14(1): 103–115, 2019.

[27] Grandell, J. Mixed Poisson Processes, Chapman & Hall, London, 1997.

[28] Gupta, P. L., Gupta, R. C., Tripathi, R. C. On the monotonic properties of discrete failure rates, Journal of statistical planning and inference, 65(2): 255–268, 1997.

[29] Steutal, F. Log-concave and log-convex distribution, Encyclopedia of Statistical Sciences, 2004.

[30] Bagnoli, M., Bergstrom, T. Log-concave probability and its application in Rationality and Equilibrium: A symposium is Honoured of Marchel K. Richer, Springer, 217–241, 2006.

[31] Keilson, J., Gerber, H. Some results for discrete unimodality, Journal of the American Statistical Association, 66(334): 386–389, 1971.

[32] Pittillo, R. F., and Woolley, C. Biological assay of streptonigrin (NSC 45383) in body fluids and tissues of mice, Antimicrobial agents and chemotherapy, 5(1): 82–85, 1974.

[33] Falls, L. W., Williford, W. O., and Carter, M. C. Probability distributions for thunderstorm activity at Cape Kennedy, Florida, Journal of Applied Meteorology, 10(1): 97–104, 1971.

[34] Saleem, M., and Aslam, M. Algorithms for Generating Neutrosophic Gamma Distributed Data, Journal of Reliability and Statistical Studies, 2025.

[35] Pushkarna, N., and Mustafa, M. Length Biased Weighted Ishita Distribution and Its Applications on Real Life Data Sets, Journal of Reliability and Statistical Studies, 2025.

Biographies

images

Rama Shanker received his Ph.D. Degree in Statistics from Patna University, Patna, India. He is currently working as Professor of Statistics in the Department of Statistics, Assam University, Silchar, Assam, India. He Has 235 research papers in his name in reputed Journals of Statistics, Mathematics, Operations Research and Biostatistics. He has 22 years of post-Ph.D teaching and research experience at undergraduate and postgraduate level. He has successfully guided four Ph.D. Scholars and currently five research scholars are working under his supervision for Ph.D. degree.

images

Riki Tabassum obtained her master Degree in Statistics from Guwahati University, Guwahati, India. She is currently a research scholar in the Department of Statistics, Assam University, Silchar and working for her Ph.D. Degree in Distribution Theory. She has published seven research papers in reputed Journals of Statistics. She has also presented three research papers in international conferences.

images

Jyotirmoyee Baishya obtained her master Degree in Statistics from Assam University, Silchar, India. She is currently a research scholar in the Department of Statistics, Assam University, Silchar and working for her Ph.D. Degree in Distribution Theory. She has published six research papers in reputed Journals of Statistics. She has also presented two research papers in international conferences.