While all efforts have been made to ensure the accuracy of the information in this journal, opinions expressed are those of the authors and do not necessarily reflect the views of The Publishers, Editor or the Editorial Board. The publishers, Editor and Editorial Board cannot be held responsible for errors or any consequences arising from the use of information contained in this journal; or the views and opinions expressed. Publication of any advertisements does not constitute any endorsement by the Publishers and Editors of the product advertised.

The contents of this journal are copyright. Apart from any fair dealing for purposes of private study, research, criticism or review, as permitted under the Australian Copyright Act, no part of this program may be reproduced without the permission of the publisher.

 

 
April 2008, Volume 5 - Issue 2

Modeling and Forecasting of l2 (Survivors at Age 2) Values for Male and Female Population of Bangladesh: Regression Model Approach

Md.Rafiqul Islam

Correspondence:
Dr. Md. Rafiqul Islam
Associate Professor and Chairman Dept. of Population Science and Human Resource Development,
University of Rajshahi Bangladesh.
E-mail:
rafique_pops@yahoo.com


ABSTRACT

The purpose of the present study is to build up statistical models to l2 (number of survivors at age 2) values for male and female population of Bangladesh. For this, the secondary data of l2 values for male and female population of Bangladesh have been taken from various sources. To check the soundness of the model, the model validation technique, cross-validity prediction power (CVPP), is applied. It is seen that a simple linear regression model is fitted to l2 values for male and female population of Bangladesh. Then, these are forecasted for 2008–2031using these fitted time trend statistical models.

Keywords: l2 (Survivors at Age 2) Values for male and female Population of Bangladesh Modeling Cross- validity prediction power (CVPP)  F-test.



INTRODUCTION

Modeling in Demography in the Asian region, in particular, in Bangladesh has hardly ever been used. For representing data in the up to date hi-tech era, statistical models are very sophisticated and pragmatic devices. The statistical model is very much significant and imperative in differentiating necessary and unnecessary characteristics in the midst of a variety of socio-economic and demographic phenomena. Modeling is in fact essentially an endeavor to find out the functional interaction and their vibrant behaviors surrounded by the various components, not only in demographic but also in socio-economic analysis. Last but not least, a model is very important for the estimation of population projections and estimations. Normally, one can depict a number of figures for the demographic parameters as well as socio-economic indicators but, in the perception of Bangladesh, very few of us comprehend which types of functional or mathematical shapes are more appropriate for the parameters and social indicators. In this study, l2 is defined as the number of survivors at age 2 in the lx function of a life table in life table analysis. l2 is very important and needed for the linking to adult mortality to attain a complete lx function or column of a life table that is employed in the application of Orphanhood method, Widowhood method and other indirect methods  for the estimation of demographic parameters of Bangladesh.

Ali (1994) found that the relationship of total separation rates and separation rates due to death, with their age variable and found a semi-log function of the type . In Islam et al (2003), it was reported that age distribution, age specific death rates (ASDRs) and the number of persons surviving at an exact age x (lx) for male population of Bangladesh in 1991 follow a modified negative exponential model, 4th degree polynomial model and 3rd degree polynomial model, respectively. Islam (2005) observed that age structure, ASDRs and lx for female population of Bangladesh follow a modified negative exponential model, 4th degree polynomial model and biquadratic polynomial model respectively. It was set up that the values of a life table for the male population followed a four parameters 3rd degree polynomial model, i. e. cubic polynomial model (Islam, 2006).

Therefore, the fundamental aims and objectives of this study are as follows:

i) to build up time trend statistical models to l2 values for male and female population of Bangladesh and,
ii) then to forecast these values employing these fitted statistical regression models for 2008-2031.


DATA AND METHODOLOGY

Data Sources

To fulfill the objectives mentioned above the secondary data on l2 values for male and female population of Bangladesh have been taken from (Islam, 2003, 2006, 2007 and 2007). These have been utilized as raw materials in the current study that are shown in Table 1.

Data Smoothing

It is observed that there is some kind of unpredicted distortions in the data aggregate if it is placed on a graph paper. Therefore, before going to fit the models to this data, an adjustment is needed to relieve these unpredicted distortions. So, these are smoothed using the Package Minitab Release 12.1 by the latest smoothing technique “4253H, twice” (Velleman, 1980). Afterward, the smoothed data are used to fit statistical models and these are shown in Table 1.

Regression Model Fitting

Using the scattered plot of l2 values for male and female population of Bangladesh, it appears that these are linearly distributed. Therefore, a statistical model, that is, a simple linear regression model is considered and the structure of the model is

yt=a0 +a1t+u

where, t represents time (years); yt represents l2 values;  a0, a1 are unknown parameters and u is the stochastic disturbance term of the model.

Note that these models are fitted using the software STATISTICA.

 

Model Validation Technique

To test out the validity or legitimacy of these models, the CVPP, , is applied. The mathematical formulation for CVPP is given as

; where, n is the number of classes, k is the number of regressors in the model and the cross-validated R is the correlation between observed and predicted values of the dependent variables (Stevens, 1996). The shrinkage of the model is the positive value of ( - R2); where is CVPP and R2 is the coefficient of determination of the fitted model. As well, 1-shrinkage is the stability of R2 of the model. The estimated CVPP analogous to their R2 and information on model fittings are summarized in Table 2. It is noted that CVPP was also employed by Islam (2003 and 2005), Islam et al (2003 and 2005) and Khan and Ali (2004) as the model justification method.

To find out the overall measure of significance level of the fitted models as well as the significance of R2 , the F-test is employed in this information. The F-test is specified by
with (l-1, n-l) degrees of freedom (d.f.);
where l = the number of parameters is to be estimated in the fitted model, n is the number of cases and R2 is the coefficient of determination of the model (Gujarati, 1998).

Table 1 Observed, Predicted and Residual of l2 Values for Male and Female Population of Bangladesh During 1961-2007

Year Male Female
Observed Smoothed Predicted Residual Observed Smoothed Predicted Residual
1961 0.77813 0.778130 0.76803 0.01011 0.78989 0.789890 0.778273 0.01162
1974 0.77067 0.793406 0.80810 -0.0147 0.77815 0.802052 0.813988 -0.01194
1981 0.81750 0.826434 0.82968 -0.0032 0.82571 0.828114 0.833219 -0.00511
1991 0.87999 0.867742 0.86051 0.00723 0.86339 0.861243 0.860692 0.00055
2005 0.90280 0.896771 0.90367 -0.0069 0.89020 0.888750 0.899154 -0.01040
2006 0.90862 0.909496 0.90676 0.00274 0.91015 0.906124 0.901901 0.00422
2007 0.91161 0.914609 0.90984 0.00477 0.91282 0.915702 0.904648 0.01105

Table 2 Information on Model Fittings

Model n k

R2

Shrinkage Parameters Significant Probability (p)

(i)

7

2

0.97579

0.958497

0.0173 a0
a1
0.0001
0.00003

(ii)

7

2

0.96377

0.937891

0.0259 a0
a1
0.00020
0.00009


Table 3 Forecasted l2 Values for Male and Female Population of Bangladesh During 2008-2031

Year Male Female
2008 0.90714 0.91284
2009 0.91022 0.91559
2010 0.91330 0.91834
2011 0.91638 0.92109
2012 0.91946 0.92384
2013 0.92254 0.92659
2014 0.92562 0.92934
2015 0.92870 0.93209
2016 0.93178 0.93484
2017 0.93486 0.93759
2018 0.93794 0.94034
2019 0.94102 0.94309
2020 0.94410 0.94584
2021 0.94718 0.94859
2022 0.95026 0.95134
2023 0.95334 0.95409
2024 0.95642 0.95684
2025 0.95950 0.95959
2026 0.96258 0.96234
2027 0.96566 0.96509
2028 0.96874 0.96784
2029 0.97182 0.97059
2030 0.97490 0.97334
2031 0.97798 0.97609


Figure 1 Observed, Smoothed and Predicted l2 Values for Male Population of Bangladesh. X axis represents Year (Time) and Y axis represents l2 Values.

Figure 2 Observed, Smoothed and Predicted l2 Values for Female Population of Bangladesh. X axis represents Year (Time) and Y axis represents l2 Values.

Figure 3 Forecasted l2 Values for Male and Female Population of Bangladesh During 2008-2031. X axis represents Year (Time) and Y axis represents forecasted l2 Values.


 

RESULTS and DISCUSSION

The statistical models, that is, simple linear regression model is assumed to fit to l2 values for male and female population of Bangladesh and the fitted time trend models are in the following:

yt=-5.2775 +0.00308t for male … (1)
t-stats (-12.2147) (14.1946)

yt=-4.60916 +0.00275t for female … (2)
t-stats (-9.72565) (11.53224)

The information on model fittings and estimated CVPP, , analogous to their R2 of these models is shown in Table 2. From this table it appears that the fitted models (1) - (2) are highly cross-validated and their shrinkages are 0.0173 and 0.0259 respectively. These imply that the fitted models (1) - (2) will be stable more than 95% and 93% respectively. Moreover, it is found that the parameters of the fitted models (1) - (2) are highly statistically significant with significant of variance explained. The stability for R2 of these models is more than 98% and 97% respectively.

The calculated values of F statistic for the models (1) - (2) are 201.53 with (1, 5) d.f. and 133.01 with (1, 5) d.f. respectively whereas the analogous tabulated values are only 16.3 for (1) - (2) models at 1% level of significance. Therefore, from these statistics it is seen that these models and their analogous R2 are highly statistically significant. Hence, the fits of these models are well.

It should be mentioned here for information that others models such as exponential, logistic, quadratic, cubic, biquadratic were also applied to fit model to these data but those are not fit well due to shrinkage and proportion of variance explained.
Thereafter, the forecasted values are estimated using these fitted time trend regression models that are presented in Table 3. It is found from the Table 3 that l2 values are increasing, i.e., upward trend due to time during the forecasted period 2008-2031.

 

CONCLUSION

In this study it is found that l2 values for male and female population of Bangladesh follow a simple linear regression model. Then these are forecasted using these statistical models during 2008– 2031. These might be used as predicted l2 values for male and female population of Bangladesh for 2008–2031 for further higher study as these may be used in the application of Orphanhood method, Widowhood method and other indirect techniques for the estimation of demographic parameters for the forecasted period 2008– 2031.

 

REFERENCES

Ali, M. Korban. (1994) Modeling of Labour Force Dynamics in Bangladesh: An Evidence from 1981 Census, The Rajshahi University Studies, Part-B, Vol. 22: 259-266.

Gujarati, Damodar N. (1998). Basic Econometrics, Third Edition, McGraw Hill, Inc., New York.

Islam, Md. Rafiqul (2003). Modeling of Demographic Parameters of Bangladesh-An Empirical Forecasting, Unpublished Ph.D. Thesis, Rajshahi University.

Islam, Md. Rafiqul, Islam, Md. Nurul, Ali, Md. Ayub & Mostofa, Md. Golam. (2003).
Construction of Male Life Table from Female Widowed Information of Bangladesh, International Journal of Statistical Sciences, Dept. of Statistics, University of Rajshahi, Bangladesh, Vol. 2: 69-82.

Islam, Md. Rafiqul. (2005). Construction of Female Life Table from Male Widowed Information of Bangladesh, Pakistan Journal of Statistics, Vol. 21(3): 275-284.

Islam, Md. Rafiqul, Islam, Md. Nurul, Ali, M. Korban & Mondal, Md. Nazrul Islam (2005). Indirect Estimation and Mathematical Modeling of Some Demographic Parameters of Bangladesh, The Oriental Anthropologist, Vol. 5(2): 163 - 171.

Islam, Md. Rafiqul (2006). Construction of Abridged Life Tables and Indirect Estimation of Some Mortality Measures of Bangladesh in 2005, Journal of Population, Indonesia, Vol.11 (2):117-130.

Islam, Md. Rafiqul (2007). Estimation of Some Mortality Measures, Modeling of lx and Age Associated with Force of Mortality of Bangladesh in 2006, Middle East J. of Age and Ageing, Vol. 4(5):23-28.

Islam, Md. Rafiqul (2007). Estimation of Some Mortality Measures, Modeling of lx and Age Associated with Force of Mortality of Bangladesh in 2007, Middle East J. of Age and Ageing, Vol. 4(6): 32-38.

Khan, Md. Atikur Rahman and Ali, Md. Ayub (2004). Dynamics of Mean Age at Marriage, TFR and NRR in Bangladesh, International J. of Statistical Sciences, Dept. of Statistics, University of Rajshahi, Vol. 3(Special Issue): 297-309.

Stevens, J. (1996). Applied Multivariate Statistics for the Social Sciences, Third Edition, Lawrence Erlbaum Associates, Inc., Publishers, New Jersey.

Velleman, P. F. (1980). Definition and Comparison of Robust Nonlinear Data Smoothing Algorithms, Journal of the American Statistical Association, Vol. 75 (371): 609-615.