|
ABSTRACT
The purpose of the
present study is to build up statistical models
to l2 (number of survivors at age 2)
values for male and female population of Bangladesh.
For this, the secondary data of l2 values
for male and female population of Bangladesh have
been taken from various sources. To check the soundness
of the model, the model validation technique, cross-validity
prediction power (CVPP), is applied. It is seen
that a simple linear regression model is fitted
to l2 values for male and female population
of Bangladesh. Then, these are forecasted for 2008–2031using
these fitted time trend statistical models.
Keywords:
l2
(Survivors at Age 2) Values for male and
female Population of Bangladesh Modeling Cross-
validity prediction power (CVPP) F-test. |
INTRODUCTION
Modeling
in Demography in the Asian region, in particular, in
Bangladesh has hardly ever been used. For representing
data in the up to date hi-tech era, statistical models
are very sophisticated and pragmatic devices. The statistical
model is very much significant and imperative in differentiating
necessary and unnecessary characteristics in the midst
of a variety of socio-economic and demographic phenomena.
Modeling is in fact essentially an endeavor to find
out the functional interaction and their vibrant behaviors
surrounded by the various components, not only in demographic
but also in socio-economic analysis. Last but not least,
a model is very important for the estimation of population
projections and estimations. Normally, one can depict
a number of figures for the demographic parameters as
well as socio-economic indicators but, in the perception
of Bangladesh, very few of us comprehend which types
of functional or mathematical shapes are more appropriate
for the parameters and social indicators. In this study,
l2 is defined as the number of survivors
at age 2 in the lx function of a life table
in life table analysis. l2 is very important
and needed for the linking to adult mortality to attain
a complete lx function or column of a life
table that is employed in the application of Orphanhood
method, Widowhood method and other indirect methods
for the estimation of demographic parameters of Bangladesh.
Ali (1994) found
that the relationship of total separation rates and
separation rates due to death, with their age variable
and found a semi-log function of the type . In Islam
et al (2003), it was reported that age distribution,
age specific death rates (ASDRs) and the number of persons
surviving at an exact age x (lx) for male
population of Bangladesh in 1991 follow a modified negative
exponential model, 4th degree polynomial
model and 3rd degree polynomial model, respectively.
Islam (2005) observed that age structure, ASDRs and
lx for female population of Bangladesh follow
a modified negative exponential model, 4th
degree polynomial model and biquadratic polynomial model
respectively. It was set up that the values
of a life table for the male population followed a four
parameters 3rd degree polynomial model, i. e. cubic
polynomial model (Islam, 2006).
Therefore, the fundamental
aims and objectives of this study are as follows:
i) to build up time trend
statistical models to l2 values for male
and female population of Bangladesh and,
ii) then to forecast
these values employing these fitted statistical regression
models for 2008-2031.
DATA AND METHODOLOGY
Data Sources
To fulfill the objectives mentioned
above the secondary data on l2 values for
male and female population of Bangladesh have been taken
from (Islam, 2003, 2006, 2007 and 2007). These have
been utilized as raw materials in the current study
that are shown in Table 1.
Data Smoothing
It is observed that there is
some kind of unpredicted distortions in the data aggregate
if it is placed on a graph paper. Therefore, before
going to fit the models to this data, an adjustment
is needed to relieve these unpredicted distortions.
So, these are smoothed using the Package Minitab Release
12.1 by the latest smoothing technique “4253H, twice”
(Velleman, 1980). Afterward, the smoothed data are used
to fit statistical models and these are shown in Table
1.
Regression
Model Fitting
Using
the scattered plot of l2 values for male
and female population of Bangladesh, it appears that
these are linearly distributed. Therefore, a statistical
model, that is, a simple linear regression model is
considered and the structure of the model is
yt=a0
+a1t+u
where, t represents time
(years); yt represents l2 values;
a0, a1 are unknown parameters
and u is the stochastic disturbance term of the model.
Note that these models are fitted
using the software STATISTICA.
Model Validation Technique
To test out the validity or
legitimacy of these models, the CVPP, , is
applied. The mathematical formulation for CVPP is given
as
; where,
n is the number of classes, k is the number of regressors
in the model and the cross-validated R is the correlation
between observed and predicted values of the dependent
variables (Stevens, 1996). The shrinkage of the model
is the positive value of ( - R2);
where is
CVPP and R2 is the coefficient of determination of the
fitted model. As well, 1-shrinkage is the stability
of R2 of the model. The estimated CVPP analogous to
their R2 and information on model fittings are summarized
in Table 2. It is noted that CVPP was also employed
by Islam (2003 and 2005), Islam et al (2003 and 2005)
and Khan and Ali (2004) as the model justification method.
To find out the overall measure
of significance level of the fitted models as well as
the significance of R2 , the F-test is employed in this
information. The F-test is specified by
with
(l-1, n-l) degrees of freedom (d.f.);
where l = the number of parameters is to be estimated
in the fitted model, n is the number of cases and R2
is the coefficient of determination of the model (Gujarati,
1998).
Table
1 Observed, Predicted and Residual of l2 Values
for Male and Female Population of Bangladesh During
1961-2007
| Year |
Male |
Female |
| Observed |
Smoothed |
Predicted |
Residual |
Observed |
Smoothed |
Predicted |
Residual |
| 1961 |
0.77813 |
0.778130 |
0.76803 |
0.01011 |
0.78989 |
0.789890 |
0.778273 |
0.01162 |
| 1974 |
0.77067 |
0.793406 |
0.80810 |
-0.0147 |
0.77815 |
0.802052 |
0.813988 |
-0.01194 |
| 1981 |
0.81750 |
0.826434 |
0.82968 |
-0.0032 |
0.82571 |
0.828114 |
0.833219 |
-0.00511 |
| 1991 |
0.87999 |
0.867742 |
0.86051 |
0.00723 |
0.86339 |
0.861243 |
0.860692 |
0.00055 |
| 2005 |
0.90280 |
0.896771 |
0.90367 |
-0.0069 |
0.89020 |
0.888750 |
0.899154 |
-0.01040 |
| 2006 |
0.90862 |
0.909496 |
0.90676 |
0.00274 |
0.91015 |
0.906124 |
0.901901 |
0.00422 |
| 2007 |
0.91161 |
0.914609 |
0.90984 |
0.00477 |
0.91282 |
0.915702 |
0.904648 |
0.01105 |
Table 2
Information on Model Fittings
|
Model |
n |
k |
R2
|
|
Shrinkage |
Parameters |
Significant Probability (p) |
|
(i)
|
7
|
2
|
0.97579
|
0.958497
|
0.0173 |
a0
a1 |
0.0001
0.00003 |
|
(ii)
|
7
|
2
|
0.96377
|
0.937891
|
0.0259 |
a0
a1 |
0.00020
0.00009 |
Table
3 Forecasted l2 Values for Male and Female Population
of Bangladesh During 2008-2031
| Year |
Male |
Female |
| 2008 |
0.90714 |
0.91284 |
| 2009 |
0.91022 |
0.91559 |
| 2010 |
0.91330 |
0.91834 |
| 2011 |
0.91638 |
0.92109 |
| 2012 |
0.91946 |
0.92384 |
| 2013 |
0.92254 |
0.92659 |
| 2014 |
0.92562 |
0.92934 |
| 2015 |
0.92870 |
0.93209 |
| 2016 |
0.93178 |
0.93484 |
| 2017 |
0.93486 |
0.93759 |
| 2018 |
0.93794 |
0.94034 |
| 2019 |
0.94102 |
0.94309 |
| 2020 |
0.94410 |
0.94584 |
| 2021 |
0.94718 |
0.94859 |
| 2022 |
0.95026 |
0.95134 |
| 2023 |
0.95334 |
0.95409 |
| 2024 |
0.95642 |
0.95684 |
| 2025 |
0.95950 |
0.95959 |
| 2026 |
0.96258 |
0.96234 |
| 2027 |
0.96566 |
0.96509 |
| 2028 |
0.96874 |
0.96784 |
| 2029 |
0.97182 |
0.97059 |
| 2030 |
0.97490 |
0.97334 |
| 2031 |
0.97798 |
0.97609 |
Figure 1 Observed, Smoothed
and Predicted l2 Values for Male Population of Bangladesh.
X axis represents Year (Time) and Y axis represents
l2 Values.

Figure 2 Observed, Smoothed
and Predicted l2 Values for Female Population of Bangladesh.
X axis represents Year (Time) and Y axis represents
l2 Values.

Figure 3
Forecasted l2 Values for Male and Female Population
of Bangladesh During 2008-2031. X axis represents Year
(Time) and Y axis represents forecasted l2 Values.

RESULTS
and DISCUSSION
The statistical models, that
is, simple linear regression model is assumed to fit
to l2 values for male and female population of Bangladesh
and the fitted time trend models are in the following:
yt=-5.2775 +0.00308t for male
(1)
t-stats (-12.2147) (14.1946)
yt=-4.60916 +0.00275t for female
(2)
t-stats (-9.72565) (11.53224)
The information on model fittings
and estimated CVPP, , analogous
to their R2 of these models is shown in Table 2. From
this table it appears that the fitted models (1) - (2)
are highly cross-validated and their shrinkages are
0.0173 and 0.0259 respectively. These imply that the
fitted models (1) - (2) will be stable more than 95%
and 93% respectively. Moreover, it is found that the
parameters of the fitted models (1) - (2) are highly
statistically significant with significant of variance
explained. The stability for R2 of these models is more
than 98% and 97% respectively.
The calculated values of F statistic
for the models (1) - (2) are 201.53 with (1, 5) d.f.
and 133.01 with (1, 5) d.f. respectively whereas the
analogous tabulated values are only 16.3 for (1) - (2)
models at 1% level of significance. Therefore, from
these statistics it is seen that these models and their
analogous R2 are highly statistically significant. Hence,
the fits of these models are well.
It should be mentioned here
for information that others models such as exponential,
logistic, quadratic, cubic, biquadratic were also applied
to fit model to these data but those are not fit well
due to shrinkage and proportion of variance explained.
Thereafter, the forecasted values are estimated using
these fitted time trend regression models that are presented
in Table 3. It is found from the Table 3 that l2 values
are increasing, i.e., upward trend due to time during
the forecasted period 2008-2031.
CONCLUSION
In this study it is found that
l2 values for male and female population
of Bangladesh follow a simple linear regression model.
Then these are forecasted using these statistical models
during 2008– 2031. These might be used as predicted
l2 values for male and female population
of Bangladesh for 2008–2031 for further higher study
as these may be used in the application of Orphanhood
method, Widowhood method and other indirect techniques
for the estimation of demographic parameters for the
forecasted period 2008– 2031.
REFERENCES
Ali, M. Korban. (1994) Modeling of Labour
Force Dynamics in Bangladesh: An Evidence from 1981
Census, The Rajshahi University Studies, Part-B, Vol.
22: 259-266.
Gujarati, Damodar N. (1998). Basic Econometrics,
Third Edition, McGraw Hill, Inc., New York.
Islam, Md. Rafiqul (2003). Modeling of
Demographic Parameters of Bangladesh-An Empirical Forecasting,
Unpublished Ph.D. Thesis, Rajshahi University.
Islam, Md. Rafiqul, Islam, Md. Nurul, Ali,
Md. Ayub & Mostofa, Md. Golam. (2003).
Construction of Male Life Table from Female Widowed
Information of Bangladesh, International Journal of
Statistical Sciences, Dept. of Statistics, University
of Rajshahi, Bangladesh, Vol. 2: 69-82.
Islam, Md. Rafiqul. (2005). Construction
of Female Life Table from Male Widowed Information of
Bangladesh, Pakistan Journal of Statistics, Vol. 21(3):
275-284.
Islam, Md. Rafiqul, Islam, Md. Nurul, Ali,
M. Korban & Mondal, Md. Nazrul Islam (2005). Indirect
Estimation and Mathematical Modeling of Some Demographic
Parameters of Bangladesh, The Oriental Anthropologist,
Vol. 5(2): 163 - 171.
Islam, Md. Rafiqul (2006). Construction
of Abridged Life Tables and Indirect Estimation of Some
Mortality Measures of Bangladesh in 2005, Journal of
Population, Indonesia, Vol.11 (2):117-130.
Islam, Md. Rafiqul (2007). Estimation of
Some Mortality Measures, Modeling of lx and Age Associated
with Force of Mortality of Bangladesh in 2006, Middle
East J. of Age and Ageing, Vol. 4(5):23-28.
Islam, Md. Rafiqul (2007). Estimation of
Some Mortality Measures, Modeling of lx and Age Associated
with Force of Mortality of Bangladesh in 2007, Middle
East J. of Age and Ageing, Vol. 4(6): 32-38.
Khan, Md. Atikur Rahman and Ali, Md. Ayub
(2004). Dynamics of Mean Age at Marriage, TFR and NRR
in Bangladesh, International J. of Statistical Sciences,
Dept. of Statistics, University of Rajshahi, Vol. 3(Special
Issue): 297-309.
Stevens, J. (1996). Applied Multivariate
Statistics for the Social Sciences, Third Edition, Lawrence
Erlbaum Associates, Inc., Publishers, New Jersey.
Velleman, P. F. (1980). Definition and
Comparison of Robust Nonlinear Data Smoothing Algorithms,
Journal of the American Statistical Association, Vol.
75 (371): 609-615.
|