Upload
rafael-lustosa
View
214
Download
0
Embed Size (px)
Citation preview
8/17/2019 Naval Research Logistics (NRL) Volume 43 Issue 6 1996 [Doi 10.1002%2F%28sici%291520-6750%28199609%294…
http:///reader/full/naval-research-logistics-nrl-volume-43-issue-6-1996-doi-1010022f28sici291520-6750281996092… 1/23
Estimating Negative Binomial Demand for Retail Inventory
Management with Unobservable Lost Sa le s
Narendra Agrawal and Stephen A . Smith
Decision
and Information Sciences, Santa Clara University, San ta C lara,
Calijbrnia
95053
The importance of effective inventory management has greatly increased for many major
retailers because of more intense competition. Retail inventory management methods often
use assumptions and demand distributions that were developed for application areas other
than retailing. For example, it is often assumed that u n m e t demand is backordered and that
demand
is
Poisson or normally distributed. In retailing, unmet demand is often lost and
unobserved. Using sales data from a major retailing chain, our analysis found that the neg-
ative binomial
fit
significantly better than the Poisson or the normal distribution. A param-
eter estimation methodology that compensates for unobserved lost sales is developed for the
negative binomial distribution. The method's effectiveness is demonstrated by comparing
parameter estimates from the complete data set to estimates obtained by artificially trun-
cating the data to simulate lost sales.
996
John Wiley Sons. Inc.
1.
INTRODUCTION
Achieving
a
high level of customer service is one of the most important objectives for
firms that sell both durable and nondurable goods. Increased competition has made the
retail inventory management system a key strategic weapon for large retailers such as su-
permarkets and depar tment stores (see [ 6 ] ) . For high-priced durable goods such as furni-
ture or appliances, unmet demand is typically backordered and thus can be observed; how-
ever, for low-cost, nondurable merchandise, sales are typically lost and not reported when
the items are out of stock. Most inventory management methods d o not explicitly account
for lost sales in updating demand forecasts. This can lead to systematic understocking of
items that are in high demand. Industry surveys [141have revealed that for certain popular
items, retail in-stock positions are actually no more than 85 , despite the fact that the
retailers' targets are typically 95-9970. Fisher, Hammond, Obermeyer, and Raman [61 also
note the strategic importance of estimating lost sales.
Demand forecasting based on a parametric distribution requires statistical estimation of
its mean and variance o r other parameters. In the case of unobserved lost sales, the pa-
rameter estimates must be adjusted appropriately to account for the unobserved compo-
nent of demand. Most inventory methods use either the Poisson or the normal, which are
analytically convenient distributions for modeling the demand per period. In general, the
normal is preferred when the demand per cycle is relatively large, while the Poisson is
better for low-demand items because it is discrete. When all demand is observed, the actual
frequency distribution of historical data might also be used. However, when demand ob-
' A n
exception is catalog sales, for which unmet demands can be recorded.
Nuval Rcseurch
Logislics,Vol. 43, pp. 839-861
( 1
996)
Copyright 996 by John Wiley &Sons, Inc.
CCC 0894-069X/96/060839-23
8/17/2019 Naval Research Logistics (NRL) Volume 43 Issue 6 1996 [Doi 10.1002%2F%28sici%291520-6750%28199609%294…
http:///reader/full/naval-research-logistics-nrl-volume-43-issue-6-1996-doi-1010022f28sici291520-6750281996092… 2/23
servations are consistently truncated at the same value, nonparametric estimation methods
have no basis for estimating the demand distribution beyond the truncation point. This is
a crucial shortcoming of nonparametric methods because inventory stocking criteria rely
on the tail of the demand distribution.
The negative binomial distribution fits our data significantly better than e ither the Pois-
son or the normal, and is only slightly less manageable analytically than the Poisson. The
negative binomial also has the advantage of providing a single discrete distribution for all
SKUs (stock keeping units) with a wide range of demand rates, removing the need for two
separate distributions. Additionally, from a practical standpoint, the negative binomial is
an efficient (and analytically tractable) distribution to represent the high variability in de-
mand that occurs
i n
retailing environments due to weather. competitors' promotions, and
other random fluctuations. Consequently, using the negative binomial distribution for in-
ventory management decisions will lead to more reliable levels of customer service and
lower costs.
This article presents both theoretical and empirical support for the contention that the
negative binomial is an appropriate demand distribution for retail inventory management
applications and develops a parameter estimation methodology that compensates for the
effects of unobservable lost sales. The method's effectiveness is demonstrated by artificially
truncating sales data from a major retailing chain and comparing the resulting parameter
estimates to those obtained from t h e full data set. Sufficient conditions are provided for
convergence of this method.
The remainder of the article is organized as follows. The reasons for selecting the negative
binomial distribution are described in Section
2.
Section 3 contains a brief review of the
relevant literature. Analysis of the data used for this study is described
i n
Section 4. The
estimation methodology is presented in Section 5 . and theempirical validation of the mcth-
odology is i n Section 6. The details of the statistical analysis are in Appendix I and Iength-
ier proofs are deferred to Appendix
2 .
Conclusions are summarized in Section
7.
2. PROBABILITY DISTRIBUI'IONS FOR D E M A N D
The following notation will be used for our analysis:
I
p
cr
random demand per week at the SKU-store level
true mean of the demand distribution
true standard deviation
of
the demand distribution
Thc Poisson distribution ( 1 ) is often used to describe retail demand. because i t arises
from the assumption of independent random arrivals at a uniform rate:
with mean
=
variance = A.
When demand per period is large, the normal distribution ( 2
is
often used because it
approximates the Poisson well for large mean demand and because
it
allows independent
selection
of
t h e mean and variance. For the discrete case. the probability distribution of
demand can be approximated as follows:
8/17/2019 Naval Research Logistics (NRL) Volume 43 Issue 6 1996 [Doi 10.1002%2F%28sici%291520-6750%28199609%294…
http:///reader/full/naval-research-logistics-nrl-volume-43-issue-6-1996-doi-1010022f28sici291520-6750281996092… 3/23
Agrawal and Smith: Estimating Negative Binomial Demand
84 1
P{Demand = k }
x
@ ( k +
I y a)
@ ( k
- ;Iy,
a),
k =
0, 1 , 2 , . . .
( 2 )
where @ X I y,
a
= normal cumulative distribution with mean y and variance a*.
The normal distribution may fit low-demand items poorly, however, because it assigns
probability to negative values and because it must be symmetric about its mean. For the
retailing data analyzed in this article, the Poisson distribution did not fit low-demand data
well either, because its fixed variance to mean ratio of one is too small. There are some
practical reasons why actual demand may be more variable than the Poisson. Random
variations may occur in the underlying Poisson arrival rate due to the weather, competitors’
promotions, or special events that are not captured by the inventory system’s forecask2
Second, customers whose purchases are Poisson arrivals may introduce additional vari-
ability by purchasing multiple items of the same type.
The negative binomial distribution is capable of capturing either of these increased vari-
ability effects. The negative binomial distribution with parameters
N
and p has the follow-
ing discrete probability function:
fYD
=
k l N , P )
= h N , p )=
(”,’”; ) P ” ( l k
O < p < l ,
N > O ,
where the cumulative probability distribution is
N S j - 1
F , , ( N , P )=
c
N - ) P ” . ( l
- P Y .
J = O
The mean and variance are
. = I - l ) ,
k = 0 , I , . . . , (3)
( 4 )
The probability distribution function described in
( 3 )
can be interpreted as the proba-
bility of having exactly k failures before the Nth success with independent events where the
probability of success is p . Notice that the ratio of the variance to mean is 1 / p , which is
greater than
I
and can be arbitrarily large. This makes the negative binomial distribution
attractive for retailing applications, which tend to have high variability.
There are other ways to describe the genesis of the negative binomial distribution that
make it intuitively appealing to model the demand process in retailing applications. For
example,
(3)
is equivalent to the probability of
k
arrivals when customer arrivals occur
Several corporate membersof the SantaClara Universi ty Retail Workbench
w h o hold such
promo-
tions reported to us that forecasting slow sellers was a particular problem
for
their inventory man-
agement systems because of larger
than
expected variability in demand.
8/17/2019 Naval Research Logistics (NRL) Volume 43 Issue 6 1996 [Doi 10.1002%2F%28sici%291520-6750%28199609%294…
http:///reader/full/naval-research-logistics-nrl-volume-43-issue-6-1996-doi-1010022f28sici291520-6750281996092… 4/23
according
to a
Poisson process with a random arrival rate
A,
which is sampled from a
gamma distribution of the form [8, p. 1241
where,
E [ ] = N / x ,
Var[
A ]
= N i x 2 ,
and
p = x / ( l + x )
Alternatively, if customer purchases occur according to a Poisson distribution, and the
quantity of each purchase is described by a logarithmic series distribution, the resulting
demand distribution is also negative binomial [ 8 , p. 1251. Multiple items per purchase
could be expected to occur in a wide variety of retail merchandise. including apparel,
housewares, and grocery items. Retail data captured currently generally do not allow the
items per customer to
be
determined. However, the logarithmic distribution appears to be
at least a reasonable candidate distribution for purchase quantity. Boswell and Patil
[21
give
I 5
different derivations
of
the negative binomial distribution.
T hus ,
it can be seen
that this distribution has a
good
intuitive basis as a model for the demand process
in
re-
tailing. Finally, the negative binomial is superior from a practical standpoint because it
accurately describes both low-demand and high-demand items, eliminating the need for
two distributions.
3.
LITERATURE REVIEW
In this section, we briefly review some of the literature
on
distributions used to model
the demand process for inventory systems. We also highlight the approaches that have
been
used to estimate the parameters of these distributions.
Most inventory research
has
focused on systems where unmet demand is backordered,
in
part because the resulting models are simpler
to
analyze, and because many inventory
models have evolved from durable goods and military applications, where demand is back-
ordered.
A
survey of research on single- and multiechelon inventory systems that consider
lost sales can be found in Nahmias and Smith
[
12. 131. Estimation methods that include
lost sales are reviewed in Nahmias
[
1 I]. Most commonly used methods for unobserved
lost sales assume that the demand process follows either a Poisson or a normal distribution.
For example, Conrad [4 ] considers a single-period newsboy model
in
which the excess
demand is lost. Under the assumption of i.i.d. Poisson demand, a maximum-likelihood
estimator
for
the mean
of
the demand distribution
is
derived by observing actual
sales
in
a
certain number of time periods. Hill [7] develops procedures to derive the moments of
customer demand based on the data obtainable from point-of-sale scanning systems. The
customer arrival rate is assumed to be Poisson and the model was not tested empirically.
Wecker
[ 18)
considers the effect of stockouts on forecasting bias. Demand
in
any period
is assumed to be generated by an autoregressive process, where the error term has a normal
8/17/2019 Naval Research Logistics (NRL) Volume 43 Issue 6 1996 [Doi 10.1002%2F%28sici%291520-6750%28199609%294…
http:///reader/full/naval-research-logistics-nrl-volume-43-issue-6-1996-doi-1010022f28sici291520-6750281996092… 5/23
Agrawal and Smith: Est imat ing Negative Binomial Dem and
843
distribution with zero mean and a given standard deviation. An unbiased estimator for the
forecast is determined under the assumption that ( i ) exactly one stockout has occurred
i n
the most recent sales period, and (ii) exactly one stockout has occurred in some prior
period. When more than one stockout has occurred, this approach leads to cumbersome
algebra.
Sarhan and Greenberg
[
15, 161 develop estimators for the normal distribution for the
case in which the samples may be doubly censored, that is, when the r , smallest and r,
largest observations are unobserved. Braden and Freimer
[
31 characterize the class of dis-
tributions for which sufficient statistics exist when observations are truncated from the top.
These distributions are termed newsboy distributions because of their ease of application
in inventory systems, but the negative binomial does not have this property. Nahmias [ 1
11
develops an estimator for truncated demand data
in
the normal-demand case, which is a
simple algebraic function of the sample data. He also shows how the estimator can be
incorporated into sequential updating routines. A nonparametric estimation method for
truncated data was developed by Kaplan and Meier [
93,
and maximum-likelihood esti-
mates are obtained. However, when sales data are always truncated at the same threshold
(the base stock level), this method provides no information regarding the shape of the tail
of the distribution, which is needed for analyzing inventory stocking policies. Anraku and
Yanagimoto [ 11and Van De Ven [171 describe estimation methodologies for the parame-
ters
of
the negative binomial distribution, but do not consider truncated data.
In
the statistical literature, an iterative estimation algorithm, known as the EM method,
has been developed for obtaining maximum-likelihood estimates from data that has been
censored in any known manner [ 5 ] It has been noted, however, that this method can
become computationally cumbersome unless the maximum-likelihood estimates for the
uncensored case are easily computed
[101.
In the case of the negative binomial, maximum-
likelihood estimates in the uncensored case require an iterative solution of an infinite series
relationship [8, p. 1321. Johnson and Kotz
[
81 present alternative simple formulas for the
uncensored case, which we use as the basis for our method.
The approach in this article differs from the existing literature in several respects. It first
compares the effectiveness of three common distributions: normal, Poisson, and negative
binomial,
in
fitting actual sales data and finds that the negative binomial
is
clearly superior
for this application. A parameter estimation method is developed for the negative binomial
with unobserved lost sales, and is empirically validated with the use of the actual data. In
most cases, this method requires a single one-dimensional search. Because our method has
minimal data requirements and is relatively simple to solve, it is attractive for use in in-
ventory replenishment applications.
4. A N A LY Z IN G T H E S A M P L E D A T A
A
comparison of the negative binomial, the Poisson, and the normal distributions was
made using sales and inventory data from a major retailer. The data contained 5 2 weeks of
unit sales at each store for a particular type of men’s slacks at a major retailing chain.
Based on the various combinations of sizes and styles, there were 4 1 different SKU s for this
product (with average weekly sales per store ranging from 0.03 to 1.27) and 24 different
stores (with average weekly sales per SKU ranging from 0.03 to 2.04). These items were
stocked at very high levels relative to sales in that an ending stock level of zero occurred in
less than
0.0
1 of the samples. This made it possible to assume that reported sales are equal
to actual demand. This product was considered a basic item; that is, it is sold throughout
8/17/2019 Naval Research Logistics (NRL) Volume 43 Issue 6 1996 [Doi 10.1002%2F%28sici%291520-6750%28199609%294…
http:///reader/full/naval-research-logistics-nrl-volume-43-issue-6-1996-doi-1010022f28sici291520-6750281996092… 6/23
844
Naval R m w r h Logistics,
Vol. 43 ( 1996)
8
7
0
the year. Replenishment was done once per week. This retailer uses media advertising to
boost its overall sales level, but does not use periodic price promotions-thus the product
price was constant throughout the year.
In order to compare the goodness of fit for alternative frequency distributions, it is nec-
essary to obtain sample sizes large enough to estimate the probabilities in the tail of the
demand distribution. For most retail sales data this is not a straightforward task. Replen-
ishment cycle times of one week or less are common for many department-store chains
and apparel merchants. Average sales per week at the store-SKU level tend to be low, that
is, usually in the range of 1-5, and less than
0.1
for slow-moving SKUs (see Figure 1
).
Demand per period needs to be forecasted at the store and SKU level, because that is the
level at which base stock levels must be determined. Sales rates
for
each class of item tend
to fluctuate by season in predictable ways. However, this means that the sales data must
either be deseasonalized
or
partitioned into peak and off-peak seasonal times.
Although data for any one store-SKU combination are quite limited, there are abundant
data for similar SKUs and stores. That is, department-store chains have hundreds of stores
carrying the same merchandise, thus offering potential groupings of stores that have similar
sales volumes and seasonal patterns. Also, retailers have tens
or
hundreds
of
thousands of
different SKUs, providing many sets of SKUs that have similar demand patterns. Because
the data analyzed for this article had these characteristics, it was decided
to
group sales data
in order to obtain sufficient sample sizes for goodness-of-fit comparisons.
We partitioned the
5
1,168 combinations of stores, SKUs, and weeks of the season into
eight categories and then treated all samples in each category as multiple observations of
the same demand distribution. For inventory management, all SKU-store combinations
in the same category would use the same base stock level. The grouping methodology was
subjective, based on a graphical analysis of the data along the following two dimensions:
8/17/2019 Naval Research Logistics (NRL) Volume 43 Issue 6 1996 [Doi 10.1002%2F%28sici%291520-6750%28199609%294…
http:///reader/full/naval-research-logistics-nrl-volume-43-issue-6-1996-doi-1010022f28sici291520-6750281996092… 7/23
Agrawal and Smith: Estimating Negative Binomial Demand
845
1.80
1.60
40
1.20
00
0
v
-
m
0.80
0.60
0.40
0.20
0.00
Pea..
Weeks rl
- m = ~ ~ ~ ~ ~ ~ ~ ~ ~ ~ ~ ~ f i k F
u * u m
Week
of
Season
Figure 2. Weekly sales for average SKU-Store.
1 .
Various SKU-store combinat ions were ranked into the top 30, the next
90,
the
next
270,
and the lowest
594
by weekly sales averages (Figure
1 ) .
2.
The weeks of the year were separated into two categories: Peak ( top
1
I
weeks)
or off-peak (next 4 1 weeks) based on total sales volume per week (Figure 2 ) .
Because of the highly skewed distribution of demand per week, the four categories based
on weekly sales volume in Figure 1 were spaced logarithmically. That is, the number of
SKU-store combinations in each pair of adjacent groups has an approximate ratio ofthree,
which is the best integer choice for achieving four logarithmically spaced groups. This spac-
ing has the effect of dividing the SKU-store categories in such a way that each category has
approximately an equal fraction o fthe total weekly demand. This means that each category
would have approximately equal financial significance.
We found that the
1
I
peak weeks of the year, which were the back-to-school and Christ-
mas seasons, had a significantly higher mean than the remaining weeks of the year. These
two groups were selected as the peak and off-peak groupings. The peak weeks fell into two
contiguous periods, one in August and a second in November and December (see Figure
2 ) .
For convenience, the same SKU-store groupings were used for both the peak and off-
peak weeks.
The characteristics of the eight categories are summarized in Table
1.
Th e sample size is
determined by multiplying the number of SKU-store combinations in each category by the
number ofweeks. These categories passed a goodness-of-fit test using the negative binomial
distribution, but failed when the normal and Poisson distributions were used. A detailed
discussion of the goodness-of-fit testing is presented in Appendix I .
Figure 3 compares the estimated distributions obtained with the negative binomial, Pois-
8/17/2019 Naval Research Logistics (NRL) Volume 43 Issue 6 1996 [Doi 10.1002%2F%28sici%291520-6750%28199609%294…
http:///reader/full/naval-research-logistics-nrl-volume-43-issue-6-1996-doi-1010022f28sici291520-6750281996092… 8/23
846
Naval
Rcscarc~lr
Logislicx. Vol. 43 ( 1996)
Table I
Data groupings based on SKU-store and peak/off-peak combinations.
Peak Off Peak
No.
of
No. of
30 Mean
=
5.7 330 Mean
=
2.4 1.230
90 Mean = 2.5 990 Mean
=
1.19
3,690
2
70 Mean = 1.16
2,970 Mean
=
0.51 1 1,070
594 Mean = 0.28 6,534 Mean
=
0. I2 24,354
SKU-store Statistics observations Statistics observations
Var = 13.4
Var = 4.8
Var = 1.60
Var
=
0.35
Var = 4.6
Var = 1.61
Var = 0.60
Var
=
0.13
son, and normal demand assumptions to the actual data for the high demand, peak case.
The figure illustrates why the fit obtained with the negative binomial distribution is sub-
stantially better-the Poisson tends to understate the demand in the right tail of the distri-
bution, and the mode of the normal is shifted to the right.
The nature of these categories allows the implementation of peak and off-peak stock
levels for each given store and S K U combination because there are only two transitions
between peak and off-peak periods. This is consistent with practices observed at this re-
tailer. as well as at a vendor of men's slacks with whom we have discussed this research.
0.18 -
I .
-
0.16
0.14
0.1
2
0
,s 0.08
-
d
0 06
0.04
0 02
~
, egat ive B inomia l
-Data
Po i sson
- _
0
0
2 4
6 8
10 12
14
16
18 20
Sales Per
Week
Figure 3. Comparing distribution models to data (high, peak, mean
=
5.7, Var = 13.4).
8/17/2019 Naval Research Logistics (NRL) Volume 43 Issue 6 1996 [Doi 10.1002%2F%28sici%291520-6750%28199609%294…
http:///reader/full/naval-research-logistics-nrl-volume-43-issue-6-1996-doi-1010022f28sici291520-6750281996092… 9/23
Agrawal
and
Smith: Estimating
Negative Binomial
Demand
847
Both use two separate inventory plans, one for the peak period beginning in August and
extending through Christmas, and the second for the remaining off-peak period.
For retailers that are promotional, for example, one week out of each month has lower
prices and increased advertising, the peak and off-peak weeks alternate. This leads to a
more complex inventory decision problem, because higher inventory stocked in prepara-
tion for a peak week may remain unsold during the off-peak weeks.
To
our knowledge,
no specific optimization methodologies (other than dynamic programming) have been
developed for solving the alternating peak and off-peak inventory problem.
5. THE PARAMETER ESTIMATION METHODOLOGY
The following additional notation will be used for our analysis:
mean of the distribution with observed demand truncated at
s
sample mean
sample mean with demand truncated at
s
observed frequency of demand k ,
k =
0,
1 , . . .
s
1
observed frequency that demand is less than or equal to L
estimator for
N
estimator for p
base stock level
When all demands are observed, a common method for deriving the estimators Na n d b
is to match the sample mean Yto
p
and the sample frequency of
0
demand tofo(N,
p ) .
(See Johnson and Kotz [
8,
pp. 131- 1351). This gives the following two equations to solve
for
k
and i
:
Y = N i - 1
( 7 )
When the base stock level is s, all demands greater than swill be reported ass . However,
the observed frequencies for demand values smaller than s will be unbiased estimates of
the true probabilities, and the observed frequency that demand is less than or equal to s is
an unbiased estimate of
1 F , N ,p ) .
The mean
p , in
the truncated case can be expressed
as follows:
which involves o n l y frequency values less than
s
This last formula allows us
to
calculate
the observed sample mean for the truncated distribution with the use of o n l y the observed
frequencies, { , fyh ’ ) k = 0,
1,
. . . s I , which are unbiased estimators of the true fre-
quencies:
8/17/2019 Naval Research Logistics (NRL) Volume 43 Issue 6 1996 [Doi 10.1002%2F%28sici%291520-6750%28199609%294…
http:///reader/full/naval-research-logistics-nrl-volume-43-issue-6-1996-doi-1010022f28sici291520-675028199609… 10/23
848
Nuvul
Rescurch
Logi.sIic.s.
Vol.
43 (
1996)
\ -
I
x = k s f p +
s
h=O
We obtain one relationship for the est imators6 and Nby matching the truncated sample
mean X o the mean p, ( 8 ,6) from (
9
) :
The other relationship for the estimators is obtained by matching the observed frequency
F:~hsor some L < s to the formula for Fl, N , p ) ; hat is,
Solving the Esiimuior fiqirutions.
The two relationships
(
12) and
(
13) for N and I;
cannot be solved in closed form. However, the following properties of the relationships
lead to a solution procedure that is quite efficient compared to a general two-dimensional
search. We first note that for each N , 1 3 ) has a unique solution p N ) defined by
where
p
N s increasing in
N .
This fact follows from Lemma
I .
LEMMA
1:
For every value of
N ,
and
L
<
s , there is
a
unique p =
p N )
defined by
F,
N , N ) )=
Fyha.
PROOF:
Recall that F h ( N ,p ) can be interpreted as the probability that k or fewer
failures are required to obtain N successes. Therefore, it follows that FA
, p )
s strictly
increasing in p and strictly decreasing in N . (Noninteger values for N are allowed as well.)
lf we take the total derivative of
(
14), it follows that
thus establishing monotonicity. QED
Using the relationship in
( 14) ,
we let functions with a single argument denote the sub-
stitution of
p N
or
p .
That is,
and
FA
N )
=
FA
N ,
?(N ) ) .
8/17/2019 Naval Research Logistics (NRL) Volume 43 Issue 6 1996 [Doi 10.1002%2F%28sici%291520-6750%28199609%294…
http:///reader/full/naval-research-logistics-nrl-volume-43-issue-6-1996-doi-1010022f28sici291520-675028199609… 11/23
Agrawal and Smith: Estimating Negative Binomial Demand
849
For any N , the pair
{
N ,
p ( N )
can be substituted into
(
1 2 ) to compute a value p s N )
defined by
The solution then follows from a one-dimensional search on N to find the value such
that p , ( N ) closely matches the observed value X, . or L = 0, the uniqueness of the solution
to p5 N )
= X,
is guaranteed by Theorem I which is proved in Appendix 2.
THEOREM 1:
b y F I ~
N )= FFhs .
For L = 0, p,\ N )
s
monotone decreasing in N , where p (
N )
s defined
For
L
=
0,
a necessary and sufficient condition for the existence of a solution to p.\ N )
=
X,
is given by Theorem 2.
THEOREM
2:
When L = 0, the algorithm will converge to a solution that satisfies ( 12)
and ( 1 3 ) if and only
if
the observed sample mean
h;)
satisfies the following condition:
e-yyh
-
I
h = O k
h- 7
x >
c ( k - s ) - + s = lim p l ( N ) ,
and
where y = -In
f g h r .
The above procedure is a generalization of an iterative procedure proposed by Johnson
and Kotz [8, p. 13 ] .
I t
makes use of only the truncated mean, and matches the cumulative
probability F'y.hs,rather than only
J:hr
When
L
=
0
is used, our procedure matches./"(N)
to the observed frequency f : .However, when
f z h s
is too small, this estimator tends to
be very noisy, because it is based on too few observations. Consequently, the resulting
estimators for Na nd p also become noisy. T o overcome this difficulty, we sometimes used
larger values of L to obtain more stable and accurate estimators during our empirical vali-
dation. In six of eight cases tested (see Table
2 ) ,
our data met the conditions necessary to
guarantee convergence of our method.
The following procedure was successful in determining an appropriate value of L > 0.
First, ifJ':h5 < 0. L could be increased until FYb5 0.1. Second, we performed agoodness-
of-fit test to compare the frequencies estimated using a given value of L to the frequencies
observed in the truncated sample. If the result ingp value was not satisfactory, the value of
L was increased further. This is illustrated in Section 6, where the empirical validation of
our methodology is discussed. If no value of
L
< s can be found that provides a successful
fit, it indicates that the negative binomial is not appropriate for the given set of data or that
s is too small. It should also be noted that if the variance in the sample is smaller than the
mean. then the use of the negative binomial distribution is not appropria te. However, for
the retail sales data we have observed to date. the variance has been greater than the mean.
8/17/2019 Naval Research Logistics (NRL) Volume 43 Issue 6 1996 [Doi 10.1002%2F%28sici%291520-6750%28199609%294…
http:///reader/full/naval-research-logistics-nrl-volume-43-issue-6-1996-doi-1010022f28sici291520-675028199609… 12/23
850
Naval Rcstw rch Lo& ic.s, Vol. 43 ( 1996)
Table 2. Results ofgoodness-of-fittests for all SKU groupings.
Chi- Estimated Estimated
SKUgrouping
s L
(fitted) dof value meana variance Mean Var
s o 9
square p distribution distribution Sampleb Sampleb
High, Peak
High,
Off
Peak
MedHi. Peak
MedHi. Off Peak
MedLo. Peak
McdLo.
Off
Peak
Low,
Peak
Low.
Off Peak
10 2
7 3
2 0
3 0
I 0
1 0
I 0
1 0
20.29
26.93
17.49
11.36
5.41
2.38
2.87
4.38
20 0.44 5.77
17
0.06 2.42
16 0.35 2.62
9 0.25 1.19
10 0.86 1.17
6 0.88 0.5
5 0.72 0.29
4 0.36 0.12
15.12
5.60
5.27
I
.60
1.71
0.59
0.37
0. I4
5.70 13.40
10
2.40 4.60
5
2.50 4.80 5
1.19 1.61
3
1.16 1.60 3
0.5 0.60
I
0.28 0.35 1
0.12 0.13
I
a
These statistics are
for
the original, untruncated data set.
These estimates
used
only the truncated data.
6.
EMPIRICAL VALIDATION
OF
THE ESTIMATION METHODOLOGY
As described previously, the base-stock levels in the data are high enough that essentially
all demands were observed. The effect of
lost
sales for different choices of s was simulated
by modifying the data as follows:
modified demand = max { actual demand, s 3 .
The modified data were used to determine values of Y-,nd FY.hs,which serve as inputs to
the algorithm. The algorithm then determines the parameter estimates for
a
negative bino-
mial fit.
The fitted distributions were compared to the observed frequencies using the p value of
a chi-square statistic. The following expression was used to compute the chi-square statistic
where
.fi = predicted frequency of demand k from the negative binomial model
z = total number of periods for which data were observed
Y
= number of entries k withji
0.000
I
dof
=
degrees of freedom for the
xz
est
=
Y
1 .
The correspondingp values are then computed to determine the likelihood that data could
have been generated by the fitted model.
Our analysis was performed for the eight data categories described earlier. and is shown
in Table 2. For each category, the smallest value of s (the second column in Table 2 ) was
determined that resulted in a chi-squarep value of at least 0.10.The high-demand, off-peak
case was an exception. The best p value we could obtain in this case was 0.06, which is
smaller than the other cases, but is still considered acceptable.
The sample mean and variance refer to the statistics for the entire sample (before
truncation). The estimated mean and variance, calculated from the truncated sample, are
referred to as the estimated distribution mean and variance. The last column shows the
8/17/2019 Naval Research Logistics (NRL) Volume 43 Issue 6 1996 [Doi 10.1002%2F%28sici%291520-6750%28199609%294…
http:///reader/full/naval-research-logistics-nrl-volume-43-issue-6-1996-doi-1010022f28sici291520-675028199609… 13/23
Agrawal and Smith: Estimating Negative B inomial Demand
85
I
Table 3.
Results ofgoodness-of-fit tests for high-peak grouping.
Estimated distribu tion
vs
Estimated distribution vs
observed truncated observed complete
distribution distribution
stimated Estimated
distribution distribution
L s
meana variance” Chi-square dof
p
value Chi-square
dof p
value
0 10 5.34
9.44 25.03
10 0 62.00 20 0
1 10 5.77
15.12 6.99 10
0.73
20.29 20
0.44
2
10
5.77 15.12
6.99
10 0.73
20.29 20 0.44
~
a These estimates used only the truncated da ta.
stock level
(so.u)
needed
to
provide a 90 service level. T he values for so.y n the las t colum n
are greater tha n
or
equal to the required truncation levels in Co lum n 2 in all cases but on e.
Th us, it app ear s tha t the typical base stock levels used in practice would provide sufficient
da ta for param eter estimation in the case of unobserved lost sales.
In
six of the eight cases in Table
2,
L
=
0
yielded a
p
value
greater than
0.1.
In the high-deman d, peak case, a marginally acce ptable p value of 0.025
was obtained for L
= 0.
T he algorithm was then tried for
L =
1 and
L = 2,
as shown in
Table 3. Th e frequencies from th e estimated distribution were comp ared t o the observed
frequencies from the trun cated sample, as well as from th e complete (u ntr un ca ted ) sample.
Th e results of the goodness-of-fit tests are sho wn in T ab le 3. For
s = 10, L = 1
and
2
yielded
identical estimated parameters and p values. Th e p values indicate th at t he fits were good.
For the high-off-peak case, a similar procedure was used to select the best L, which resulted
in
L = 3,
which had the best
p
value of
0.06.
Som e of the com mo n inventory mana gemen t metr ics were also
calculated
for
the fi ts obtain ed from th e trunca ted data. In particular, the me an, variance,
an d the e xpected lost sales
(E[
Lost Sales]) for the fitted distribution were co m pu ted for a
range of trunc atio n levels ob tain ed by assu m ing different values of stock levels. T he
fol-
lowing expression was used fo r
E [
Lost Sales]
:
Obtaining
valiies,fi,r
L.
Culculation
o f E r r o r s .
T he calculated values were compa red to the values tha t would be obtain ed withou t trun-
cation, tha t is,
s
= co, n Figures 4 and 5. Th e horizontal axes in these figures correspond
to the trunca tion level,
or
base stock level, expressed a s
a
multiple of th e stock level needed
to provide 90 service (see last colum n in Ta ble 2 ) . Because errors will only m atter if
dem an d exceeds the stock level, this seem s appropriate.
A
value of 1 on the horizontal axis
corresponds
to
the base stock
level
tha t yields 90 service, an d higher values correspond
to
higher base stock levels. T h e vertical axes me asure t he e rrors, again expressed as multiples
of the stock level needed fo r 90 service.
Figure 4 shows the error in the m ean e st imate due to t runcat ion. T he range of the per-
centage error is from to
2.4%.
W hen th e stock level equals that needed for
90%
cus-
tom er service, the ab solute errors are less tha n
2 .
Fo r stock levels tha t would yield cus-
tom er service levels higher than 90 , the ab solute errors are less tha n 1 . Figure 5 shows
that th e range of errors in E [Lost Sales]
is
from
-0.6% to 2.3%.
Fo r stock levels tha t yield
90 cus tom er service, the absolute errors are less tha n
1 ,
an d for stock levels that would
8/17/2019 Naval Research Logistics (NRL) Volume 43 Issue 6 1996 [Doi 10.1002%2F%28sici%291520-6750%28199609%294…
http:///reader/full/naval-research-logistics-nrl-volume-43-issue-6-1996-doi-1010022f28sici291520-675028199609… 14/23
852
8
A
0
8
m
0
0
0
A
A
~ _ _ _ _ _ _
m
High-Peak
~ o High-Off Peak
MedHigh-Peak
o MedHigh-Off Peak
MedLow-Peak
MedLow-Off Peak
Low-Peak
o Low -Off Peak
.
A
0
0
0
0
0
A
0
0
0
A
- -
0.5 1 5 2 2.5 3
Truncation Level
I
(Stock Level
for
90
Service)
Figure4. Error in computed mean versus t runca t ion level.
yield custo me r service levels higher tha n 90%, he absolute errors are less than 0.5%.Sim-
ilarly, for the st and ard deviation, the errors are relatively sm all, and range fro m
.3
t o
2.9 . As expected,
i n
each case,
as
the stock level increases, smaller errors result, because
mo re de m and information is available.
7 CONCLUSION
In
retailing, increased competitive pressure has forced m an y com pan ies to improve the
efficiency of their inventory m anag em ent systems. For basic merchandise, the large num -
ber
of
low-cost suppliers has made supply chain management
a
focal point for
gaining
competitive advantage through providing the highest level
of
customer service with the
least
inventory investment. Accurate methods for estimation of customer demand and
cus tom er service level achieved are therefore crucial to m ajor retailers. Th is article identi-
fies an d addresses two significant practical sho rtcom ings of most estimation me thods used
for retail inventory m anag em ent. First, the negative binomial de ma nd d istribution
was
8/17/2019 Naval Research Logistics (NRL) Volume 43 Issue 6 1996 [Doi 10.1002%2F%28sici%291520-6750%28199609%294…
http:///reader/full/naval-research-logistics-nrl-volume-43-issue-6-1996-doi-1010022f28sici291520-675028199609… 15/23
Agrawal
and Smith: Estimating Negative Binomial Demand
853
2 -
8
0
A
A
8
High-Peak
High-Off Peak
+ MedHigh-Peak
o
MedHigh-Off Peak
A MedLow-Peak
A
MedLow-Off Peak
Low-Peak
o
Low-Off Peak
B
A
A
d
-0.5
-
~~~ ~~~ ~
~ ~ _ _ ~
.. .
____
-1
0 0.5 1
1.5 2 2.5 3
Truncation Level /(Stock Level for
90
Service)
Figure
5.
Error
in
computed
E [
ostsale]
versus truncation level.
found to be significantly better than either the Poisson
or
the normal, which are the pre-
dominant distribution choices for inventory models. Second, this article develops parame-
ter estimation methods for the case of unobservable lost sales, which
are
prevalent in re-
tailing, but are omitted from most demand models.
The choice of negative binomial distribution is supported in two ways. First, behavioral
assumptions for the underlying demand process suggest the use
of
the negative binomial.
Second, using the
p
value
of
the chi-square statistic to compare the fits of the three distri-
butions to the actual frequency distribution of the data found that with the negative bino-
mial all groups of data but one had p values substantially greater than 0.1, whereas for the
Poisson and the normal, all groups had
p
values less than 0.005 except one. These obser-
vations indicate that the negative binomial is far superior to the other distribution choices
for these data. Consequently, the normal and Poisson models can lead to stocking levels
that are significantly in error for most commonly used customer service levels in retailing.
However, when the required service levels are low ( e g , 70 or 8 0 ) , he stock levels tend
to be the same for all three distributions.
Estimation methods were developed for the parameters of the negative binomial distri-
bution, using demand data truncated at the base stock level. It was shown, using base stock
levels sufficient to provide at least a
90%
customer service level, that these methods yielded
satisfactory parameter estimates, based on the chi-square
p
values obtained from compar-
8/17/2019 Naval Research Logistics (NRL) Volume 43 Issue 6 1996 [Doi 10.1002%2F%28sici%291520-6750%28199609%294…
http:///reader/full/naval-research-logistics-nrl-volume-43-issue-6-1996-doi-1010022f28sici291520-675028199609… 16/23
854 Naval Rrseurch Logistics, Vol. 43 ( 1996)
ing the model to the actual frequency distribution of the data. Furthermore, these param-
eter estimates led to small errors in estimates
for
the mean and variance of demand and
expected lost sales.
We have worked with the members of Santa Clara University’s Retail Workbench in
obtaining, analyzing, and interpreting the store-level data to test
our
methodology and
verify our conclusions. We believe that these research results can lead to more effective
applications of inventory management methods for retailers based on improved demand
models. Second, we believe that the demand models and estimation methods developed
here can provide other researchers with a more appropriate representation of customer
demand for the development of new inventory management methods for retailing ap-
plications.
One area seems particularly important for future research. Cyclical demand fluctuations
due to periodic retailer promotions commonly occur. Because excess inventory from peak
periods carries forward, a multiperiod optimal inventory policy is required for these appli-
cations. Optimization methods that determine multiple base stock levels or perhaps
multiple replenishment frequencies for cyclical random demand patterns with an underly-
ing seasonal variation would be particularly valuable.
APPENDIX 1 . S A M P L E DATA
A N A L Y S I S
A variety of ranking methods were explored to identify groupings for the data. S K Us
were ranked based
o n
average weekly sales, and stores were ranked by sales volume. Store
and SKU combinations were ranked together as in Figure 1 . Ultimately, the SKU-store
rankings in Figure 1 were separated into the top 30, the next 90, the next
270,
and the lowest
594, by selecting breakpoints on the graph of ranked sales averages. These breakpoints are
spaced approximately logarithmically. Larger numbers of logarithmically spaced points
were also tested, but the goodness-of-fit results were not as good. Next, the average weekly
sales rankings
in
Figure
2
were used to separate the weeks into two classifications of weeks,
peak and off peak, corresponding to the top 1 1 and the bottom
41
weeks, respectively.
All eight of the resulting clusters yielded chi-square p values of at least 0.10 when com-
pared to the frequency values that would be generated by a negative binomial distribution
with the same mean and variance. Several aggregations of sales by S K U and store were
tested without separating weeks of the season into separate categories. All of these gave p
values
of
less than
0.
.
Because
of
the simplicity of using just eight groups of data, subse-
quent analyses were performed using the four groups of SKU-store classifications together
with the peak and off-peak weekly classifications.
Table 4 compares the
p
values for the three distributions using the eight groupings of
data in Table
1 .
When compared to the actual frequency distribution of the data, the neg-
ative binomial resulted in chi-squarep values greater than 0.1 for all eight groups. Both the
Poisson and normal resulted in p values significantly less than those obtained with the
negative binomial. Indeed, all
p
values but two for these two alternative distributions were
less than 0.005.
Note that the degrees of freedom in this table differ from those in Table 2 because of
the difference in estimation methodologies for the frequencies
{
,h
1 . In each case, data
corresponding to,fi
<
0.000 were dropped for calculating the chi-square value in ( 18). In
8/17/2019 Naval Research Logistics (NRL) Volume 43 Issue 6 1996 [Doi 10.1002%2F%28sici%291520-6750%28199609%294…
http:///reader/full/naval-research-logistics-nrl-volume-43-issue-6-1996-doi-1010022f28sici291520-675028199609… 17/23
Agrawal and Sm ith: Estimating Negative Binomial Dem and
855
Table
4. Chi-square errors and p values for Poisson. normal, and negative binomial distribution
fits.
x2 p alue) x2
p
alue) x 2 p alue)
SK U Cluster Poisson Normal N.B in dof Mean Var
High , peak
High.
off
peak
Med Hi, peak
MedHi, off peak
MedL o, peak
MedLo.
off
peak
Low . peak
Low,
off
Deak
35 (0.02)
569
(0.00)
266 (0.00)
I35
(0.00)
204 (0.00)
143
(0.00)
192(0.00)
100(0.00)
24 (0.24)
293
(0.00)
I59
(0.00)
753
(0.00)
1004(0.00)
262 I
(0.00)
I237
(0.00)
960
(0.00)
22.2 (0.33)
27.2
(0.13)
16.5 (0.42)
1 I .9 (0.22)
4.82 (0.90)
4.78 (0.57)
1.06 (0.96)
1.98 (0.85)
20 5.7
20 2.4
16 2.5
9 1.19
10
1.16
6
0.5 1
5
0.28
5
0.12
13.4
4.60
4.80
1.61
1.60
0.60
0.35
0.13
Table 4, he
{h .}
ere obtained by matching the mean and variance to the actual data,
whereas in Table 2, they were obtained by using our estimation methodology.
Because the intended application for these fitted distributions is inventory management,
it is natural to ask what impact the choice of distribution has on the selection of base stock
levels. The recommended inventory levels associated with each of the three distributions
are shown for
a
variety ofsample cases in Table 5. These are compared to the best inventory
policy based on the frequency distribution of the actual sales data. Service levels of 99%,
95%, and 90% were considered, because these are typical target levels used by retailers.
That is, the stock level
s
must be chosen so that P { Demand } is greater than a target
service level, for the appropriate demand distribution.
It is clear that the negative binomial is much closer to the stock level that would have
been selected using the actual demand distribution. The normal and Poisson stock levels
are consistently low, when in error. The mean absolute percentage errors (MAPEs) for all
stock levels considered in Table 5 are 18.2% for the Poisson, 19.2% for the normal, and
0.3%
for the negative binomial. The results show that the normal and Poisson models can
lead to stock levels that are significantly in error in some cases, and are inferior to the
negative binomial on average.
Table
5.
Optimal base stock levels
for
various dem and distributions. Order of
the figures in table: actual distribution. negative binomial. normal. Poisson.
Target Service Level
S K U
Cluster
99% 95% 90
High, peak
High,
off
peak
MedHi, peak
MedHi, off peak
MedLo, peak
MedLo, off peak
Low, peak
Low,
off
peak
8/17/2019 Naval Research Logistics (NRL) Volume 43 Issue 6 1996 [Doi 10.1002%2F%28sici%291520-6750%28199609%294…
http:///reader/full/naval-research-logistics-nrl-volume-43-issue-6-1996-doi-1010022f28sici291520-675028199609… 18/23
APPENDIX 2. PROOFS O F THEOREMS
1
AND 2
Th e proof of Theorem
1
requires the following key result, which is stated and proved in
Lem ma 2.
LEMM A 2: For any
j >
L ,
we have
(
N
+
1
)
>
F,
) ,
or
all
N .
PROOF: Define
w h e r e a ( N ) = [ l
- p N +
l ) ] / [ l - p ( N ) ] < l , s i n c e p ( N + l ) > p ( N ) ( f ro m L e m m a 1 ) .
Using (2 0 ) and the fact that
==,J i N ) =
1, for each N , w e have
Similarly, since 1
F , ( N )
= 1 F 1 - ( N + 1 ), it follows that
It can be further shown that ( 2 0 ) - ( 23 lead to the following two conditions:
i )
rh(N ) s unimod al in k , and ,
i i ) there exists k2(N ) > L such that
r h ( N )
1,
for 0 < k
I
, ( N )
and
r A ( N )
<
1 ,
f o r k
>
k , ( N ) .
( 2 4 )
In order t o establish ( i ) , we show that the derivative of r h ( N ) in ( 2 0 ) changes s ign at
mo st once from positive to negative.
If
we hold N fixed an d tre at
k
as a c ontin uou s variable
for simplicity, it can be seen that
d r k ( N ) / d k= r k ( N ) [ l n ( a ( N ) )+ l / ( N + k ) ] .
( 2 5 )
Because In(a
N ) )
< 0 a n d
1
/ (N
+
k ) > 0 an d is decreasing in k , there is at m ost on e value
o f k , denoted by k , ( N ) , uch that
d r k ( N ) / d k 2 0 , f o r k s k , (N )
8/17/2019 Naval Research Logistics (NRL) Volume 43 Issue 6 1996 [Doi 10.1002%2F%28sici%291520-6750%28199609%294…
http:///reader/full/naval-research-logistics-nrl-volume-43-issue-6-1996-doi-1010022f28sici291520-675028199609… 19/23
Agrawal
an d Smith: Estimating Negative Binomial Dem and
857
and
wh ere, if it exists, is unique ly defined by
k , ( N )
=
- N - l / l n ( a ( N ) ) . ( 2 6 )
This establishes condition ( i ) .
In order to establish ( ii ) , consider first the case when
L =
0, so that
Yo
N )
= 1
for all N .
From
the preceding discussion, there m ust exist a finite positive v alue k , N ) ; therw ise all
values
of rh N )
1. Because r,: N )
s
unimodal , YO
N ) =
1 , a n d
(
2
1 )
holds, there mu st be
values of
k
> k , N ) uch tha t rA N )
<
1. Thus , because r,: N ) s unimodal, k 2 (N ) xists for
L = 0.
For
L
>
0, we first show tha t rl.(
N ) > 1.
Suppose
Y, ( N ) 1 . If
k l
N )
, hen
r k ( N )
<
I
f o r a l l k >
L,contradicting(23).lfkl(N)>
L , t h e n r k ( N ) < f o r a l l k s L ,c on tra dic tin g
(
22 ). Therefore
rI N ) 1
mu st hold.
Given that
r12(
)
>
1,
(
23
)
implies tha t
r k
N )<
1
for som e
k
> L . Because r h ( N )
s
also
un im oda l, this establishes the existence of k 2 (N ) or L > 0 as well. Figure 6 illustrates the
tw o cases graphically.
Now, the lem m a is proved by considering the following two cases: I f L < j <
k 2 ( N ) ,
we
have
Th e first sum in
( 2 7 )
is zero in view
of
(2 2 ) , whereas
all
terms in the second sum must be
positive bec ause rh ( N )>
I
for L
+ I
k <
k 2 (
N ) . T h u s F, N
+ I ) >
F,( N ) n this case. If
.j 2 k2(N ) , using ( 2 ) to rewrite the sum s, we have
rr w
F , ( N +
1 )
F , ( N )
= c [ . / i N . L N + 1 ) 1 = z: . / i (N)[ l - k ( N I 1 ( 2 8 )
h = j + l h = J i I
Because
r h ( N )<
1 when k k 2 ( N ) , t again follows tha t
F, N
+
1 ) > Fl
N ) .
QED
T H E O R E M
1:
For
L =
0, p5
N )
s m ono ton e decreasing in N , w h e r e p ( N )
is
defined by
F L ( N )
=
F i b s .
PROOF:
Using summation by parts, it can be verified that
p s ( N )
an be written as
follows:
When = 0, I
F o ( N )=
1 - f , ( N ) =
1
’gh’, which
is
a constant .
For k
> 0, all other
terms in the sum in ( 2 9 ) are decreasing in N by L emm a 2. Thus, for L
=
0, the result is
proved.
QED
For L
>
0,
the following sufficient con ditio n can be stated for the theorem:
8/17/2019 Naval Research Logistics (NRL) Volume 43 Issue 6 1996 [Doi 10.1002%2F%28sici%291520-6750%28199609%294…
http:///reader/full/naval-research-logistics-nrl-volume-43-issue-6-1996-doi-1010022f28sici291520-675028199609… 20/23
858
1 2
1 1
1
rh(N)
0 9
0 8
0 7
0 6
0 5
0 4
L=2
=O
Figure
6 Ratio of probability densities for ( N
+
1 )
and
N
If some value K
2
L exists such that
C o
[ 1 FA N ) ] s decreasing in N , then
p,JN ) s decreasing in N for all s
2
K .
The proof of this condition again follows from Lemma
2,
because
all
F A
N )
are increas-
ing inNfork> L .
T H E O R E M 2:
When L = 0, the algorithm will converge
to
a solution that satisfies ( 12)
and ( 13) if and only
if
the observed sample mean
X,)
atisfies the following condition:
e-7yl’
-
I
h = O k h a
x >
c ( k - s ) -
+ s = lirn p , ( N )
and
where = -In f b s
8/17/2019 Naval Research Logistics (NRL) Volume 43 Issue 6 1996 [Doi 10.1002%2F%28sici%291520-6750%28199609%294…
http:///reader/full/naval-research-logistics-nrl-volume-43-issue-6-1996-doi-1010022f28sici291520-675028199609… 21/23
Agrawal and Smith: Estimating Negative Binomial Demand
859
PROOF: For ease of exposition, we will drop the superscript from f Z h s n the remain-
When
L =
0, from Eq. ( 13
),
we know that for a given value of N , p N ) can be calculated
der of this proof. We will first establish the first part of the condition specified in ( 3 0 ) .
as follows:
Using this relation and the definition ofa negative binomial distribution function, we have
Multiplying and dividing this expression by
Nl ' ,
we get
The term N ( 1
N ) )
can now be written as follows, with the use of the relationship forp
given above:
Using L'H6pital's rule, the limit of N ( 1
N ) )
an be found as
N
tends to infinity:
lim
N (
1
N ) )
=
-Info.
K-. a
Thus, the limit of./; ( N ) s N tends to infinity is calculated as follows:
From Eq. ( 1 2 ) , we now have
3-
1
lim p 5 N ) C
( k
) lim j i ( N ) + s
N+
a
h = O
K+
Substituting the result from ( 3 2 ) in
( 3 3 ) ,
we get
( 3 3 )
Because p s N ) converges monotonically down to this limit by Theorem 1 it follows that if
X, is strictly greater than the lower limit, finite convergence must occur. This proves the
first part of the condition specified in ( 3 0 ) .
In order to prove the second part of the condition specified in 3 0 ) ,we begin by noting
that the expression for p 5 N ) iven in ( 1 2 ) can be rewritten as
8/17/2019 Naval Research Logistics (NRL) Volume 43 Issue 6 1996 [Doi 10.1002%2F%28sici%291520-6750%28199609%294…
http:///reader/full/naval-research-logistics-nrl-volume-43-issue-6-1996-doi-1010022f28sici291520-675028199609… 22/23
860
Nuvul Rescwrcli
Logi.slic:v.
Vol.
43 ( 1996)
From (3 ), we can obtain the following expressions:
and
In
3 ),
taking the limit ofp( N) as
N
tends to 0, we get
lim p N ) =
0.
N-0
Also, in( 37 ), takingthelimit of.f; (N)asNtends oo , weget
l im
1;
N ) = 0.
fi-0
Now, from ( 3 6 ) and (39), the limit ofh.(N ) can be obtained as follows:
limfi.(N)
= 0.
h-0
With the use of the result in
(40) ,
he limit of
p , ( N ) [
from 3 5 ) ] s
N
tends to
0
is
lim p , ( N )
=
s( 1 -1; .
N-0
(37)
( 3 8 )
( 3 9 )
Because monotonicity of p,,
N )
with respect to
N
has already been established in Theo-
rem
I Eq. (4 )
provides an upper bound for the value of
p , N )
calculated from the fitted
distribution. Therefore,
ifx,
is strictly less than this upper bound, finite convergence must
occur, thus establishing the second part of the condition in
(30).
QED
ACKNOWLEDGMENTS
The authors thank the corporate sponsors
of
the Santa Clara University Retail
Work-
bench for providing the data for this analysis and for their valuable insights and financial
support for this research. The opinions expressed in this article are those of the authors
and d o not necessarily reflect those of the sponsors. We also thank Dale Achabal, Shelby
McIntyre, and Steven Nahmias
for
their valuable comments during the course of this re-
search. Comments by two anonymous referees and the Associate Editor have led to sig-
nificant improvements
in
this article.
A n y
remaining errors are the authors’ responsibility.
REFERENCES
[
1
] Anraku ,
K.,
and Yanag imoto . T., “Est imat ion for the Negative Binomial Distribution Based
o n
t he
Conditional Likelihood,”
Comnzinicuriom
in Sluli.s/ic.s-Sirnrr/ulrion
und
C‘cmprrlufion.
19(
3 ) . 77
1-786
(
1990).
8/17/2019 Naval Research Logistics (NRL) Volume 43 Issue 6 1996 [Doi 10.1002%2F%28sici%291520-6750%28199609%294…
http:///reader/full/naval-research-logistics-nrl-volume-43-issue-6-1996-doi-1010022f28sici291520-675028199609… 23/23
Agrawal a n d Smith: Est imating Negative Binomial D em and
86 1
[21
Boswell , M.T. , an d Pat i l , G.P. , “Chance Me chanis ms Gen erat ing the Negative Binomial Dis-
tributions,” in G.P. Patil (Ed.),
Random Counts in Scientif ic
Work. The Pennsylvania State
University Press, University Park , PA, 197
I ,
pp. 3-22.
[31 Braden,
D.,
an d Freimer, M., “Informational Dyn amic s of Censored Observat ions,” Munage-
ment Science,37, 1390- 1404 ( I99 I ) .
[41 Con rad, S.A. , “Sales Dat a and Est imation of Deman d,” 0pw ation.s Research Quurter/-v, 27,
[ 51
Dempster , A.P., Laird, N.M., an d Rub in, D.B. , “M axim um Likel ihood Est imation from In-
complete Data via the EM Method,”
Journal of thLJ Royal Statistical
Society, B-39,
1-38
( 1 9 7 7 ) .
[ 6 ]
Fisher , M.L., H am m on d, J .. Obermeyer. W., and R am an , A., “Mak ing Supply Meet Dem and
in an Uncertain W orld,” Hu rvurd Bi is inrss Revicw. 72,83-93 ( 1994) .
[
71
Hill . R.M., “Parameter Est imation and Performance Measurement in Lost Sales Inventory
Systems,” International
Joiirnal OfProdiiction Economics, 2 8 , 2
1
1-2
15
( 1992) .
[81
John son, N.L. , an d Kotz,
S . Discrete Distribzitions ,
Ho ugh ton Mifflin. Boston.
1969.
[
91
Kaplan. E.L. , and Meier ,
P.,
“No nparam etr ic Est imation from Inco mple te Observat ions,”
[
101
Meilijson,
I .
“ A
Fast Improvem ent to the EM Algor ithm on I ts Own T erms,” Joztr-nu1ofthe
[ 1 1 ] Nahmias ,
S. ,
“D em and Est imation in Lost Sales Inventory Systems,”
Nuvul Rcscwch
Logis-
[
121
Nahmias , S. , an d Sm ith, S.A., “Mathem atical Models of Retailer Inventory Systems: A Re-
view,” in R.L. Sarin
(
Ed.),
Pwspcctives in Opr ra / io t i. v Mun agm wnt .
Kluwer Academic, Nor-
well, MA,
1993.
[
131
Nahmias, S. ,an d Sm ith, S.A., “Optimizing Inventory Levels in
a
Tw o Echelon R etailer System
with P artial Lost Sales,” ManugKonmt Sciencc, 40( 5
),
582-596
(
1994) .
[
141
“Quick Response: What it is: What it’s Not.”
Chuin SIorP‘4gc
E.uwit ivc>, p 4B-20B
(
March
1991) .
[
151
Sarha n, A.E., a nd Greenberg, B.G., “Est imation of Locat ion and Scale Param eters by Order
Statistics from Singly and Doubly Censored Samples. Part 1. The No rmal Dis t r ibu t ion u p to
Samp les of Size 10,”
Annals of’Muthernuf icd luti.stic~.s,7,427-45 ( 1956) .
[
16)
Sarha n, A.E. , and G reenberg. B.G.. “Est imation of Locat ion an d Scale Parameters by O rder
Statistics from Singly an d D oubly Cen sored Samples, P art
2.
Tables for the N orm al D istr ibu-
tion for Sam ples of Size
I I i 15.” Annuls
o~’Mall~onu/iccr/Slati.slic:s,
8, 79-105 ( 1957) .
[ 171 Van De V en, R. , “Est imating the Sha pe Parameter for the Negative Binomial Distr ibut ion,”
Joiit-nu1o/’Sturi.sficalCornprrtution and Sitnidution.46,
1
I 1- 23 ( 1993) .
[ IS] Wecker, W.E., “Predicting D ema nd from Sales Data in the Presence of Stockouts ,” Managr -
r n ~ i i / ‘cic~ncr. 4, 1043-1054 ( 1978) .
123-127
(
1976) .
Atncrican Staiistical Associulion Jo iimcil, 53,457-48 1 ( I958
).
R ~ ~ ~ ~ l S I ~ i i s t i c u l S ~ c i ~ ~ ~ ~ ~ ,
-51,
127-138
(
1989) .
tics, 41, 739-757 ( 1994) .
M a n u s c r i p t re c ei v ed A u g u s t
1994
R e v i s ed m a n u s c r i p t r e c ei v ed F e b r u a r y
1996
A c c e p t e d March 6, I996