ciuupi: An R package for Computing Confidence Intervals that Utilize Uncertain Prior Information
Abstract:
We have created the R package ciuupi to compute confidence intervals that utilize uncertain prior information in linear regression. Unlike post-model-selection confidence intervals, the confidence interval that utilizes uncertain prior information (CIUUPI) implemented in this package has, to an excellent approximation, coverage probability throughout the parameter space that is very close to the desired minimum coverage probability. Furthermore, when the uncertain prior information is correct, the CIUUPI is, on average, shorter than the standard confidence interval constructed using the full linear regression model. In this paper we provide motivating examples of scenarios where the CIUUPI may be used. We then give a detailed description of this interval and the numerical constrained optimization method implemented in R to obtain it. Lastly, using a real data set as an illustrative example, we show how to use the functions in ciuupi.
Suppose that
is a random -vector of responses, is a known
matrix with linearly independent columns,
is an unknown parameter -vector and
,
where is unknown. Let be a specified nonzero
-vector and suppose that the parameter of interest is
. We refer to the
model
as the “full” model and the confidence interval for based on
this model as the “standard” confidence interval. It is often the case
that applied statisticians want to utilize uncertain prior information
in their analysis. This uncertain prior information may arise from
previous experience, scientific background or expert opinion. For
example, a common assumption in a factorial experiment is that higher
order interaction terms are equal to zero. We consider the uncertain
prior information that
, where
is a specified nonzero -vector that is linearly
independent of and is a specified number. Our
interest lies in computing (within a reasonable amount of time) a
confidence interval for , with minimum coverage probability
, that utilizes the uncertain prior information that
.
One could incorporate uncertain prior information in statistical
inference using a Bayesian approach. In other words, a
credible interval for could be constructed using an informative
prior distribution for . However, the
ciuupi package uses a
frequentist approach to utilize the uncertain prior information that
. Utilizing uncertain prior information in frequentist
inference has a distinguished history, which includes (Hodges and Lehmann 1952),
(Pratt 1961), (Stein 1962), (Cohen 1972), (Bickel 1984), Kempthorne (1983) ((1983),
(1987), (1988)), Casella and Hwang (1983) ((1983), (1987)),
(Goutis and Casella 1991), (Tseng and Brown 1997), and (Efron 2006).
The standard confidence interval has the desired coverage probability
throughout the parameter space. However, it does not utilize the
uncertain prior information that . One may attempt to utilize
this uncertain prior information by carrying out a preliminary
hypothesis test of the null hypothesis against the
alternative hypothesis . This attempt is based on the
following two hopes. Firstly, if the prior information is correct then
this test will lead to a confidence interval that is narrower than the
standard confidence interval. Secondly, if this prior information
happens to be incorrect then this test will effectively lead to the
standard confidence interval. Unfortunately, this attempt fails
miserably because, for certain values of ,
and , this post-model-selection
confidence interval has minimum coverage probability far below
(see e.g. Kabaila and Giri (2009b), (2009b)), making it unacceptable.
(Kabaila and Giri 2009a) proposed a family of confidence intervals, with minimum
coverage probability , that utilize the uncertain prior
information that as follows. This family of confidence
intervals have expected length that is less than the expected length of
the standard interval when the prior information is correct and maximum
(over the parameter space) expected length that is not too much larger
than the expected length of the standard confidence interval. In
addition, these confidence intervals have the same expected length as
the standard confidence interval when the data strongly contradict the
prior information. The admissibility result of (Kabaila et al. 2010) implies
that a confidence interval with the desired minimum coverage probability
and expected length that is less than that of the standard confidence
interval when the prior information is correct, must have an expected
length that exceeds that of the standard interval for some parameter
values.
Unfortunately, computing these confidence intervals is quite time
consuming. Furthermore, there is no existing R package to compute these
confidence intervals. Thus, if one wants to compute the confidence
interval proposed by (Kabaila and Giri 2009a) and originally computed using MATLAB
programs, they may have to write their own programs to do so. The time
and skill required to write such programs present large barriers to the
use of this confidence interval in applied statistics.
(KaMa2017?) ((KaMa2017?), Appendix A) described the family of confidence
intervals proposed by (Kabaila and Giri 2009a) when is known. Each
confidence interval in this family is specified by a different tradeoff
between its performance when the prior information is correct and its
performance when this prior information happens to be incorrect.
(KaMa2017?) ((KaMa2017?)) then specified an attractive tradeoff that
leads to a unique confidence interval. This interval and its coverage
probability and expected length properties can now be easily and quickly
computed using the R package ciuupi.
This confidence interval has the following three practical applications.
Firstly, if has been accurately estimated from previous data,
as in the factorial experiment example described later, then it may be
treated as being effectively known. Secondly, for sufficiently
large (, say), if we replace the assumed known value of
by its usual estimator in the formula for the confidence
interval then the resulting interval has, to a very good approximation,
the same coverage probability and expected length properties as when
is known. Thirdly, some more complicated models (including
those considered by (KaMa2017?), (KaMa2017?)) can be approximated by the
linear regression model with known when certain unknown
parameters are replaced by estimates.
The only information needed to assess the coverage probability and
expected length of the confidence interval that utilizes uncertain prior
information (CIUUPI) are the values of ,
, and . We stress that this
assessment does not use the observed response . Indeed,
if we want to choose between the CIUUPI and some other confidence
interval, such as the standard confidence interval, then this choice
must be made prior to any examination of the observed response
.
In this paper we provide motivating examples of scenarios where this
confidence interval may be used. We then describe, in detail, the CIUUPI
computed by the ciuupi package and the numerical constrained
optimization method implemented to obtain it. We contrast and compare
the CIUUPI with a credible interval for
constructed using an informative prior distribution for . Lastly,
instructions on how to use the functions in ciuupi are given, using a
real data set, from a factorial experiment, as an illustrative example.
We hope that, by making ciuupi freely available, statisticians who
have uncertain prior information of the type that we specify and wish to
utilize it will be encouraged to use the CIUUPI instead of the
potentially misleading post-model-selection confidence interval.
2 Motivating examples
The following motivating examples are provided by (Kabaila and Giri 2013). These are
examples of scenarios where the ciuupi package may be used to find a
confidence interval for the parameter of interest that utilizes
the uncertain prior information that .
Pooling of normal means. Suppose that
for and
for
, where the ’s are
independent and identically distributed (i.i.d.)
. The parameter of interest is
and we have uncertain prior information that .
One-way analysis of variance for two treatments. Suppose that
for and
, where the ’s are i.i.d.
. The parameter of interest is
and we have uncertain prior information that
.
A factorial experiment with two or more replicates. The
parameter of interest is a specified contrast. For
factorial experiments it is commonly believed that higher order
interactions are negligible. Suppose that the uncertain prior
information is that the highest order interaction is zero.
One-way analysis of covariance with two treatments and normal
errors. The parameter of interest is the difference in
expected responses for the two treatments, for a specified value of
the covariate. The uncertain prior information is that the
hypothesis of ‘parallellism’ is satisfied.
Polynomial regression. Suppose that
for , where the ’s are i.i.d.
. The parameter of interest is the
expected response for a specified value of the explanatory variable
. The uncertain prior information is that .
Linear regression with at least one interaction term. The parameter
of interest is a given linear combination of the regression
parameters. The uncertain prior information is that a specified
interaction term is 0.
In addition to the above examples, (KaMa2017?) have used the ciuupi
package to aid in the computation of a confidence interval that utilizes
uncertain prior information in the following more complicated scenario
that arises in the analysis of both clustered and longitudinal data.
Suppose that
for and , where
, the ’s are i.i.d.
, and the ’s are i.i.d.
. The parameter of interest is
and we have uncertain prior information that
.
3 The confidence interval that utilizes uncertain prior information computed by ciuupi
Let
,
the least squares estimator of . Then
and
are the least squares estimators of and , respectively.
Now let
and
.
The known correlation between and is
.
Let , a scaled version of
, and
, an
estimator of . Assume that is known.
The confidence interval that utilizes uncertain prior information about
has the form
where is an odd continuous function and
is an even continuous fuction. In
addition, and for all ,
where the quantile is defined by for
. The functions and are fully specified by the
vector as follows.
By assumption, , and
. The values of and
for any are found by cubic spline interpolation
for the given values of and for
. The functions and are
computed such that has minimum coverage probability
and the desired expected length properties. This numerical
computation method is described in detail in the next section. Note that
the functions and are computed assuming that is
known.
As stated in the introduction, for sufficiently large
(, say), if we replace the assumed known value of
by
in the formula for then the resulting interval has, to a very
good approximation, the same coverage probability and expected length
properties as when is known. In ciuupi, if no value of
is supplied then the user is given the option of replacing
by , with a warning that needs to
be sufficiently large (, say).
Numerical constrained optimization method used to compute the vector
Let
and
where
for . A computationally convenient
expression for the coverage probability of is
where denotes the pdf. This coverage probability depends
on the unknown parameter , the functions and , the known
correlation and the desired minimum coverage probability
. (Giri 2008) has shown that is an
even function of .
Define the scaled expected length of to be the expected
length of divided by the expected length of the standard
confidence interval, given by
This scaled expected length of is given by
This scaled expected length depends on the unknown parameter ,
the function , the known correlation and the desired minimum
coverage probability . (Giri 2008) has shown that
is an even function of .
We compute the functions and such that has minimum
coverage probability and the desired expected length
properties as follows. For given , we minimize
the objective function
with respect to the vector
, subject
to the coverage constraint for all
. Equivalently, minimize the objective function
subject to this constraint, where . A
computationally convenient formula for the objective function
(4) is
Since we are minimizing this objective function, we can leave out the
constant at the front of the integral.
When is large, this numerical computation recovers the
standard confidence interval (3) for .
As decreases towards 0, this computation puts increasing
weight on achieving a small value of , i.e. an
improved confidence interval performance when the uncertain prior
information that is correct. However, as decreases,
increases, i.e. the performance of
the confidence interval when the prior information happens to be
incorrect is degraded. Following (KaMa2017?), we choose such
that the “gain” when the prior information is correct, as measured by
is equal to the maximum possible “loss” when the prior information
happens to be incorrect, as measured by
We denote this value of by . Our computational
implementation of the constraint for all
is to require that for all
. By specifying constraints on
the coverage probability for such a fine grid of
nonnegative values of , we ensure that, to an exceedingly good
approximation, for all values
of .
In summary, we compute the vector
by
minimizing (6), where is chosen such that
(7) = (8), subject to the constraints
for all
. Once
has been computed
in this way, we can easily compute the confidence interval that utilizes
the uncertain prior information (CIUUPI) for observed response
.
This constrained optimization procedure is carried out using the slsqp
function in the nloptr
package (see Johnson (2014), (2014)). Perhaps surprisingly, the large number of
constraints on the coverage probability is
handled well by the slsqp function. The integrals in (2)
and (6) are computed as follows. For greater accuracy,
each integral is split into a sum of six integrals, with lower and upper
endpoints consisting of successive knots. Each of these integrals is
then computed using Gauss Legendre quadrature with five nodes. Gauss
Legendre quadrature was found to be both faster and more accurate than
the R function integrate. This quadrature is carried out using the
gauss.quad function in the
statmod package (see
Smyth (2005), (2005)).
4 A comparison of the CIUUPI with a Bayesian interval estimator
(Kabaila and Dharmarathne 2015) compare Bayesian and frequentist interval estimators for
in the linear regression context considered in this paper when
is unknown. They find that the Bayesian and frequentist
interval estimators differ substantially. In this section we compare a
credible interval for with the CIUUPI, assuming
that is known.
For ease of comparison of the CIUUPI with a credible interval, we
re-express the regression sampling model as follows. Let the
matrix be obtained from
using the transformation described in Appendix B of
(Kabaila and Dharmarathne 2015). The attractive property of is
that
We re-express the regression sampling model as
,
where ,
and . Obviously,
.
Let
denote the least squares estimator of
. Note that
.
Clearly, has a bivariate
normal distribution with mean and covariance
matrix and, independently,
.
Dividing the endpoints of the CIUUPI by , we
obtain the following confidence interval for :
where the functions and have been obtained using the constrained
optimization described in the previous section.
The uncertain prior information that implies the uncertain
prior information that . The properties of a Bayesian
credible interval depend greatly on the prior distribution
chosen for . We have chosen a
prior distribution that leads to a credible interval with some
similarities to the CIUUPI. Assume that the prior probability density
function of is proportional to
, where
and denotes the Dirac delta
function. In other words, we assume an improper prior density for
that consists of a mixture of an infinite rectangular unit-height ‘slab’
and a Dirac delta function ‘spike’, combined with noninformative prior
densities for the other parameters. This prior density is a Bayesian
analogue of the weight function used in the weighted average over
, (5). It may be shown that the marginal
posterior density of is
where
and denotes the pdf. We
note that this posterior density is a mixture of two normal probability
density functions, such that the weight given to the posterior density
centred at increases with increasing
, when . It is evident from
(10) that the highest posterior density Bayesian
credible interval may consist of the union of two disjoint
intervals. For this reason, we consider the shortest
credible interval.
Note that the graph of the function (10) of
consists of the graph of the function
shifted to the right by . We can therefore express
the shortest credible interval for in the form
,
for the appropriate functions and . We compare this interval with
the frequentist confidence interval
(9) as follows. Let
and
.
Then
is equal to
which has a similar form to (9), but with and
replaced by and respectively. Therefore, we may compare
the interval (9) with (11) by
comparing the functions and with the functions and ,
respectively. We will also compare the interval (9)
with (11) by comparing the frequentist coverage
probability function of (11).
5 Using the ciuupi package
In this section we use a real data set to illustrate how each of the six
functions in ciuupi works. Table 1 below gives the
name of each of the functions and a short description of what it does.
In the following subsections we show how the functions in Table
1 are used in R.
Table 1: Functions in the ciuupi package
Function
Description
bsciuupi
Compute the optimized vector
bsspline
Evaluate and at
cpci
Evaluate at
selci
Evaluate at
ciuupi
Compute the CIUUPI, i.e. compute
cistandard
Compute the standard confidence interval
Factorial experiment example
Consider the factorial experiment described by Kabaila and Giri (2009a)
(Discussion 5.8, (2009a)), which has been extracted from a real
factorial data set provided by (Box et al. 1963). The two factors are
the time of addition of HNO and the presence of a ‘heel’. These
factors are labelled A and B, respectively. Define and
for “Time of addition of HNO” equal to 2 hours and 7
hours, respectively. Also define and for “heel
absent” and “heel present”, respectively. Assume a model of the form
where . This model can be written in
matrix form as
where
,
and .
According to (Box et al. 1963), a very accurate estimate of ,
obtained from previous related experiments, is 0.8.
Suppose that the parameter of interest is expected
response when factor A is high and factor B is lowexpected
response when factor A is low and factor B is low. In other
words, , so that
, where
. Our aim is to find a confidence
interval, with minimum coverage 0.95, for . We suppose that
there is uncertain prior information that the two-factor interaction is
zero. In other words, we suppose that there is uncertain prior
information that . The uncertain prior information is,
then, that ,
where and . Now that we have
specified , and , we
can compute
.
Evaluating the confidence interval (no examination of the observed response)
First suppose that we have not yet examined the observed response
and that we are interested in knowing how the
confidence interval that utilizes uncertain prior information (CIUUPI)
performs for given values of , ,
and . We begin by storing the values of
, , and in R
as follows.
# Specify alpha, a, c and x.alpha <-0.05a <-c(0, 2, 0, -2)c <-c(0, 0, 0, 1)x <-cbind(rep(1, 4), c(-1, 1, -1, 1), c(-1, -1, 1, 1), c(1, -1, -1, 1))
Next we use the numerical constrained optimization to compute the values
at the knots of the functions and that define the CIUUPI. We
must specify whether natural cubic spline interpolation (natural = 1) or
clamped cubic spline interpolation (natural = 0) is used in the
description of these functions. In the case of clamped cubic spline
interpolation the first derivatives of and are set to zero at
and . Natural cubic spline interpolation is the default, and is
carried out using splinefun in the stats package. Clamped cubic
spline interpolation is carried out using cubicspline in the pracma
package. The nonlinear constrained optimization using natural cubic
spline interpolation for the description of the functions and is
much faster and results in a coverage probability that is slightly
closer to throughout the parameter space. For this example
we are able to obtain the vector
in 6.56 minutes
when using natural cubic spline interpolation and in 21.27 minutes when
using clamped cubic spline interpolation. This computation was carried
out on a PC with an Intel i7-7500 CPU (3.4GHz) and 32GB of RAM. The
following code is used to obtain the vector
that
specifies the CIUUPI, which is obtained from the numerical constrained
optimization that uses natural cubic spline interpolation for the
description of the functions and .
# Compute (b(1), b(2), ..., b(5), s(0), s(1), ..., s(5)) that specifies the CIUUPIbsvec <-bsciuupi(alpha, a = a, c = c, x = x)bsvec
Alternatively, since we know that , we could obtain the
vector
that specifies the CIUUPI using the code
# Compute (b(1), b(2), ..., b(5), s(0), s(1), ..., s(5)) that specifies the CIUUPI,# given rhobsvec2 <-bsciuupi(alpha, rho =-0.707)
Now that we have the vector
that
specifies the CIUUPI, we can graph the functions and using the
following code:
# Compute the functions b and s that specify the CIUUPI on a grid of valuessplineval <-bsspline(seq(0, 8, by =0.1), bsvec, alpha)# The first 5 values of bsvect are b(1),b(2),...,b(5).# The last 6 values are s(0),s(1),...,s(5).xseq <-seq(0, 6, by =1)bvec <-c(0, bsvec[1:5], 0)svec <-c(bsvec[6:11], qnorm(1- alpha/2))# Plot the functions b and s plot(seq(0, 8, by =0.1), splineval[, 2], type ="l", main ="b function",ylab =" ", las =1, lwd =2, xaxs ="i", col ="blue", xlab ="x")points(xseq, bvec, pch =19, col ="blue")plot(seq(0, 8, by =0.1), splineval[, 3], type ="l", main ="s function",ylab =" ", las =1, lwd =2, xaxs ="i", col ="blue", xlab ="x")points(xseq, svec, pch =19, col ="blue")
Figure 1 shows the graphs of the functions and
that specify the CIUUPI, when these functions are described using
natural cubic spline interpolation, for this example. For comparison,
Figure 2 shows the graphs of the functions and
that specify the CIUUPI, when these functions are described using
clamped cubic spline interpolation. These figures are quite similar;
there is a small difference in both the and functions near
.
We can also use the vector
that
specifies the CIUUPI to evaluate and then plot the coverage probability
and scaled expected length
as functions of . This is done using the
following code.
# Compute the coverage probability and scaled expected for a grid of values of gammagam <-seq(0, 10, by =0.1)cp <-cpciuupi(gam, bsvec, alpha, a = a, c = c, x = x)sel <-selciuupi(gam, bsvec, alpha, a = a, c = c, x = x)# Plot the coverage probability and squared scaled expected lengthplot(gam, cp, type ="l", lwd =2, ylab ="", las =1, xaxs ="i",main ="Coverage Probability", col ="blue", xlab =expression(paste("|", gamma, "|")), ylim =c(0.9495, 0.9505))abline(h =1-alpha, lty =2)plot(gam, sel^2, type ="l", lwd =2, ylab ="", las =1, xaxs ="i",main ="Squared SEL", col ="blue", xlab =expression(paste("|", gamma, "|")), ylim =c(0.83, 1.17))abline(h =1, lty =2)
Figure 1: Graphs of the functions and for the factorial
experiment example for the CIUUPI, with minimum coverage probability
0.95, when they are described using natural cubic spline interpolation,
for the factorial experiment
example.Figure 2: Graphs of the functions and for the factorial
experiment example for the CIUUPI, with minimum coverage probability
0.95, when they are described using clamped cubic spline interpolation,
for the factorial experiment
example.
Figure 3 shows the graphs of
and the square of for the CIUUPI (where the
functions and have been specified by natural cubic spline
interpolation) produced by this code.
Figure 3: Graphs of the and the square of
functions for the CIUUPI, with minimum
coverage probability 0.95, where the functions and are described
by natural cubic spline interpolation, for the factorial experiment
example.
We can see from Figure 3 that, regardless of the value
of , the coverage probability of the CIUUPI is extremely close
to . We can also see that the expected length of the CIUUPI
is less than the expected length of the standard confidence interval
when is small, with the minimum scaled expected length achieved
when . For moderate values of , the expected
length of the standard interval is less than the expected length of the
CIUUPI. However, for large , the expected length of the CIUUPI
is essentially the same as the expected length of the standard interval.
For comparison, Figure 4 shows the graphs of
and the square of for
the CIUUPI when the functions and are described by clamped cubic
spline interpolation.
Figure 4: Graphs of the and the square of
functions for the CIUUPI, with minimum
coverage probability 0.95, where the functions and are described
by clamped cubic spline interpolation, for the factorial experiment
example.
Figures [fig:bs_comp] and
[fig:BCI_cov] show the differences between the Bayesian 95%
credible interval and the 95% CIUUPI. Figure
[fig:bs_comp] shows the graphs of the and functions
(left panel), and the and functions (right panel), for the
factorial experiment example. Note that, similarly to and ,
is an odd continuous function and is an even continuous function.
Figure [fig:BCI_cov] shows the graph of the frequentist coverage
probability of the Bayesian 95% credible interval, for the factorial
experiment example. This coverage probability is also an even function
of . Unlike the coverage probability of the CIUUPI, the minimum
over of the frequentist coverage probability of the Bayesian
95% credible interval is substantially less than 0.95.
Figure 5: Graphs of the and functions (left panel), and the
and functions (right panel), for the factorial experiment
example.Figure 6: The frequentist coverage probability of the Bayesian 95%
confidence interval, for the factorial experiment
example.
Computing the confidence interval (using the observed response)
The observed response for the factorial experiment example data is
and is assumed to
take the value 0.8. We use the function ciuupi to return the
confidence interval (1) for that utilizes the
uncertain prior information that . Continuing from the
previous example, this is done in R as follows:
# Using the vector (b(1),b(2),...,b(5),s(0),s(1),...,s(5)), compute the CIUUPI# for this particular datat <-0y <-c(87.2, 88.4, 86.7, 89.2)ci <-ciuupi(alpha, a, c, x, bsvec, t, y, natural =1, sig =0.8); ci
We obtain the output
lower upperciuupi -0.77107553.218500
For comparison purposes, the function standard_CI will return the
standard confidence interval (3) for .
The code
# Compute the standard confidence intervalcistandard(a = a, x = x, y = y, alpha = alpha, sig =0.8)
will return
lower upperstandard -1.0174463.417446
The 95% confidence interval that utilizes uncertain prior information
is much shorter than the standard confidence interval
. These are observed values of confidence intervals that
have, to an excellent approximation, the same coverage probability. For
comparison, a 95% Bayesian credible interval for is
. Although this interval is shorter than the CIUUPI, it
can be seen from Figure
[fig:BCI_cov] that the minimum over of the
frequentist coverage of the Bayesian credible interval is substantially
less than 0.95.
6 Discussion
It is very common in applied statistics to carry out preliminary
data-based model selection using, for example, hypothesis tests or
minimizing a criterion such as the AIC. As pointed out by Leamer (1978)
((1978), chapter 5), such model selection may be motivated by
the desire to utilize uncertain prior information in subsequent
statistical inference. He goes even further when he states, on p.123,
that “The mining of data that is common among non-experimental
scientists constitutes prima facie evidence of the existence of prior
information”. One may attempt to utilize such prior information by
constructing confidence intervals, using the same data, based on the
assumption that the selected model had been given to us a priori, as
the true model. This assumption is false and it can lead to confidence
intervals that have minimum coverage probability far below the desired
minimum coverage (see e.g. Kabaila (2009), (2009), Leeb and Pötscher (2005),
(2005)), making them invalid.
A numerical constrained optimization approach to the construction of
valid confidence intervals and sets that utilize uncertain prior
information has been applied by (Farchione and Kabaila 2008), (Kabaila and Giri 2009a),
(Kabaila and Giri 2013), (Kabaila and Giri 2014), (Kabaila and Tissera 2014) and (Abeysekera and Kabaila 2017). In each
case, numerical constrained optimization was performed using programs
written in MATLAB, restricting the accessibility of these confidence
intervals and sets. The R package ciuupi is a first step in making
these types of confidence intervals and sets more widely accessible.
This article is converted from a Legacy LaTeX article using the
texor package.
The pdf version is the official version. To report a problem with the html,
refer to CONTRIBUTE on the R Journal homepage.
Footnotes
References
W. Abeysekera and P. Kabaila. Optimized recentered confidence spheres for the multivariate normal mean. Electronic Journal of Statistics, 11: 1798–1826, 2017. URL https://doi.org/10.1214/17-EJS1272.
P. J. Bickel. Parametric robustness: Small biases can be worthwhile. The Annals of Statistics, 12: 864–879, 1984. URL https://doi.org/10.1214/aos/1176346707.
G. E. P. Box, L. R. Connor, W. R. Cousins, O. L. Davies, F. R. Hinsworth and G. P. Sillitto. The design and analysis of industrial experiments. 2nd ed Oliver; Boyd, London, 1963.
G. Casella and J. T. Hwang. Empirical Bayes Confidence Sets for the Mean of a Multivariate Normal Distribution. Journal of the American Statistical Association, 78: 688–698, 1983. URL https://doi.org/10.2307/2288139.
G. Casella and J. T. Hwang. Employing vague prior information in the construction of confidence sets. Journal of Multivariate Analysis, 21: 79–104, 1987. URL https://doi.org/10.1016/0047-259X(87)90100-X.
A. Cohen. Improved confidence intervals for the variance of a normal distribution. Journal of the American Statistical Association, 67: 382–387, 1972. URL https://doi.org/10.2307/2284389.
B. Efron. Minimum volume confidence regions for a multivariate normal mean vector. Journal of the Royal Statistical Society B, 68: 655–670, 2006. URL https://doi.org/10.1111/j.1467-9868.2006.00560.x.
D. Farchione and P. Kabaila. Confidence intervals for the normal mean utilizing prior information. Statistics & Probability Letters, 78: 1094–1100, 2008. URL https://doi.org/10.1016/j.spl.2007.11.003.
K. Giri. Confidence intervals in regression utilizing prior information. 2008.
C. Goutis and G. Casella. Improved invariant confidence intervals for a normal variance. The Annals of Statistics, 19: 2015–2031, 1991. URL https://doi.org/10.1214/aos/1176348384.
J. L. Hodges and E. L. Lehmann. The use of previous experience in reaching statistical decisions. Annals of Mathematical Statistics, 23: 396–407, 1952. URL https://doi.org/10.1214/aoms/1177729384.
S. G. Johnson. The NLopt nonlinear-optimization package. 2014.
P. Kabaila and G. Dharmarathne. A Comparison of Bayesian and Frequentist Interval Estimators in Regression That Utilize Uncertain Prior Information. Australian & New Zealand Journal of Statistics, 57(1): 99–118, 2015. URL https://doi.org/10.1111/anzs.12104.
P. Kabaila and K. Giri. Confidence intervals in regression utilizing prior information. Journal of Statistical Planning and Inference, 139: 3419–3429, 2009a. URL https://doi.org/10.1016/j.jspi.2009.03.018.
P. Kabaila and K. Giri. Further properties of frequentist confidence intervals in regression that utilize uncertain prior information. Australian & New Zealand Journal of Statistics, 259–270, 2013. URL https://doi.org/10.1111/anzs.12038.
P. Kabaila and K. Giri. Simultaneous confidence intervals for the population cell means, for two-by-two factorial data, that utilize uncertain prior information. Communications in Statistics – Theory and Methods, 43: 4074–4087, 2014. URL https://doi.org/10.1080/03610926.2012.718846.
P. Kabaila and K. Giri. Upper bounds on the minimum coverage probability of confidence intervals in regression after model selection. Australian & New Zealand Journal of Statistics, 51: 271–287, 2009b. URL https://doi.org/10.1111/j.1467-842X.2009.00544.x.
P. Kabaila, K. Giri and H. Leeb. Admissibility of the usual confidence interval in linear regression. Electronic Journal of Statistics, 4: 300–312, 2010. URL https://doi.org/10.1214/10-EJS563.
P. Kabaila and D. Tissera. Confidence intervals in regression that utilize uncertain prior information about a vector parameter. Australian & New Zealand Journal of Statistics, 56: 371–383, 2014. URL https://doi.org/10.1111/anzs.12090.
P. J. Kempthorne. Controlling risks under different loss functions: The compromise decision problem. The Annals of Statistics, 16: 1594–1608, 1988. URL https://doi.org/10.1214/aos/1176351055.
P. J. Kempthorne. Minimax-Bayes Compromise Estimators. Proc. Bus. and Econ. Statist. Sec. Amer. Statist. Assoc., 563–573, 1983.
P. J. Kempthorne. Numerical specification of discrete least favorable prior distributions. Society for Industrial and Applied Mathematics. Journal on Scientific and Statistical Computing, 8: 171–184, 1987. URL https://doi.org/10.1137/0908028.
E. E. Leamer. Specification searches: Ad hoc inference with nonexperimental data. John Wiley & Sons, 1978.
C. M. Stein. Confidence sets for the mean of a multivariate normal distribution. Journal of the Royal Statistical Society. Series B (Methodological)., 24: 365–396, 1962. URL https://doi.org/10.1111/j.2517-6161.1962.tb00458.x.
Y. L. Tseng and L. D. Brown. Good exact confidence sets for a multivariate normal mean. The Annals of Statistics, 5: 2228–2258, 1997. URL https://doi.org/10.1214/aos/1069362396.
Reuse
Text and figures are licensed under Creative Commons Attribution CC BY 4.0. The figures that have been reused from other sources don't fall under this license and can be recognized by a note in their caption: "Figure from ...".
Citation
For attribution, please cite this work as
Mainzer & Kabaila, "ciuupi: An R package for Computing Confidence Intervals that Utilize Uncertain Prior Information", The R Journal, 2019
BibTeX citation
@article{RJ-2019-026,
author = {Mainzer, Dr. Rheanna and Kabaila, Dr. Paul},
title = {ciuupi: An R package for Computing Confidence Intervals that Utilize Uncertain Prior Information},
journal = {The R Journal},
year = {2019},
note = {https://rjournal.github.io/},
volume = {11},
issue = {1},
issn = {2073-4859},
pages = {323-336}
}