Documentos de Académico
Documentos de Profesional
Documentos de Cultura
Introduction
First, the value πi is bounded between zero and one whilst β0 + β1 xi is unbounded,
secondly a graph of πi against xi is usually S–shaped rather than linear and thirdly,
if the number of individuals possessing the attribute is modelled to be binomial, the
variances of the proportions are not constant for the different values of the predictor.
The model can often be linearized by using the logistic or logit transformation. Instead
of fitting the straight line πi = β0 + β1 xi the proportions are transformed to values πi′
where π
′ i
πi = ln
1 − πi
and we fit the straight line
πi′ = β0 + β1 xi .
Once the linear model has been fitted, predictions for the original proportions can be
obtained using the equation
exp(β̂0 + β̂1 xi )
π̂i = .
1 + exp(β̂0 + β̂1 xi )
Although the logit transformation linearizes the model, the variances are still not equal
and for any individual whose response is coded as 0 or 1, the logit of the response
is either minus infinity or plus infinity and so the model is fitted using an iterative
maximum likelihood method. Due to the computational complexity of this method, it
is only available in larger statistical packages.
8.1
LOGISTIC REGRESSION Example
In logistic regression, the coefficient β1 also has a interpretation different from that
in simple linear regression. The ratio πi /(1 − πi ) is called the odds in favour of the
attribute or condition being present. The difference between the logarithm of the odds
at a value x of the predictor and a value x + 1 of the predictor is
odds at x + 1
= e β1 .
odds at x
The estimate of eβ1 is called the odds ratio and is the increase in the odds for a unit
increase in the predictor variable.
Example
Samples of fifty insects are exposed to different concentrations of an insecticide and the
number of insects who have died recorded after 24 hours with the results shown below.
0.0 50 2
2.2 50 4
5.7 50 5
8.4 50 10
16.0 50 36
20.5 50 44
8.2
LOGISTIC REGRESSION Minitab Analysis
With the above observations being stored in columns named Concentration, Sample
and Killed in a Minitab worksheet, Minitab is used to fit the logistic model using the
Regression menu.
Log-Likelihood = -112.346
Test that all slopes are zero: G = 158.588, DF = 1, P-Value = 0.000
Goodness-of-Fit Tests
Method Chi-Square DF P
Pearson 1.539 4 0.820
Deviance 1.484 4 0.829
Hosmer-Lemeshow 1.539 4 0.820
Group
Value 1 2 3 4 5 6 Total
Success
Obs 2 4 5 10 36 44 101
Exp 1.5 2.7 6.4 11.5 34.7 44.1
Failure
Obs 48 46 45 40 14 6 199
Exp 48.5 47.3 43.6 38.5 15.3 5.9
Total 50 50 50 50 50 50 300
8.3
LOGISTIC REGRESSION Conclusions
Conclusions
The odds ratio is 1.31. That is, for every unit increase in concentration, the odds of the
insect being killed is multiplied by 1.31, or with every unit increase in concentration,
the odds of killing the insect increase by 31%.
A measure of the fit of the model is provided by the deviance which has a chi–squared
distribution with degrees of freedom equal to the number of sample points minus the
number of parameters in the model. In this case, the deviance is small (1.484, χ24 ) and
so the model fits the observations.
e−0.246
π̂ =
1 + e−0.246
= 0.44
An approximate estimate of the LD50 concentration, the concentration which will kill
50% of the population can be obtained by solving
0 = −3.444 + 0.267x
as the logit of π = 0.5 is 0. This gives an estimate of the LD50 concentration as 12.9.
8.4
LOGISTIC REGRESSION Discussion and Other Link Functions
assumes that no πi is either zero or one. If some πi are zero or one or take values very
close to zero or one the usual modification in the transformation is to take
1
πi = if πi = 0
2ni
1
πi = 1 − if πi = 1.
2ni
exp(β0 + β1 x)
π̂ = .
1 + exp(β0 + β1 x)
Other transformations which produce response functions with similar shapes are the
probit transformation which transforms the πi using the cumulative normal distri-
bution where
′ −1
πi = Φ πi
The probit transformation was introduced by Bliss and is widely used in estimating
LD50 dosages. Because of the computational complexity, before the availability of
computer packages tables to aid computation were prepared and other methods of
calculating LD50 dosages were proposed. These alternate methods to probit analysis
still provide quick and easy estimates of LD50 dosages.
8.5
ESTIMATION OF LD50 DOSES Method of Extreme Lethal Dosages
Assumptions
Estimate
The estimate of log(LD50) is the mean of highest dosage which fails to kill and lowest
dosage which does kill.
Example
λ x = log(λ)
d 1.9 + 1.5
log(LD50) =
2
= 1.7
d = 50
LD50
8.6
ESTIMATION OF LD50 DOSES Behran’s Method
Assumptions
Estimate
The estimate of log(LD50) is the value of x for which Sx− (r) = Sx+ (n − r) where
Example
d 6
log(LD50) = 0.64 + 0.01
11
= 0.645
d = 4.42
LD50
8.7
ESTIMATION OF LD50 DOSES Kärber’s Method
Assumptions
Estimate
d 1X
log(LD50) = (xi + xi+1 )(pi+1 − pi )
2
Example
xi +xi+1
Concentration log(concentration) Number killed in 2 pi+1 − pi
groups of 100
λ x = log(λ)
d
log(LD50) = 0.647
d = 4.44
LD50
8.8
PROBIT ANALYSIS Minitab Analysis
Using Minitab with the probit link function on the numerical example for Behran’s
method gives the following output
Log-Likelihood = -394.154
Test that all slopes are zero: G = 459.197, DF = 1, P-Value = 0.000
Goodness-of-Fit Tests
Method Chi-Square DF P
Pearson 14.0430 7 0.050
Deviance 21.8610 7 0.003
Hosmer-Lemeshow 14.0430 7 0.050
Group
Value 1 2 3 4 5 6 7 8 9 Total
Event
Obs 0 12 22 35 51 61 75 88 100 444
Exp 3.8 9.1 18.4 32.2 48.9 65.9 80.1 90.0 95.7
Non-event
Obs 100 88 78 65 49 39 25 12 0 456
Exp 96.2 90.9 81.6 67.8 51.1 34.1 19.9 10.0 4.3
Total 100 100 100 100 100 100 100 100 100 900
8.9
PROBIT ANALYSIS References
8.10