Está en la página 1de 18

University of Wollongong Thesis Collections

University of Wollongong Thesis Collection


University of Wollongong Year
A predictive GIS methodology for
mapping potential mining induced rock
falls
Hani Zahiri
University of Wollongong
Zahiri, Hani, A predictive GIS methodology for mapping potential mining induced rock
falls, M.Eng thesis, School of Civil, Mining and Environmental Engineering, University of
Wollongong, 2006. http://ro.uow.edu.au/theses/494
This paper is posted at Research Online.
http://ro.uow.edu.au/theses/494



NOTE

This online version of the thesis may have different page formatting and pagination
from the paper copy held in the University of Wollongong Library.




UNIVERSITY OF WOLLONGONG

COPYRIGHT WARNING

You may print or download ONE copy of this document for the purpose of your own research or
study. The University does not authorise you to copy, communicate or otherwise make available
electronically to any other person any copyright material contained on this site. You are
reminded of the following:

Copyright owners are entitled to take legal action against persons who infringe their copyright. A
reproduction of material that is protected by copyright may be a copyright infringement. A court
may impose penalties and award damages in relation to offences and infringements relating to
copyright material. Higher penalties may apply, and higher damages may be awarded, for
offences and infringements involving the conversion of material into digital or electronic form.



Chapter 3
Weights-of-Evidence Method




21
Chapter 3
Weights-of-Evidence Method
3.1 Introduction

According to Sawatzky et al., (2004) the Weights-of-Evidence (WofE) method is
a quantitative method for combining evidences in support of a hypothesis. In the
simple terms, the weights-of-evidence method constructs a probabilistic model in
using known occurrences of a feature under consideration (eg. rock fall). The
known occurrences of the feature are commonly referred to as the training points.
The controlling or predictive factors, termed as evidential themes
1
, are the
measures of the datasets that cause the feature to occur (Kemp et al., 2001 and
Sawatzky et al., 2004). The weights-of-evidence method combines different
datasets as controlling factors of occurrence of a particular phenomenon by
weighting each factor using a log-linear form of a Bayesian based model
(Sawatzky et al., 2004 and Spiegelhalter, 1986). The technique calculates the
weights for each controlling or predictive factor based on its dominance in the
training point theme (D) units within the area of a predictor (B) (Bonham-Carter,
1994). Equations 3-1 and 3-2 represent the general formulation of the weights in
WofE method:

} {
} {
ln
D B P
D B P
W =
+
(3-1)

} {
} {
ln
D B P
D B P
W =

(3-2)

1
- This term is defined in Arcview as a synonym of map or layer
22
where
P = probability of object occurrence;
ln = the natural log;
B = controlling (predictor) theme;
D = training point theme unit;
W
+
= the level of positive correlation between the presence of the
evidential (controlling) theme and the training points;
W
-
= the level of negative correlation between the presence of the
evidential (controlling) theme and the training points;

The overall level of spatial association between the evidential theme and the
training points is given by the parameter of Contrast, C, which is defined by
Equation 3-3 (Bonham-Carter et al., 1989):

C = W
+
- W
-
(3-3)

The weights-of-evidence method was first applied in quantitative analysis for
medical purposes in disease diagnosis. The symptoms were considered as
evidences. Based on the defined hypothesis this patient has disease x, two
weights were calculated for each symptoms, one for presence and one for absence
of each symptom. The Contrast of weights measures the relationship between
disease and the particular symptom. In the cases of other patients, the weights
could then be used to calculate the probability of the occurrence of particular
diseases based on the presence or absence of symptoms (Lusted, 1968; Aspinall
and Hill, 1983; Spiegelhalter and Knill-Jones, 1984).
In geological application, the weights-of-evidence method was originally
developed in a non-spatial mode (Reboh and Reiter, 1983; McCammon, 1989),
using the Bayesian approach to combine controlling evidences for mineral
23
prospects in an expert system. Campbell et al. (1982) applied this method to
combine evidences in map form and successfully predicted the extension of the
Mount Tolman molybdenum deposit in Washington. Reddy et al. (1992) applied
the same approach for the prediction of base metal deposits in a greenstone belt.
Bonham-Carter used weights-of-evidence with the combination of spatial data
analysis in the mapping of mineral potential (Bonham-Carter and Thomas 1973;
Bonham-Carter and Chung 1983; Bonham-Carter et al. 1988, 1989). Bonham-
Carter et al. (1989) and Bonham-Carter (1994) further described a Bayesian based
method through which the quantitative relationship between an indicator map and
a mineral deposit pattern was analysed. After the development of Arc-WofE and
Arc-SDM extensions in 2001, the weights-of-evidence method has been widely
used in various fields of study using the GIS based approach.

3.2 The Bayesian concept

The Bayes law provides the basis for the weights-of-evidence method. To
illustrate this concept, the likelihood of rain for tomorrow example described by
Bonham-Carter (1994) is used. Suppose we want to predict the likelihood of rain
on the next day. According to historical data, it is known that on average 100 days
of the year has rainfall. A reasonable estimate of the prior probability of rain in
the next day might be the average proportion, 100/365. This initial or prior
estimate can be modified if other relevant information is available. For example,
an important factor that influences rainfall is the time of year. The prior
24
probability would be updated by multiplying by a factor that varies with the time
of year. This can be expressed as:

Factor year of Time Rain P year of Time Rain P _ _ _ * } { } _ _ { = (3-4)

where
} {Rain P = prior probability, and
} _ _ { year of Time Rain P = posterior probability, which is the conditional
probability of rain given the time of year, or prior probability multiplied
by a factor.

The Time-of-year factor would vary depending on the month of the year. Another
factor that affects the likelihood of rain could be whether it has rained today or
not. These two factors or sources of evidence for the proposition it will rain
tomorrow can then be combined by the expression:

factor today Rain factor year of Time Rain P evidences Rain P _ _ * _ _ _ * } { } { = (3-5)

where Rain_today_factor is determined from historical data, and the effect of
combined factors of evidence is the product of two factors. Several sources of data
that provide predictors (evidence) regarding tomorrows weather can be used to
update prior probability. The prior probability can be successively updated with
the addition of new predictor evidence, so that the posterior probability based on a
piece of evidence can be used as the prior for using a new piece of evidence.
Fundamental to the understanding of weights-of-evidence is the concept of prior
and posterior probabilities. The prior probability defines the expected outcome of
25
an event in the absence of evidence, whereas the posterior probability is defined
as:

Posterior probability = Prior probability * factor for each predictor (evidential themes)
(3-6)

Let us further consider the problem of determining the probability of rock fall
occurrence within a specified region T, made up of u unit cells (areas). In practice,
the size of the unit areas depends on the quality of available data and spacing
between known rock fall locations, termed as training points. As a rule, the
method assumes that each training points occupies only one small unit area hence
the probability of a point can be defined as probability per unit area. If N{D} is
the number of unit cells containing rock falls and N{T} is the total number of unit
cells in an area of study (Figure 3.1), the prior probability of the rock falls
occurrence in the study area is
} {
} {
T N
D N
.
Steep areas are usually prone to rock falls. The slope of an area is an important
controlling factor in estimating rock fall probability in an area. Slope factor or
evidence can be used to modify prior probability of rock fall occurrence. Figure
3.1 shows a hypothetical spatial relationship between an area T, training theme D
and predictor map B (or evidential theme).






26






Figure 3.1 Binary map showing the location of rock falls and Venn diagram summarising
the spatial overlaps relationship between the map pattern and the rock fall pattern (After
Harris et al., 2000)

The probability of rock fall occurrence, given the presence of evidence B (eg.
steep slope), can be expressed by Equation 3-7:


} {
} {
} {
B P
B D P
B D P

= (3-7)

To obtain an expression for the posterior probability of rock fall occurrence in
terms of the prior probability and a multiplication slope factor we note that the
conditional probability on the binary slope map, B, given the presence of a rock
fall is defined as:

Please see print copy for Figure 3.1
27

} {
} {
} {
D P
D B P
D B P

= (3-8)

In order to obtain an expression indicating the relationship between posterior
probability and prior probability and an evidence factor, Equations 3-7 and 3-8
can be combined to solve for } { B D P , showing the requested relationship:


} {
} {
} { } {
B P
D B P
D P B D P = (3-9)

Hence the posterior probability of a rock fall, given the presence of the evidence
equals the prior probability of rock fall P{D} multiplied by the factor
} {
} {
B P
D B P
.
Similarly, the posterior probability of rock fall occurrence given the absence of
evidence (flat areas) can be expressed by Equation 3-10:


} {
} {
} { } {
B P
D B P
D P B D P = (3-10)


3.2.1 Odds and likelihood ratios
According to Bonham-Carter (1994), the probability that an event will occur
divided by the probability that it will not occur is defined as a ratio called odds.
The weights-of-evidence method uses the natural logarithm of odds, known as the
log odds or Logits. From Equation 3-10 the odds ratio can be expressed as:


} { } {
} { } {
} {
} {
B P B D P
D B P D P
B D P
B D P
= (3-11)

28
From the definition of conditional probability:


} {
} { } {
} {
} {
} {
B P
D B P D P
B P
B D P
B D P =

= (3-12)

Substituting this expression for } { B D P into the numerator of the right side of
Equation 3-12, and rearranging terms yields the following:


} {
} {
.
} {
} {
.
} {
} {
} {
} {
D B P
D B P
B P
B P
D P
D P
B D P
B D P
= (3-13)

The odds of a rock fall O{D} are equal to
} { 1
} {
D P
D P

, or
} {
} {
D P
D P
; Substituting the
definition of odds into Equation 3-13 reduces to:


} {
} {
} { } {
D B P
D B P
D O B D O = (3-14)

where } { B D O is the conditional (posterior) odds of D given B, } {D O is the prior
odds of D and
} {
} {
D B P
D B P
is known as Sufficiency Ratio (LS) (Bonham-Carter,
1994). Taking the natural logarithm of both sides of Equation 3-14 gives log
e
LS,
the positive weight-of-evidence W
+
:


+
+ = W D Logit B D Logit } { } { (3-15)

Similar algebraic expressions lead to the derivation of an odds expression for the
conditional probability of D given the absence of the evidence (see Equation 3-
16).
29

} {
} {
} { } {
D B P
D B P
D O B D O = (3-16)
The term
} {
} {
D B P
D B P
is called the Necessity Ratio (LN) (Bonham-Carter, 1994). In
weights-of-evidence terminology, W
-
is the natural logarithm of LN, or log
e
LN
that is expressed as:

+ = W D Logit B D Logit } { } { (3-17)



Both LS and LN are also called likelihood ratio and can be calculated from the
available field data (see Figure 3.2)









Figure 3.2 Diagrammatic representation of the weights calculation in weights-of-
evidence method (After Harris et al., 2000)

3.2.2 Combining Datasets
The conditional probability of a rock fall occurring, given the presence of two
predictive map patterns, B
1
(eg. slope) and B
2
(eg. cliff height), can be expressed
as:

Please see print copy for Figure 3.2
30

} {
} {
} {
2 1
2 1
2 1
B B P
B B D P
B B D P


= (3-18)
which can be expressed as


} { } { } { } {
} { } {
} {
} { } {
} {
2 1 2 1
2 1
2 1
2 1
2 1
D P D B B P D P D B B P
D P D B B P
B B P
D B B P D P
B B D P
+


=
(3-19)

Equation 3-19 is the Bayes Rule for two exclusive hypotheses, D and D, with
1 } { } { = + D P D P . If B
1
andB
2
are conditionally independent then:

} { } { } {
2 1 2 1
D B P D B P D B B P = (3-20)

and Equation 3-19 reduces to:


} {
} {
.
} {
} {
} { } {
2
2
1
1
2 1
B P
D B P
B P
D B P
D P B B D P = (3-21)

Note that Equation 3-21 is similar to Equation 3-9, except that the multiplying
factors for the two themes are used to update the prior probability to give the
posterior probability. Using the odds formulation, the conditional or posterior
odds can be expressed for two themes B
1
and B
2
as:


2 1 2 1
* * } { } { LS LS D O B B D O = (3-22)


31
or the log-linear weights-of-evidence form:


+ +
+ + =
2 1 2 1
} { } { W W D Logit B B D Logit (3-23)

where the subscripts 1 and 2 refer to the likelihood ratio or weights determined
independently for evidential themes 1 and 2 respectively. The weights for each of
the two themes (predictors) are calculated in exactly the same way as the weights
for a single theme.
As pointed out by Bonham-Carter (1994), there are now four different ways of
combining two binary map patterns. In addition to Equation 3-23, three other
ways could be formulated as below:

+
+ + =
2 1
2
1
} { } { W W D Logit B B D Logit (3-24)


+
+ + =
2 1 2
1 } { } { W W D Logit B B D Logit (3-25)



+ + =
2 1
2 1 } { } { W W D Logit B B D Logit (3-26)


With 3 datasets as evidence, there are 2
3
, or 8, possible combinations. In general
with n map there are 2
n
possible combinations. The general expression for
combining i=1, 2, 3 n maps is:

=
=
n
i
i n
LS D O B B B B D O
1
3 2 1
* } { } ..... { (3-27)
for the likelihood ratio, and

=
+
+ =
n
i
i n
W D it B B B B D Logit
1
3 2 1
} { log } ..... { (3-28)
for the weights.
32
Equations 3-27 and 3-28 are the computing formulae for combining a set of
evidential themes with the Bayes model which is implemented in GIS
environment. In general, if the i-th evidence is absent instead of presence, the LS
becomes LN and W
+
becomes W
-
. Where data is missing for a particular theme in
some locations, the likelihood ratio is set to1, or the weight is set to 0.

3.3 Implementation of weights-of-evidence method

Applying weights-of-evidence method to map rock fall potential will entail the
following steps:
Choose a series of maps that are useful for providing evidential themes for
rock falls occurrence. These may include maps of cliff heights and slope.
For each map, determine the optimum reclassification scheme to convert
the themes to binary or ternary format. This maximises the spatial
association between the map and the occurrence points.
Calculate the weights for each predictive map or evidential theme.
Check for conditional independence between the evidential datasets. In
some cases, maps may have to be combined or rejected to avoid
conditional independency.
Combine the evidential themes using calculated weights.
Generate result (response) theme showing posterior probabilities.
Figure 3.3 is a schematic of the main steps of the GIS based WofE method
implementation. The implementation steps are also summarised in Figure 3.4.

33






































Figure 3.3 Schematic GIS based weights-of-evidence method procedures
Selecting evidential themes and providing GIS-based inventory
Weighting evidential themes with respect to the spatial association with training points
Combining weighted evidential themes and generate response theme
Posterior Probability
Response theme
Weighting Table
Training point theme
34
































Figure 3.4 General procedure for weights-of-evidence implementation


Generate response themes
and
Interpretation
Check conditional independence
Secondary weighting evidential themes
Theme reclassification
Primary weighting evidential themes
Setting of initial parameters
(Define area of study, unit cell
and training points)
Data editing, manipulation
and formating in GIS framework
Selection of controlling factors
Problem definition
35
Among the stages, in first stages the geometry of the problem, goals, available
models, available data, limitations and creation a prevision of the results, are the
main issues. Selecting reasonable controlling factors to contribute to the model as
evidential themes is the next step. This is followed by compilation of GIS based
data inventory including data transferring, editing and manipulation. During the
primary weighting step, weights are calculated for each evidential theme with
respect to the spatial association between the training points and the evidential
themes. Themes are then reclassified based on primary weighting results in order
to convert from multi-class themes to binary or ternary pattern Weighted themes
are then combined together using calculated weights for each one. The response
theme shows the probability of rock fall occurrence within the area of study.

También podría gustarte