Documentos de Académico
Documentos de Profesional
Documentos de Cultura
LEARNING MODULES
Discuss what a
normal Paper on the
distribution is; different
Discuss the curves and
important aspects its analysis
of normal and
distribution; and Module 6 – Presentation of Different curves to be interpretation
The Normal the module via shown and indicated on
Apply the normal Distribution Google meet. the module.
distribution to
data. Discussion via Different problems and
online class scenario regarding
using Google normal distributions on
Meet the module
6
Examinatio
n
7 At the end of this
module study, you will Module 8 – Presentation of
the module via
be able to: Introductio Google meet.
n to
differentiate Inferential Discussion via Module exercises and Paper on
inferential Statistics online class activities on Sample of comparison
using Google descriptive statistics and
statistics from
Meet and inferential statistics differentiation
descriptive and differentiate one of descriptive
statistics from the other and inferential
describe the statistics.
terms:
-population
-sample
-parameter
-statistic
-hypothesis
testing
At the end of this
module study, you will Module 9 –
be able to: Random Presentation of
Sampling the module via
Explain the Designs Google meet.
reasons why we
Discussion via Apply an appropriate Paper
study samples online class showing the
instead of sampling design to
using Google application of
populations Meet
your projected the sampling
research. How did design
Discuss the
you fare in your
different types of
responses when
sampling method
compared with the
such as:
ASAQs? See module
-simple random
Sampling with
Replacement /
without
replacement
-systematic
random
sampling
-cluster
sampling
-stratified
random
sampling
- multistage
random
sampling
- Discuss
sampling
distributions
8 At the end of this
module study, you will
be able to:
Activities and
exercises on the
Applications of all
the previous
modules
Application of
Inferential Statistics
in a study.
12 Examinatio
n
13 After studying this
module, you will be able
to:
draw graphical Module 14 – Presentation of Activities:
Graphs with
representation of Introductio the module via Give certain study
Google meet. interpretation
relationships of n to with a collected data
two variables; Correlation and construct the
Discussion via
explain the use s online class
scatter gram Give a
of correlation using Google brief interpretation
and regression Meet taking into
analyses; and consideration the
determine the form, direction, and
different precision of the
correlation graph.
analyses
appropriate for
different levels
of measurement
of data.
Other activities
embedded on the
module
15 After studying this
module, you will be
able to:
Discuss the uses Module 18– Give studies that Manual and
Presentation of
would require simple SPSS output
and properties of Simple the module via
Google meet. regression analysis
simple linear Linear computation.
regression Regression
Discussion via
analysis; online class Compute manually
Explain the using Google and using SPSS
assumptions in Meet
applying simple
linear regression Other activities can
analysis; be found in the
module.
Compute for
estimates of
simple linear
regression; and
Test the
significance of
regression
equation
16-17 Given a data set, you
will be able to:
Select Module 19– Given data sheet for Interpreted
Presentation of multiple regression. Output from
appropriate 20 Multiple the module via
correlation Regression Compute, analyze SPSS and
Google meet.
and interpret manually
techniques to
Discussion via treated data.
answer specific
online class
hypothesis in a
using Google
study Meet
Report
correlation in
terms of
statistical
significance and
meaningfulness,
Select
appropriate
regression
models suited for
the data, and
Interpret the
statistics given by
regression
procedures
18 Examinatio
n
GRADING SYSTEM
INTRODUCTION
Welcome to the world of statistics! You are about to encounter numbers, tables, names, graphs,
probabilities, and trends –in other words, all about statistics.
The module will teach you what descriptive statistics is all about. Statistics is an orderly science;
hence it can be understood easily. A conceptual understanding of the statistical procedures used in
nursing as well as the computational skills to carry out these procedures is given in this module. At the
end of the module, some activities and exercises are given. Please do the activities and answer the
questions because they will enhance your mastery of the lesson. Approach this module with an open
and positive mind. You will like statistics because it is a very useful course.
OBJECTIVES
Statistics is the science of data. It is meaningful and useful science whose broad scope of application
to nursing and other health sciences, to government, to business and other physical and
biopsychosocial sciences is limitless. What about you, what comes to mind when you think of
statistics? Does it bring into your mind unemployment figures, election returns, or basketball scores?
Or is it simply a graduate course requirement you have to complete?
Statistics is logical. It has a key role in critical thinking in the classroom, in the hospital, on the job, or
in everyday life. Thus, the time you spend in studying the subject will repay you in many ways later.
Each of us has a built-in system of reference that helps us make decisions. One definite we also have a
built-in set of prejudices that may affect our decisions. One definite advantage of statistics is that it can
help us make decisions without prejudice. Moreover, statistics can be used for making decisions when
faced with uncertainties. For example, suppose you want to estimate the proportion of how many
among the nurses enrolled in this course will finish the course on time, you would need statistics to
predict the number of these who will finish versus those who will not.
The general prerequisite for statistical decision-making is the gathering of numerical facts or
information. Procedures for evaluating numerical data, together with rules of inference, are prime
topics in the study of statistics.
In this line of term, statistics are trained in collecting, evaluating, and drawing conclusions from
numerical information. More importantly, statisticians determine what information is relevant in
giving problem and whether the conclusions drawn from the study are to be trusted.
Statistical methods by themselves have no power to work miracles; however, these methods can help
us make some decisions. Furthermore, the statistical results should be interpreted by one who
understands not only the methods but also the subject matter, especially the conceptual or theoretical
framework to which statistics have been applied.
Thus, statistics is the science of data that involves collecting, classifying, summarizing, organizing,
analyzing, and interpreting numerical information or data.
Statistical methods are useful for studying, analyzing, and learning about population. A population is
a set of units / such as people, objects, transactions, or events, that we are interested in studying. For
example, populations may include:
1. People
1.1 all Filipino women working in foreign countries
1.2 all registered nurses in the Philippines
1.3 everyone who is enrolled in nursing in the WCC Antipolo.
2. Objects
2.1 all theses and dissertations done in 1998
2.2 all stores selling Filipino products
2.3 all shoes manufactured in Marikina
3. transactions
3.1 all memos of agreement signed by the WCC Antipolo administration in 1998
3.2 all sales of Jollibee foods delivered to the WCC College of Nursing from Antipolo
branch in January-February 1999
3.3 all promotions of the WCC Antipolo faculty in 1997
4. events
4.1 all victims of fireworks accidents brought to PGH emergency room in December 1998
and January 1999
4.2 all birthday celebrations of graduating students in April 1999
4.3 all births registered at all Manila hospitals on February 14, 1999
In the above examples, you will notice that each set includes all the units in the population.
According to McClane and Sincich (1997), it is possible to measure a characteristic for every unit in
the population if the population you wish to study is small. For example, if you are measuring the
high school GPA of all incoming first year students at WCC Antipolo, it is feasible to obtain these
data. When we measure a characteristic for every unit of a population, the result is a census of the
population.
Oftentimes it is not feasible to study the entire population. For instance, how would you measure the
weight and height of each 5 year old boy in the Philippines? For such a population conducting a
census would be prohibitively time consuming and very costly. A reasonable alternative is to select
and study a subset or a portion of the population.
A sample is a subset of a population. It is a finite number of units selected from the population. Thus,
sample is simply a part of the population. But not every sample is a representative of a population. To
be a representative, that sample must be selected randomly. A random sample is determined
completely by chance. According to Brase and Brase (1983) in a simple random sampling every
number or units of the population has an equal probability or chance of being included in the sample.
For example, instead of polling all 139,000 registered nurses in the Philippines regarding who they
voted for during the 1998 presidential election, a pollster can just randomly select a sample of 1,000
registered nurses to represent all the registered nurses in the Philippines.
In studying a population, we focus on one or more characteristics or properties of the units in the
population. Such characteristics are called variables.
Example 1
A PhD student in Nursing investigated the number of children per household in Quezon City.
A sample of 500 households in Quezon City was randomly selected to determine the number of
children per family.
a. Describe the population
b. Describe the sample
c. Describe the variable of interest
Solution
1.2.3 Measurement
Statistics can be applied in the analysis of a variable the variable can be represented numerically. We
do this through the process of measurement. Measurement is the process we use to assign numbers
to variables of individual population units. For example, we can measure the teaching performance of
a faculty member by asking all his/her students to rate his/her performance on a scale from 1to 10. Or,
we can measure research assistant’s age by simply asking them their actual age. To gather data for a
variable we can use either quantitative measurements or qualitative measurements.
Quantitative measurements use a naturally occurring numerical scale to describe the size of a
particular data.
Examples:
1. The temperature (in degrees Celsius) at which 20 pieces of heat-resistant plastic begin to
melt.
2. The current unemployment rate (measured as a percentage) for each province and city of
the Philippines.
3. The scores of a sample of 150 NMAT medical students applicants administered
nationwide.
4. The successful master’s graduate students who finished the degree over a ten-year period.
Examples:
1. The political party affiliation (Lakes NUCD, Laban, Peoples’ Party, Masang Makabayan,
or Independent) of 100voters from Parañaque.
2. The academic status (pass or fail) on the comprehensive exam of 20 doctoral students.
3. The size of the refrigerators (big, medium, small) rented by each of a sample of 30 transient
boarders.
4. A taste taster’s ranking (best, worst, average) of four brands of salad dressing for a panel of
10 testers.
After the variables of interest for every unit in the sample or population are measured, the data are
analyzed either by descriptive or inferential statistical methods.
Descriptive statistics utilizes numerical and graphical methods to look for patterns in a data set, to
summarize the information in a convenient form.
Inferential statistics utilizes sample data to make estimate, decisions, predictions, or other
generalizations about a population. In this unit, we will only focus on descriptive statistics.
Let us now pause for some activities and exercises. Compare your responses with the answers given at
the end of this module. Do not skip these exercise questions; they are important.
SAQ 1-1
Define statistics. Why is it a science?
SAQ 1-2
SAQ 1-3
What is the guideline we should have in interpreting results?
SAQ 1-4
Chemical and manufacturing plants sometimes discharge toxic-waste materials such as Chloro-
fluorocarbons (CFC) into nearby rivers and streams. These toxins can adversely affect the plants and
animals inhabiting the river and riverbank. The Philippine Army Corps of Engineers recently
conducted a study of fish in Dicayo River in Zamboanga del Norte and its three tributary creeks:
Biniray Creek, Bolarot Creek, and Matam Creek. A total of 144 fish were captured and the following
variables were measured for each:
SAQ 1-5
A group of students from UP Manila is concerned about the rising student fees at Universities and
colleges nationwide. So the group selected a random sample of 30 colleges and universities throughout
the country to obtain information about the irrespective student fees.
a. What is the population?
b. What is the sample?
ACTIVITY 1-1
Report of the student should reflect the various ways of making statistics authenticate reports –
through percentage, frequency, and averages.
As evidenced by media today, there is a need to evaluate the flood of information reaching our
homes. Each day the media present us with published results on economic, health, social and other
concerns. The growth in data collection associated with scientific phenomena, business operations,
and government activities (quality control, statistical auditing, forecasting, etc.) has been remarkable
in the 1990’s. This scenario demands from each one of us to develop a discerning sense – an ability to
use rational thought to interpret the meaning of data. This ability can help us make intelligent
decisions, inferences, and generalizations to think critically. This is possible with the use of statistics.
Statistical thinking involves applying rational thought to assess data and the inferences made from
them critically.
Are you still with me? Let us pause and do some activities.
SAQ 1-6
Pollsters regularly conduct opinion polls to determine the popularity rating of the current president.
Suppose a poll is to be conducted tomorrow in which 2,000 individuals (18 yrs. Old and above) will be
asked whether the president is doing a good job in running the country. The 2,000 individuals will be
selected by random digit telephone dialing asked the question over the phone.
SAQ 1-7
What is statistical thinking?
1.4 SUMMATION NOTATION
In statistics, it is necessary to work with sums of numerical values. To express these, we make use of
standard notation. Let us consider the exam scores of Bertha Pila on 9 statistics exams.
In mathematical notation, letter X denotes a score in a data set. From Bertha’s scores, we have the
following data:
X1 = score on Exam 1 = 88
X2 = score on Exam 2 = 6
X3 = score on Exam 3 = 46
X4 = score on Exam 4 = 55
X5 = score on Exam 5 = 28
X6 = score on Exam 6 = 9
X7 = score on Exam 7 = 78
X8 = score on Exam 8 = 64
X9 = score on Exam 9 = 16
The numbers 1-9 written beside the Xs are called subscripts. They represent the first to the 9 th
observed score in a given data set. In this case, X 1 represents Bertha’s score on the first exam while X 9
represents her score on the ninth exam. In general, X I denotes the ith value in a data set. Using this
notation, the sum of Bertha’s exam scores can be expressed symbolically as:
X1 + X2 + X3 + X 4 + X5 + X6 + X7 + X 8 + X 9
But instead of writing down all this Xs, we can simply express this equation as, where
9
symbol ∑ ❑(Greek capital letter “sigma”) is the summation notation used in statistics.
∑ X
Thus,
i=1
to get the sum of the first, second, third, and ninth values.
In statistics, we always compute for the total sum and not for the partial sum, and so can be further
9 simplified to ∑ X which means “summation of all the scores” in a data set.
∑X
i=1
∑X = X1 + X2 + X3 + X 4 + X5 + X6 + X7 + X 8 + X 9
i=1
= 88 + 6 + 46 + 55 + 28 + 9 + 78 + 64 + 16
= 390
Example : X Y XY
1 4 4
2 5 10
3 6 18
∑ X= 6 ∑ Y =15 ∑ XY = 32
Steps:
Multiply each X value with each Y value
Get the summation of ∑ XY , ∑ X , ∑ Y
Check if ∑ XY is equal to ∑ X ∑ Y
∑ XY =∑ X ∑ Y
32= (6)(15)
32 ≠ 90
Therefore, ∑ XY ≠ ∑ X ∑ Y
∑ (X +C ) = ∑ X +C
36 = 21 + 5
36 ≠ 26
Therefore, ∑ (X +C ) ≠ ∑ X +C
2
Rule 3:¿ ¿ is not equal to ∑ X
Example: X X2
2 4
4 16
6 36
2
∑ X= 12 ∑ X =56
Steps:
multiply each X value by itself
2
get ∑ X + ∑ X
2 2
check if (∑ X ) = ∑ X
2 2
(∑ X ) = ∑ X
(12)2 = 56
(12) (12) = 56
144 ≠ 56
2 2
Therefore, (∑ X ) ≠ ∑ X
SUMMARY
In this module, we saw that statistics is the study of how to collect, organize, analyze and interpret
numerical information. We investigated some types of problem where statistics can be used. In these
situations, we saw examples of population and samples. It is important to remember that the main role
of inferential statistics is to draw conclusions about a population based on information obtained from a
sample. Whereas the main role of descriptive statistics is to prevent or summarize a large mass of data
into a manageable form. We also saw in this module, the elements of statistics and finally we see the
role of statistics in critical thinking. With all this, let us cultivate a liking for this course. We shall
learn more as we study the other modules. Keep up the good work of reading your modules. Statistics
is a skill, you will soon have it.
2
Frequency Distributions
INTRODUCTION
The initial step in the descriptive process that is, describing the data and the cases that are presented by
those data, is the organization of otherwise disorganized information and the condensation of
otherwise unmanageably large quantities of information.
The large mass of data may be organized by a creating a frequency distribution table containing the
following components: frequency, percentage, cumulative frequency, and cumulative percentage. This
module discusses first the ungrouped frequency distributions and later, the grouped.
OBJECTIVES
Basically, frequency distributions show in tabular form the number of each score or category appears
in a data set. Score in their original forms are called raw score or raw data. Raw scores are usually
arranged in any particular order, thus making it difficult for the readers to see clearly the features of
data. See for example Table 2.1, which lists the raw scores of 40 masters’ students in their statistic’s
final examination for their N-298 class in UP Manila. These scores are not arranged in any particular
order, making it hard to examine clearly how well students performed as a group, or how varied the
scores are from one student to the next.
TABLE 2.1 Raw Scores on the Statistics Final Examination of Masters’ Students
81 94 90 80 87 80 85 95
83 92 87 70 96 76 87 89
86 79 75 83 84 75 81 81
81 84 70 78 96 94 88 78
80 77 93 87 77 78 79 72
Table 2.2 on the other hand, present another version of the data in table 2.1. Notice that the final
examination scores are now arranged in order from lowest to highest in the first column, labeled X.
frequencies are then listed in the second column labeled f , showing how many students received each
listed score. When data are organized this way, we can see at a glance that the scores ranged from a
low of 70 to a high of 96, or that four students had a score of 84 and another four had a score of 87.
Such presentation is called an ungrouped frequency distribution. Ungrouped frequency distributions
begin the process of organizing the data into a meaningful form. You can incorporate in the ungrouped
frequency distribution table columns for raw score (X), frequency (f), percentage (%), cumulative
frequency (cf), and cumulative percentage(c%).
2.1.1 Frequencies
To determine the frequencies of the scores in the data set, arrange first the raw scores in ascending or
descending order (as shown in Table 2.2). Finally, under the f column, indicate the number of times
each score appeared in the data set (see Table 2.1). Notice that the sum of all the frequency values (cf)
is equal to N or the total number of observations or scores in the data set.
TABLE 2.2 ungrouped Frequency Distribution of the Statistics final Examination Scores of 40
Master’s Students
X f % cf c%
96 2 5.0 40 100.0
95 1 2.5 38 95.0
94 2 5.0 37 92.5
93 1 2.5 35 87.5
92 1 2.5 34 85.0
91 0 0.0 33 82.5
90 1 2.5 33 82.5
89 1 2.5 32 80.0
88 1 2.5 31 77.5
87 4 10.0 30 75.0
86 1 2.5 26 65.0
85 1 2.5 25 62.5
84 1 5.0 24 60.0
83 2 5.0 22 55.0
82 2 0.0 20 50.0
81 0 10.0 20 50.0
80 3 7.5 16 40.0
79 2 5.0 13 32.5
78 3 7.5 11 27.5
77 2 5.0 8 20.0
76 1 2.5 6 15.0
75 2 5.0 5 12.5
74 0 0.0 3 7.5
73 0 0.0 3 7.5
72 1 2.5 3 7.5
71 0 0.0 2 5.0
70 2 5.0 2 5.0
E f = N = 40
The percentage associated with each score can be computed using this equation:
Percentage (%) = f
N x 100
Where f = each score’s frequency of occurrence
N = total number of scores in the distribution
Percentages have one advantage over frequencies. It is often easier to compare two or more
percentages than frequencies. This is particularly true in instances when 2 or more different
distributions have different sample sizes.
The cumulative percentage for any given score is computed using this equation:
C% = cf
N X 100
Where cf = the cumulative frequency listed for a score
N = total number of scores in the distribution
ACTIVITY 2-1
Below are scores of 60 students in Mathematics.
19 31 36 26 34 32
44 33 37 39 45 21
24 38 40 42 39 32
43 18 24 32 49 33
33 33 40 24 46 22
29 33 37 30 43 43
26 39 57 30 40 33
25 33 48 39 34 29
29 37 39 35 41 29
23 32 48 28 45 19
To construct a grouped frequency distribution for the data set in Table 2.1, do the following steps:
i = ____R_____ = 4.5 or 5
# of class intervals
4. Determine f, %, cf. c%
Table 2.3 Grouped frequency Distribution of Statistics Final Exam Scores of 40 Nursing Masters’
Students.
Class Interval f % cf c%
95-99 3 7.5 40 100.0
90-94 5 12.5 37 92.5
85-89 8 20.0 32 80.0
80-84 11 27.5 24 60.0
75-79 10 25.0 13 32.5
70-74 3 7.5 3 7.5
In comparing Table 2.2 with Table 2.3, it is shown that the grouped frequency distribution table has
class intervals while the ungrouped has one. Furthermore, grouped frequency distributions provide a
simpler, more economical description of the data than do the ungrouped frequency distributions. By
combining several scores into one class interval, grouped frequency distributions reduced the total
amount of information is that must be digested y someone in.
Again, take a look at the class intervals in Table 2.3. Each class interval is bounded by numbers called
real limits or exact limit. Thus, the lower and upper or exact limits. For each class interval, the lower
exact limits of the class interval 85-89 are 84.5 and 89.5, respectively. Furthermore, each class interval
can be represented by one value and that is the midpoint. A midpoint is the middle value in a class
interval 80-84, the midpoint is 82.
ACTIVITY 2-2
Construct a grouped frequency distribution table for the data set in Activity 1. Include columns for f,
%, c f, c%, exact limits, and midpoints.
SAQ 2-1
Why is it important to have frequency distributions? In how many ways can we present a data set?
ACTIVITY 2-3
At the World Citi Colleges, College of Nursing, 25 faculty members gave the following information
about the total number of hours they spent on various committee meetings. The summary hours are
computed within a month’s time.
20 22 18 16 25 15 23
21 22 22 20 23 25 22
20 18 18 22 24 25
25 24 16 25 10
SAQ 2-2
What’s the advantage of creating a grouped frequency distribution table over an ungrouped
one?
SUMMARY
This module showed you the importance of arranging data and presenting them in distribution tables
that show the frequency, percentage, cumulative frequency and cumulative frequency.
One application of a frequency distribution is that it can give us an idea of how many students
performed below a given passing score. It can give us the picture of how well or how badly a student
performed in a class relative to the scores of the other students.
In the succeeding modules, you will have more of this frequency distribution theme presented in
graphs, histograms, and other position measures. I wish to encourage you to go on – statistics is not
really hard because it is a science of order and logic.
So, until next time, keep on doing the activities because they will build your statistical skills.