Documentos de Académico
Documentos de Profesional
Documentos de Cultura
Assessment
A process where test results are subject to critical study according to established
measurement principles.
Assessment decisions could substantially improved student performance, guide the
teachers in enhancing the teaching-learning process and assist policy makers in
improving the educational system.
Assessment
Instructional Decisions
Grading Decisions
Diagnostic Decisions
Selection Decisions
Placement Decisions
Counseling and Guidance Decisions
Program or Curriculum Decisions
Administrative Policy Decisions
Roles of Assessment
Summative tries to determine the extent to which the learning objectives for a course
are met and why.
Diagnostic determining the gaps in learning or learning processes, hopefully to be able
to bridge these gaps.
Formative allows the teacher to redirect and refocus the course of teaching a subject
matter.
Placement plays a vital role in determining the appropriate placement of a student both
in terms of achievement and aptitude.
Approaches in Assessment
Assessment of Learning
Assessment for Learning
Assessment as Learning
Modes of Assessment
Traditional
Alternative
Authentic
NRT
CRT
50%
80%
Performance of other
students
Broad, covers many
objective
Shallow, usually one or two
items per objective
The more spread of scores,
the better
Items are chosen to
promote variance or
spread. One aim is to
produce good distracter
options.
Percentile rank and
standard scores used
Standards indicative of
mastery
Narrow, covers a few
objectives
Narrow, covers a few
objective
Variability may be minimal
Items are chosen to reflect
the criterion behavior
Number succeeding or
failing or range or
acceptable performance
used
Completion Items
Essay Items
Use several short essay questions rather than a long one.
Provide a clear focus for students questions.
Indicate limitations or scoring criteria to pupils.
Item Analysis a numerical method for analyzing test items employing student
response alternatives or options.
Criteria in determining the desirability and undesirability of an item:
a. Difficulty of an item
b. Discriminating power of an item
c. Measures of attractiveness
Difficulty Index (P)
proportion of the number of students in the upper and lower groups who answered
an item correctly.
P = UL + LL
2n
Level of Difficulty of an Item
Index Range
Difficulty Level
Recommendation
0.00- 0.20
Very difficult
NA
0.21 0.40
Difficult
LA
0.41- 0.60
Moderately Difficulty VA
0.61 0.80
Easy
LA
0.81 1.00
Very Easy
NA
Item
Evaluation
Recommendation
VA
0.30 0.39
0.20 - 0.29
Below 0.19
Poor Item
NA
Evaluation
VA
VA
VGI (RET)
GI
(RET)
LA
(RET)
LA
F(
RET)
NA
LA
LA
BF
(REV)
NA
NA
NA
(REJ)
Validity
Reliability
Fairness
Practicality and Efficiency
Ethics in Assessment
Validity
The degree to which a test or measuring instrument measures what it intends to
measure.
soundness (what the test measures and how well it could be applied)
Types of Validity
Content Validity- the extent to which the content or topic of the test is truly
representative of the course.
Depends on the relevance of the individuals responses to the behavior area
under consideration rather on the apparent relevance of item content.
Commonly used in evaluating achievement test.
Appropriate for the criterion-referenced measure.
Concurrent Validity - the degree to which the test agrees or correlates with a
criterion set up as an acceptable measure.
Applicable to test employed for the diagnosis of existing status rather than for the
prediction of further outcome.
E.g. validating a test made by the teacher by correlating with a previously proven
valid test.
Predictive Validity determined by showing how well predictions made from the test
are confirmed by evidence gathered at some subsequent time
Construct Validity the extent to which the test measures a theoretical trait.
Reliability
The extent to which a test is dependable, self-consistent and stable.
It is concerned with the consistency of responses from moment to moment.
A reliable test may not always be valid.
Methods in Testing the Reliability of Good Measuring In
Test-Retest Method the same measuring instrument is administered twice to
the same group of students and the correlation coefficient is determined.
Limitations: time interval, environmental conditions
Spearman rank correlation coefficient or Spearman rho is a statatistical tool used
to measure the relationship between paired ranks assigned to individual scores
on two variables, x and y.
rs = 1 rs
6 D
N3 - N
= Spearman rho
3. Set up the classes by adding C/2 to the highest score as the upper class limit
of the highest class and subtract C/2 to the highest score as the lower class
limit of the highest class. Set up the real and integral limits.
4. Tally the score.
5. Determine the Cumulative Frequency and the Cumulative Percentage
Frequency distributions.
6. Present the frequency polygon and histogram.
Mode- Refers to the score or scores that occurred most in the distribution.
Unimodal
Bimodal
multimodal
Properties of Mode
It is the score/s occurring most frequently
Nominal average
It can be used for qualitative and quantitative data
Not affected by extreme values
Measures of Variability
Refers to a single value that is used to describe the spread out of the scores in a
distribution, that is above or below the measures of central tendency.
Range, quartile deviation, sd