Home Table of Contents Orientation Support Lessons
Navigation Tabs
Divider bar space Previous Page  17 of 45  Next Page space
space line
Glossary Graphic Glossary for Reliability
A B C D E F G H I J K L M N O P Q R S T U V W X Y Z

Coefficient (or Cronbach´s) alpha: An internal consistency measure of a test that is the average of all split-half reliabilities.

Inter-rater reliability: The correlation between two different scorers´ scores on the same test. This is a measure of agreement or consistency between different test scorers.

Internal consistency reliability: The degree of consistency in responses across the many items of a single test or the consistency with which those items measure a single dimension.

Judgmental review: A method for detecting bias that uses the opinions of individuals representing relevant subgroups of the population of potential examinees to assess the fairness of items on a test.

Objective Scoring: Scoring systems which do not require any expertise or opinion. If a test is computer scorable, it is objectively scored.

Parallel-forms reliability: The correlation between test scores from two versions (forms) of a test that are presumed to have the same measurement characteristics.

Percent Correct: A common scoring system which divides the total points received by the total points possible. That proportion is then multiplied by 100 to get a percentage.

Random error: Error that occurs by chance and is not consistent or predictable. Contrasts with systemic error.

Rubric: Organized set of performance criteria associated with a range of point values often used for scoring performance-based assessments, constructed response items and other forms of supply items.

Split-half reliability: An internal consistency measure found by correlating two even halves of a test (such as correlating the odd numbered problems with the even number problems).

Standard error of measurement: A statistical index of measurement error which gives an estimate of how much an examinee´s score might vary across multiple administrations of the same test. The standard error also indicates how close a person´s score probably is to their typical level of performance on that test.

Subjective Scoring: Scoring systems which require some expertise or experience. If two people following the same scoring key and instructions might disagree on the correct number of points to assign, then the scoring is subjective.

Test-retest reliability: The correlation between a student´s scores on the same test taken at two different points in time. For example, a student takes an exam during week one of school and then retakes the same exam during week three.

Principles of Measurementspace
Divider bar space Previous Page Top of Page Next Page space
space line