classification of standardized achievement test tests

Classification of Standardized achievement test Tests

Numerous tests are used in the schools, which can be classified into different

types or categories. One way to classify tests is by the construct the test purports

to measure. Using this method, tests can be organized into achievement tests,

aptitude or intelligence tests, personality inventories, projective techniques,

interest inventories, attitude measures, and so forth. Each of these types of tests

may be further divided into subcategories. For example, individual tests are

designed for administration in a one-on-one situation. Group tests are designed

for group administration.

Individual tests are typically used for clinical purposes, such as making a

diagnosis of a disability or disorder or determining strengths and weaknesses in

a specific area of functioning (e.g., intelligence, achievement). These tests are

administered to one student at a time. The test user should have considerable

training in test administration, scoring, and interpretation. Group tests, on the

other hand, are designed primarily as instruments for mass testing (Anastasi &

Urbina, 1997). They are largely pencil-and-paper measures suitable for

administration to large or small groups of students at the same time. The

majority of tests used in schools are group tests. The recent large-scale tests

used by states are also group tests. Group tests are fairly easy to administer and

score, and their use does not require much special training on the part of the

examiners. Some group tests may also be computer administered and scored.

1

Tests may also be grouped into speed tests and power tests. A speed test is

designed to measure the speed or rate of performance, rather than the

acquisition or mastery of knowledge. Typically, speed tests include very easy

items so every test taker knows the answers. This way, only speed of response is

measured by the test. On the other hand, a power test is designed to measure the

knowledge of the test taker, regardless of his or her speed of performance.

Power tests contain items with varying degrees of difficulty and allow enough

time for test takers to attempt all items. Performance is based on how well a

student can answer the items, instead of how fast he or she can perform.

However, most tests used in the schools measure both knowledge and speed as

factors on test performance. Specifically, these tests are designed to measure

students' knowledge in a domain of content and a time limit is set for the

students to complete the test. In other words, a student's score is influenced by

both accuracy and speed of his or her answers. On such tests, although a student

may have the necessary knowledge required to answer the test items correctly,

the student may not receive a high score if he or she works in a slow speed and

is unable to complete many items within the time limit.

Another way to classify tests is by the measurement theory that underlies a test.

Based on this consideration, tests may be categorized into norm-referenced tests

(NRT) and criterion-referenced tests (CRT). The major difference between NRT

and CRT lies in the standard used in interpreting test performance. In NRT, a

2

student's performance is compared to that of other students in a group.

Specifically, interpretation of scores is made by comparing the student's

performance to the average of performance of other similar students on the test

or to the norm of a standardization sample. It indicates the student's status in a

group or how well the student has performed with respect to the rest of the

group. NRT is by far the most common approach to test interpretation.

However, it should be noted that because NRT compares a student's

performance to the normative group, language, culture, and socioeconomic

status differences between the student and the normative group could affect the

student's score (Kubiszyn & Borich, 2003). CRT, also known as domain-

referenced tests, employs a different frame of reference in test interpretation

than does NRT. In this approach, a performance standard called criterion is

established prior to testing to indicate mastery of the specific content domain

covered by the test. A student's performance is compared to the pre established

criterion, rather than to the performance of other students. Interpretation of CRT

results yield specific information regarding the student's proficiency in or

mastery of the measured skills. In recent years, states have begun to develop

minimum competence tests to assess students' performance, which are designed

and used based on the theory of criterion-referenced measurement.

3

The Metropolitan Achievement Tests

The Metropolitan Achievement Tests (MAT7), a series of norm-referenced,

objectives-based tests, were administered in October 2004 to students in grades

3 and 5.

4

These tests are designed to measure achievement in the basic skills commonly

found in state and district curricula. The content areas measured were Reading,

Mathematics, Language, Science and Social Studies. Also administered was the

Otis-Lennon School Ability Test (OLSAT), which measures the student’s

ability to cope with school learning tasks in verbal and nonverbal areas.

The percentile ranks compare the students’ performance in Braintree with mean

scores of students in the same grade across the nation.

The Peabody Individual Achievement Test

The Peabody Individual Achievement Test (PIAT-R) is the revised version of

the now classic Peabody Individual Achievement Test (PIAT), an individually

administered, norm-referenced measure of academic achievement. The test was

designed to provide a wide-range screening measure in six content areas that

can be used with students in kindergarten through the 12th grade.

Nelson-Denny Reading Test

he Nelson-Denny Reading Test was created in 1929 by M.S. Nelson and E.C.

Denny, both of whom were on the faculty of Iowa State Teacher's College. The

purpose of the test is to measure reading ability among high school and college

students. It is not appropriate for the clinical evaluation of reading disorders,

5

http://en.wikipedia.org/wiki/Reading_(process)

http://en.wikipedia.org/wiki/Iowa_State_University

however it may be used to identify students in need of remedial reading

instruction. The Nelson-Denny includes two

subtests(Vocabulary and Comprehension) both with multiple choice questions,

and yields four scores. The most recent revision was published in 1993 and is

available from Riverside Publishing in Itasca, Illinois.

The Nelson-Denny can be administered in a group format. Total administration

time is about 45 minutes which includes time spent instructing examinees

6

http://en.wikipedia.org/wiki/Itasca,_Illinois

http://en.wikipedia.org/wiki/Riverside_Publishing

http://en.wikipedia.org/wiki/Multiple_choice_question

http://en.wikipedia.org/wiki/Reading_comprehension

http://en.wikipedia.org/wiki/Vocabulary

http://en.wikipedia.org/wiki/Remedial_instruction

http://en.wikipedia.org/wiki/Remedial_instruction

classification of standardized achievement test tests

Documents