domingo, 25 de septiembre de 2011

Assessing English Language Learners

Introductory chapter

Evaluation is the systematic acquisition and assessment of information to provide useful feedback about some object.

Assessment is the process of documenting, usually in measurable terms, knowledge, skills, attitudes and beliefs. Assessment can focus on the individual learner, the learning community (class, workshop, or other organized group of learners), the institution, or the educational system as a whole. Studies of a theoretical or empirical nature (including case studies, portfolio studies, exploratory, or experimental work) addressing the assessment of learner aptitude and preparation, motivation and learning styles, learning outcomes in achievement and satisfaction in different educational contexts are all welcome, as are studies addressing issues of measurable standards and benchmarks.

It is important to notice that the final purposes and assessment practices in education depends on the theoretical framework of the practitioners and researchers, their assumptions and beliefs about the nature of human mind, the origin of knowledge and the process of learning.

A test or an examination (or "exam") is an assessment intended to measure a test-taker's knowledge, skill, aptitude, physical fitness, or classification in many other topics

Types of Tests

Norm-Referenced
Standardized tests compare students' performance to that of a norming or sample group who are in the same grade or are of the same age. Students' performance is communicated in percentile ranks, grade-equivalent scores, normal-curve equivalents, scaled scores, or stanine scores.
Criterion-Referenced
A student's performance is measured against a standard. One form of criterion-referenced assessment is the benchmark, a description of a key task that students are expected to perform.
Survey
Survey tests typically provide an overview of general comprehension and word knowledge.
Diagnostic Tools
Diagnostic tests assess a number of areas in greater depth.

Formal Tests
Formal tests may be standardized. They are designed to be given according to a standard set of circumstances, they have time limits, and they have sets of directions which are to be followed exactly.
Informal Tests
Informal tests generally do not have a set of standard directions. They have a great deal of flexibility in how they are administered. They are constructed by teachers and have unknown validity and reliability.
Static (Summative) Tests
Measures what the student has learned.
Dynamic (Formative) Tests
Measures the students' grasp of material that is currently being taught. Can also measure readiness. Formative tests help guide and inform instruction and learning.

The Cornerstones of Testing

Language testing at any level is a highly complex undertaking that must be based on theory as well as practice. Although focuses on practical aspects of classroom testing, an understanding of the basic principles of good testing is essential. The guiding principles that govern good test design, development and analysis are validity, reliability, practicality, washback, authenticity, transparency and security. Constant references to these important "cornerstones" of language testing will be made throughout the workshop.
cornerstones checklist

Validity

The term validity refers to the extent to which a test measures what it says it measures. In other words, test what you teach, how you teach it! Types of validity include content, construct, and face. For classroom teachers, content validity means that the test assesses the course content and outcomes using formats familiar to the students. Construct validity refers to the "fit" between the underlying theories and methodology of language learning and the type of assessment. For example, a communicative language learning approach must be matched by communicative language testing. Face validity means that the test looks as though it measures what it is supposed to measure. This is an important factor for both students and administrators. Other types of validity are more appropriate to large-scale assessment.


Reliability

Reliability refers to the consistency of test scores. It simply means that a test would give similar results if it were given at another time. Three important factors effect test reliability. Test factors such as the formats and content of the questions and the length of the exam must be consistent. For example, testing research shows that longer exams produce more reliable results than very brief quizzes. Administrative factors are also important for reliability. These include the classroom setting (lighting, seating arrangements, acoustics, lack of intrusive noise etc.) and how the teacher manages the exam administration. Affective factors in the response of individual students can also affect reliability. Test anxiety can be allayed by coaching students in good test-taking strategies.


Practicality

Classroom teachers are well familiar with practical issues, but they need to think of how practical matters relate to testing. A good classroom test should be "teacher-friendly". A teacher should be able to develop, administer and mark it within the available time and with available resources. Classroom tests are only valuable to students when they are returned promptly and when the feedback from assessment is understood by the student. In this way, students can benefit from the test-taking process.


Washback

Washback refers to the effect of testing on teaching and learning. Unfortunately, students and teachers tend to think of the negative effects of testing such as "test-driven" curricula and only studying and learning "what they need to know for the test". Positive washback, or what we prefer to call "guided washback" can benefit teachers, students and administrators. Positive washback assumes that testing and curriculum design are both based on clear course outcomes which are known to both students and teachers/testers. If students perceive that tests are markers of their progress towards achieving these outcomes, they have a sense of accomplishment. In short, tests must be part of learning experiences for all involved.


Authenticity

Language learners are motivated to perform when they are faced with tasks that reflect real world situations and contexts. Good testing or assessment strives to use formats and tasks that mirror the types of situations in which students would authentically use the target language. Whenever possible, teachers should attempt to use authentic materials in testing language skills.




Transparency

Transparency refers to the availability of clear, accurate information to students about testing. Such information should include outcomes to be evaluated, formats used, weighting of items and sections, time allowed to complete the test, and grading criteria. Transparency dispels the myths and mysteries surrounding secretive testing and the adversarial relationship between learning and assessment. Transparency makes students part of the testing process.


Security

Most teachers feel that security is an issue only in large-scale, high-stakes testing. However, security is part of both reliability and validity. If a teacher invests time and energy in developing good tests that accurately reflect the course outcomes, then it is desirable to be able to recycle the tests or similar materials. This is especially important if analyses show that the items, distractors and test segments are valid and discriminating. In some parts of the world, cultural attitudes towards "collaborative test-taking" are a threat to test security and thus to reliability and validity. As a result, there is a trade-off between letting tests into the public domain and giving students adequate information about tests. 
 




No hay comentarios:

Publicar un comentario