The discourse of assessment has been an important issue in education. Assessment is one of the basic foundations for effective teaching and learning process since it is a way of monitoring and evaluating student achievement and performance during and after the learning process. Nitko & Brookhart (2007, p. 4) defined assessment as “a process for obtaining information that is used for making a decisions about students; curricula, programs, and schools”. Similarly, Smith and Lovat (2003, p. 175) defined assessment as “placing an interpretation on measurement information concerning a student’s or students’ performance”. Thus, it can be synthesized that assessment is fundamental to make decisions and policies related to the improvement of student performance and achievement. The results of assessment can be used as a basis for future improvement and development of teaching and learning process.
Considering the importance of assessment in measuring students’ learning, this paper will discussed several assessment methods in language learning, including advantages and disadvantages, the possible adaptation of each assessment method in language learning. Finally, this paper will sum up a brief synthesis based on the whole aspects.
B. STUDENT ASSESSMENT METHODS IN LANGUAGE LEARNING
1. Multiple choice test
Multiple-choice test is one of the most widely used in achievement test of all types of assessment method. Multiple choice items consist of a stem and a set of options or choices. It is primarily used to assess student learning outcomes at factual and comprehension levels (Airasian, 2005). Thus, multiple-choice can be used to ask questions which asses student ability at knowledge and understanding level.
The use of multiple-choices offers several advantages for both teachers and students. A clear advantage is the scoring system is standardized and uniform in the sense of the availability of designated correct answer. Thus, assessors can reduce their potentiality of being subjectivity and bias in scoring (Athanasou & Lamprianou, 2002; Swartz, 2006; William B. Walstad & Becker, 1994). Furthermore, with little response time required of the students to answer questions, multiple-choice test provide broader content coverage and better content validity than most alternative methods allow (Bacon, 2003; McDonald, 2007; Tanner, 2003).
Despite of its strengths, multiple-choice tests are also characterized with several limitations. Kuechler and Simkin (2003) and Airasian (2005) multiple-choice does not give any opportunity for students to construct, organize and present their own response to questions. It has probability of making guess for answering questions. Rogers & Ndalichako (2000) argued that multiple-choice test cannot differ between correct answers derived from student knowing the answer those derived from student guessing the answer. In addition, multiple-choice test tend to measure only a very narrow sample of content at a point of time and it usually require only superficial recognition of knowledge to answer correctly (Valencia, 1999).
Multiple-choice test is one of the most widely used test forms for language learning. It has been used as formal test such as new student entrance test, placement test and even international test such as Test of English as Foreign Language (TOEFL) and International English Language Testing System (IELTS). There are several reasons for using this test. Firstly, multiple-choice test is commonly used if the cognitive domain which is going to be tested is at knowledge and comprehension level. Secondly, considering its advantages as easy to score and provide broader content coverage and better content validity, multiple-choice test is used for large-scale assessment which involve large number of people.
Example of Multiple Choice Test
1. When did Indonesia proclaim its independence?
a. August 15th 1954 c. August 15th 1945
b. August 17th 1954 d. August 17th 1945
2. Which of the following countries are not in Asian continent?
a. Singapore c. India
b. Malawi d. Iran
3. The process by which plants use light to make glucose is called?
a. respiration c. energizing
b. photosynthesis d. growing
2. Short-answer test and completion test
The short-answer test and completion test are categorized as supply-type test items. They are essentially the same except their method of presenting the problem (Linn, Miller, & Gronlund, 2005). Completion item is an assessment method in which students are assessed to recall knowledge by simply asking questions and require students to answer it by completing incomplete statements while short-answer test uses a direct question. Thus, the students are required to provide a word or a phrase to complete the statement and to answer direct question. These methods can be used to assess student ability at knowledge and comprehension level.
McMillan (1997) identified several main advantages of using these two methods. He argued that both items are easy to construct, broad coverage of facts because of short response time, higher reliability of scoring and quicker to mark compared to essay tests. Nitko & Brookhart (2007) identified another benefit in which students have lower opportunity to guess the answers correctly compared to true-false and multiple-choice tests. In addition, it requires students to produce an answer instead of simply selecting from a list of potential answers (McDonald, 2007). However, McMillan (1997) argued these methods also have two main limitations. Firstly, it takes more time to mark completion items compared with selected response items such as multiple-choice test. Secondly, if it is not constructed well, there is a possibility of having more than one answer which leads to difficulty to score the results.
Short-answer test and completion test are the assessment methods that can be used to test language learning. For example, to test student ability in vocabularies and verb forms. In some cases, this type of assessment is also used in other subjects to assess lower level thinking skills such as recall and comprehension of information. Considering one of its advantages of giving lower level of opportunity for students to guess the correct answers, both methods can be used as an alternative assessment for multiple-choice tests to assess students at lower order of thinking skills. Furthermore, short-answer test and completion test can be combined with other assessment methods which assess higher level of thinking skills such as tests to provide students with sounder tests. In this case, a test assesses both low level and higher level of thinking skills. By doing it, it is expected that it will raise the standards of the student achievement.
Example of Completion test
1. The name of the president who proclaimed Indonesian was ……………
2. The distance between Makassar and Bali is ……. miles.
3. Indonesia proclaimed its independence in the year ……..
Example of Short-answer test
1. Name the country in which Borobudur Temple is located.
2. In what year did the president Soekarno proclaim Indonesian independence?
3. Name three characteristics of mammals?
3. Essay writing test
Essay test is employed when more complex knowledge is going to be tested (Wragg, 1997). In this case, an essay question requires students to demonstrate a higher order thinking skill to respond to problems presented in questions. Essay test can be divided into two categories: restricted response and extended response (McDonald, 2007). The first category implies that students’ answers are limited. It poses a problem and requires students to present direct and specific response to the problem. The latter category implies that students’ answers are not limited. In this case, they have more freedom and opportunity to present and organize their own ideas as response to the problems. Thus, essay writing can be used to ask questions which assess student ability at more complex level than objective tests such as true-false and multiple-choice do.
According to Swartz (2006) and (Conderman, 2001), Essay test provides more opportunity to assess more higher levels of attribute achievement. Essay test is typically easy to construct and it can be used to evaluate student ability to organize, express and depend their ideas, including their critical thinking ability (McDonald, 2007). Another advantage is that the assessment format allows students to have more freedom in answering questions. This freedom allow students to formulate their response that may display higher order of thinking skills (William B Walstad, 2006).
Some of the disadvantages of applying essay test are it requires more time to administer and to mark (Valencia, 1996), it only covers limited content domain in one test time (Bridgeman & Morgan, 1996), it is less reliable compared to objective tests, it provides wide opportunity for bluffing for students who have limited idea of the materials (Hanna & Dettmer, 2004).
Essay test is one of the most widely used test forms for language learning. The use of this method is usually combined with multiple-choice test. The main reason for using this type of test is its usefulness to test student learning outcomes in more complex level such as organizing, analyzing and synthesizing ideas to response to questions. This idea is in line with one of the targets of curriculum and education philosophy in which students are expected to analyze and synthesize problems to create their genuine ideas. However, there is still one main weakness of using this method in which its scoring system is subjective. As stated by many researcher and academics that one of the limitations of essay test is the subjectivity of scoring system (McDonald, 2007). Nitko & Brookhart (2007) suggested some alternative strategies to minimize this limitation such as having two lecturers to assess student essay, creating scoring rubrics and correcting one question at a time.
Example of Essay writing test
1. Describe the strengths and weaknesses of the use information and communication technology in assessment?
2. Compare the beneficial use of journal and project as assessment methods?
3. Explain why you agree or disagree with the use of ICT in assessment?
5. Portfolio
Portfolio can be defined as a collection of students’ works demonstrating progress and accomplishment (Brady & Kennedy, 2005; Gülbahar & Tinmaz, 2006; Hanna & Dettmer, 2004). Nitko and Brookhart (2007) argued portfolio is used as formative evaluation tool. Thus, it can be synthesized that portfolio is a collection of students’ works which serve as tool to judge the students’ performance. Portfolio as formative tool can be used when the teachers want to assess the students’ performance in more holistic way. In this case it does not simply assess students’ factual knowledge as in multiple-choice or true-false tests but also self reflection, problem solving and critical thinking. Robins (2006) and Klenowski (2000) claimed portfolio promotes more complete portrait and valid inferences of student achievement.
There are several advantages of using portfolio assessment. Hayes (1997) claimed that the benefits related to the use of portfolio assessment are finding the need of students, reflection for teachers and students, enhancing instructional methods and evidence of learning. Portfolio provides opportunity for students, teachers, parents and other to get more holistic view o changes in student performance over time (Cizek, 1997). Portfolio provides opportunity for students to collect their series of ‘real world’ performance from certain period of time (Cole, Struyk, Kinder, Sheehan, & Kish, 1997; Wiggins, 1998). Thus, it can be inferred that portfolio help the students to build metacognitive perspective to learning through reflection and independent learning.
On the other hand, portfolio has also some disadvantages. According to McMillan (1997) scoring system in terms of reliability is the major drawbacks which is similar to other performance-based assessment. He also added that sampling of student products may lead to weak generalization. In addition, portfolio is very time consuming and labor-intensive for teachers (Lambdin & Walker, 1994) Thus, the main drawbacks of portfolio assessment are low reliability of scoring system, big probability of weak generalization and very time consuming.
Portfolio is useful technique because it encourages reflective thinking and self discovery through reflective writing. This kind of assessment method is especially useful to assess students’ progress in their writing skill. Paesani (2006) several reasons for using portfolio are it is consistent with process-oriented tasks. Portfolio also provides a sense of accomplishment for the students since they may see the collection of their work gathered from time to time. Finally, portfolio educates students to be more independent learner. In addition, portfolio can also be a useful means communication among students, parents and teachers (Lambdin & Walker, 1994; Lamme & Hysmith, 1991).
Example of Portfolio
Students are asked to create portfolio of their progress for their writing course
Each activity in student guide identifies a design skill and a life skill that students will be practicing. Students are performing realistic tasks which involve meaningful application of knowledge and skill in writing. There is usually a product – sentences, paragraphs, stories, etc. – that is completed by doing the activity. Students are asked to place these in their portfolio. This becomes the evidence that the work was completed and how well it was done. These samples are examples of student ability to apply relevant knowledge and skills. Their portfolios tell the story of their growth and learning.
6. Projects
A project can be defined as “a substantial piece of work on a designated topic, involving the student in researching and organizing information for presentation. The project may be completed individually or groups, and the presentation predominantly involves written work supplemented by pictures, tables, diagrams or graphs” (Brady & Kennedy, 2005, p. 65). Project is a type of product assessment (Brady & Kennedy, 2005) or performance assessment (McDonald, 2007) which may have a strong process dimension.
Brady and Kennedy (2005) argued the main benefits of the use of project as an assessment method are that it shows student ability to collect and organize data in an integrated ways and it assess students’ learning outcomes through research. However, this type of assessment method also involves several limitations as other product assessment such as portfolio, exhibition and journal. Brady and Kennedy (2005) identified the limitation of project assessment as vague assessment criteria and subjectivity of marking the outcomes, and the tendency for students to focus more on surface report such as good picture and diagram rather than on substantive outcomes of the projects.
Project may be used as assessment method in language learning such as giving them project to analyze certain language phenomenon. This test offers many benefits especially in relation to research skills. Students can be trained to collect and analyze data in integrated and systematic ways through this process. These sorts of skills are needed in their real life. This implies that the use of project can raise the standard of student achievement through life skills in research.
Example of Projects
A. Conduct a research project on factors influencing student achievement at UNM Makassar.
B. Conduct a research project on the relationships between assessment methods with the effectiveness of teaching and learning process.
C. Conduct a research project on the use of information and communication technology to raise the standards of the student achievement at UNM Makassar.
G. Journal
A journal is a type of self assessment in which students reflect on their learning. Journal does not only function as a means for students to express their thinking but also as a means for teachers to know what their students think. Journal entries are usually made several times within a week and it is a useful way of communication between teachers and students (Brady & Kennedy, 2005).
According to Brady and Kennedy (2005) the implementation of journal as an assessment method offers several benefits. Students are given more opportunity to express their perception of the process and the product of learning, and to reflect on their own work. It also offers potentiality of collaborative assessment. Similarly, Liebars (1999) argue the use of journal writing in classroom can provide a wonderful opportunity for students to try the ideas that they learn in class and to reflect on themselves.
Brady and Kennedy (2005), however, also identified a major weakness of this assessment method as the subjectivity of scoring especially given the highly impressionistic nature of some writings. In addition, claimed the use of journal as an assessment technique can be very time consuming especially in assessing student writing entries (Woodward, 1998).
Considering the discussion above, it is clear that journal is useful technique to assess student achievement especially in relation to student self reflection. It is also useful for teachers to know student perception about their teaching.
Example of Journals
The following is an example of journal writing
Name : …………….
Student ID : ……………
Please fill out this self-evaluation. Think back through reflective journal for today/ this week/this semester/ in the Subject Assessment in Education and comment on the following items in terms of your own learning.
• Make three statements about noteworthy events you have experienced today/this week/this semester that relate to teaching and learning.
• What do you perceive as your strengths as you study about the materials?
• What concerns do you still have?
• What steps do you want to take next as development and improvement for your future learning?
C. CONCLUSION
In a higher education institution, students are required to process a large amount of complex knowledge and information and to perform responsibly upon that information. Assessment method should be provided to reflect the level of achievement at which students are expected to practice. Assessment in the form of questions should be congruent with faculty expectations of student learning.
Good assessment is indicated by the use of balanced assessment methods. What an assessor need to assess has to be clear before deciding the best way to assess it. In this case, simple learning outcomes involve simple assessment methods to be used and complex learning outcomes involves complex assessment methods to be used. For example: objective test such as multiple-choice, true false and matching test can be used to measure student knowledge and simple skills. Performance-based assessment can be used to measure student ability in more complex skills such as speech, critical thinking and portfolio. Therefore, it should be kept in mind that integrating several assessment methods is useful since it will cover higher learning levels and competencies.
REFERENCES
Cizek, J. G. (1997). Learning, Achievement, and Assessment: Constructs at a Crossroads. In Phyye, G. D. (eds), Handbook of Classroom Assessment: Learning, Adjustment and Achievement, 1-32, California: Academic Press.
Phyye, G. D. (1997). Classroom Assessment: A Multidimensional Perspective. In Phyye, G. D. (eds), Handbook of Classroom Assessment: Learning, Adjustment and Achievement, 33-51, California: Academic Press.
Lester, K. L., Lambdin, D. V., & Preston, R. V. (1997). A New Vision of the Nature and Purposes of Assessment in the Mathematics Classroom. In Phyye, G. D. (eds), Handbook of Classroom Assessment: Learning, Adjustment and Achievement, 287-319, California: Academic Press.
Airasian, P. W. (2005). Classroom Assessment: Concepts and Applications (5 ed.). New York: Mc Graw Hill.
Athanasou, J. A., & Lamprianou, I. (2002). A Teacher's Guide to Assessment. Australia: Social Science Press.
Ayers, S. F. (2001). Developing Quality Multiple-Choice Tests for Physical Education. Journal of Physical Education, Recreation & Dance, 72(6), 23-29.
Bacon, D. R. (2003). Assessing Learning Outcomes: A Comparison of Multiple-Choice and Short-Answer Questions in Marketing Context. Journal of Marketing Education, 25(1), 31-36.
Brady, L., & Kenedy, K. (2003). Curriculum Construction (2 ed.). New South Wales: Pearson Education Australia.
Brady, L., & Kennedy, K. (2005). Celebrating Student Achievement: Assessment and Reporting (2 ed.). Australia: Pearson Education Australia.
Bridgeman, B., & Morgan, R. (1996). Success in College for Students With Discrepancies Between Performance on Multiple-Choice and Essay Tests. Journal of Educational Psychology, 88(2), 333-340.
Clarridge, P. B., & Whitaker, E. M. (1997). Rolling the Elephant over: How to Effect Large-Scale Change In Reporting Process. Portsmouth: Heinemann.
Cole, K. B., Struyk, L. R., Kinder, D., Sheehan, J. K., & Kish, C. K. (1997). Portfolio Assessment: Challenges in Secondary Education. The High School Journal, 80(4), 261-272.
Conderman, G. (2001). Program Evaluation: Using Multiple Assessment Methods to Promote Authentic Student Learning and Circular Change. Teacher Education and Special Education, 24, 391-394.
Curtis, D. D., & Keeves, J. P. (2000). The Course Experience Questionnaire as an Institutional Performance Indicator. International Education Journal, 1(2), 73-82.
Gronlund, N. E. (1985). Measurement and Evaluation in Teaching (5 ed.). New York: McMillan Publishing Company.
Gülbahar, Y., & Tinmaz, H. (2006). Implementing Project-Based Learning And E-Portfolio Assessment In an Undergraduate Course. Journal of Research on Technology in Education, 38(3), 309-327.
Hanna, G. S., & Dettmer, P. A. (2004). Assessment for Effective Teaching: Using Context-Adaptive Planning. New York: Pearson Education, Inc.
Hayes, E. (1997). Portfolio Assessment in Adult Basic Education: A Pilot Study. Adult Basic Education, 7(3), 165-188.
Kane, M., Crooks, T., & Cohen, A. (1999). Validating Measures of Performance. Educational Measurement: Issues and Practices, 18(2), 5-17.
Khattri, N., Kane, M. B., & LReeve, A. (1995). How performance assessments affect teaching and learning. Educational Leadership, 53(3), 80-83.
Klenowski, V. (2000). Portfolios: Promoting Taching. Assessment in Education, 7(2), 215-236.
Kuechler, W. L., & Simkin, M. G. (2003). How Well Do Multiple Choice Tests Evaluate Student Understanding in Computer Programming Classes. Journal of Information Systems Education, 14(4), 389-399.
Lambdin, D. V., & Walker, V. L. (1994). Planning for Classroom Portfolio Assessment. The Arithmetic Teacher, 41(6), 318-324.
Lamme, L. L., & Hysmith, C. (1991). One School's Adventure into Portfolio Assessment. Language Arts, 68(8), 629-640.
Liebars, C. S. (1999). Journals and Portfolios: Alternative Assessment for Preservice Teachers. Teaching Children Mathematics, 6(3), 164-167.
Linn, R. L., Miller, M. D., & Gronlund, N. E. (2005). Measurement and Assessment in Teaching (9 ed.). New Jersey: Pearson Education Inc.
McDonald, M. E. (2007). The Nurse Eucator's Guide to Assessing Learning Outcomes (2 ed.). Massachusetts: Jones and Bartlett Publishers.
McMillan, J. H. (1997). Classroom Assessment: Principle and Practice for Effective Instruction. United States of America: Allyn and Bacon.
Nitko, A. J., & Brookhart, S. M. (2007). Educational Assessment of Students (5 ed.). United States of America: Pearson Education, Inc.
Paesani, K. (2006). Exercices de Style: Developing Multiple Competencies Through a Writing Portfolio. Foreign Language Annals, 39(4), 618-639.
Petress, K. C. (2003). An Educational Philosophy Guides the Pedagogical Process. College Student Journal, 37(1), 128-134.
Robins, J. (2006). Electronic Portfolios as a Bridge. Intervention in School and Clinic, 42(2), 107-113.
Rogers, W. T., & Ndalichako, J. (2000). Number-Right, Item Response, and Finite-States Scoring: Robustness with Respect to Lack of Equally Classifiable Option and Item Option Independence. Educational and Psychological Measurement, 60, 5-9.
Smith, D. L., & Lovat, T. J. (2003). Curriculum: Action on Reflection (4 ed.). New South Wales, Australia: Social Science Press.
Stefanou, C., & Parkes, J. (2003). Effects of Classroom Assessment on Student Motivation in Fifth-Grade Science. The Journal of Educational Research, 96(3), 152-162.
Swartz, S. M. (2006). Acceptance and Accuracy of Multiple Choice, Confidence-Level, and Essay Question Format for Graduate Students. Journal of Education for Business, 81(4), 215-220.
Tanner, D. E. (2003). Multiple-Choice Items: Pariah, Panacea, or Neither of the above? American Secondary Education, 31(2), 27-36.
Valencia, S. W. (1996). A Correlational Analysis of Multiple-Choice and Essay Assessment Measures. Research in Education, 56, 73-84.
Valencia, S. W. (1999). Adding Authenticity to Traditional Multiple Choice Test Formats. Education, 120(1), 82-87.
Walstad, W. B. (2006). Testing for Depth of Understanding in Economics Using Essay Questions. Journal of Economic Education, 37(1).
Walstad, W. B., & Becker, W. E. (1994). Achievement Differences on Multiple-Choice and Essay Tests in Economics. The American Economic Review, 84(2), 193-196.
Wiggins, G. (1998). Educative Assessment: Designing Assessment to Inform and Improve Student Performance. California: Jossey-Bass Inc.
Woodward, H. (1998). Reflective Journals and Portfolios: Learning Through Assessment. Assessment and Evaluation in Higher Education, 23(4), 415-423.
Wragg, E. C. (1997). Assessment and Learning in the Secondary School. London: Routledge.
Zinn, L. M. (1994). Lifelong options. Moorhead Circle, Boulder: CO.