Return to Blogs Main Page

The Influence of Student Engagement on Test Scores

By Steve Wise on September 14, 2012 at 7:54 am

Login to recommend this post »

Please login to like this post. 9

Obtaining a valid test score from a student requires more than just a good test.  It also requires an engaged student who devotes enough effort to the test to demonstrate his true level of proficiency. Without sufficient effort, regardless how good the test is, a student’s test score is likely to markedly underestimate what he knows and can do.  In this case, the student’s score is said to reflect low individual score validity (ISV) due to low test-taking effort.  This means that the score should not be considered a trustworthy indicator of the student’s proficiency level.  It also means that any growth score that is calculated using a low ISV score will be distorted and should be interpreted with caution.

No assessment is immune to this problem.  And despite every effort made to ensure the highest quality of assessments, we can’t control the behavior of the students who are testing.  Instead, we try to identify instances of students not giving their best efforts.  This allows us to identify scores with low ISV.

What Is Known About Non-Effortful Test Taking in MAP Assessments

Researchers at NWEA have pioneered methods for identifying test events that reflect low ISV.  Using these methods, we have established and refined criteria for classifying scores as reflecting low ISV.  Based on these criteria, we have discovered a number of important relationships about student engagement.  These include findings that:

  • Boys exhibit scores with low ISV about twice as often as do girls.  For some reason, this finding surprises no one.
  • Low ISV is about twice as likely to occur on a reading test event than on a math test event.  This is because reading test items tend to contain more words, which students view as requiring more effort to answer.
  • Scores with low ISV are more common during fall testing than during spring testing.  This is a bit counterintuitive, as some partners believe that spring MAP testing (coinciding with the state accountability testing and other end-of-school-year demands) is more vulnerable to low student effort.
  • The prevalence of low ISV scores increases with grade.  In one study, we found that about 1% of the MAP scores show low ISV at grade 2, with a gradual increase to about 15% in grade 9.
  • The time of day that MAP is given is important.  The percentage of low ISV scores gradually increase during the day, with students tested late in the day exhibiting low ISV about twice as often as those tested at the beginning of the day.

What Can Educators Do About the Problem?

Educators should be mindful that how, when, and where a student is administered MAP matters.  Students should be given encouragement from their teachers to give their best effort, and the importance of getting valid MAP scores should be emphasized.  MAP should be administered in a quiet setting that is free of distractions, and there should not be incentives to students to rush through the test (such as going to lunch or recess immediately afterward).  Ideally, students should be testing at the beginning of the day (though we realize that this will not always be possible).  The general goal is to administer MAP in such a way that maximizes student engagement throughout their test events.

What Can NWEA Do About the Problem?

We at NWEA are committed to providing the best possible information to educators about their students’ academic growth.  Part of that commitment involves our being able to identify low ISV scores.  Such scores should be identified on our score reports so that our partners are aware of them.  In addition, we are also working on developing methods of adjusting RIT scores for the amount of non-effortful behavior that occurs during test events.  Finally, we are trying to develop a “smarter” type of MAP that can preempt disengaged test taking by identifying it as it happens and notifying proctors or by displaying encouraging messages to the student.

 

Steve Wise is a Senior Research Fellow at NWEA.

12 comment(s) - you must be logged in to comment

Michael LoCascio's picture
I would also point out that NWEA already does provide information to school districts which can be used to identify possible scores which reflect low student engagement. The comprehensive data file is available through the NWEA reports site for school districts. In this file there is a record of the test duration and the percent correct for each child, I believe within 24 hours after upload. We filter this information down and select all students who are below 35% correct in grades 2-10 and 25% correct in grades K-1 as possible students for retesting. We also examine all students who took less than 15 minutes in grades 2-10 and 8 minutes in grades K-1. We then ask the schools to review these students and determine if the scores appear inaccurate. The downside to this approach is the heavy amount of retesting, which slows down the analysis of district data and lengthens the test window for the schools. Since we place a heavy emphasis on the analysis of student growth, however, we feel this extra effort is worthwhile

Michael LoCascio's picture
Very interesting information and I agree that this is something that is universal to any assessment. There is often a lot of grumbling among educators about the inconsistencies and discrepancies seen between MAP results and other student performance data, whether this is from other commercial assessments, state assessments, classroom grades, or teacher input. Unlike most assessments, however, NWEA is very open about the limitations of MAP--try pressing your state to release the reliability of their high stakes assessments. That's not always easy information to come by. The reality is that no assessment is 100% reliable, and student engagement is a variable that is difficult to predict and control. All the more reason why we need to use multiple data points when making instructional decisions on our children or on the need to ensure the integrity of our data when evaluating schools, programs, and staff. I like the ideas which Steve is proposing, especially the notion of identifying low ISV scores on the reports, but the other ideas, especially the RIT adjustment and the “smarter” MAP sound as if they are years away. Jhowell brings an interesting idea as well, one I hope NWEA listens to—the idea of motivating students during the test is intriguing and would likely have a noticeable impact on student engagement. Let’s face it, the MAP assessments are free of bells and whistles, which isn’t necessarily bad, but from the perspective of a child, they are not exactly engaging and exciting.

Yvonne Fawcett's picture
If you are using the Web-Based Platform, the Grade Report indicates the test duration for each student taking each of the assessments...very helpful!

David Cosme Jr.'s picture
One of the ways in which we combat this is to provide value to the students for the test. The teachers explain how the test is being used, and make connections to what they are learning in class by saying this like, "This is an area that came up low in your MAP test." as they deliver the lesson. They also have goal setting sessions where they learn about the norm and typical growth so students can set challenging and realistic goals. This has not rid us of this problem, but we have seen a dramatic improvement in student engagement during the assessments.

Margaret Wilson's picture
Some of these students are in special education. They need the tests read to them, but by middle school, they refuse to stay on the same question so I can read each question once, and usually race through the test. One student commented that he was going to try for the lowest score in the group. Why can't the tests have the questions read with the students using headphones?

Melinda Connon's picture
The prevalence of 15% ISV in high school students is one of the reasons we need to be cautious about evaluating teachers based on test scores. As a casual observation, my math students that take longer get higher scores -- taking longer often includes actually using the scratch paper. Because MAP comes closest of our standardized instruments to my upper level math courses, I offer extra credit for my personal final if they raise their scores from the previous year (fall-fall and spring-spring)....this gives the former ISV student some motivation and reward for those who usually try their best.

Jeffrey Dungan's picture
I found this article interesting especially the findings about ISV decreasing with grade level and the time of day. We try to conduct all MAP testing before 12 noon each day and we test HS students in small groups which we hope decreases test anxiety about being the last to finish. In the ES and MS we provide a mandatory 10 minute break in the middle of Math and Reading tests. In regards to this topic I have always been a little dubious of the Standard Deviation measure in MAP results as a comprehensive measure of a students effort. It may show if a student is randomly clicking through a test but it will not show students who are rushing over test materials. Personally, as a MAP coordinator I would like more data concerning student testing characteristics by grade to see how better to structure testing at different grade levels

Pamela Cole's picture
LLHowell, I am not familiar with the quiz on moodle, is this through your district or is it something I can use on the NWEA site. I know our district uses moodle but I was not aware of any quiz the students could take, thanks in advance for your assistance.

Linda Howell's picture
We are new to MAP testing but are seeing what has been described in the article above. I do like the idea of incorporating encouraging messages to students - especially the middle school age population I serve. I have used the quiz feature in Moodle and students seem to like the feedback/messages they encounter as they take the quiz. I am also wondering if some type of message at the end - when student preliminary results are given - would also help with student motivation and effort.

Pamela Cole's picture
We just had this discussion at our staff meeting. We were looking at scores as a staff and sharing concerns about student with low scores. Our 4/5 teacher shared that one of her students had a very low score yet in the classroom she was one of her better reading students. As a staff we thought the teacher could share the scores with her students on an individual basis and discuss the scores recieved and let the students know what the score indicates about their abilities based on the test data. We then decided as a staff we need to share with students more about the assessment and what it acutually tells the student about their abilities.

Marg Robertson's picture
I agree that it would be nice to have this added to the report. We do record this ourselves and use it as a factor in evaluating student scores.

Sarah Hale's picture
I think it would be helpful if NWEA reports included a field that recorded the test duration for each student so we could track this factor as an indicator of effort. The duration is noted in the score window at the end of the test, but I have not found it recorded in any reports.

Join the Community

Create Account

Already a member? Sign in

Login to Your Account

Ask a Question

Draw on the knowledge, experience and innovation of your peers across the country and around the world.

Ask a question now!