{"id":52,"date":"2015-09-17T19:00:00","date_gmt":"2015-09-17T19:00:00","guid":{"rendered":"http:\/\/proftesting.com\/blog\/?p=52"},"modified":"2016-01-29T14:08:45","modified_gmt":"2016-01-29T14:08:45","slug":"2015917testing-essentials-scoring-qc","status":"publish","type":"post","link":"https:\/\/www.proftesting.com\/blog\/2015\/09\/17\/2015917testing-essentials-scoring-qc\/","title":{"rendered":"Testing Essentials: Scoring QC"},"content":{"rendered":"<p><a href=\"\/contributors\/reed-castle\">Reed Castle, Ph.D.<\/a><\/p>\n<p>The primary goal of any testing program is to treat test takers in a fair and equitable manner and to assure that inferences made from an exam score are appropriate. To do so, one component is to maintain proper scoring and reporting procedures. For those failing, diagnostic score reports should be provided in a meaningful and reliable manner. The presentation covers Scoring QC and reporting scores in a reliable manner.<\/p>\n<p>There is a wonderful <a href=\"https:\/\/www.google.com\/url?sa=t&amp;rct=j&amp;q=&amp;esrc=s&amp;source=web&amp;cd=1&amp;cad=rja&amp;uact=8&amp;ved=0CDAQFjAAahUKEwjavebI6fvHAhWGQpIKHWMzC9k&amp;url=http%3A%2F%2Fwww.ncme.org%2Fncme%2FAsiCommon%2FControls%2FBSA%2FDownloader.aspx%3FiDocumentStorageKey%3D2ef52eca-91e6-49bf-8f85-737708cb12a9%26iFileTypeCode%3DPDF%26iFileName%3DQuality%2520Control%2520Procedures%2520in%2520the%2520Scoring%2520Equating%2520and%2520Reporting%2C%25202007%2C%252026(1)%2520Avi%2520Allalouf&amp;usg=AFQjCNEvN4q3tytW2Sin5EEwonE0X7lGxw&amp;sig2=3GGtY_6Yfr-7KZmmVi3MRQ\">journal article<\/a> by Avi Allalouf that discusses QC procedures for scoring. In it, he outlines why scoring mistakes can happen and creates six broad areas. They are:<\/p>\n<p><!--[if !supportLists]-->1.<span style=\"font-size:7pt\">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; <\/span><!--[endif]--><strong>No standards have been established<\/strong>. In this category, one could have people conducting the same scoring activity but doing it their own way or making judgments inconsistent with another person conducting the same job. To reduce the probability of this happening, scoring standards need to be established.<\/p>\n<p><!--[if !supportLists]-->2.<span style=\"font-size:7pt\">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; <\/span><!--[endif]--><strong>Practices are undocumented.&nbsp; <\/strong>Standards can vary from examination to examination or administration to administration. Having documentation for each program is critical. The documentation should be comprehensive so different people scoring should be able to replicate others\u2019 work.<\/p>\n<p><!--[if !supportLists]-->3.<span style=\"font-size:7pt\">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; <\/span><!--[endif]--><strong>Standards are not followed.<\/strong> The author wisely chooses the term \u201coverconfidence\u201d to describe this type of mistake.&nbsp; Reviewing and applying the standards at each scoring opportunity will help reduce this mistake from occurring. Being comfortable doing it isn\u2019t good enough.<\/p>\n<p><!--[if !supportLists]-->4.<span style=\"font-size:7pt\">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; <\/span><!--[endif]--><strong>Standards are not updated.<\/strong> Standards need to be updated to prevent potential mistakes. An example could be a change in the number of scored items or applying multiple keys to an item.<\/p>\n<p><!--[if !supportLists]-->5.<span style=\"font-size:7pt\">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; <\/span><!--[endif]--><strong>Scheduling and financial pressures.<\/strong> Anyone who has taken an exam realizes the urgency in reporting scores. Testing agencies have a responsibility to get accurate scores back to organizations in a reasonable amount of time. Better to be late and accurate than to be early and inaccurate.<\/p>\n<p><!--[if !supportLists]-->6.<span style=\"font-size:7pt\">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; <\/span><!--[endif]--><strong>Responsibility.<\/strong> <strong>&nbsp;<\/strong>Who is ultimately responsible for the scores? If multiple people are part of the process, there needs to be an independent audit of the work.<\/p>\n<p>Documented standards that are routinely updated help reduce the likelihood of a QC mistake. A chain of command should be in place to help assure multiple reviews of the scoring process. Prior to reporting scores, data forensics should be conducted to identify any cheating (copying) or trend of item drift, which may suggest compromised items.<\/p>\n<p><strong>Score Reporting<\/strong><\/p>\n<p>Once data forensics have been applied and the scores are ready, a few decisions need to be made. They include:<\/p>\n<p><!--[if !supportLists]-->1.<span style=\"font-size:7pt\">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; <\/span><!--[endif]-->Raw versus scaled scores<\/p>\n<p><!--[if !supportLists]-->2.<span style=\"font-size:7pt\">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; <\/span><!--[endif]-->Mail versus electronic score notification<\/p>\n<p><!--[if !supportLists]-->3.<span style=\"font-size:7pt\">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; <\/span><!--[endif]-->For computer-based exams, should scores be instant?<\/p>\n<p><!--[if !supportLists]-->4.<span style=\"font-size:7pt\">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; <\/span><!--[endif]-->Should diagnostic scores be reported? My colleagues do a wonderful job of discussing this <a href=\"http:\/\/www.fromtheitembank.com\/home\/2015\/6\/30\/revision-of-the-standards-an-advisory-note-on-subscore-reporting\">here<\/a> and <a href=\"http:\/\/www.clearhq.org\/page-1721345#spring15\">here<\/a>.<\/p>\n<p>The scoring process is a critical step in the examination process. The goal of any testing program is to assure that test inferences are appropriate and fair.&nbsp;<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Reed Castle, Ph.D.<\/p><p>The primary goal of any testing program is to treat test takers in a fair and equitable manner and to assure that inferences made from an exam score are appropriate. To do so, one component is to maintain proper scoring and reporting procedures.&nbsp;<\/p>","protected":false},"author":5,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[1],"tags":[26,25,27],"class_list":["post-52","post","type-post","status-publish","format-standard","hentry","category-industry-news","tag-clear-conference","tag-scoring","tag-testing"],"_links":{"self":[{"href":"https:\/\/www.proftesting.com\/blog\/wp-json\/wp\/v2\/posts\/52","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.proftesting.com\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.proftesting.com\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.proftesting.com\/blog\/wp-json\/wp\/v2\/users\/5"}],"replies":[{"embeddable":true,"href":"https:\/\/www.proftesting.com\/blog\/wp-json\/wp\/v2\/comments?post=52"}],"version-history":[{"count":1,"href":"https:\/\/www.proftesting.com\/blog\/wp-json\/wp\/v2\/posts\/52\/revisions"}],"predecessor-version":[{"id":172,"href":"https:\/\/www.proftesting.com\/blog\/wp-json\/wp\/v2\/posts\/52\/revisions\/172"}],"wp:attachment":[{"href":"https:\/\/www.proftesting.com\/blog\/wp-json\/wp\/v2\/media?parent=52"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.proftesting.com\/blog\/wp-json\/wp\/v2\/categories?post=52"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.proftesting.com\/blog\/wp-json\/wp\/v2\/tags?post=52"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}