{"id":37,"date":"2015-06-24T17:00:00","date_gmt":"2015-06-24T17:00:00","guid":{"rendered":"http:\/\/proftesting.com\/blog\/?p=37"},"modified":"2016-05-16T17:12:18","modified_gmt":"2016-05-16T17:12:18","slug":"2015623computer-adaptive-testing-an-overview-and-considerations","status":"publish","type":"post","link":"https:\/\/www.proftesting.com\/blog\/2015\/06\/24\/2015623computer-adaptive-testing-an-overview-and-considerations\/","title":{"rendered":"Computer Adaptive Testing: An Overview and Considerations"},"content":{"rendered":"<p>Computer Adaptive Testing (CAT) is a testing methodology that weds two processes\u2014adaptive testing and computer administration\u2014for efficient measurement and administration. When compared to fixed length exams delivered linearly on a computer, CAT exams measure a candidate\u2019s ability with fewer delivered exam questions and higher precision. By precision, it is suggested that error is reduced and reliability is increased.<\/p>\n<p>What allows us to adapt tests to match candidate ability? Pairing items with people is done by using Item Response Theory. Item Response Theory (IRT) is a modern measurement theory that allows for the development of exams that match item difficulty to a candidate\u2019s ability.<\/p>\n<p>For CAT to adapt to a candidate\u2019s ability, there are item selection algorithms. These algorithms include estimating candidate ability, matching items delivered to candidate\u2019s ability, preventing over exposure of items, and matching item delivery with the exam blueprint or content requirements.\u00a0 Because of this, large item pools are needed to assure exams can be delivered to meet all ability levels across all content areas.<\/p>\n<p>One advantage that can be seen with IRT-based CAT is more reliable exam scores which results in a higher confidence in the pass\/fail consistency estimates. With credentialing exams, the primary reliability estimates we seek are those associated with making consistent pass\/fail decisions. A second possible advantage is the reduction in exam administration time. As a real-life example, a two day paper exam with limited exam sites was reduced to an exam that can be taken in one sitting. In addition, the number of exam sites grew exponentially allowing candidates to register within an exam date window and take the exam closer to their location. As one could imagine, the benefits of this transition were significant.<\/p>\n<p>Example<\/p>\n<p>In the example below, a CAT was administered and the pass\/fail decision was reached in 23 items. The cut-score was 600 on a scale range of 0 to 1000. There are 3 data points for each item. The lower Standard Error (SE Low), the observed score (Score), and higher Standard Error (SE High).\u00a0 In essence this is a confidence interval. Within this band, we can be highly confident where the person\u2019s true score resides. \u00a0The stopping rule for this was having the entire confidence band on one side of the passing score.<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" width=\"557\" height=\"360\" class=\"alignleft size-full wp-image-516\" src=\"http:\/\/www.proftesting.com\/blog\/wp-content\/uploads\/2015\/06\/Adaptive.jpg\" alt=\"Adaptive\" srcset=\"https:\/\/www.proftesting.com\/blog\/wp-content\/uploads\/2015\/06\/Adaptive.jpg 557w, https:\/\/www.proftesting.com\/blog\/wp-content\/uploads\/2015\/06\/Adaptive-250x162.jpg 250w, https:\/\/www.proftesting.com\/blog\/wp-content\/uploads\/2015\/06\/Adaptive-120x78.jpg 120w\" sizes=\"auto, (max-width: 557px) 100vw, 557px\" \/><\/p>\n<p>As can be seen, with each item administered, the error band is reduced.\u00a0 With each item completed, the process selects the next item to minimize error and match the item\u2019s difficulty with the candidate\u2019s ability. Because of this, a candidate\u2019s proficiency can be identified with fewer items.<\/p>\n<p style=\"text-align: center;\"><!-- [if gte vml 1]><v:shapetype id=\"_x0000_t75\" coordsize=\"21600,21600\" o:spt=\"75\" o:preferrelative=\"t\" path=\"m@4@5l@4@11@9@11@9@5xe\" filled=\"f\" stroked=\"f\">\n<v:stroke joinstyle=\"miter\"\/>\n<v:formulas>\n<v:f eqn=\"if lineDrawn pixelLineWidth 0\"\/>\n<v:f eqn=\"sum @0 1 0\"\/>\n<v:f eqn=\"sum 0 0 @1\"\/>\n<v:f eqn=\"prod @2 1 2\"\/>\n<v:f eqn=\"prod @3 21600 pixelWidth\"\/>\n<v:f eqn=\"prod @3 21600 pixelHeight\"\/>\n<v:f eqn=\"sum @0 0 1\"\/>\n<v:f eqn=\"prod @6 1 2\"\/>\n<v:f eqn=\"prod @7 21600 pixelWidth\"\/>\n<v:f eqn=\"sum @8 21600 0\"\/>\n<v:f eqn=\"prod @7 21600 pixelHeight\"\/>\n<v:f eqn=\"sum @10 21600 0\"\/>\n<\/v:formulas>\n<v:path o:extrusionok=\"f\" gradientshapeok=\"t\" o:connecttype=\"rect\"\/>\n<o:lock v:ext=\"edit\" aspectratio=\"t\"\/>\n<\/v:shapetype><v:shape id=\"Picture_x0020_1\" o:spid=\"_x0000_i1025\" type=\"#_x0000_t75\" style='width:305.4pt;height:197.4pt;visibility:visible;mso-wrap-style:square'>\n<v:imagedata src=\"file:\/\/\/C:UsersMattAppDataLocalTempmsohtmlclip11clip_image001.png\" o:title=\"\"\/>\n<\/v:shape><![endif]--><!-- [if !vml]--><!--[endif]--><\/p>\n<p>While CAT is a very attractive option for programs, it requires a very robust item bank which can increase expenses. There is also a requirement for moderate to large testing samples (number of candidates taking the exam). CAT is an attractive solution for robust exam programs that have lengthy tests. A large number of candidates are required for the IRT calibrations, but the shorter exam seat time can result in large cost savings.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Computer Adaptive Testing (CAT) is a testing methodology that weds two processes\u2014adaptive testing and computer administration\u2014for efficient measurement and administration. When compared to fixed length exams delivered linearly on a computer, CAT exams measure a candidate\u2019s ability with fewer delivered exam questions and higher precision. By precision, it is suggested that error is reduced and reliability is increased.&nbsp;<\/p>","protected":false},"author":5,"featured_media":39,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[1],"tags":[],"class_list":["post-37","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-industry-news"],"_links":{"self":[{"href":"https:\/\/www.proftesting.com\/blog\/wp-json\/wp\/v2\/posts\/37","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.proftesting.com\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.proftesting.com\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.proftesting.com\/blog\/wp-json\/wp\/v2\/users\/5"}],"replies":[{"embeddable":true,"href":"https:\/\/www.proftesting.com\/blog\/wp-json\/wp\/v2\/comments?post=37"}],"version-history":[{"count":2,"href":"https:\/\/www.proftesting.com\/blog\/wp-json\/wp\/v2\/posts\/37\/revisions"}],"predecessor-version":[{"id":517,"href":"https:\/\/www.proftesting.com\/blog\/wp-json\/wp\/v2\/posts\/37\/revisions\/517"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.proftesting.com\/blog\/wp-json\/"}],"wp:attachment":[{"href":"https:\/\/www.proftesting.com\/blog\/wp-json\/wp\/v2\/media?parent=37"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.proftesting.com\/blog\/wp-json\/wp\/v2\/categories?post=37"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.proftesting.com\/blog\/wp-json\/wp\/v2\/tags?post=37"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}