ATP’s new best practice guidelines for Technology-Based Assessment—a summary

Read Time 4 Mins

Principles

Learning & Development

They say that if you ask two psychometricians a question, you’ll get at least three opinions. So what happens if you try to work with 100 assessment experts?

On November 14, the Association of Test Publishers (ATP) and the International Test Commission launched a major set of guidelines for technology based assessment (which you can download for free here).

At 175 pages long, they cover most aspects of using technology in testing.

Let me share a little about them.

Some background

Preparing the guidelines was a substantial project that took four years and involved over 100 contributors. It was led by two eminent psychometricians: John Weiner of PSI and Steve Sireci of the University of Massachusetts Amherst.

Each chapter section was authored by an expert, reviewed by several other experts and then combed through many times. There has also been a public review process and a legal review.

As one of the contributors, I authored the chapter on privacy and a couple of smaller sections, and led the review of the chapter on test delivery. I am also on the ATP Committee on Technology-Based Assessment, which will work on taking forward future iterations of the guidelines.

The guidelines are broken down into 11 chapters:

Test development and authoring (including gamification and technology-enhanced items)
Test design and assembly—linear or adaptive
Test delivery environments (including web, mobile, offline, locked-down browsers,disruptions and interoperability)
Scoring—automated and technology assisted
Digitally based results reporting
Data management (storage, maintenance, integrity, integration)
Psychometric and technical quality
Test security
Data privacy in technology-based assessment
Fairness and accessibility
Global testing considerations including translation

Some sections are focused on psychometrics, ensuring that technology is used in a way that’s consistent with the psychometric principles of validity, reliability and fairness.

Other sections are more focused on technology pragmatism and good practice. Although it covers all stakes of tests, the guidelines focus more on issues relating to summative tests with medium or high stakes than they do to formative or low stakes tests.

Key takeaways

Here are a few examples from the guidelines that give a clear idea of the kind of thing they cover.

Technology-enhanced items

A technology-enhanced item (or TEI), is defined as a test item that incorporates media or additional functionality that is only available through electronic means.

TEIs can make the assessment more authentic … increase learner engagement, which in turn increases learner effort, which in turn can make test results more valid. Share on X

The guidelines suggest that the use of such items can better measure constructs (the knowledge or skill that the test seeks to measure) by increasing the scope of a test or exam—for example, by using audio or video or animations/graphics. Such TEIs can also make the assessment more authentic and give face validity (stakeholder buy-in). These items can increase learner engagement, which in turn increases learner effort, which in turn can make test results more valid.

Classify, match & order is part of Learnosity’s extensive range of TEIs.

The practical guidance suggests that:

When designing TEIs, start from the analysis of what you are seeking to test (the construct)—for example, from skills maps or content blueprints.
In order to produce high-quality TEIs, it’s important to have item writing guidelines that focus authors on what works well for each item type and gives consistency between items.
Make sure to check the operation of TEIs on the devices that test takers will use to ensure they work well (e.g. they do not need too much scrolling).
It’s important to give learners tutorials or practice in technology-enhanced item formats before the test to ensure that test takers are familiar with them.
Such steps will reduce the risk of “CIV”—construct irrelevant variance—caused, for example, by test takers being unable to demonstrate their skill in the item.

For a wide range of TEI examples from Learnosity, click here. If followed correctly, the guidelines should help with the wider application of such items for good learning and measurement purposes.

Web vs local vs offline vs mobile delivery

A key chapter of the guidelines covers approaches for web-based delivery, local delivery, offline, and mobile delivery, offering pros and cons of the different approaches.

The guidelines suggest that whatever the delivery modality, test delivery systems “should be robust and secure, including capabilities for graceful degradation, encryption, auditing and meaningful system messaging”.

There is good practice guidance on what to do in the event of Internet connectivity failure or other challenges, with recommendations that vendors should “perform thorough quality assurance on all delivery methods and combinations … on a wide range of devices and conditions .. including stress tests on central (cloud) infrastructure in representative conditions before the testing event”.

Approach to technology disruptions

There is some excellent related material on dealing with technology disruptions during assessments. We have all seen examples of exams going wrong due to failures of technology and it’s good to have some guidance on how to deal with such issues. To this end, the guidelines cover:

Preventing disruptions
Developing a response plan in the event of disruptions
Having clear policies on communication in the event of a disruption
Training personnel around disruptions
Planning for possible disruptions when setting up vendor contracts

The key message is that prevention is better than cure, but should test disruptions occur, it’s important to prepare well for them so that they can be ameliorated.

The key message is that prevention is better than cure, but should test disruptions occur, it’s important to prepare well for them so that they can be ameliorated. Share on X

Vocabulary for test security solutions

The excellent chapter on test security gives a high-level overview of safeguarding against potential security threats to focus effort on the most important strategies. It sets out three sets of solutions for test security, all of which should be considered:

Prevention
Ways of preventing people from cheating at tests—e.g. randomizing questions or choice order to make it harder for people to copy from others or take advantage of published “cheat sheets”
Deterrence
Effective communication to test takers to persuade them not to cheat.
Detection/response
Find occurrences of cheating and respond appropriately to them—e.g. data forensics that can identify likely cheating via statistical analysis.

I think it’s likely that the industry will coalesce on this categorization to help coordinate efforts against test fraud/cheating, and this could help improve communication and action to increase test security.

On data privacy

Last but not least, is the chapter on data privacy (written by me but with contributions and review from several others).

At 10 pages, it gives a relatively concise introduction on why privacy is important for those delivering tests and gives some good practice guidance on what steps to take—both for legal compliance and to respect test taker privacy and rights.

For example, one of the guidelines encourages pseudonymity, which is a core Learnosity practice. It says: “Where practical, personal data captured during the assessment process should be stored and transmitted in an encrypted and/or pseudonymized form to reduce the risk of unauthorized access or disclosure of personal data.”

As I’ve shared previously, test sponsors and vendors capture a great deal of data when delivering assessments, and with great amounts of data comes great responsibility.

I think the assessment industry as a whole is becoming increasingly vigilant and respectful of test-taker privacy so data is only captured to enable good quality assessment for the benefit of stakeholders and society. I’m pleased this chapter sets out good practice for everyone to follow.

You can download the ATP guidelines in full here.

John Kleeman

EVP at Learnosity

Privacy Overview

This website uses cookies to improve your experience while you navigate through the website. Out of these cookies, the cookies that are categorized as necessary are stored on your browser as they are essential for the working of basic functionalities of the website. We also use third-party cookies that help us analyze and understand how you use this website. These cookies will be stored in your browser only with your consent. You also have the option to opt-out of these cookies. But opting out of some of these cookies may have an effect on your browsing experience.

Necessary

Always Enabled

Necessary cookies are absolutely essential for the website to function properly. These cookies ensure basic functionalities and security features of the website, anonymously.

Cookie	Duration	Description
cookielawinfo-checkbox-advertisement	1 year	Set by the GDPR Cookie Consent plugin, this cookie is used to record the user consent for the cookies in the "Advertisement" category .
cookielawinfo-checkbox-analytics	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Analytics".
cookielawinfo-checkbox-analytics	1 year	Set by the GDPR Cookie Consent plugin, this cookie is used to record the user consent for the cookies in the "Analytics" category .
cookielawinfo-checkbox-functional	11 months	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Functional".
cookielawinfo-checkbox-necessary	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary".
cookielawinfo-checkbox-necessary	1 year	Set by the GDPR Cookie Consent plugin, this cookie is used to record the user consent for the cookies in the "Necessary" category .
cookielawinfo-checkbox-others	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Other.
cookielawinfo-checkbox-performance	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Performance".
viewed_cookie_policy	11 months	The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.

Analytics

Analytical cookies are used to understand how visitors interact with the website. These cookies help provide information on metrics the number of visitors, bounce rate, traffic source, etc.

Cookie	Duration	Description
_ga	2 years	The _ga cookie, installed by Google Analytics, calculates visitor, session and campaign data and also keeps track of site usage for the site's analytics report. The cookie stores information anonymously and assigns a randomly generated number to recognize unique visitors.
_gid	1 day	Installed by Google Analytics, _gid cookie stores information on how visitors use a website, while also creating an analytics report of the website's performance. Some of the data that are collected include the number of visitors, their source, and the pages they visit anonymously.
_hjAbsoluteSessionInProgress	30 minutes	Hotjar sets this cookie to detect the first pageview session of a user. This is a True/False flag set by the cookie.
_hjFirstSeen	30 minutes	Hotjar sets this cookie to identify a new user’s first session. It stores a true/false value, indicating whether it was the first time Hotjar saw this user.
_hjid	1 year	This is a Hotjar cookie that is set when the customer first lands on a page using the Hotjar script.
_hjIncludedInPageviewSample	2 minutes	Hotjar sets this cookie to know whether a user is included in the data sampling defined by the site's pageview limit.
_pk_id.*	1 year 27 days	This cookie is used to store a few details about the user such as the unique visitor ID.
_pk_ses.*	30 minutes	This is a short lived cookies used to temporarily store data for the visit.
1P_JAR	1 day	Set by Google. This group sets a unique ID to remember your preferences and other information such as website statistics and track conversion rates.
CONSENT	2 years	Set by Google. This group sets a unique ID to remember your preferences and other information such as website statistics and track conversion rates.
DV	1 day	Used to collect information about how visitors use our site which we use to compile reports to help us make improvements. The cookies enable the collection of information in an anonymous form, including the number of visitors to the site, where visitors have come to the site from and the pages they visited.
lpv*	30 minutes	This cookie is set to keep Pardot from tracking multiple page views on a single asset over a 30-minute session
NID	6 months	Set by Google. This group sets a unique ID to remember your preferences and other information such as website statistics and track conversion rates.
pardot	past	The pardot cookie is set while the visitor is logged in as a Pardot user. The cookie indicates an active session and is not used for tracking.
visitor_id*	10 years	This cookie includes a unique visitor ID and the unique identifier for your Pardot account. This cookie is set for visitors by the Pardot tracking code.
visitor_id*-hash	10 years	This cookie contains the Pardot account ID and stores a unique hash. This cookie is a security measure to make sure that a malicious user can’t fake a visitor from Pardot and access corresponding prospect information.

Advertisement cookies are used to provide visitors with relevant ads and marketing campaigns. These cookies track visitors across websites and collect information to provide customized ads.

Cookie	Duration	Description
VISITOR_INFO1_LIVE	5 months 27 days	A cookie set by YouTube to measure bandwidth that determines whether the user gets the new or old player interface.
YSC	session	YSC cookie is set by Youtube and is used to track the views of embedded videos on Youtube pages.

Functional

ATP’s new best practice guidelines for Technology-Based Assessment—a summary

They say that if you ask two psychometricians a question, you’ll get at least three opinions. So what happens if you try to work with 100 assessment experts?

Some background

Key takeaways

Technology-enhanced items

Web vs local vs offline vs mobile delivery

Approach to technology disruptions

Vocabulary for test security solutions

On data privacy

John Kleeman

EVP at Learnosity

AI in assessment: how artificial intelligence injects real wonder into learning experiences

Excellence + equity: how California’s new math framework is trading tradition for transformation

Learnosity named on TIME Magazine’s inaugural list of the World’s Top EdTech Companies

ATP’s new best practice guidelines for Technology-Based Assessment—a summary

They say that if you ask two psychometricians a question, you’ll get at least three opinions. So what happens if you try to work with 100 assessment experts?

Some background

Key takeaways

Technology-enhanced items

Web vs local vs offline vs mobile delivery

Approach to technology disruptions

Vocabulary for test security solutions

On data privacy

John Kleeman

EVP at Learnosity

Let’s make it official

Related articles

AI in assessment: how artificial intelligence injects real wonder into learning experiences

Excellence + equity: how California’s new math framework is trading tradition for transformation

Learnosity named on TIME Magazine’s inaugural list of the World’s Top EdTech Companies