Statistical stopping criteria for automated screening in systematic reviews

Callaghan, Max W.; Müller-Hansen, Finn

doi:10.1186/s13643-020-01521-4

Item

ITEM ACTIONSEXPORT

DownloadE-Mail

Please note that a newer version of this item is available:
https://publications.pik-potsdam.de/pubman/item/item_25190_7

DetailsSummary

Previous Next of 0

Statistical stopping criteria for automated screening in systematic reviews

Callaghan, M. W., Müller-Hansen, F. (2020): Statistical stopping criteria for automated screening in systematic reviews. - Systematic Reviews, 9, 273.
https://doi.org/10.1186/s13643-020-01521-4

Item is Released

show all hide all

Basic

show hide

Item Permalink: https://publications.pik-potsdam.de/pubman/item/item_25190 Version Permalink: https://publications.pik-potsdam.de/pubman/item/item_25190_3

Genre: Journal Article

Files

show Files

hide Files

25190oa.pdf (Publisher version), 3MB

View Save

File Permalink:
https://publications.pik-potsdam.de/pubman/item/item_25190_1/component/file_25191/25190oa.pdf

Name:
25190oa.pdf

Description:
-

OA-Status:

Visibility:
Public

MIME-Type / Checksum:
application/pdf / [MD5]

Technical Metadata:

View

Copyright Date:
-

Copyright Info:
-

License:
-

Locators

show

Creators

show

hide

Creators:
Callaghan, Max W.¹, Author
Müller-Hansen, Finn¹, Author

Affiliations:
1Potsdam Institute for Climate Impact Research, ou_persistent13

Content

show

hide

Free keywords: -

Abstract: Active learning for systematic review screening promises to reduce the human effort required to identify relevant documents for a systematic review. Machines and humans work together, with humans providing training data, and the machine optimising the documents that the humans screen. This enables the identification of all relevant documents after viewing only a fraction of the total documents. However, current approaches lack robust stopping criteria, so that reviewers do not know when they have seen all or a certain proportion of relevant documents. This means that such systems are hard to implement in live reviews. This paper introduces a workflow with flexible statistical stopping criteria, which offer real work reductions on the basis of rejecting a hypothesis of having missed a given recall target with a given level of confidence. The stopping criteria are shown on test datasets to achieve a reliable level of recall, while still providing work reductions of on average 17%. Other methods proposed previously are shown to provide inconsistent recall and work reductions across datasets.

Details

show

hide

Language(s):

Dates: Published Online: 2020-11-28Finally published : 2020-11-28

Publication Status: Finally published

Pages: -

Publishing info: -

Table of Contents: -

Rev. Type: Peer

Identifiers: DOI: 10.1186/s13643-020-01521-4
MDB-ID: No data to archive
PIKDOMAIN: RD3 - Transformation Pathways
Organisational keyword: RD3 - Transformation Pathways

Degree: -

Event

show

Legal Case

show

Project information

show

Source 1

show

hide

Title: Systematic Reviews

Source Genre: Journal, SCI, Scopus, oa

Creator(s):

Affiliations:

Publ. Info: -

Pages: - Volume / Issue: 9 Sequence Number: 273 Start / End Page: - Identifier: CoNE: https://publications.pik-potsdam.de/cone/journals/resource/systematic-reviews
Publisher: BioMed Central (BMC)