Evaluating the AI Tool “Elicit” as a Semi-Automated Second Reviewer for Data Extraction in Systematic Reviews: A Proof-of-Concept

Research output: Contribution to journalArticlepeer-review

Abstract

Systematic reviews are essential for evidence synthesis but often require extensive time and resources, especially during data extraction. This proof-of-concept study evaluates the performance of Elicit, an AI tool specifically developed to support systematic reviews, in the context of a systematic review on psychological factors in dermatological conditions. We compared Elicit’s automated data extraction with manually extracted data across 43 studies and 602 data points. Both were assessed against a consensus-based ground truth. Elicit achieved an overall accuracy of 81.4%, compared to 86.7% for human reviewers—a difference that was not statistically significant. In cases where Elicit and the human reviewer extracted the same information, this information was correct in 100% of instances, suggesting that agreement between human and machine may serve as a reliable proxy for validity. Based on these results, we propose a semi-automated workflow in which Elicit functions as a second reviewer, reducing workload while maintaining high data quality. Our results demonstrate that domain-specific AI tools can effectively augment data extraction in systematic reviews, especially in settings with limited time or personnel.
Original languageEnglish
JournalSocial Science Computer Review
DOIs
Publication statusE-pub ahead of print - 3 Dec 2025

Keywords*

  • systematic reviews
  • large language models
  • machine-assisted review
  • Elicit
  • data extraction
  • data collection
  • machine learning
  • evidence synthesis
  • semi-automated workflows

Field of Science*

  • 1.2 Computer and information sciences

Publication Type*

  • 1.1. Scientific article indexed in Web of Science and/or Scopus database

Fingerprint

Dive into the research topics of 'Evaluating the AI Tool “Elicit” as a Semi-Automated Second Reviewer for Data Extraction in Systematic Reviews: A Proof-of-Concept'. Together they form a unique fingerprint.

Cite this