ELaPro, a LOINC-mapped core dataset for top laboratory procedures of eligibility screening for clinical trials

Background: Screening for eligible patients continues to pose a great challenge for many clinical trials. This has led to a rapidly growing interest in standardizing computable representations of eligibility criteria (EC) in order to develop tools that leverage data from electronic health record (EH...

Verfasser: Rafee, Ahmed
Riepenhausen, Sarah
Neuhaus, Philipp
Meidt, Alexandra
Dugas, Martin
Varghese, Julian
FB/Einrichtung:FB 05: Medizinische Fakultät
Dokumenttypen:Artikel
Medientypen:Text
Erscheinungsdatum:2022
Publikation in MIAMI:25.09.2023
Datum der letzten Änderung:25.09.2023
Angaben zur Ausgabe:[Electronic ed.]
Quelle:BMC Medical Research Methodology 22 (2022), 141, 1-14
Schlagwörter:Eligibility screening; UMLS; LOINC; Data models; Medical informatics
Fachgebiet (DDC):000: Informatik, Wissen, Systeme
610: Medizin und Gesundheit
Lizenz:CC BY 4.0
Sprache:English
Förderung:Finanziert über die DEAL-Vereinbarung mit Wiley 2019-2022.
Format:PDF-Dokument
URN:urn:nbn:de:hbz:6-29908563159
Weitere Identifikatoren:DOI: 10.17879/99908474253
Permalink:https://nbn-resolving.de/urn:nbn:de:hbz:6-29908563159
Verwandte Dokumente:
  • ist identisch zu:
  • Onlinezugriff:10.1186_s12874-022-01611-y.pdf

    Background: Screening for eligible patients continues to pose a great challenge for many clinical trials. This has led to a rapidly growing interest in standardizing computable representations of eligibility criteria (EC) in order to develop tools that leverage data from electronic health record (EHR) systems. Although laboratory procedures (LP) represent a common entity of EC that is readily available and retrievable from EHR systems, there is a lack of interoperable data models for this entity of EC. A public, specialized data model that utilizes international, widely-adopted terminology for LP, e.g. Logical Observation Identifiers Names and Codes (LOINC®), is much needed to support automated screening tools. Objective: The aim of this study is to establish a core dataset for LP most frequently requested to recruit patients for clinical trials using LOINC terminology. Employing such a core dataset could enhance the interface between study feasibility platforms and EHR systems and significantly improve automatic patient recruitment. Methods: We used a semi-automated approach to analyze 10,516 screening forms from the Medical Data Models (MDM) portal’s data repository that are pre-annotated with Unified Medical Language System (UMLS). An automated semantic analysis based on concept frequency is followed by an extensive manual expert review performed by physicians to analyze complex recruitment-relevant concepts not amenable to automatic approach. Results: Based on analysis of 138,225 EC from 10,516 screening forms, 55 laboratory procedures represented 77.87% of all UMLS laboratory concept occurrences identified in the selected EC forms. We identified 26,413 unique UMLS concepts from 118 UMLS semantic types and covered the vast majority of Medical Subject Headings (MeSH) disease domains. Conclusions: Only a small set of common LP covers the majority of laboratory concepts in screening EC forms which supports the feasibility of establishing a focused core dataset for LP. We present ELaPro, a novel, LOINC-mapped, core dataset for the most frequent 55 LP requested in screening for clinical trials. ELaPro is available in multiple machine-readable data formats like CSV, ODM and HL7 FHIR. The extensive manual curation of this large number of free-text EC as well as the combining of UMLS and LOINC terminologies distinguishes this specialized dataset from previous relevant datasets in the literature.