Large language models (LLMs) in the evaluation of emergency radiology reports: performance of ChatGPT-4, Perplexity, and Bard

A. Infante, S. Gaudino, F. Orsini, A. Del Ciello, C. Gulli', Biagio Merlino, Luigi Natale, Roberto Iezzi, Evis Sala

Risultato della ricerca: Contributo in rivistaArticolo in rivista

Abstract

Large language models (LLMs), especially those based on the Generative Pre-trained Transformer (GPT) architecture, have becomewidely popularand have beenappliedinvarious fields due to their ability to provide written responses to a diverse range of queries swiftly and accurately. LLMs have demonstrated a transformative and potentially revolutionary capacity in multiple medical subfields, including radiology.1 A promising utilisation of these models is to streamline free-text radiology reports into concise or structured formats, 2,3 thereby enhancing accessibility and organisation of extensive information, potentially facilitating communication among medical professionals. Furthermore, incorporating automated radiological structured reporting systems could enhance clinical procedures, standardising language across institutions, promoting effective communication among healthcare experts, and improving the efficiency of data extraction for research purposes. The present authors share their preliminary results with three LLMs evaluating their accuracy in extracting emergency data recognition within a human-generated emergency radiology report.
Lingua originaleEnglish
pagine (da-a)102-106
Numero di pagine5
RivistaClinical Radiology
Volume79
Numero di pubblicazione2
DOI
Stato di pubblicazionePubblicato - 2024

Keywords

  • chatGPT

Fingerprint

Entra nei temi di ricerca di 'Large language models (LLMs) in the evaluation of emergency radiology reports: performance of ChatGPT-4, Perplexity, and Bard'. Insieme formano una fingerprint unica.

Cita questo