The third specialized seminar on data science and its applications , 2024-12-11

Title : ( Enhancing Text Extraction from Scanned Medical Documents Using Large Language Models )

Authors: Mahdi Nemati , Mahmood Amintoosi ,

Citation: BibTeX | EndNote

Abstract

Accurate text extraction from scanned medical documents is essential for data management and clinical decision-making. This study evaluates Large Language Models (LLMs) as an enhancement to traditional Optical Character Recognition (OCR) methods. By leveraging language and context, LLMs offer improved accuracy and relevance in text interpretation. We compared the EasyOCR model and the multimodal \\\"gpt-4o-mini\\\" LLM on a dataset of 110 medical transcript samples. Performance was assessed by comparing extracted texts against clinical data embeddings, using cosine similarity for semantic accuracy. The OCR model achieved an F1-score of 0.59, while the LLM scored 0.70, demonstrating LLMs\\\' potential to advance text extraction in healthcare

Keywords

, Optical Character Recognition, Large Language Models, Text Extraction, Medical Documents
برای دانلود از شناسه و رمز عبور پرتال پویا استفاده کنید.

@inproceedings{paperid:1106439,
author = {مهدی نعمتی and Amintoosi, Mahmood},
title = {Enhancing Text Extraction from Scanned Medical Documents Using Large Language Models},
booktitle = {The third specialized seminar on data science and its applications},
year = {2024},
location = {IRAN},
keywords = {Optical Character Recognition; Large Language Models; Text Extraction; Medical Documents},
}

[Download]

%0 Conference Proceedings
%T Enhancing Text Extraction from Scanned Medical Documents Using Large Language Models
%A مهدی نعمتی
%A Amintoosi, Mahmood
%J The third specialized seminar on data science and its applications
%D 2024

[Download]