Speaker
Mr
Spiros Millas
(The Cyprus Institute)
Description
This presentation explores how Large Language Models (LLMs) enhance Optical Character Recognition (OCR) pipelines through contextual text correction, document understanding, semantic labeling, and information extraction. It will also highlight real-world use cases such as automated document processing, invoice and receipt parsing, identity verification, and multilingual text recognition.
By showcasing how LLMs add intelligence and context to OCR systems, this presentation illustrates how the combination of vision and language technologies is driving more accurate, efficient, and human-like understanding of text within images and documents.