Semantic and visual classification of digitized documents

Main Article Content

Marçal Rusiñol
This paper presents an overview of the problem of automatic classification of digitized documents. We will see the options available to describe both the visual appearance and the textual and semantic con-tents of these documents. We will review how these descriptions can be used for the classification, clustering or retrieval of digitized documents. We will summarise the state-of-the-art approaches both from the computer vision and natural language processing fields and will see how the latest breakthroughs in Deep Learning have revolutionized these fields.
Keywords
Document analysis, computer vision, natural language pro-cessing, machine learning, deep learning

Article Details

How to Cite
Rusiñol, Marçal. “Semantic and visual classification of digitized documents”. Item: revista de biblioteconomia i documentació, vol.VOL 2, no. 65-66, https://raco.cat/index.php/Item/article/view/353618.