Off-line identifying Script Writers by Swin Transformers and ResNeSt-50
Article Sidebar
Main Article Content
In this work, we present two advanced models for identifying script writers, leveraging the power of deep learning. The proposed systems utilize the new vision Swin Transformer and ResNeSt-50. Swin Transformer is known for its robustness to variations and ability to model long-range dependencies, which helps capture context and make robust predictions. Through extensive training on large datasets of handwritten text samples, the Swin Transformer operates on sequences of image patches and learns to establish a robust representation of each writer’s unique style. On the other hand, ResNeSt-50 (Residual Neural Network with Squeeze-and-Excitation (SE) and Next Stage modules), with its multiple layers, helps in learning complex representations of a writer’s unique style and distinguishing between different writing styles with high precision. The SE module within ResNeSt helps the model focus on distinctive handwriting characteristics and reduce noise. The experimental results demonstrate exceptional performance, achieving an accuracy of 98.50% (at patch level) by the Swin Transformer on the CVL database, which consists of images with
cursively handwritten German and English texts, and an accuracy of 96.61% (at page level) by ResNeSt-50 on the same database. This research advances writer identification by showcasing the effectiveness of the Swin Transformer and ResNeSt-50. The achieved accuracy underscores the potential of these models to process and understand complex handwriting effectively.
Article Details

Aquesta obra està sota una llicència internacional Creative Commons Reconeixement-NoComercial-SenseObraDerivada 4.0.
(c) Afef Kacem Echi, Takwa Ben Aïcha Gader, 2024
Articles més llegits del mateix autor/a
- Afef Kacem Echi, Asma Saïdani, Abdel Belaïd, How to separate between Machine-Printed/Handwritten and Arabic/Latin Words? , ELCVIA: electronic letters on computer vision and image analysis: Vol. 13 Núm. 1 (2014)
- takwa Ben Aïcha, Yahya Bouslimi, Afef Kacem Echi, Deep Learning Based Localisation and Segmentation of Prostate Cancer from mp-MRI Images , ELCVIA: electronic letters on computer vision and image analysis: Vol. 22 Núm. 1 (2023)
- Kawther Khazri Ayeb, Afef Kacem, Takwa Ben Aïcha Gader, Deep Learning based-framework for Math Formulas Understanding , ELCVIA: electronic letters on computer vision and image analysis: Vol. 23 Núm. 2 (2024)