A TaLISMAN: Automatic Text and LIne Segmentation of historical MANuscripts

dc.contributor.authorPintus, Ruggeroen_US
dc.contributor.authorYang, Yingen_US
dc.contributor.authorGobbetti, Enricoen_US
dc.contributor.authorRushmeier, Hollyen_US
dc.contributor.editorReinhard Klein and Pedro Santosen_US
dc.date.accessioned2014-12-16T07:29:36Z
dc.date.available2014-12-16T07:29:36Z
dc.date.issued2014en_US
dc.description.abstractHistorical and artistic handwritten books are valuable cultural heritage (CH) items, as they provide information about tangible and intangible cultural aspects from the past. Massive digitization projects have made these kind of data available to a world-wide population, and pose real challenges for automatic processing. In this scenario, document layout analysis plays a significant role, being a fundamental step of any document image understanding system. In this paper, we present a completely automatic algorithm to perform a robust text segmentation of old handwritten manuscripts on a per-book basis, and we show how to exploit this outcome to find two layout elements, i.e., text blocks and text lines. Our proposed technique have been evaluated on a large and heterogeneous corpus content, and our experimental results demonstrate that this approach is efficient and reliable, even when applied to very noisy and damaged books.en_US
dc.description.seriesinformationEurographics Workshop on Graphics and Cultural Heritageen_US
dc.identifier.isbn978-3-905674-63-7en_US
dc.identifier.issn2312-6124en_US
dc.identifier.urihttps://doi.org/10.2312/gch.20141302en_US
dc.identifier.urihttps://diglib.eg.org/handle/10.2312/gch.20141302.035-044
dc.publisherThe Eurographics Associationen_US
dc.subjectI.3.0 [Computer Graphics]en_US
dc.subjectGeneralen_US
dc.subjecten_US
dc.subjectI.3.3 [Computer Graphics]en_US
dc.subjectPicture/Image Generationen_US
dc.subjectDigitizing and scanningen_US
dc.subjectI.3.6 [Computer Graphics]en_US
dc.subjectMethodology and Techniquesen_US
dc.subjecten_US
dc.subjectI.3.8 [Computer Graphics]en_US
dc.subjectApplicationsen_US
dc.titleA TaLISMAN: Automatic Text and LIne Segmentation of historical MANuscriptsen_US
Files
Original bundle
Now showing 1 - 2 of 2
Loading...
Thumbnail Image
Name:
035-044.pdf
Size:
3.28 MB
Format:
Adobe Portable Document Format
No Thumbnail Available
Name:
paper1001_e-paper.pptx
Size:
4.66 MB
Format:
Microsoft Powerpoint XML