dhSegment: A Generic Deep-Learning Approach for Document Segmentation. Ares Oliveira, S., Seguin, B., & Kaplan, F. 2018 16th International Conference on Frontiers in Handwriting Recognition (ICFHR), August, 2018. Conference Name: 2018 16th International Conference on Frontiers in Handwriting Recognition (ICFHR) ISBN: 9781538658758 Place: Niagara Falls, NY, USA Publisher: IEEE
dhSegment: A Generic Deep-Learning Approach for Document Segmentation [link]Paper  doi  abstract   bibtex   
In recent years there have been multiple successful attempts tackling document processing problems separately by designing task specific hand-tuned strategies. We argue that the diversity of historical document processing tasks prohibits to solve them one at a time and shows a need for designing generic approaches in order to handle the variability of historical series. In this paper, we address multiple tasks simultaneously such as page extraction, baseline extraction, layout analysis or multiple typologies of illustrations and photograph extraction. We propose an open-source implementation of a CNN-based pixel-wise predictor coupled with task dependent post-processing blocks. We show that a single CNN-architecture can be used across tasks with competitive results. Moreover most of the task-specific post-precessing steps can be decomposed in a small number of simple and standard reusable operations, adding to the flexibility of our approach.
@article{ares_oliveira_dhsegment_2018,
	title = {{dhSegment}: {A} {Generic} {Deep}-{Learning} {Approach} for {Document} {Segmentation}},
	shorttitle = {{dhSegment}},
	url = {https://ieeexplore.ieee.org/document/8563218/},
	doi = {10.1109/ICFHR-2018.2018.00011},
	abstract = {In recent years there have been multiple successful attempts tackling document processing problems separately by designing task specific hand-tuned strategies. We argue that the diversity of historical document processing tasks prohibits to solve them one at a time and shows a need for designing generic approaches in order to handle the variability of historical series. In this paper, we address multiple tasks simultaneously such as page extraction, baseline extraction, layout analysis or multiple typologies of illustrations and photograph extraction. We propose an open-source implementation of a CNN-based pixel-wise predictor coupled with task dependent post-processing blocks. We show that a single CNN-architecture can be used across tasks with competitive results. Moreover most of the task-specific post-precessing steps can be decomposed in a small number of simple and standard reusable operations, adding to the flexibility of our approach.},
	urldate = {2023-05-03},
	journal = {2018 16th International Conference on Frontiers in Handwriting Recognition (ICFHR)},
	author = {Ares Oliveira, Sofia and Seguin, Benoit and Kaplan, Frederic},
	month = aug,
	year = {2018},
	note = {Conference Name: 2018 16th International Conference on Frontiers in Handwriting Recognition (ICFHR)
ISBN: 9781538658758
Place: Niagara Falls, NY, USA
Publisher: IEEE},
	pages = {7--12},
}

Downloads: 0