Document Layout Analysis


SnapVision contains a powerful image processing system that automatically identifies all the text regions within a document image. It will also measure the straightness of text regions with a line tracking function that fits splines to each text line and then creates a dewarped image by straightening each spline. By removing non text regions of the document image and by straightening curved or skewed text lines, OCR accuracy can be significantly increased. Document Layout Analysis can also in many instances detect if there are clipped regions that are not part of the main document image. The example images below illustrates how Document Layout Analysis enhances document images for OCR.


Input Image



Document Layout Analysis Output Image