Last Updated: 31 Oct 2018 08:06 by Ron
Created on: 03 Jan 2017 14:04
Category: PDFViewer
Type: Feature Request
PdfViewer: Allow customization of the tolerance used by TextFormatProvider when splitting text to lines
When text is exported from PDF document using the TextFormatProvider (it's also used internally for the Copy operation), it is automatically split to lines using the vertical position of the words on the page, and small tolerance. 

Currently the tolerance is hard-coded to 0.1 pixels, which is not suitable for documents which contains scanned and OCR-ed text, and there the text lines could be slightly inclined. The result is that words on one slightly inclined line are recognized as if they are on separate lines.
(Total attached files size should be smaller than 20mb. Allowed extensions: .zip, .rar, .jpg, .png, .gif)
1 comment
Posted on: 03 Jan 2017 20:52
This would be Extremely helpful in my Current Project.