Text copied from PDF report created on Linux is not correct

Progress® Telerik® Reporting Feedback Portal

Create an account Log In

Back to Feed

Request a Feature Report a Bug

Completed

Last Updated: 03 Dec 2024 13:53 by Stephan

Mark

Created on: 24 Oct 2019 10:04

Category: Reporting

Type: Bug Report

Text copied from PDF report created on Linux is not correct

When exporting a report to PDF on Linux the document looks correct. If you try to copy text from this PDF and paste it in a text editor, the text appears as random symbols, e.g. empty squares.

4 comments

Stephan

Posted on: 03 Dec 2024 13:53

Hi Todor,

thank you very much for your engagement!

ADMIN

Todor

Posted on: 02 Dec 2024 12:12

Hi Stephan,

The issue reoccurred with our latest version 2024 Q4 (18.3.24.1112). I cast a vote on your behalf for the new problem - 2024 Q4, pdf rendering made text extraction impossible.

We will fix the regression with priority, hopefully before the end of the year 2024.

Regards,
Todor
Progress Telerik

Stay tuned by visiting our roadmap and feedback portal pages, enjoy a smooth take-off with our Getting Started resources, or visit the free self-paced technical training at https://learn.telerik.com/.

Stephan

Posted on: 25 Nov 2024 15:26

Hello,

is this already fixed? I have the same issues with Skia engine.

Thanks in advance.

Mark

Posted on: 06 Nov 2019 13:21

Let me also elaborate in saying this is a much bigger problem that copy-pasting text. It is also an issue with searching for text in the PDF. It is an even bigger problem for document management systems that take the outputted PDFs, and search them for keywords or other text to extract as part of their processing of the PDFs.

We have attempted workarounds to run the PDFs through OCR software (pdfsandwich on Linux, which uses Tesseract OCR) but the end result is a much larger file due to the PDF being converted to an image-PDF, and unreliable text because OCR is not 100% accurate.

The only viable solution is for Telerik to fix this issue. Otherwise, the PDFs generated by Telerik Reporting for .NET Core will be largely useless in automated processing.