Summary: | PDF generated from LibreOffice Writer has a problem where text from the document, when copied gets mangled. | ||
---|---|---|---|
Product: | LibreOffice | Reporter: | Kevin Buchs <kevin.buchs> |
Component: | Printing and PDF export | Assignee: | Not Assigned <libreoffice-bugs> |
Status: | RESOLVED WORKSFORME | ||
Severity: | normal | CC: | vmiklos, vsfoote |
Priority: | medium | ||
Version: | unspecified | ||
Hardware: | x86-64 (AMD64) | ||
OS: | Windows (All) | ||
Whiteboard: | |||
Crash report or crash signature: | Regression By: | ||
Bug Depends on: | |||
Bug Blocks: | 103378 | ||
Attachments: |
Original Writer document
PDF exported by LibreOffice Text document pasted from copy of PDF My PDF export options text file of corrupted strings dumped from PDF text file extracted strings after font change to Default style and export to PDF |
Description
Kevin Buchs
2018-04-10 02:44:22 UTC
Created attachment 141239 [details]
Original Writer document
Created attachment 141240 [details]
PDF exported by LibreOffice
Created attachment 141241 [details]
Text document pasted from copy of PDF
Created attachment 141242 [details]
My PDF export options
Confirmed on Windows 10 Pro 64-bit en-US with Version: 6.0.3.2 (x64) Build ID: 8f48d515416608e3a835360314dac7e47fd0b821 CPU threads: 8; OS: Windows 10.0; UI render: default; Locale: en-US (en_US); Calc: CL So this is really weird--is it a problem with the font. Or with export filtering to PDF? In sample ODT, the Default paragraph style has modified font to use Calibri, and direct formatting is applied to increase font size. Exporting from the ODT to PDF result renders in various viewers (Adobe Reader, Firefox, Chrome) correct appearance. But as noted, a select, copy, paste from the document has characters garble: l -> a, a -> l, o -> i, i -> o, g -> " " Then dumping text of the PDF with gs (i.e. 'gswin64c -sDEVICE=txtwrite -o output.txt Tue-1000-Apr-10-2018-Devotion-2.pdf') has corrupted strings. If I change the Default style to use a font other than Calibri--e.g. Liberation Sans, or Arial--the resulting PDF then has no string corruption on copy/paste or if dumping the strings with gs. For now can work around of using a different font. But weird! Created attachment 141247 [details]
text file of corrupted strings dumped from PDF
This text file is a gswin64c string dump from the ODT with Calibri export. The PDF is composed correctly--but the strings have characters transposed. Same document with a font change of the Default Paragraph style to use Arial or Liberation Sans exported to PDF that also views correctly--and when strings are extracted no glitches (attached next post).
Created attachment 141248 [details]
text file extracted strings after font change to Default style and export to PDF
WFM in master. I guess it's a duplicate of Bug 115117. Please see LO 6.0.4. |