PDF Incremental Updates Feature & PDF Text Extraction Error Reporting Implementation using Java

It has implemented Text extraction error reporting functionality for TextAbsorber and TextFragmentAbsorber classes. It was observed that when users load a PDF document from binary, manipulate it and save it to a different binary.
By: Aspose
 
LANE COVE, Australia - Jan. 26, 2018 - PRLog -- What's New in this Release?

Aspose team is pleased to announce the release of Aspose.Pdf for Java 17.12.0.  While investigating a scenario where a PDF document used PDF Type 3 fonts, it was observed that the TextAbsorber class was not retrieving the text correctly. Reason was that the fonts used in the PDF, contained different encoding and it is not possible to extract text from such documents, by using Adobe Reader itself. Aspose team has realized the necessity to implement functionality in the API that such error in the document can be reported. Aspose team is pleased to inform users that text extraction error reporting has been implemented for TextAbsorber and TextFragmentAbsorber classes, which is available with Aspose.Pdf for Java 17.12. It was observed that when users load a PDF document from binary, manipulate it (i.e add some annotations) and save it to a different binary – the content of the PDF document was used to be totally changed. In order to avoid such issues, it have implemented an additional method i.e saveIncrementally() into the Document class. Now users will be able to save document into a Stream object, using Incremental Updates. As it always recommended to use latest release of API's as they include latest features / improvements and fixes related to issues reported in earlier released versions. Some important improved features included in this release are given below

·         PDF Incremental updates when load pdf document from binary

·         PDF to JPEG - Missing text in output JPG

·         PDF to HTML: text misplaced in resultant HTML

·         HTML to PDF - Conversion process hangs

·         PDF to HTML - Text changes its position

·         Text absorber retrieves the garbled text

·         PDF to Doc: Text in the word document are wrapped one on another

·         PDF to XPS: colored images changes to greyscale

·         PDF to PDF/A - Text starts appearing overlapped

·         Text replacement issue: Characters are missing in replaced text

·         PDF to DOCX - text is overlapping in resultant file

·         PDF to HTML: text shifted to left side

·         PDF to Excel - Blank File is Generated

·         Remove text underline in a PDF document

·         Open PDF file from stream add annotation invalidates the signature

·         PDF to PNG - invisible objects become visible

Newly added documentation pages and articles

Some new tips and articles have now been added into Aspose.Pdf for Java documentation that may guide you briefly how to use Aspose.Pdf for performing different tasks like the followings.

-  Saving PDF to DOCX: https://docs.aspose.com/display/pdfjava/Convert+PDF+to+other+Formats#ConvertPDFtootherFormats-SavingtoDOCX

-  Convert PDF to HTML format: https://docs.aspose.com/display/pdfjava/Convert+PDF+to+HTML+format

Overview: Aspose.Pdf for Java

Aspose.Pdf is a Java PDF component to create PDF documents without using Adobe Acrobat. It supports Floating box, PDF form field, PDF attachments, security, Foot note & end note, Multiple columns document, Table of Contents, List of Tables, Nested tables, Rich text format, images, hyperlinks, JavaScript, annotation, bookmarks, headers, footers and many more. Now users can create PDF by API, XML and XSL-FO files. It also enables users to converting HTML, XSL-FO and Excel files into PDF.

More about Aspose.Pdf for Java

- Homepage of Aspose.Pdf for Java: http://www.aspose.com/products/pdf/java

- Download Aspose.Pdf for Java at: http://www.aspose.com/downloads/pdf/java

- Read online documentation of Aspose.Pdf for Java at: http://www.aspose.com/docs/display/pdfjava/Home

Contact Information

Aspose Pty Ltd

Suite 163, 79 Longueville Road

Lane Cove, NSW, 2066

Australia

http://www.aspose.com/

sales@aspose.com


Phone: 888.277.6734

Fax: 866.810.9465

Contact
Aspose
***@aspose.com
End
Source:Aspose
Email:***@aspose.com Email Verified
Tags:extract text from PDF-document, Java PDF API, PDF to DOCX conversion
Industry:Software
Location:Lane Cove - New South Wales - Australia
Account Email Address Verified     Account Phone Number Verified     Disclaimer     Report Abuse
Aspose Pty Ltd. PRs
Trending News
Most Viewed
Top Daily News



Like PRLog?
9K2K1K
Click to Share