En

Extract Text Areas from Document Pages using GroupDocs.Parser for .NET 18.7

Today, we are excited to announce the release of version 18.7 of GroupDocs.Parser for .NET. The latest release supports extracting text areas from document pages. This feature may help you getting data for text analysis. We recommend you to upgrade the API to the latest version and share your valuable feedback. Extracting Text AreasExtracting text areas is useful when you need to get the data for text analysis. To extract text areas, text extractors implement their own internal private class and provide DocumentContent property (see PdfTextExtractor as the sample).
July 10, 2018 · 2 min · Usman Aziz

Extract TOC from EPUB Documents using GroupDocs.Text for .NET 18.4

It gives us immense pleasure to announce the release of version 18.4 of GroupDocs.Text for .NET. The latest version allows extracting the table of contents from the EPUB documents. Furthermore, we have added the feature of detecting media type of .one file. Following sections provide details about the newly added features. Extracting TOC from EPUB Documents Using version 18.4, you can now extract TOC from the EPUB documents. To access the TOC, TableOfContents property of **EpubPackage **class is used.
April 18, 2018 · 3 min · Usman Aziz

Extract Formatted Text from CHM Documents using GroupDocs.Text for .NET 18.3

We keep looking forward to bringing you more features and therefore, we have released version 18.3 of GroupDocs.Text for .NET providing the support of extracting formatted text from CHM documents. The latest version also allows you to extract text by pages and extract table of content from CHM documents. The following sections will provide you the details about the new features of the API. Extracting Formatted Text from CHM Documents GroupDocs.
March 29, 2018 · 3 min · Usman Aziz