We’re happy to announce the GroupDocs.Parser for .NET 25.10 release, available as of October 2025. This update brings a new table‑extraction capability, several API refinements, and a critical fix for PDF pagination.

What’s new in this release

📄 Extract tables from any page (PARSERNET‑2745)

Added two overloads to GroupDocs.Parser.Parser that let you pull table structures directly from a document, without needing a template.

  • IEnumerable<PageTableArea> GetTables() – extracts tables from the whole document.
  • IEnumerable<PageTableArea> GetTables(int pageIndex) – extracts tables from a specific page.

Note: PageTableArea represents a detected table region together with its cells, page reference and bounding rectangle.

Example – Get tables from a single page

using (Parser parser = new Parser(filePath))
{
    int pageIndex = 0;               // first page (zero‑based)
    IEnumerable<PageTableArea> tables = parser.GetTables(pageIndex);

    // iterate over tables
    foreach (var table in tables)
    {
        // work with table.Cells, table.Page, etc.
    }
}

🔧 API redesign – internal PageTableArea constructor (PARSERNET‑2743)

The public constructor of GroupDocs.Parser.Data.PageTableArea has been removed and is now internal. Instances of this class are created exclusively by the Parser engine, preventing accidental misuse. This is a backward‑incompatible change for anyone who instantiated PageTableArea directly.

🐞 Fix – PDF parsing limited to first 4 pages (PARSERNET‑1871)

The parser now correctly processes all pages of a PDF document. Previously, only the first four pages were returned when calling the parsing API.

How to get the update

Resources