ParagraphAbsorber

ParagraphAbsorber

Overview

ParagraphAbsorber is a class in Aspose.PDF FOSS for .NET.

Absorbs text from PDF pages and organizes it into sections and paragraphs.

This class provides 6 methods for working with ParagraphAbsorber objects in .NET programs. Available methods include: ParagraphAbsorber, Visit. All public members are accessible to any .NET application after installing the Aspose.PDF FOSS for .NET package. Properties: IsMulticolumnParagraphsAllowed, PageMarkups, ParagraphAbsorberOptions, SectionsSearchDepth, TextReplaceOptions.

Properties

NameTypeAccessDescription
PageMarkupsList<PageMarkup>ReadThe page markups produced by Visit(Document) or Visit(Page).
ParagraphAbsorberOptionsParagraphAbsorberOptionsRead/WriteActive options bag.
SectionsSearchDepthintRead/WriteHow many nesting levels to descend when partitioning sections.
TextReplaceOptionsTextReplaceOptions?Read/WriteOptions that flow through to the inner TextFragmentAbsorber when re-emitting paragraphs.
IsMulticolumnParagraphsAllowedboolRead/WriteGets or sets whether multicolumn paragraph merging is enabled.

Methods

SignatureDescription
ParagraphAbsorber()Calls ParagraphAbsorber on this ParagraphAbsorber instance.
ParagraphAbsorber(paragraphAbsorberOptions: ParagraphAbsorberOptions)
ParagraphAbsorber(sectionsSearchDepth: int)
ParagraphAbsorber(sectionsSearchDepth: int, paragraphAbsorberOptions: ParagraphAbsorberOptions)
Visit(doc: Document)Visit an entire document — absorbs all pages.
Visit(page: Page)Visit a single page and absorb its text into sections/paragraphs.

See Also