ParagraphAbsorber
Overview
ParagraphAbsorber is a class in Aspose.PDF FOSS for .NET.
Absorbs text from PDF pages and organizes it into sections and paragraphs.
This class provides 6 methods for working with ParagraphAbsorber objects in .NET programs.
Available methods include: ParagraphAbsorber, Visit.
All public members are accessible to any .NET application after installing the Aspose.PDF FOSS for .NET package.
Properties: IsMulticolumnParagraphsAllowed, PageMarkups, ParagraphAbsorberOptions, SectionsSearchDepth, TextReplaceOptions.
Properties
| Name | Type | Access | Description |
|---|---|---|---|
PageMarkups | List<PageMarkup> | Read | The page markups produced by Visit(Document) or Visit(Page). |
ParagraphAbsorberOptions | ParagraphAbsorberOptions | Read/Write | Active options bag. |
SectionsSearchDepth | int | Read/Write | How many nesting levels to descend when partitioning sections. |
TextReplaceOptions | TextReplaceOptions? | Read/Write | Options that flow through to the inner TextFragmentAbsorber when re-emitting paragraphs. |
IsMulticolumnParagraphsAllowed | bool | Read/Write | Gets or sets whether multicolumn paragraph merging is enabled. |
Methods
| Signature | Description |
|---|---|
ParagraphAbsorber() | Calls ParagraphAbsorber on this ParagraphAbsorber instance. |
ParagraphAbsorber(paragraphAbsorberOptions: ParagraphAbsorberOptions) | |
ParagraphAbsorber(sectionsSearchDepth: int) | |
ParagraphAbsorber(sectionsSearchDepth: int, paragraphAbsorberOptions: ParagraphAbsorberOptions) | |
Visit(doc: Document) | Visit an entire document — absorbs all pages. |
Visit(page: Page) | Visit a single page and absorb its text into sections/paragraphs. |