Skip to main content

PdfDocumentProcessor.GetText(PdfDocumentPosition, PdfDocumentPosition) Method

Retrieves the text located between the specified document positions.

Namespace: DevExpress.Pdf

Assembly: DevExpress.Docs.v23.2.dll

NuGet Package: DevExpress.Document.Processor

Declaration

public string GetText(
    PdfDocumentPosition startPosition,
    PdfDocumentPosition endPosition
)

Parameters

Name Type Description
startPosition PdfDocumentPosition

A PdfDocumentPosition object that is the initial document position.

endPosition PdfDocumentPosition

A PdfDocumentPosition object that is the final document position.

Returns

Type Description
String

A String value that is the target the text.

Remarks

The overloaded GetText method uses the page coordinate system. See the following help topic to learn more: Coordinate Systems.

If there is no text between the specified positions, this method returns text that is nearest to these positions.

using (DevExpress.Pdf.PdfDocumentProcessor processor = new DevExpress.Pdf.PdfDocumentProcessor())
{
    processor.LoadDocument("TextExtraction.pdf");
    PdfDocumentPosition startPosition = new PdfDocumentPosition(1, new PdfPoint(0, 0));
    PdfDocumentPosition endPosition = new PdfDocumentPosition(1, new PdfPoint(500, 500));

    string pageText = processor.GetText(startPosition, endPosition);
    Console.WriteLine(pageText);
}
See Also