Skip to main content
A newer version of this page is available. .
.NET Framework 4.5.2+

PdfDocumentProcessor.Text Property

Provides access to the PDF text.

Namespace: DevExpress.Pdf

Assembly: DevExpress.Docs.v19.1.dll

Declaration

public string Text { get; }

Property Value

Type Description
String

A String value, that is the target text.

Remarks

This tutorial describes how to extract the text of a PDF file at runtime using the PDF Document API.

To extract the text of a PDF file, do the following.

  1. Create a PdfDocumentProcessor.
  2. To open a PDF file, pass a stream that contains the document data to the PdfDocumentProcessor.LoadDocument method.
  3. After the document is loaded, you can extract its plain text using the PdfDocumentProcessor.Text property.

The following code implements this functionality.

string ExtractTextFromPDF(string filePath) {
    string documentText = "";
    try {
        using (PdfDocumentProcessor documentProcessor = new PdfDocumentProcessor()) {
            documentProcessor.LoadDocument(filePath);
            documentText = documentProcessor.Text;
        }
    }
    catch { }
    return documentText;
}

The following code snippet (auto-collected from DevExpress Examples) contains a reference to the Text property.

Note

The algorithm used to collect these code examples remains a work in progress. Accordingly, the links and snippets below may produce inaccurate results. If you encounter an issue with code examples below, please use the feedback form on this page to report the issue.

See Also