PdfDocumentProcessor.Text Property
Provides access to the PDF text.
Namespace: DevExpress.Pdf
Assembly: DevExpress.Docs.v24.2.dll
NuGet Package: DevExpress.Document.Processor
#Declaration
#Property Value
Type | Description |
---|---|
String | A String value that is the target text. |
#Remarks
The Text property obtains the document content clipped to the crop box. Use the PdfDocumentProcessor.GetText() method to obtain content without clipping.
#Text Normalization in PDF Document API
PdfDocumentProcessor applies FormKC normalization when the bidirectional or RTL text is processed. In other cases, no normalization is applied.
#Example
The code snippet below uses the PdfDocumentProcessor.Text
property to extract the text of a PDF file at runtime.
string ExtractTextFromPDF(string filePath)
{
string documentText = "";
try {
using (PdfDocumentProcessor documentProcessor =
new PdfDocumentProcessor())
{
documentProcessor.LoadDocument(filePath);
documentText = documentProcessor.Text;
}
}
catch { }
return documentText;
}
#Related GitHub Examples
The following code snippet (auto-collected from DevExpress Examples) contains a reference to the Text property.
Note
The algorithm used to collect these code examples remains a work in progress. Accordingly, the links and snippets below may produce inaccurate results. If you encounter an issue with code examples below, please use the feedback form on this page to report the issue.