.NET Framework 4.5.2+
.NET Framework 4.5.2+
.NET Standard 2.0+
.NET Core 3.0+

PdfPageWord Class

An individual word related to a specific PDF page.

Namespace: DevExpress.Pdf

Assembly: DevExpress.Docs.v19.2.dll

Declaration

public class PdfPageWord :
    PdfWord
Public Class PdfPageWord
    Inherits PdfWord
public class PdfPageWord :
    PdfWord
Public Class PdfPageWord
    Inherits PdfWord
public class PdfPageWord :
    PdfWord
Public Class PdfPageWord
    Inherits PdfWord

Remarks

The PdfPageWord class uses the page coordinate system. Refer to the Coordinate Systems topic for more information.

Use the PdfPageWord.PageNumber property of this class to obtain the page number corresponding to a specific word in a PDF. The PdfWord.Characters property provides access to the character settings of a word.

In PDF Document API, a PdfPageWord instance is returned by the PdfDocumentProcessor.NextWord and PdfDocumentProcessor.PrevWord methods.

The code sample below shows how to use the NextWord method to retrieve the list of document fonts.

static void Main(string[] args)
{
    HashSet<string> FontNames = new HashSet<string>();

    using (PdfDocumentProcessor processor = new PdfDocumentProcessor())
    {
        processor.LoadDocument("Document.pdf");

        //Check all words in the document
        PdfWord currentWord = processor.NextWord();
        while (currentWord != null)
        {
            //Add the current font name to the list
            for (int i = 0; i < currentWord.Characters.Count; i++)
            {
                    FontNames.Add(currentWord.Characters[i].Font.FontName);
            }
            currentWord = processor.NextWord();
        }
    }
    Console.WriteLine(string.Format("The loaded document contains the following fonts:\r\n{0}", 
    string.Join("\r\n", FontNames.ToArray())));
}

Inheritance

Object
PdfWord
PdfPageWord
See Also