Search results

Extract Text in React PDF Viewer component

25 Jan 2023 / 1 minute to read

The PDF Viewer library allows you to extract the text from a page along with the bounds. Text extraction can be done using the isExtractText property and extractTextCompleted event.

Here is an example of how you can use the isExtractText property and extractTextCompleted event:

Copied to clipboard
<PdfViewerComponent
    id="container"
    documentPath="PDF_Succinctly.pdf"
    serviceUrl="https://ej2services.syncfusion.com/production/web-services/api/pdfviewer"
    isExtractText={true}
    extractTextCompleted={this.extractTextCompleted}
    style={{ height: '640px' }}>
</PdfViewerComponent>

extractTextCompleted = (args) => {
    // Extract the Complete text of load document
    console.log(args);
    console.log(args.documentTextCollection[1]);
    // Extract the Text data.
    console.log(args.documentTextCollection[1][1].TextData);
    // Extract Text in the Page.
    console.log(args.documentTextCollection[1][1].PageText);
    // Extract Text along with Bounds
    console.log(args.documentTextCollection[1][1].TextData[0].Bounds);
};

Find the sample how to Extract Text