OWLxtract

Updated on 26 Sep 2024
1 Minute to read

Dark
Light

Article summary

Did you find this summary helpful?

Thank you for your feedback

OWL Software Edition Module or Feature Available In

Advanced

Enterprise

Enterprise Plus

Enterprise Advanced

Smart City

OWLxtract is the process of extracting the PDF and Image Files data and storing the data within the OWLdocs module. During the OWLdocs product search process, these extracted files will be analyzed and if any matches are found, they will be shown on the result screen.

Steps to Perform OWLxtract:

Click OWLdocs.
This action is performed on PDF and Image files. For an existing file, click the action menu under the Action column and click OWLxtract.
This will open the popup for extraction.

The popup will display two options to extract the file, Text Detection and Text Analysis.
Text Detection: During this process, the OWL system detects the text in a document along with the following information:
- The lines and words of detected text
- The relationships between the lines and words of detected text
- The page that the detected text appears on
- The location of the lines and words of text on the document page
- PDF Documents in English, French, German, Italian, Portuguese, Spanish
- Handwritten Documents in English
Text Analysis: During this process, OWL detects text in a document, analyzes documents, and forms relationships among the detected text and perfoms the following:
- Text Extraction- The raw text extraction from a document
- Form Extraction- Form data extraction from a document in the form of key-value pair
- Table Extraction- Extracts tables, table cells, and the items within table cells
Once you have selected our extraction method click Process to complete the action.
Once the OWLxtract process has started, you will receive an email notification about the text extraction is in progress.
Once the text extraction is completed you will receive another email about the extraction completion.
After text extraction is completed the file status will be changed to Text Extraction Processed.

Now this file is ready to be used for OWLdocs product searches.
You can perform OWLxtract during the file upload process itself.
When uploading, select the OWLxtract checkbox in the upload file popup.
The two options will be displayed Text Detection and Text Analysis.
Select the appropriate method and click Upload.
This will upload the file as well as the extraction.

Was this article helpful?

What's Next

OWLtranslate