This quick start guide will walk you through the Rossum app and show you all the basics. You will learn how to:
Create an account
To create an account, head to the Rossum registration page. In the first step, you need to select your region to define the list of Pre-trained fields that will be extracted (Please note: you can change it later).
Note: Although Rossum currently supports only pre-trained data fields for processing invoice and purchase orders, the technology is document agnostic and can extract data from any structured document including receipts, shipping documents, claims, packing lists, etc.
Next, enter your personal information, including First name, Last name, Company, and Phone number.
Then you can choose the business email address you want to associate with the account and a secure password. Your trial account is free for <300 documents per month and is fully configurable.
Note: Automated learning of your documents is available only in higher editions.
Welcome to the app
After logging in, you will see the main screen.
On the left side you will find a list of Queues. Each queue is an organizational unit that typically represents a specific type of document that needs to be processed. You can create, remove, or group queues as needed using the fully customizable left panel.
Read more: How to configure fields for data extraction
In the center of the screen, you can see a list of your documents with a preloaded sample invoice.
Upload your documents
You can now begin uploading your documents by clicking the “Import” button or sending them to the provided email address (this can also be done via the API). You can import PDF, PNG, JPEG, TIFF, XLSX, and DOCX files or scanned images of your documents.
When you click “Done“, the AI will automatically begin extracting the data fields. The AI extraction typically takes between 30-60 seconds.
After processing, you will find the documents in the “To review” tab in case human review is required.
Check out more information about the Rossum interface in this short video.
Validate the data
A grey checkmark means that an AI validated the field, while a green checkmark indicates that a human validated the field. Start with the fields that don’t have any check marks if you want to focus on the most problematic ones.
To check the fields one by one, press Tab, or Enter to skip the green and grey checkmark fields. Please remember that you must review and correct the red-flagged fields. You will not be able to export the data otherwise.
If you find that a data field was not extracted correctly, you can make adjustments by:
- Clicking and pointing
- Dragging the rectangle
- Typing the text directly
Export the data
After the validation, click on “Confirm“. The document will appear either in the Confirmed tab (you can enable/disable this view in the Queue’s Basic settings) or in the Exported tab on the main dashboard. There you can choose to download the data in different formats (XLSX, CSV, XML, JSON, via API)
Customize the data capture
Once you are confident with using Rossum, you can customize the captured Data fields. Read more about the out-of-the-box extracted fields and how to set up a custom field.
Note: Rossum returns confidence scores for missing fields for the header fields returned by the Generic Engine. It is worth noting, that even after this change, the logic of the automation settings remains the same, so you should see no difference in the system’s behavior. Please contact firstname.lastname@example.org if you want to automate missing fields based on confidence thresholds.