Preparing an Extraction schema is one of the most important tasks you need to do when configuring Rossum. You have a lot to define, including the names and formats of Data fields you want to extract, necessary value constraints, and, if you’re using them, enum options.
In order to correctly configure the Data fields, go to the Workspaces & Queues settings screen and select a Queue, then click on the “Edit schema” button in the upper right corner:
Exploring the Schema editor
You can now see the Extraction schema in JSON for your selected queue. The left sidebar shows a preview of your schema as it will appear in a user’s validation screen.
Using this editor, you can do all the extraction schema configuration: add custom fields, change label names, edit field types and formatting, and do other similar changes to any field attribute.
Extraction schema structure
When working with the JSON file, note the structure of the file and appropriate level of nesting. Be careful when distinguishing between the field types and their relations to each other.
The basic element is the Data field. However, Rossum enables to capture even more complex structures like Tables. In order to create a Table, add a new multivalue field (i.e. the grid) with tuples in it (i.e. rows). Moreover, the schema must contain at least one section, that works as a container for more Data fields.
Updating Extraction schema
As you edit the Extraction schema, the editor generates a preview in the sidebar in real time.
The Schema editor also displays syntax and semantic errors and constraints as you write so you can make corrections immediately.
Saving schema changes
Once you’re finished your updates, you can save your Extraction schema so that Rossum applies the updates to your selected Queue and you can start using it for extraction. You also have the option of discarding your new changes and resuming your updates from the previously saved version.
You can find out more information on how to customize schema in Rossum also in the following video.