Schema Configuration for Document Extraction

Define custom extraction schemas for any document type — field types, validation rules, and line item extraction.

Last updated: April 8, 2026

Field types

Supported field types: text, number, currency, date, boolean, enum, and line_item. Each field has a name, type, description, and an extraction prompt that the model uses to find the value.

Validation rules

Add per-field validation: regex patterns, min/max bounds, required vs optional, and custom evaluators. Failed validations surface as warnings on the Review & Edit screen.

Line item extraction

For tabular data (invoice line items, PO lines), define a line_item field with a sub-schema. The model extracts the table and returns one row per line item with the sub-fields populated.