Data Model

Raydocs organizes document extraction into a clear hierarchy. Understanding this structure helps you design effective workflows and use the API efficiently.

Resource Hierarchy

Core Resources

Workspace

The top-level organizational container.Workspaces group templates, sessions, and team members together. Each workspace has its own set of users with specific roles (admin, user, readonly).

Field	Description
`id`	Unique identifier
`name`	Display name
`icon`	Emoji or icon

One workspace can contain: Multiple templates, unlimited sessions, multiple team members

Extraction Template

Defines what data to extract.Templates contain the JSON schema that specifies which fields to extract from documents. Each template belongs to one workspace and can be used across many sessions.

Field	Description
`id`	UUID identifier
`name`	Template name
`description`	Optional description
`schema_json`	Extraction schema definition
`settings`	Parsing configuration
`workspace_id`	Parent workspace

Key relationships:

Belongs to one Workspace
Used by many Sessions

Extraction Session

An individual extraction job.Sessions are where actual extraction happens. You upload documents to a session, run the extraction, and retrieve results. Each session uses one template.

Field	Description
`id`	UUID identifier
`name`	Session name
`extraction_template_id`	Template to use
`status`	pending, processing, completed, failed

Key relationships:

Uses one Template
Contains many Documents
Produces Results

Document

A source file for extraction.Documents are PDFs, images, or other files uploaded to a session. After upload, documents are automatically parsed into chunks for extraction.

Field	Description
`id`	UUID identifier
`filename`	Original filename
`mime_type`	File type
`size`	Size in bytes
`status`	pending, parsing, parsed, failed

Key relationships:

Belongs to one Session
Contains many Chunks (after parsing)

Chunk

A parsed segment of a document.When documents are processed, they’re split into chunks — meaningful segments of text with page references. The AI uses these chunks to find and extract data.

Field	Description
`id`	UUID identifier
`content`	Text content
`page_number`	Source page
`chunk_index`	Order within document
`metadata`	Additional parsing info

Key relationships:

Belongs to one Document
Referenced in extraction Results

Extraction Result

The extracted data output.Results contain the structured data extracted from session documents according to the template schema. Each result includes the extracted values and optionally AI reasoning traces.

Field	Description
`id`	UUID identifier
`status`	pending, processing, completed, failed
`data`	Extracted values
`reasoning`	AI reasoning (if enabled)

Key relationships:

Belongs to one Session
References source Documents/Chunks

Typical Workflow

Create a Workspace

Set up a workspace for your project or team. Invite collaborators if needed.

Design an Extraction Template

Define what data you want to extract using the JSON schema format. Include field definitions, search queries, and extraction prompts.

See the Extraction Schema Guide for detailed schema documentation.

Create an Extraction Session

For each batch of documents you want to process, create a session linked to your template.

Upload Documents

Add your source documents (PDFs, images, etc.) to the session. Documents are automatically parsed into chunks.

Run Extraction

Execute the extraction. The AI searches relevant chunks and extracts data according to your schema.

Retrieve Results

Access the structured extraction results via API or export to Excel/CSV.

Resource Limits

Resource	Limit
Workspaces per user	Based on plan
Templates per workspace	Unlimited
Sessions per template	Unlimited
Documents per session	100
File size	50 MB

Limits may vary based on your subscription plan. Contact support for enterprise limits.

Workspaces API

Manage workspaces and team members.

Templates API

Create and configure extraction schemas.

Sessions API

Run extraction jobs on documents.

Results API

Access extracted data and exports.

Overview

Cookbook

Workspaces

Workspace Users

Extraction Templates

Extraction Sessions

Batch Operations

Documents

Results

Resource Hierarchy

Core Resources

Typical Workflow

Resource Limits

API Navigation

Workspaces API

Templates API

Sessions API

Results API

Overview

Cookbook

Workspaces

Workspace Users

Extraction Templates

Extraction Sessions

Batch Operations

Documents

Results

​Resource Hierarchy

​Core Resources

​Typical Workflow

​Resource Limits

​API Navigation

Workspaces API

Templates API

Sessions API

Results API

Resource Hierarchy

Core Resources

Typical Workflow

Resource Limits

API Navigation