Documents and RAG

Orlo supports task-scoped retrieval context through uploaded documents and retrieval chunks.

Supported document flows

POST /v1/tasks/:task_id/documents

Supported content types:

The API chunks the text immediately, stores retrieval chunks, enqueues embeddings, and returns 202.

POST /v1/tasks/:task_id/documents

Provide a chunks[] array to skip Orlo's inline chunking and go straight to embedding.

POST /v1/tasks/:task_id/documents

PDF-specific notes:

For larger PDFs:

This is the preferred flow for larger PDFs.

Orlo resolves storage in this order:

PDF extraction is asynchronous.
Retrieval is task-scoped, so multiple documents under the same task can contribute context.
Direct PDF uploads are currently documented as born-digital first, with OCR support depending on the worker runtime.