OCR

Not all agents can read text off a scanned page; this extracts it as layout-aware markdown, tables included.

Documents

Extracts text from scanned PDFs and images of documents using optical character recognition. Returns the recognized text as layout-aware markdown, preserving reading order and tables. Works on photographs of documents, scanned reports, and image-based PDFs.

Use when

You have a scanned PDF or a photo of a document with no selectable text layer and need its contents as text.

Not for

PDFs that already have a selectable text layer, answering questions about a document, describing photos, or translating.

What you can do

Each is a sub-skill of OCR; the router picks the right one for your request.

Extract text via OCR

Runs OCR over a scanned PDF or document image and returns layout-aware markdown per page, tables included.

~10 credits / page (up to 3000)

What you get back

information

Returns the recognized text as markdown per page, in reading order, with tables preserved. Character styling like bold or italic is not preserved.

When it checks with you first

The document is very long and you have not said which pages you need, so reading every page would cost far more than reading the few that matter.

Run it

Skills run through one gateway with your Faro token. Hand it an intent in plain language; Faro routes to the right sub-skill, runs it, and bills per call.

curl -X POST "https://skill.askfaro.com/skills/ocr/run" \
  -H "Authorization: Bearer $FARO_TOKEN" \
  -H "Content-Type: application/json" \
  -d '{"intent":{"prompt":"Extract the text from this scanned PDF"}}'

Example requests

›Extract the text from this scanned PDF

Create a free account