Skills/Content Moderation/Score for triage

Score for triage

0.1 credits / check

Runs the same content check but surfaces the full per-category score map as the primary output, ranked by confidence. Use this when you want to apply your own thresholds per category or sort a review queue by risk level rather than acting on the default flag.

Use when

You want to set your own per-category thresholds or rank a queue by confidence, not trust one flag.

Not for

A simple yes-or-no check where the default verdict is enough.

Cost

0.1 credits / check

What it accepts

Set these inside the intent when you run it.

textoptional

Text content to score. Provide text, an image URL, or both.

image_urloptional

A publicly reachable URL of an image to score alongside or instead of text.

What you get back

The full per-category calibrated score map (0 to 1) ranked by confidence, alongside the overall flag, so you can apply your own thresholds.

Run it

Run this sub-skill directly: pin it with operation and pass its inputs in the intent. (Omit operation and the Content Moderation skill will route from your intent instead.)

curl -X POST "https://skill.askfaro.com/skills/content-moderation/run" \
  -H "Authorization: Bearer $FARO_TOKEN" \
  -H "Content-Type: application/json" \
  -d '{"intent":{"operation":"triage","text":"This content might be borderline -- score it for me.","image_url":"https://example.com/review/image.jpg"}}'

Example requests

  • Score this batch of comments so I can rank them by risk level
  • Apply my own threshold of 0.8 for violence and 0.6 for hate speech to this post
  • Give me confidence scores per category for this piece of content