Submits a publicly reachable image URL to the content classifier and returns a verdict across image-capable harm categories: sexual content, violence, and self-harm. Each category includes a calibrated confidence score alongside a boolean flag.
You want to screen a picture for explicit, violent, or self-harm content before showing or storing it.
Reading text out of the image, detecting AI-generated or manipulated images, or general image labeling.
0.1 credits / check
Set these inside the intent when you run it.
A publicly reachable URL of the image to check. Must be under 20 MB and not behind a login.
An overall flagged verdict, a per-category flag map, and confidence scores for the image-applicable harm categories.
Run this sub-skill directly: pin it with operation and pass its inputs in the intent. (Omit operation and the Content Moderation skill will route from your intent instead.)
curl -X POST "https://skill.askfaro.com/skills/content-moderation/run" \
-H "Authorization: Bearer $FARO_TOKEN" \
-H "Content-Type: application/json" \
-d '{"intent":{"operation":"moderate_image","image_url":"https://example.com/user-upload/photo.jpg"}}'Example requests