Content moderation that follows your rules.
Write your moderation policy as a bullet list in plain English. Send text, images, audio or video to one API. Get back a clear allow / block / review decision with a per-rule breakdown — powered by frontier LLMs.
- dec_INIT00marketplace-listingsblockphone number leaked
- dec_INIT01user-biosblockexternal profile link
- dec_INIT02marketplace-listingsallow—
- dec_INIT03support-ticketsreviewlegal threat detected
The rule list is the policy. Same shape across every use case.
Each example below is a real rule set you might run in production — plus a sample piece of content and the exact response we'd return. Click between them to see different industries.
- No phone numbers in listings
- No external chat handles (WhatsApp, Telegram, Signal)
- No discriminatory language
- No prohibited categories (weapons, drugs, prescription meds)
- Flag listings priced > 50% below market for human review
/v1/moderate with just the content.- No phone numbers in listingsmatchedGerman-format mobile number detected in body.+49 176 8123 1188
- No external chat handlesmatchedReference to off-platform messenger.WhatsApp
- No discriminatory languageokNo protected-class references found.
- No prohibited categoriesokItem category appears to be consumer electronics.
- Flag listings priced > 50% below marketokAsking price is within expected market range.
Three steps. From signup to first decision in under five minutes.
Write rules in plain English
One bullet per rule. No regex, no taxonomy lookups, no model training. Update them anytime — changes are live in seconds.
Send content to one API
Text, images, audio, video — same endpoint, same shape. We handle vision, transcription, and language detection for you.
Get an actionable decision
allow · review · block. Plus a per-rule breakdown with evidence so you can audit, debug, or surface it to a human moderator.
A console your trust & safety team will actually want to use.
Rules, decisions, costs and latency — all in one place. No third-party dashboards, no spreadsheets, no Slack screenshots.
A control room for every piece of content that hits your platform.
Live decision feed, per-rule-set block rates, latency, and spend — all in real time. Drill into any decision in two clicks.
Plain-English rules. Hard, soft, and allow overrides.
Define each rule once. Mark whether matching it blocks, sends to human review, or explicitly allows the content. Threshold logic decides the final outcome.
Every decision is auditable. With evidence.
Open any historical decision and see exactly which rules matched, what the model considered as evidence, and the full JSON we returned to your backend.
Same API. Wildly different rule sets.
Marketplaces & classifieds
Block listings that leak contact info, hide prohibited categories, and stop scam patterns before they reach buyers.
- No phone numbers or external chat handles
- Detect fake / off-platform deals
- Block prohibited categories per country
Dating & social apps
Keep bios, photos and DMs aligned with your community guidelines — without paying a team of 50 human moderators.
- Block explicit content or solicitation
- Detect underage indicators
- Stop links to external monetisation
AI chat products
Wrap your LLM with a fast moderation layer that catches prompt injection, off-topic abuse, and policy violations.
- Pre- and post-generation checks
- Block prompt injection attempts
- Enforce brand-safe outputs
Customer support
Triage tickets by tone and risk. Auto-escalate legal threats. Let real abuse jump the queue.
- Detect legal threats for human review
- Allow profanity, block violence
- Sentiment-aware queue ordering
Stop forcing your business rules into someone else's 13 categories.
Comparison reflects publicly listed self-serve plans as of May 2026. Names and trademarks belong to their respective owners. Used here for comparison only.
Ship in an afternoon, not a sprint.
One call. Same response shape across every modality. SDKs for TypeScript and Python with full type definitions — or just hit the HTTP API directly. We don't reinvent webhooks, batching, or idempotency.
- Idempotency keys & per-rule-set throttling out of the box.
- Webhooks for async / high-volume jobs.
- Streaming responses for chat use cases.
curl https://api.simplemoderation.com/v1/moderate \
-H "Authorization: Bearer sk_live_..." \
-H "Content-Type: application/json" \
-d '{
"rule_set": "marketplace-listings",
"content": {
"type": "text",
"text": "Selling iPhone 15 Pro. WhatsApp +49 176 8123 1188"
}
}'Built like a piece of infrastructure. Priced like a side project.
Transparent pay-as-you-go. Per-token, never per-seat.
- 5,000 decisions / month included
- All modalities (text, image, audio, video)
- Up to 3 rule sets
- Email support
- 50,000 included decisions
- Unlimited rule sets & versions
- Slack support, 24h SLA
- EU or US data residency
- Volume discounts from 1M / mo
- Dedicated VPC + private model
- 99.99% SLA, 24/7 on-call
- Custom data processing agreement
Questions buyers actually ask.
How is this different from OpenAI’s free Moderation API?
OpenAI’s API returns 13 fixed categories like ‘sexual’, ‘violence’, ‘harassment’. It doesn’t know your business. We let you write ‘no phone numbers in marketplace listings’ or ‘flag legal threats for human review’ — and we return per-rule decisions with evidence. We’re also multimodal and don’t cap you on requests.
What about latency?
LLM-backed decisions on a rich rule set typically take a few seconds — that’s the cost of getting per-rule reasoning and evidence. For inline UX (e.g. chat) you can opt into a streaming “decision-only” mode that returns block/allow in under a second and posts the full breakdown to a webhook. Image and video are async by default.
Do you store our content?
Only if you explicitly enable decision logs, and only the content related to that decision. You can turn it off, configure per-rule-set retention, or pin storage to the EU. Nothing is used to train shared models.
Can I A/B test rule changes?
Yes. Every rule set is versioned. You can shadow-mode a new version against live traffic for a few days, compare disagreement rate against the production version, and promote when you’re confident.
What models do you use under the hood?
A pool of frontier LLMs (Anthropic, OpenAI, Google) for the heavy decisions, and small fine-tuned models for cheap pre-filtering. You don’t pick a model — we route based on rule complexity and your latency budget.
Can I try before I commit?
Every plan ships with a 14-day money-back guarantee — no questions asked, full refund if it’s not the right fit. You can also walk through the examples on this page or email [email protected] and we’ll set up a guided demo.
Stop policing user content with regex and Slack messages.
Ship in an afternoon. Iterate on your rules in seconds. Sleep at night knowing every decision is auditable.