$5 free credits when you sign up
Use Case · Customer Support

Support bots with guardrails built in.

Support traffic carries real customer data and the occasional hostile prompt. Nemo Router redacts PII and blocks injection on every request, keeps tone consistent with versioned templates, and caps spend with budgets.

support-bot · guardrail trace

Every message, screened first

Injection scanclean
PII detectedemail, phone
PII actionredacted
Prompt templatesupport-v4
Modelgemini-2.5-flash
Budget$18 / $50
PII-safeinjection-blockedbudgeted
PII categories
5

Email, phone, SSN, card, IP — redacted

Injection defense
Every request

Inbound jailbreak patterns blocked

Prompt tone
Versioned

One template drives the whole fleet

Spend
Budgeted

Per-key and per-team cost ceilings

Why Nemo for support

Safety, consistency, and cost — handled

A support bot is the part of your AI stack that talks to real customers. It needs to protect their data, resist abuse, stay on-brand, and not blow the budget.

PII redaction on every message

Support conversations carry real customer data — emails, phone numbers, account details. PII redaction runs as a guardrail before the prompt leaves your gateway, so customer data never lands unredacted in a provider log.

  • Detects email, phone, SSN, credit-card, and IP-address patterns
  • Masks PII before the prompt reaches the model
  • Runs in-process with low overhead — LLM time still dominates
  • Every redaction recorded in the guardrail log

Prompt-injection defense

A public-facing bot gets hostile inputs — jailbreak attempts, instruction-override prompts. The injection-detection guardrail inspects every inbound message and blocks a flagged request before it reaches the model.

  • Detects jailbreak and instruction-override patterns
  • Flagged requests blocked before the provider call
  • Blocked events surfaced in the guardrail log for review
  • Scope is per-key, per-team, or per-org

Versioned prompt templates

Tone, escalation rules, and brand voice live in a central template — not scattered across bot deployments. Reference a template by id, pass variables at call time, and every bot updates when you edit it.

  • System prompt held centrally, versioned with history
  • Bots reference nemo_prompt_template_id + variables
  • A/B test two template versions to compare resolution quality
  • Roll back to a previous version without a redeploy

Cost control for high-volume traffic

Support volume is bursty and large. Per-key and per-team budgets cap spend, caching absorbs repeated questions, and real-time cost tracking shows exactly where the budget goes.

  • Per-key and per-team budgets with a hard 402 ceiling
  • Caching serves repeated FAQ answers without a provider call
  • Real per-call cost from the LiteLLM response-cost header
  • Pick a fast, low-cost catalog model for routine tickets
How it works

A support message, screened end to end

Every inbound message passes the same gates: injection detection, then PII redaction, then the templated prompt. Guardrails run before the model ever sees the text.

Support request flow

  1. Customer message

    inbound to your bot

    Real names, account details, sometimes a hostile prompt.

  2. Injection check

    guardrail · pre-call

    Jailbreak / override patterns blocked before the model.

  3. PII redaction

    guardrail · pre-call

    Email, phone, SSN, card, IP masked from the prompt.

  4. Template + model

    prompt template id

    Central system prompt; routed to your chosen model.

  5. Reply + logged

    guardrail log

    Redactions, blocks, cost, and latency all recorded.

Guardrails run in-process before the provider call. A blocked or redacted message never reaches the model — and the event is logged so your trust and safety team can review it.

Guardrails

PII never leaves your gateway unredacted

PII redaction

Five categories, masked before the model

When a customer pastes an order number, an email, or a card number into a support chat, the redaction guardrail catches it. The masked prompt is what the model — and the provider’s logs — actually see. The original stays inside your gateway boundary.

  • Email, phone, SSN, credit-card, and IP-address detection
  • Masking applied before the prompt is forwarded
  • Per-key / per-team / per-org guardrail scope
  • Every redaction recorded in the guardrail log
guardrail · pii-redaction

Before vs. after

Raw inputjane@acme.com
Redacted[EMAIL]
Raw input+1 555 010 4477
Redacted[PHONE]
Sent to providermasked only
5 categoriespre-calllogged
The code

Reference a template, let guardrails run

Your bot calls the standard chat endpoint and points at a prompt template by id. Guardrails apply automatically — you do not wire them per request. These snippets are generated from the SDK examples the playground uses.

Installpip install openai
1# Cache: enabled (org default). Pass nemo_cache: false to skip.
2from openai import OpenAI
3import os
4
5client = OpenAI(
6 api_key=os.environ["NEMOROUTER_API_KEY"],
7 base_url="https://api.nemorouter.ai/v1",
8)
9
10response = client.chat.completions.create(
11 model="gemini-2.5-flash",
12 temperature=1,
13 max_tokens=1024,
14 top_p=1,
15 messages=[
16 {"role": "user", "content": "Hello! What models do you support?"},
17 ],
18 extra_body={
19 # "nemo_cache": False, # Uncomment to skip cache
20 },
21)
22
23print(response.choices[0].message.content)

Pass nemo_prompt_template_id in extra_body to drive tone from a central, versioned template.

FAQ

Common support-bot questions

Safe by default

Ship a support bot your security team signs off on

PII redaction, injection defense, versioned prompts, and budgets — all unlocked on every plan across 20+ models.