Zespan is an AI agent observability and engineering platform. It traces every agent decision, tool call, handoff, and delegation in production. It also provides prompt versioning, built-in LLM-as-judge evaluations, guardrails, cost optimization, and an AI ops assistant called ZespanPilot.

How do I instrument my AI agent with Zespan?

Zespan requires 2 lines of code. Import zespan and call zespan.init({ apiKey: process.env.LT_KEY }). This auto-patches OpenAI, Anthropic, Gemini, Bedrock, and Mistral. For framework-level tracing, add one handler: ZespanCallbackHandler for LangChain, ZespanCrewAIListener for CrewAI, or ZespanADKHandler for Google ADK.

Does Zespan support prompt versioning?

Yes. Zespan includes prompt management with versioning, a playground for iteration, and A/B testing to compare prompt versions against each other in production.

What evaluations does Zespan support?

Zespan ships 12 built-in LLM-as-judge evaluation templates including faithfulness, relevance, toxicity, groundedness, and more. Evaluations run automatically on every trace with no custom scoring functions required.

How does Zespan compare to Langfuse?

Zespan is agent-native: every span carries agent identity, delegations are first-class trace events, and an agent map is built automatically. Langfuse was built for LLM pipelines and extended to agents later. Zespan also ships 12 built-in eval templates (Langfuse has none), includes an AI cost optimizer, and ZespanPilot for AI ops. Langfuse has open-source self-hosting; Zespan does not.

What is the free tier for Zespan?

The free tier includes 10,000 traces per month, 14-day retention, 2 projects, and 1 seat. No credit card required.

Feature — Guardrails

Stop bad outputs before they reach users.

7 guardrail types run inline on every LLM request — block, warn, redact, or log. PII, toxicity, topic drift, format, cost ceiling, and custom rules.

Pre-call and post-call phases. Configurable per agent. Live test before deploying.

Start for free →Get a demo

zespan.com — guardrails

Works withPre-call phasePost-call phaseAgent filterRegexLLM judgePII redaction

types

actions

50ms

min latency cap

7 Guardrail Types

PII detects and redacts personal data. Toxicity blocks harmful content. Topic boundary prevents scope creep. Format enforces output structure. Cost ceiling blocks expensive requests. Custom LLM uses your own judge prompt. Regex handles exact pattern matching.

PII, Toxicity, Topic Boundary, Format, Cost Ceiling, Custom LLM, Regex
Phase: pre (before LLM call), post (after response), or both
Priority 0–100: controls execution order when multiple guardrails apply

7 guardrail types

Zespan guardrail configuration showing types, actions, and phase settings

4 Actions: Block, Warn, Redact, Log

Block rejects the request and returns a GuardrailBlockedError. Warn logs the issue and allows through. Redact removes matching content and allows the modified text through. Log records without interfering.

Block: throws GuardrailBlockedError — handle in your catch block
Redact: matching content removed, modified text returned in result.modifiedText
Warn / Log: zero user-visible impact, full audit trail

Live Test Before Deploying

Pass any draft guardrail config and arbitrary input text to the live test endpoint — no save required, no deployment needed. See exactly what would be blocked, warned, or redacted before it goes live.

Test with draft config: preview guardrail behavior without saving
Apply guardrails in Playground: validate prompt safety interactively
Per-guardrail latency cap (50ms–30s): slow guards never block requests

Execution Logs & Metrics

Every guardrail check is logged: slug, passed/failed, action taken, reason, modified text, and latency. Time-range metrics (pass/block/warn/redact rates) available for 24h, 7d, 30d. All config changes written to audit log.

Per-check log: queryable by guardrail ID, result, and time range
Result caching: repeated identical inputs skip re-evaluation via CacheLayer
Audit log: create/update/enable/disable events with actor user ID and IP

Get started

Set up in under 5 minutes

typescriptGuardrails

import { Zespan } from '@zespan/sdk';

const lt = new Zespan({ apiKey: process.env.ZESPAN_API_KEY });

// Guardrail check — configured in dashboard, enforced by SDK
try {
  const result = await lt.guardrails.check({
    input: userMessage,
    projectId: 'your-project-id',
  });
  // result.passed, result.action, result.modifiedText
} catch (err) {
  if (err instanceof GuardrailBlockedError) {
    return { blocked: true, reason: err.reason };
  }
}

Start for free →Get a demo

Frequently asked

Do guardrails add latency to my LLM calls?

Only pre-call guardrails add latency — they run before the LLM call. Post-call guardrails run after and don't affect your response time. For pre-call guards, you can configure a max latency cap (50ms–30s) so a slow guardrail never blocks the request.

What happens when a guardrail blocks a request?

The SDK throws a GuardrailBlockedError with a reason field. Catch this error in your application and handle it — return a fallback response, log it, or show the user an appropriate message.

Can I scope a guardrail to only apply to certain agents?

Yes. Each guardrail has an agent filter field — set it to specific agent names and that guardrail only runs for those agents. Different agents can have different safety rules on the same project.

What's the difference between a custom LLM guardrail and a regex guardrail?

Regex guardrails use pattern matching — they're fast (sub-millisecond) and deterministic, ideal for exact strings, known PII formats, or prohibited phrases. Custom LLM guardrails use an LLM as judge — slower but understand context, semantics, and nuance. Use regex for rules you can fully specify; use custom LLM for rules that require judgment.

Are guardrail results cached?

Yes. Guardrail results are cached by input hash via CacheLayer. If the same input is seen again, Zespan returns the cached result without re-running the check — saving latency and LLM judge costs on repeated inputs.

Explore more features

Setup takes under 5 minutes. Works with OpenAI, Anthropic, LangChain, and more.

Get started free →Get a demo

← All features