Pricing

The right plan for every team.

Start free. Add security and scale as you grow. Every plan includes full LLM observability.

Free

$0

Free forever

Start free

50,000 traces / month

1,000 evaluations / month

14-day trace retention

1 seat

  • fluiq.instrument(): full observability
  • Trace explorer & live dashboard
  • Streaming traces
  • Community support
  • fluiq.optimize(): response caching
  • CI/CD eval gates
  • Slack anomaly alerts
  • fluiq.secure(): security scanning🔒 Growth+

Team

$49/mo

billed monthly

Start free

Unlimited traces

10,000 evaluations / month

90-day trace retention

Up to 5 seats

  • Everything in Free
  • fluiq.optimize(): response caching
  • CI/CD eval gates
  • Slack anomaly alerts
  • Email support (48h SLA)
  • fluiq.secure(): security scanning🔒 Growth+
Most Popular

Growth

$149/mo

billed monthly

Start free

Unlimited traces

100,000 evaluations / month

1-year trace retention

Up to 20 seats

  • Everything in Team
  • fluiq.secure(): security scanning
  • Prompt injection, PII, jailbreak & secret-leak protection
  • Custom eval thresholds
  • SSO (single sign-on)
  • Priority support (24h SLA)

Enterprise

Custom

Talk to us

Unlimited traces

Unlimited evaluations

Custom trace retention

Unlimited seats

  • Everything in Growth
  • VPC / on-prem deployment
  • Custom SLA & dedicated support
  • Audit logs & compliance exports
  • SAML / SCIM provisioning
  • Dedicated onboarding

Compare plans

Every feature, side by side.

FeaturesFreeTeamGrowthEnterprise
Observability
Traces / month50KUnlimitedUnlimitedUnlimited
Trace retention14 days90 days1 yearCustom
Live dashboard
Trace explorer
Streaming traces
Evaluation
Evals / month1,00010,000100,000Unlimited
LLM-as-judge metrics
CI/CD eval gates
Custom eval thresholds
Security: fluiq.secure()
Prompt injection blocking
PII detection & redaction
Jailbreak detection
Secret leak prevention
Indirect injection detection
Optimization: fluiq.optimize()
Response caching
Cache hit dashboard
Team & Access
Seats1520Unlimited
SSO
SAML / SCIM
Audit logs
Support
Community support
Email support48h SLA24h SLADedicated
Slack alerts
Dedicated onboarding
Deployment
Cloud (managed)
VPC / on-prem

What you unlock

Two calls. Security and speed, handled.

fluiq.optimize() ships with Team, and fluiq.secure() unlocks on Growth.

fluiq.secure()

Included in Growth & Enterprise

One call. Full pipeline protection.

Wrap your pipeline with server-side security scanning before any data is stored. Fluiq checks every prompt and response, so attack patterns are never shipped in the public SDK.

  • PII Detection & Redaction: Names, emails, phone numbers, SSNs, and credit cards, detected and redacted before persistence.
  • Prompt Injection Blocking: Catches injection patterns, jailbreak attempts, and skeleton key attacks in real time.
  • Jailbreak & Semantic Attack Scoring: Semantic similarity scoring against known attack vectors, even when phrasing varies.
  • Secret Leak Prevention: Scans LLM outputs for leaked API keys, tokens, and high-entropy credential strings.
  • Indirect Injection Detection: Inspects tool outputs and context documents for second-order injection hidden in retrieved content.
  • Warn or Block mode: warn (default) flags risks and attaches security metadata to the trace. block intercepts before the LLM call and raises FluiqSecurityError.
Python
fluiq.instrument(api_key="fl_...")
fluiq.secure()  # warn mode flags risks on the trace
fluiq.secure(mode="block")  # block mode

fluiq.optimize()

Included in Team & above

Serve repeated prompts from cache.

Fluiq analyses your historical traces to find which LLM calls repeat most often and provisions a dedicated Redis cache for your account. Repeated prompts are served instantly, saving both latency and cost.

  • Trace-Driven Cache Profiling: The backend mines your trace history to build a cache profile, no manual configuration needed.
  • Automatic Cache Population: Real LLM responses are stored automatically on the first call; subsequent matches are served from Redis.
  • Cache mode: Full interception: matching prompts never reach the LLM API.
  • Observe mode: Records what would have been a cache hit without intercepting: review your savings before opting in.
  • Zero code changes: One fluiq.optimize() call after instrument(). The SDK handles connection, profiling, and cache lookup.
  • Cache hit dashboard: See hit rates, latency savings, and estimated cost savings in your Fluiq dashboard.
Python
fluiq.instrument(api_key="fl_...")
fluiq.optimize()  # cache mode
fluiq.optimize(mode="observe")  # observe mode

FAQ

Frequently asked questions

What counts as a trace?

One traced span, typically one LLM call, one retriever call, or one decorated function invocation. A single end-to-end agent run usually emits 5-20 traces depending on how many tools and LLM calls it makes. The Free plan includes 50,000 traces per month; Team and above are unlimited.

Which frameworks does Fluiq support?

Fluiq instruments at the function-call level and ships integrations for OpenAI, Anthropic, Gemini, LangChain, LangGraph, CrewAI, Google ADK, and raw HTTP calls via the @trace decorator. Streaming, tool calls, thinking tokens, and MCP servers are all captured automatically.

What counts as an evaluation?

One LLM-as-judge scoring call: e.g. a hallucination check on an answer or a relevance score over a retrieved chunk set. Metrics include hallucination, faithfulness, relevance, toxicity, coherence, and completeness. Free includes 1,000 evals/month, Team 10,000, Growth 100,000, and Enterprise is unlimited.

When do I need fluiq.secure()?

fluiq.secure() runs server-side security scanning: PII detection and redaction, prompt-injection and jailbreak blocking, secret-leak prevention, and indirect-injection detection. It is included on the Growth and Enterprise plans. In warn mode it flags risks on the trace without blocking; in block mode it raises FluiqSecurityError before a HIGH-risk prompt reaches the LLM.

How does fluiq.optimize() work?

Available on Team and above. After you call fluiq.optimize(), the SDK fetches your trace-derived cache profile from the Fluiq backend, connects to a dedicated Redis instance provisioned for your account, and begins serving repeated prompts from cache. In observe mode it records what would have been a cache hit so you can review projected savings before enabling full interception.

Do you support self-hosting?

Yes. VPC and on-prem deployments are available on the Enterprise plan. The SDK is a thin instrumentation layer and can be pointed at your own backend endpoint if you prefer full self-hosting.

Can I switch frameworks later?

Yes. Because Fluiq instruments at the call level, the same SDK works across all supported frameworks simultaneously. Switching from LangChain to LangGraph, or adding a new provider, requires no changes to your instrumentation.

Ship safer AI, faster.

Start free, then add security and scale when you need it.

No credit card required on Free and Team plans.