Name: Fluiq
Brand: Fluiq

Question 1

What counts as a trace?

Accepted Answer

One traced span, typically one LLM call, one retriever call, or one decorated function invocation. A single end-to-end agent run usually emits 5-20 traces depending on how many tools and LLM calls it makes. Trace ingestion is unlimited and free on every plan. The Free plan keeps a rolling 14-day history, while Team and above retain your traces indefinitely.

Question 2

Which frameworks does Fluiq support?

Accepted Answer

Fluiq instruments at the function-call level and ships integrations for OpenAI, Anthropic, Gemini, LangChain, LangGraph, CrewAI, Google ADK, and raw HTTP calls via the @trace decorator. Streaming, tool calls, thinking tokens, and MCP servers are all captured automatically.

Question 3

What counts as an evaluation?

Accepted Answer

One LLM-as-judge scoring call: e.g. a hallucination check on an answer or a relevance score over a retrieved chunk set. Metrics include hallucination, faithfulness, relevance, toxicity, coherence, and completeness. Free includes 100 evals/month, Starter 2,000, Team 10,000, Growth 50,000, and Enterprise is unlimited. Beyond your allowance you pay per evaluation, priced by depth: $0.007 for a single-judge metric up to $0.545 for a deep agentic run with a multi-model jury, because those are genuinely different amounts of work.

Question 4

When do I need fluiq.secure()?

Accepted Answer

fluiq.secure() runs server-side security scanning: PII detection and redaction, prompt-injection and jailbreak blocking, secret-leak prevention, indirect-injection and RAG-poisoning detection, tool-input exfiltration and allowlist enforcement, and multi-agent trust checks (cross-agent injection and trust-boundary escalation). It is available on every plan, including Free, and metered by scan volume rather than locked behind a tier: 1,000 scans a month on Free, up to 2,000,000 on Growth. Scanning is pattern and NER based with no LLM call, so it costs a fraction of an evaluation to run and we price it that way. In warn mode it flags risks on the trace without blocking; in block mode it raises FluiqSecurityError before a HIGH-risk prompt reaches the LLM.

Question 5

How does fluiq.optimize() work?

Accepted Answer

Available on Team and above. After you call fluiq.optimize(), the SDK fetches your trace-derived cache profile from the Fluiq backend, connects to a dedicated Redis instance provisioned for your account, and begins serving repeated prompts from cache. In observe mode it records what would have been a cache hit so you can review projected savings before enabling full interception.

Question 6

Do you support self-hosting?

Accepted Answer

Yes. VPC and on-prem deployments are available on the Enterprise plan. The SDK is a thin instrumentation layer and can be pointed at your own backend endpoint if you prefer full self-hosting.

Question 7

Can I switch frameworks later?

Accepted Answer

Yes. Because Fluiq instruments at the call level, the same SDK works across all supported frameworks simultaneously. Switching from LangChain to LangGraph, or adding a new provider, requires no changes to your instrumentation.

Question 8

How do I know if my LLM is actually working in production?

Accepted Answer

Fluiq traces every LLM call, tool call, and decorated function automatically, so you watch live token usage, latency (p50/p95/p99), USD cost per agent node, and pass/fail status stream onto the dashboard as runs complete. Automated LLM-as-judge evals score hallucination, faithfulness, relevance, and more on real production traffic, and Slack alerts fire the moment quality regresses or failure rates climb, so "is it working?" becomes a number you watch rather than a guess.

Question 9

What's the cheapest way to monitor my LLM application?

Accepted Answer

Start on Fluiq's Free plan: unlimited traces, 100 evaluations, and 1,000 security scans per month at no cost, with full tracing, cost attribution, and latency analytics included. Instrumentation is one line, fluiq.instrument(), so there's no agent to run and no infrastructure to host. Free keeps a rolling 14-day history; when you outgrow it, Starter and above retain traces indefinitely, and fluiq.optimize() caches repeated prompts to cut model spend, so monitoring can actually lower your bill instead of adding to it.

Question 10

Will adding observability slow down my LLM?

Accepted Answer

No. The Fluiq SDK is a thin instrumentation layer that records spans and ships them to the backend in the background, so it adds negligible overhead to the call itself. It is also fail-open by design: if the Fluiq backend is ever slow or unreachable, your application keeps running and never blocks waiting on a trace. You get full visibility without paying for it in latency.

Question 11

How do I know if my AI is actually secure?

Accepted Answer

fluiq.secure() scans both sides of every call. Before the model runs, pre-call scanning catches prompt injection, jailbreaks, and skeleton-key attacks; after it runs, post-call scanning redacts PII and secrets and inspects the whole trace tree for agentic threats: RAG poisoning, tool-input exfiltration, tools used outside their allowlist, and multi-agent trust attacks. Every risk is flagged on the trace with a severity and category, so security is something you see per request instead of assume.

Question 12

What happens when an AI system gets hacked?

Accepted Answer

Common attacks such as a jailbreak prompt, a poisoned retrieved document, or a tool tricked into leaking data all try to make your model ignore its instructions or exfiltrate sensitive information. With fluiq.secure() in block mode, Fluiq raises a FluiqSecurityError before a high-risk prompt ever reaches the model; in warn mode it records the attempt on the trace without interrupting traffic. Because scanning fails open, a scanner error degrades to observe-only instead of taking your app down, and every blocked or flagged event can alert your team in Slack.

Question 13

Can someone trick my LLM into giving away secrets?

Accepted Answer

That is exactly the class of attack Fluiq is built to stop. The response gate scans model output for PII and high-entropy secrets and redacts them before they are stored or returned, while tool-input exfiltration and allowlist checks catch sensitive data being smuggled out through tool calls. Pre-call injection detection blocks prompts engineered to override your system instructions, and post-call scanning reads retrieved docs and sibling spans to catch indirect injection planted in your RAG sources.

Features	Free	Starter	Team	Growth	Enterprise
Observability
Traces / month	Unlimited	Unlimited	Unlimited	Unlimited	Unlimited
Trace retention	14 days	Unlimited	Unlimited	Unlimited	Unlimited
Live dashboard & trace explorer
Per-node token & cost attribution
p50 / p95 / p99 latency tracking
Spend breakdown by provider & model
Multi-agent DAG rendering (LangGraph, CrewAI, ADK)
Agent summaries & per-run rollups
Streaming traces
Multimodal trace capture (images, audio)
Import from LangSmith, Langfuse, Phoenix, Braintrust
Tamper-evident audit log
Evaluation
Evals / month included	100	2,000	10,000	50,000	Unlimited
LLM-as-judge metrics
Agentic evaluation: tool selection & trajectory
Multi-agent coordination scoring
Depth control (fast / standard / deep)
Choose your judge model
Bring your own provider keys (BYOK)
Transparent judge prompts (exact prompt & version on every score)
Vision / multimodal judging
Warn & block eval modes
End-user feedback & team annotations
Multi-model judge jury with per-juror audit trail
CI/CD eval gates (python -m fluiq.ci)
Custom eval thresholds
Editable judge prompts (per-org overrides)
Custom client judges (your own prompt as a scorer)
Pay-as-you-go beyond the allowance					Committed
Security
Security scans / month included	1,000	50,000	500,000	2,000,000	Unlimited
Prompt injection detection
Jailbreak & skeleton-key detection
Semantic attack scoring
PII detection & redaction
Secret leak prevention
Indirect injection detection
RAG poisoning detection
Tool-input exfiltration & allowlist enforcement
Cross-agent injection & trust-boundary escalation
Image & multimodal scanning
Warn or block mode
Custom guardrail policies
Prompt management
Versioned prompt registry
Fetch by slug from the SDK
Version history & one-click restore
Environment deploys (dev / staging / prod)
Prompts reusable as custom judges
Datasets
Golden datasets built from traces
Whole-trajectory capture (steps, tools, MCP, media)
Connect Agents auto-sync
Batch eval & security runs over a dataset
Run-vs-run regression comparison
Per-run judge & jury selection
Optimization
Trace-driven cache profiling
Prompt response caching
Embedding caching
Observe mode (measure savings before intercepting)
Cache hit-rate dashboard
Optimization Insights: cache candidates & projected savings
Cost hotspots: slowest calls, error rates, top spenders
Team & Access
Seats	1	3	10	25	Unlimited
API keys	1	3	5	15	50
Multiple organizations
Teammate invitations & roles
SSO
SAML / SCIM provisioning
Compliance exports
Support
Community support
Slack alerts on eval & security events
Email support		72h SLA	48h SLA	24h SLA	Dedicated
Dedicated onboarding
Deployment
Cloud (managed)
VPC / on-prem

Features	Free	Starter	Team	Growth	Enterprise
Observability
Traces / month	Unlimited	Unlimited	Unlimited	Unlimited	Unlimited
Trace retention	14 days	Unlimited	Unlimited	Unlimited	Unlimited
Live dashboard & trace explorer
Per-node token & cost attribution
p50 / p95 / p99 latency tracking
Spend breakdown by provider & model
Multi-agent DAG rendering (LangGraph, CrewAI, ADK)
Agent summaries & per-run rollups
Streaming traces
Multimodal trace capture (images, audio)
Import from LangSmith, Langfuse, Phoenix, Braintrust
Tamper-evident audit log
Evaluation
Evals / month included	100	2,000	10,000	50,000	Unlimited
LLM-as-judge metrics
Agentic evaluation: tool selection & trajectory
Multi-agent coordination scoring
Depth control (fast / standard / deep)
Choose your judge model
Bring your own provider keys (BYOK)
Transparent judge prompts (exact prompt & version on every score)
Vision / multimodal judging
Warn & block eval modes
End-user feedback & team annotations
Multi-model judge jury with per-juror audit trail
CI/CD eval gates (python -m fluiq.ci)
Custom eval thresholds
Editable judge prompts (per-org overrides)
Custom client judges (your own prompt as a scorer)
Pay-as-you-go beyond the allowance					Committed
Security
Security scans / month included	1,000	50,000	500,000	2,000,000	Unlimited
Prompt injection detection
Jailbreak & skeleton-key detection
Semantic attack scoring
PII detection & redaction
Secret leak prevention
Indirect injection detection
RAG poisoning detection
Tool-input exfiltration & allowlist enforcement
Cross-agent injection & trust-boundary escalation
Image & multimodal scanning
Warn or block mode
Custom guardrail policies
Prompt management
Versioned prompt registry
Fetch by slug from the SDK
Version history & one-click restore
Environment deploys (dev / staging / prod)
Prompts reusable as custom judges
Datasets
Golden datasets built from traces
Whole-trajectory capture (steps, tools, MCP, media)
Connect Agents auto-sync
Batch eval & security runs over a dataset
Run-vs-run regression comparison
Per-run judge & jury selection
Optimization
Trace-driven cache profiling
Prompt response caching
Embedding caching
Observe mode (measure savings before intercepting)
Cache hit-rate dashboard
Optimization Insights: cache candidates & projected savings
Cost hotspots: slowest calls, error rates, top spenders
Team & Access
Seats	1	3	10	25	Unlimited
API keys	1	3	5	15	50
Multiple organizations
Teammate invitations & roles
SSO
SAML / SCIM provisioning
Compliance exports
Support
Community support
Slack alerts on eval & security events
Email support		72h SLA	48h SLA	24h SLA	Dedicated
Dedicated onboarding
Deployment
Cloud (managed)
VPC / on-prem

The right plan for every team.

Free

Starter

Team

Growth

Enterprise

Priced by what an evaluation actually costs

Every feature, side by side.

Two calls. Security and speed, handled.

fluiq.secure()

fluiq.optimize()

Questions about billing, evals, or security?

Ship safer AI, faster.