Optimization
fluiq.optimize() enables semantic caching on Fluiq-managed Redis. Calls with semantically similar prompts are served from cache — instant response, zero API cost.
One call to optimize() enables semantic caching for all traced OpenAI calls — chat, streaming, and embeddings.
Python
import openai
from fluiq import instrument, optimize
instrument(api_key="fl_...")
optimize() # semantic caching (Team+ plan)
client = openai.OpenAI()
# First call — hits OpenAI, response cached in Redis
r1 = client.chat.completions.create(
model="gpt-4o",
messages=[{"role": "user", "content": "What is machine learning?"}],
)
# Semantically similar call — served instantly from cache
r2 = client.chat.completions.create(
model="gpt-4o",
messages=[{"role": "user", "content": "Explain machine learning to me"}],
)
# r2: same quality, zero latency, zero API cost