Skip to content

Cerebras

Cerebras offers the fastest LLM inference available — significantly faster than GPU-based providers, ideal for real-time applications.

Setup

Get an API key at cloud.cerebras.ai.

const agent = new Agent({
name: 'my-agent',
model: {
provider: 'cerebras',
model: 'llama3.3-70b',
},
systemPrompt: 'You are a helpful assistant.',
});
Terminal window
npx wrangler secret put CEREBRAS_API_KEY
ModelBest for
llama3.3-70bBest quality (recommended)
llama3.1-8bFastest, lightweight tasks

Environment variable

interface Env {
CEREBRAS_API_KEY: string;
}