Interface StreamOptions

Options for streaming generation.

interface StreamOptions {
    provider?: ProviderId;
    timeout?: number;
    retries?: number;
    localBaseUrl?: string;
    localApiKey?: string;
    localTimeout?: number;
    temperature?: number;
    creativity?: CreativityLevel;
    maxTokens?: number;
    topP?: number;
    stop?: string | string[];
    frequencyPenalty?: number;
    presencePenalty?: number;
    reasoning?: ReasoningLevel;
    webSearch?: boolean;
    thinkingBudget?: number;
    extra?: Record<string, unknown>;
    onDelta?: (delta: string) => void;
    onComplete?: (fullContent: string) => void;
    onError?: (error: Error) => void;
}

Hierarchy (View Summary)

GenerateOptions
- StreamOptions
  - GenStreamOptions

Properties

`Optional`provider

provider?: ProviderId

Override the provider (auto-detected from model by default).

`Optional`timeout

timeout?: number

Request timeout in milliseconds.

Default

`Optional`retries

retries?: number

Number of retries on failure.

Default

`Optional`localBaseUrl

localBaseUrl?: string

Base URL for the local provider. Overrides the LOCAL_BASE_URL environment variable. Required when using provider: 'local' without LOCAL_BASE_URL set.

Example

'http://127.0.0.1:11434/v1'  // Ollama

Example

'http://127.0.0.1:8765/v1'   // mlx-lm / omlx

`Optional`localApiKey

localApiKey?: string

API key for the local provider. Overrides the LOCAL_API_KEY environment variable. Defaults to "local" for servers that don't validate keys.

`Optional`localTimeout

localTimeout?: number

Request timeout in ms for the local provider. Defaults to 60 000 ms. Increase for slow or large local models.

`Optional`temperature

temperature?: number

Sampling temperature (0-2). Lower = more deterministic, higher = more creative. undefined = provider default.

Can also use semantic presets via creativity option.

`Optional`creativity

creativity?: CreativityLevel

Semantic creativity level. Alternative to raw temperature values.

'precise': Temperature 0 (deterministic)
'balanced': Temperature 0.7 (default)
'creative': Temperature 1.0
'wild': Temperature 1.5

If both temperature and creativity are set, temperature takes precedence.

`Optional`maxTokens

maxTokens?: number

Maximum tokens to generate. undefined = provider default.

`Optional`topP

topP?: number

Top-p (nucleus) sampling. undefined = provider default.

`Optional`stop

stop?: string | string[]

Stop sequences.

`Optional`frequencyPenalty

frequencyPenalty?: number

Frequency penalty (-2 to 2).

`Optional`presencePenalty

presencePenalty?: number

Presence penalty (-2 to 2).

`Optional`reasoning

reasoning?: ReasoningLevel

Unified reasoning level across providers. Maps automatically to provider-specific implementations:

'off': No reasoning (OpenAI: none, Anthropic: no thinking, xAI: *-non-reasoning)
'low': Light reasoning (OpenAI: low, Anthropic: 2048 tokens)
'medium': Moderate reasoning (OpenAI: medium, Anthropic: 8192 tokens)
'high': Deep reasoning (OpenAI: high, Anthropic: 32768 tokens, xAI: *-reasoning)

Note: Not all models support reasoning. For unsupported models, this is ignored.

`Optional`webSearch

webSearch?: boolean

Enable web search (xAI only). Ignored for other providers.

`Optional`thinkingBudget

thinkingBudget?: number

Thinking budget in tokens for local models that support it (e.g. Qwen3.5 via oMLX). When set, the model will produce reasoning/thinking content before the final answer. Thinking content is streamed separately via reasoningContent and does not mix with the visible response.

Only applies to local provider. Ignored for cloud providers.

`Optional`extra

extra?: Record<string, unknown>

Arbitrary additional options passed to the provider. Use for bleeding-edge features not yet in the typed interface.

`Optional`onDelta

onDelta?: (delta: string) => void

Callback for each content delta.

`Optional`onComplete

onComplete?: (fullContent: string) => void

Callback when streaming completes.

`Optional`onError

onError?: (error: Error) => void

Callback on error during streaming.

Interface StreamOptions

Hierarchy (View Summary)

Index

Properties

Properties

`Optional`provider

`Optional`timeout

Default

`Optional`retries

Default

`Optional`localBaseUrl

Example

Example

`Optional`localApiKey

`Optional`localTimeout

`Optional`temperature

`Optional`creativity

`Optional`maxTokens

`Optional`topP

`Optional`stop

`Optional`frequencyPenalty

`Optional`presencePenalty

`Optional`reasoning

`Optional`webSearch

`Optional`thinkingBudget

`Optional`extra

`Optional`onDelta

`Optional`onComplete

`Optional`onError

Settings

On This Page

Interface StreamOptions

Hierarchy (View Summary)

Index

Properties

Properties

Optionalprovider

Optionaltimeout

Default

Optionalretries

Default

OptionallocalBaseUrl

Example

Example

OptionallocalApiKey

OptionallocalTimeout

Optionaltemperature

Optionalcreativity

OptionalmaxTokens

OptionaltopP

Optionalstop

OptionalfrequencyPenalty

OptionalpresencePenalty

Optionalreasoning

OptionalwebSearch

OptionalthinkingBudget

Optionalextra

OptionalonDelta

OptionalonComplete

OptionalonError

Settings

On This Page

`Optional`provider

`Optional`timeout

`Optional`retries

`Optional`localBaseUrl

`Optional`localApiKey

`Optional`localTimeout

`Optional`temperature

`Optional`creativity

`Optional`maxTokens

`Optional`topP

`Optional`stop

`Optional`frequencyPenalty

`Optional`presencePenalty

`Optional`reasoning

`Optional`webSearch

`Optional`thinkingBudget

`Optional`extra

`Optional`onDelta

`Optional`onComplete

`Optional`onError