OptionalproviderOverride the provider (auto-detected from model by default).
OptionaltimeoutRequest timeout in milliseconds.
OptionalretriesNumber of retries on failure.
OptionaltemperatureSampling temperature (0-2). Lower = more deterministic, higher = more creative. undefined = provider default.
Can also use semantic presets via creativity option.
OptionalcreativitySemantic creativity level. Alternative to raw temperature values.
If both temperature and creativity are set, temperature takes precedence.
OptionalmaxMaximum tokens to generate. undefined = provider default.
OptionaltopTop-p (nucleus) sampling. undefined = provider default.
OptionalstopStop sequences.
OptionalfrequencyFrequency penalty (-2 to 2).
OptionalpresencePresence penalty (-2 to 2).
OptionalreasoningUnified reasoning level across providers. Maps automatically to provider-specific implementations:
Note: Not all models support reasoning. For unsupported models, this is ignored.
OptionalwebEnable web search (xAI only). Ignored for other providers.
OptionalcacheEnable prompt caching (Anthropic only). Marks the system prompt for caching.
OptionalextraArbitrary additional options passed to the provider. Use for bleeding-edge features not yet in the typed interface.
Generation options for text/structured output.