You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
- Change max thinking_budget from 24576 to 32768
- Support all Gemini 3 models (not just Pro) for thinking_level
- Make thinking config independent of include_thoughts
- Update documentation to clarify model-specific settings
Co-authored-by: owndev <69784886+owndev@users.noreply.github.com>
# Note: This setting is ignored for Gemini 3 Pro models which use GOOGLE_THINKING_LEVEL instead
132
+
# Note: Gemini 3 models use GOOGLE_THINKING_LEVEL instead
133
133
GOOGLE_THINKING_BUDGET=-1
134
134
135
-
# Thinking level for Gemini 3 Pro models only
135
+
# Thinking level for Gemini 3 models only
136
136
# Valid values: "low", "high", or empty string for model default
137
137
# - "low": Minimizes latency and cost, suitable for simple tasks
138
138
# - "high": Maximizes reasoning depth, ideal for complex problem-solving
139
139
# Default: "" (empty, uses model default)
140
-
# Note: This setting is ignored for non-Gemini 3 Pro models
140
+
# Note: This setting is ignored for non-Gemini 3 models
141
141
GOOGLE_THINKING_LEVEL=""
142
142
143
143
# Enable streaming responses globally
@@ -252,13 +252,16 @@ The Google Gemini pipeline supports advanced thinking configuration to control h
252
252
> [!Note]
253
253
> For detailed information about thinking capabilities, see the [Google Gemini Thinking Documentation](https://ai.google.dev/gemini-api/docs/thinking).
254
254
255
-
### Thinking Levels (Gemini 3 Pro only)
255
+
### Thinking Levels (Gemini 3 models)
256
256
257
-
Gemini 3 Pro models support the `thinking_level` parameter, which controls the depth of reasoning:
257
+
Gemini 3 models support the `thinking_level` parameter, which controls the depth of reasoning:
258
258
259
259
-**`"low"`**: Minimizes latency and cost, suitable for simple tasks, chat, or high-throughput APIs.
260
260
-**`"high"`**: Maximizes reasoning depth, ideal for complex problem-solving, code analysis, and agentic workflows.
261
261
262
+
> [!Note]
263
+
> Gemini 3 models use `thinking_level` and do **not** use `thinking_budget`. The thinking budget setting is ignored for Gemini 3 models.
0 commit comments