Complete reference for all agent configuration options
tools
: Registry of our tools the agent can call. Example for custom toolsbrowser
: Browser object where you can specify the browser settings.output_model_schema
: Pydantic model class for structured output validation. Exampleuse_vision
(default: True
): Enable/disable vision capabilities for processing screenshotsvision_detail_level
(default: 'auto'
): Screenshot detail level - 'low'
, 'high'
, or 'auto'
page_extraction_llm
: Separate LLM model for page content extraction. You can choose a small & fast model because it only needs to extract text from the page (default: same as llm
)initial_actions
: List of actions to run before the main task without LLM. Examplemax_actions_per_step
(default: 10
): Maximum actions per step, e.g. for form filling the agent can output 10 fields at once. We execute the actions until the page changes.max_failures
(default: 3
): Maximum retries for steps with errorsuse_thinking
(default: True
): Controls whether the agent uses its internal “thinking” field for explicit reasoning steps.flash_mode
(default: False
): Fast mode that skips evaluation, next goal and thinking and only uses memory. If flash_mode
is enabled, it overrides use_thinking
and disables the thinking process entirely. Exampleoverride_system_message
: Completely replace the default system prompt.extend_system_message
: Add additional instructions to the default system prompt. Examplesave_conversation_path
: Path to save complete conversation historysave_conversation_path_encoding
(default: 'utf-8'
): Encoding for saved conversationsavailable_file_paths
: List of file paths the agent can accesssensitive_data
: Dictionary of sensitive data to handle carefully. Examplegenerate_gif
(default: False
): Generate GIF of agent actions. Set to True
or string pathinclude_attributes
: List of HTML attributes to include in page analysismax_history_items
: Maximum number of last steps to keep in the LLM memory. If None
, we keep all steps.llm_timeout
(default: 90
): Timeout in seconds for LLM callsstep_timeout
(default: 120
): Timeout in seconds for each stepdirectly_open_url
(default: True
): If we detect a url in the task, we directly open it.calculate_cost
(default: False
): Calculate and track API costsdisplay_files_in_done_text
(default: True
): Show file information in completion messagescontroller
: Alias for tools
for backwards compatibility.browser_session
: Alias for browser
for backwards compatibility.