Skip to main content
  • search - Search queries (DuckDuckGo, Google, Bing)
  • navigate - Navigate to URLs
  • go_back - Go back in browser history
  • wait - Wait for specified seconds

Page Interaction

  • click - Click elements by their index
  • input - Input text into form fields
  • upload_file - Upload files to file inputs
  • scroll - Scroll the page up/down
  • find_text - Scroll to specific text on page
  • send_keys - Send special keys (Enter, Escape, etc.)

JavaScript Execution

  • evaluate - Execute custom JavaScript code on the page (for advanced interactions, shadow DOM, custom selectors, data extraction)

Tab Management

  • switch - Switch between browser tabs
  • close - Close browser tabs

Content Extraction

  • extract - Extract data from webpages using LLM

Visual Analysis

  • screenshot - Request a screenshot in your next browser state for visual confirmation

Form Controls

  • dropdown_options - Get dropdown option values
  • select_dropdown - Select dropdown options

File Operations

  • write_file - Write content to files
  • read_file - Read file contents
  • replace_file - Replace text in files

Task Completion

  • done - Complete the task (always available)
I