Navigation & Browser Control
search
- Search queries (DuckDuckGo, Google, Bing)navigate
- Navigate to URLsgo_back
- Go back in browser historywait
- Wait for specified seconds
Page Interaction
click
- Click elements by their indexinput
- Input text into form fieldsupload_file
- Upload files to file inputsscroll
- Scroll the page up/downfind_text
- Scroll to specific text on pagesend_keys
- Send special keys (Enter, Escape, etc.)
JavaScript Execution
evaluate
- Execute custom JavaScript code on the page (for advanced interactions, shadow DOM, custom selectors, data extraction)
Tab Management
switch
- Switch between browser tabsclose
- Close browser tabs
Content Extraction
extract
- Extract data from webpages using LLM
Visual Analysis
screenshot
- Request a screenshot in your next browser state for visual confirmation
Form Controls
dropdown_options
- Get dropdown option valuesselect_dropdown
- Select dropdown options
File Operations
write_file
- Write content to filesread_file
- Read file contentsreplace_file
- Replace text in files
Task Completion
done
- Complete the task (always available)