Navigation & Browser Control
search- Search queries (DuckDuckGo, Google, Bing)navigate- Navigate to URLsgo_back- Go back in browser historywait- Wait for specified seconds
Page Interaction
click- Click elements by their indexinput- Input text into form fieldsupload_file- Upload files to file inputsscroll- Scroll the page up/downfind_text- Scroll to specific text on pagesend_keys- Send special keys (Enter, Escape, etc.)
JavaScript Execution
evaluate- Execute custom JavaScript code on the page (for advanced interactions, shadow DOM, custom selectors, data extraction)
Tab Management
switch- Switch between browser tabsclose- Close browser tabs
Content Extraction
extract- Extract data from webpages using LLM
Visual Analysis
screenshot- Request a screenshot in your next browser state for visual confirmation
Form Controls
dropdown_options- Get dropdown option valuesselect_dropdown- Select dropdown options
File Operations
write_file- Write content to filesread_file- Read file contentsreplace_file- Replace text in files
Task Completion
done- Complete the task (always available)