41 KiB
Cremote MCP Tools - LLM Usage Guide
This guide explains how LLMs can use the cremote MCP (Model Context Protocol) tools for web automation tasks.
🎉 Complete Web Automation Platform
The cremote MCP server provides 31 comprehensive web automation tools organized across 6 enhancement phases:
- Core Tools (10): Essential web automation capabilities
- Phase 1 (2): Element state checking and conditional logic
- Phase 2 (4): Enhanced data extraction and batch operations
- Phase 3 (3): Form analysis and bulk operations
- Phase 4 (4): Page state and metadata tools
- Phase 5 (4): Enhanced screenshots and file management
- Phase 6 (3): Accessibility tree support for semantic understanding
Available Tools (31 Total)
1. web_navigate_cremotemcp
Navigate to URLs and optionally take screenshots.
Parameters:
url
(required): The URL to navigate toscreenshot
(optional): Boolean, take a screenshot after navigationtab
(optional): Specific tab ID to usetimeout
(optional): Timeout in seconds (default: 5)
Example Usage:
web_navigate_cremotemcp:
url: "https://example.com"
screenshot: true
2. web_interact_cremotemcp
Interact with web elements through various actions.
Parameters:
action
(required): One of "click", "fill", "submit", "upload", "select"selector
(required): CSS selector for the target elementvalue
(optional): Value for fill/upload/select actionstab
(optional): Specific tab ID to usetimeout
(optional): Timeout in seconds (default: 5)
Example Usage:
web_interact_cremotemcp:
action: "click"
selector: "button.submit"
web_interact_cremotemcp:
action: "fill"
selector: "input[name='email']"
value: "user@example.com"
web_interact_cremotemcp:
action: "select"
selector: "#country"
value: "United States"
3. web_extract_cremotemcp
Extract data from web pages (HTML source, element content, or JavaScript execution).
Parameters:
type
(required): One of "source", "element", "javascript"selector
(optional): CSS selector (required for "element" type)code
(optional): JavaScript code (required for "javascript" type)tab
(optional): Specific tab ID to usetimeout
(optional): Timeout in seconds (default: 5)
Example Usage:
web_extract_cremotemcp:
type: "source"
web_extract_cremotemcp:
type: "element"
selector: "div.content"
web_extract_cremotemcp:
type: "javascript"
code: "document.title"
4. web_screenshot_cremotemcp
Take screenshots of web pages.
Parameters:
output
(required): File path where screenshot will be savedfull_page
(optional): Capture full page (default: false)tab
(optional): Specific tab ID to usetimeout
(optional): Timeout in seconds (default: 5)
Example Usage:
web_screenshot_cremotemcp:
output: "/tmp/page-screenshot.png"
full_page: true
5. web_manage_tabs_cremotemcp
Manage browser tabs (open, close, list, switch).
Parameters:
action
(required): One of "open", "close", "list", "switch"tab
(optional): Tab ID (required for "close" and "switch" actions)timeout
(optional): Timeout in seconds (default: 5)
Example Usage:
web_manage_tabs_cremotemcp:
action: "open"
web_manage_tabs_cremotemcp:
action: "list"
web_manage_tabs_cremotemcp:
action: "switch"
tab: "ABC123"
6. web_iframe_cremotemcp
Switch iframe context for subsequent operations.
Parameters:
action
(required): One of "enter", "exit"selector
(optional): Iframe CSS selector (required for "enter" action)tab
(optional): Specific tab ID to usetimeout
(optional): Timeout in seconds (default: 5)
Example Usage:
web_iframe_cremotemcp:
action: "enter"
selector: "iframe#payment-form"
timeout: 10
web_iframe_cremotemcp:
action: "exit"
Note: The timeout parameter is particularly important for iframe operations as they can hang if the iframe takes time to load or if the selector doesn't match any elements.
7. file_upload_cremotemcp
Upload files from the client to the container for use in form uploads.
Parameters:
local_path
(required): Path to the file on the client machinecontainer_path
(optional): Path where to store the file in the container (defaults to /tmp/filename)
Example Usage:
file_upload_cremotemcp:
local_path: "/home/user/document.pdf"
container_path: "/tmp/upload.pdf"
file_upload_cremotemcp:
local_path: "/home/user/image.jpg"
# Will default to /tmp/image.jpg in container
8. file_download_cremotemcp
Download files from the container to the client (e.g., downloaded files from browser).
Parameters:
container_path
(required): Path to the file in the containerlocal_path
(required): Path where to save the file on the client machine
Example Usage:
file_download_cremotemcp:
container_path: "/tmp/downloaded-report.pdf"
local_path: "/home/user/Downloads/report.pdf"
9. console_logs_cremotemcp
Get console logs from the browser tab for debugging.
Parameters:
tab
(optional): Specific tab ID to useclear
(optional): Clear logs after retrieval (default: false)
Example Usage:
console_logs_cremotemcp:
clear: true
console_logs_cremotemcp:
tab: "ABC123"
clear: false
10. web_element_check_cremotemcp
(New in Phase 1)
Check element states without interaction - perfect for conditional logic.
Parameters:
selector
(required): CSS selector for the element(s)check_type
(optional): Type of check - "exists", "visible", "enabled", "focused", "selected", "all" (default: "exists")tab
(optional): Specific tab ID to usetimeout
(optional): Timeout in seconds (default: 5)
Example Usage:
web_element_check_cremotemcp:
selector: "#submit-button"
check_type: "enabled"
web_element_check_cremotemcp:
selector: ".error-message"
check_type: "visible"
web_element_check_cremotemcp:
selector: "input[type='checkbox']"
check_type: "all"
Response Format: The tool returns a JSON object with boolean values for each check:
{
"exists": true,
"visible": true,
"enabled": false,
"focused": false,
"selected": true,
"count": 1
}
Use Cases:
- Check if a form is ready for submission
- Verify if error messages are displayed
- Confirm element visibility before interaction
- Count matching elements
- Implement conditional workflows
11. web_element_attributes_cremotemcp
(New in Phase 1)
Get detailed element information including attributes, properties, and computed styles.
Parameters:
selector
(required): CSS selector for the elementattributes
(optional): Attributes to retrieve - "all" or comma-separated list (default: "all")tab
(optional): Specific tab ID to usetimeout
(optional): Timeout in seconds (default: 5)
Attribute Prefixes:
- No prefix: HTML attributes (e.g., "id", "class", "href")
style_
: Computed CSS styles (e.g., "style_display", "style_color")prop_
: JavaScript properties (e.g., "prop_textContent", "prop_value")
Example Usage:
web_element_attributes_cremotemcp:
selector: "#user-profile"
attributes: "all"
web_element_attributes_cremotemcp:
selector: "input[name='email']"
attributes: "value,placeholder,type"
web_element_attributes_cremotemcp:
selector: ".status-indicator"
attributes: "class,style_color,style_display,prop_textContent"
Response Format: Returns a JSON object with requested attributes:
{
"id": "user-profile",
"class": "profile-card active",
"data-user-id": "12345",
"style_display": "block",
"style_color": "rgb(0, 0, 0)",
"prop_textContent": "John Doe"
}
Use Cases:
- Extract form field values
- Get element styling information
- Retrieve data attributes
- Analyze element properties for debugging
12. console_command_cremotemcp
Execute JavaScript commands in the browser console.
Parameters:
command
(required): JavaScript command to executetab
(optional): Specific tab ID to usetimeout
(optional): Timeout in seconds (default: 5)
Example Usage:
console_command_cremotemcp:
command: "console.log('Hello from MCP!')"
console_command_cremotemcp:
command: "document.querySelector('h1').textContent"
timeout: 10
13. web_extract_multiple_cremotemcp
(New in Phase 2)
Extract data from multiple selectors in a single call, reducing round trips and improving efficiency.
Parameters:
selectors
(required): Object with keys as labels and values as CSS selectorstab
(optional): Specific tab ID to usetimeout
(optional): Timeout in seconds (default: 5)
Example Usage:
web_extract_multiple_cremotemcp:
selectors:
title: "h1"
price: ".price"
description: ".product-description"
availability: ".stock-status"
Response Format: Returns structured results with both successful extractions and any errors:
{
"results": {
"title": "Product Name",
"price": "$29.99",
"description": "Product description text",
"availability": "In Stock"
},
"errors": {}
}
Use Cases:
- Extract multiple data points from product pages
- Gather form field values efficiently
- Collect page metadata in one call
- Reduce API calls for complex data extraction
14. web_extract_links_cremotemcp
(New in Phase 2)
Extract all links from a page with powerful filtering options.
Parameters:
container_selector
(optional): CSS selector to limit search to a containerhref_pattern
(optional): Regex pattern to filter links by hreftext_pattern
(optional): Regex pattern to filter links by text contenttab
(optional): Specific tab ID to usetimeout
(optional): Timeout in seconds (default: 5)
Example Usage:
# Extract all links
web_extract_links_cremotemcp: {}
# Extract only HTTPS links from navigation
web_extract_links_cremotemcp:
container_selector: "nav"
href_pattern: "https://.*"
# Extract download links
web_extract_links_cremotemcp:
text_pattern: ".*[Dd]ownload.*"
Response Format: Returns detailed link information:
{
"links": [
{
"href": "https://example.com/page1",
"text": "Page 1",
"title": "Go to Page 1",
"target": "_blank"
}
],
"count": 1
}
Use Cases:
- Discover all navigation links
- Find download or external links
- Extract social media links
- Analyze site structure
15. web_extract_table_cremotemcp
(New in Phase 2)
Extract table data as structured JSON with optional header processing.
Parameters:
selector
(required): CSS selector for the table elementinclude_headers
(optional): Whether to extract headers for structured data (default: true)tab
(optional): Specific tab ID to usetimeout
(optional): Timeout in seconds (default: 5)
Example Usage:
web_extract_table_cremotemcp:
selector: "#data-table"
include_headers: true
Response Format: Returns both raw rows and structured data when headers are included:
{
"headers": ["Name", "Age", "City"],
"rows": [
["John", "25", "New York"],
["Jane", "30", "London"]
],
"data": [
{"Name": "John", "Age": "25", "City": "New York"},
{"Name": "Jane", "Age": "30", "City": "London"}
],
"count": 2
}
Use Cases:
- Extract pricing tables
- Process data tables for analysis
- Convert HTML tables to structured data
- Export table data for processing
16. web_extract_text_cremotemcp
(New in Phase 2)
Extract text content with optional pattern matching and different extraction types.
Parameters:
selector
(required): CSS selector for elements to extract text frompattern
(optional): Regex pattern to match within the extracted textextract_type
(optional): Type of text extraction - "text", "innerText", "textContent" (default: "textContent")tab
(optional): Specific tab ID to usetimeout
(optional): Timeout in seconds (default: 5)
Example Usage:
# Extract all text content
web_extract_text_cremotemcp:
selector: ".content"
# Extract phone numbers using regex
web_extract_text_cremotemcp:
selector: ".contact-info"
pattern: "\\d{3}-\\d{3}-\\d{4}"
# Extract visible text only
web_extract_text_cremotemcp:
selector: ".description"
extract_type: "innerText"
Response Format: Returns text content and pattern matches:
{
"text": "Contact us at 555-123-4567 or 555-987-6543",
"matches": ["555-123-4567", "555-987-6543"],
"count": 1
}
Use Cases:
- Extract contact information
- Find specific data patterns (emails, phones, dates)
- Get clean text content
- Process text for analysis
17. web_form_analyze_cremotemcp
(New in Phase 3)
Analyze forms completely to understand their structure, fields, and submission requirements.
Parameters:
selector
(required): CSS selector for the form elementtab
(optional): Specific tab ID to usetimeout
(optional): Timeout in seconds (default: 5)
Example Usage:
# Analyze a registration form
web_form_analyze_cremotemcp:
selector: "#registration-form"
# Analyze any form on the page
web_form_analyze_cremotemcp:
selector: "form"
Response Format: Returns comprehensive form analysis:
{
"action": "/submit-registration",
"method": "POST",
"fields": [
{
"name": "username",
"type": "text",
"value": "",
"placeholder": "Enter username",
"required": true,
"disabled": false,
"readonly": false,
"selector": "[name='username']",
"label": "Username"
}
],
"field_count": 5,
"can_submit": true,
"submit_text": "Register"
}
Use Cases:
- Understand form structure before filling
- Identify required fields and validation rules
- Generate appropriate field selectors
- Plan form completion workflows
18. web_interact_multiple_cremotemcp
(New in Phase 3)
Perform multiple interactions in a single call for efficient batch operations.
Parameters:
interactions
(required): Array of interaction objectstab
(optional): Specific tab ID to usetimeout
(optional): Timeout in seconds (default: 5)
Each interaction object contains:
selector
(required): CSS selector for the elementaction
(required): One of "click", "fill", "select", "check", "uncheck"value
(optional): Value for fill/select actions
Example Usage:
# Complete a login form in one call
web_interact_multiple_cremotemcp:
interactions:
- selector: "#username"
action: "fill"
value: "testuser"
- selector: "#password"
action: "fill"
value: "testpass"
- selector: "#remember-me"
action: "check"
- selector: "#login-btn"
action: "click"
Response Format: Returns results for each interaction:
{
"results": [
{
"selector": "#username",
"action": "fill",
"success": true
},
{
"selector": "#password",
"action": "fill",
"success": true
}
],
"success_count": 4,
"error_count": 0,
"total_count": 4
}
Use Cases:
- Complete forms efficiently
- Perform multiple clicks/selections
- Batch checkbox/radio button operations
- Reduce round trips for complex interactions
19. web_form_fill_bulk_cremotemcp
(New in Phase 3)
Fill entire forms with key-value pairs in a single operation.
Parameters:
fields
(required): Object mapping field names/selectors to valuesform_selector
(optional): CSS selector for the form elementtab
(optional): Specific tab ID to usetimeout
(optional): Timeout in seconds (default: 5)
Example Usage:
# Fill a contact form
web_form_fill_bulk_cremotemcp:
form_selector: "#contact-form"
fields:
name: "John Doe"
email: "john@example.com"
message: "Hello, this is a test message."
# Fill fields across the entire page
web_form_fill_bulk_cremotemcp:
fields:
username: "testuser"
password: "testpass"
email: "test@example.com"
Response Format: Returns results for each field:
{
"filled_fields": [
{
"selector": "[name='name']",
"action": "fill",
"success": true
},
{
"selector": "[name='email']",
"action": "fill",
"success": true
}
],
"success_count": 3,
"error_count": 0,
"total_count": 3
}
Use Cases:
- Quick form completion
- Bulk data entry
- Form testing with multiple datasets
- Automated registration workflows
20. web_page_info_cremotemcp
(New in Phase 4)
Get comprehensive page metadata and state information.
Parameters:
tab
(optional): Specific tab ID to usetimeout
(optional): Timeout in seconds (default: 5)
Example Usage:
web_page_info_cremotemcp:
tab: "tab-123"
Response Format: Returns detailed page information:
{
"title": "Example Page",
"url": "https://example.com",
"loading_state": "complete",
"ready_state": "complete",
"domain": "example.com",
"protocol": "https:",
"charset": "UTF-8",
"cookie_enabled": true,
"online_status": true
}
21. web_viewport_info_cremotemcp
(New in Phase 4)
Get viewport and scroll information.
Parameters:
tab
(optional): Specific tab ID to usetimeout
(optional): Timeout in seconds (default: 5)
Example Usage:
web_viewport_info_cremotemcp:
tab: "tab-123"
Response Format: Returns viewport dimensions and scroll data:
{
"width": 1920,
"height": 1080,
"scroll_x": 0,
"scroll_y": 150,
"scroll_width": 1920,
"scroll_height": 2400,
"device_pixel_ratio": 1.0,
"orientation": "landscape-primary"
}
22. web_performance_metrics_cremotemcp
(New in Phase 4)
Get page performance metrics.
Parameters:
tab
(optional): Specific tab ID to usetimeout
(optional): Timeout in seconds (default: 5)
Example Usage:
web_performance_metrics_cremotemcp:
tab: "tab-123"
Response Format: Returns performance data:
{
"navigation_start": 1692123456789,
"load_event_end": 1692123457234,
"load_time": 445,
"dom_load_time": 234,
"resource_count": 15,
"js_heap_size_used": 2048576
}
23. web_content_check_cremotemcp
(New in Phase 4)
Check for specific content types and loading states.
Parameters:
type
(required): Content type to check ("images", "scripts", "styles", "forms", "links", "iframes", "errors")tab
(optional): Specific tab ID to usetimeout
(optional): Timeout in seconds (default: 5)
Example Usage:
# Check image loading status
web_content_check_cremotemcp:
type: "images"
# Check for errors
web_content_check_cremotemcp:
type: "errors"
Response Format: Returns content verification results:
{
"type": "images",
"images_loaded": 8,
"images_total": 10
}
24. web_screenshot_element_cremotemcp
(New in Phase 5)
Take a screenshot of a specific element on the page.
Parameters:
selector
(required): CSS selector for the element to screenshotoutput
(required): Path where to save the screenshottab
(optional): Specific tab ID to usetimeout
(optional): Timeout in seconds (default: 5)
Example Usage:
# Screenshot a specific element
web_screenshot_element_cremotemcp:
selector: "#main-content"
output: "/tmp/element-screenshot.png"
# Screenshot a form
web_screenshot_element_cremotemcp:
selector: "form.login-form"
output: "/tmp/login-form.png"
Key Features:
- Automatically scrolls element into view
- Captures only the specified element
- Handles element positioning and sizing
25. web_screenshot_enhanced_cremotemcp
(New in Phase 5)
Take an enhanced screenshot with metadata.
Parameters:
output
(required): Path where to save the screenshotfull_page
(optional): Capture full page (default: false)tab
(optional): Specific tab ID to usetimeout
(optional): Timeout in seconds (default: 5)
Example Usage:
# Enhanced screenshot with metadata
web_screenshot_enhanced_cremotemcp:
output: "/tmp/enhanced-screenshot.png"
full_page: true
# Viewport screenshot with metadata
web_screenshot_enhanced_cremotemcp:
output: "/tmp/viewport-screenshot.png"
full_page: false
Response Format: Returns screenshot metadata:
{
"timestamp": "2025-08-16T10:30:00Z",
"url": "https://example.com",
"title": "Example Page",
"viewport_size": {
"width": 1920,
"height": 1080
},
"full_page": true,
"file_path": "/tmp/enhanced-screenshot.png",
"file_size": 245760,
"resolution": {
"width": 1920,
"height": 1080
}
}
26. file_operations_bulk_cremotemcp
(New in Phase 5)
Perform bulk file operations (upload/download multiple files).
Parameters:
operation
(required): Operation type ("upload" or "download")files
(required): Array of file operationstimeout
(optional): Timeout in seconds (default: 30)
File Operation Object:
local_path
(required): Path on client machinecontainer_path
(required): Path in containeroperation
(optional): Override operation type for this file
Example Usage:
# Bulk upload files
file_operations_bulk_cremotemcp:
operation: "upload"
files:
- local_path: "/local/file1.txt"
container_path: "/tmp/file1.txt"
- local_path: "/local/file2.txt"
container_path: "/tmp/file2.txt"
timeout: 30
# Bulk download files
file_operations_bulk_cremotemcp:
operation: "download"
files:
- local_path: "/local/downloaded1.txt"
container_path: "/tmp/file1.txt"
- local_path: "/local/downloaded2.txt"
container_path: "/tmp/file2.txt"
Response Format: Returns detailed operation results:
{
"successful": [
{
"local_path": "/local/file1.txt",
"container_path": "/tmp/file1.txt",
"operation": "upload",
"size": 1024
}
],
"failed": [
{
"local_path": "/local/file2.txt",
"container_path": "/tmp/file2.txt",
"operation": "upload",
"error": "file not found"
}
],
"summary": {
"total": 2,
"successful": 1,
"failed": 1
}
}
27. file_management_cremotemcp
(New in Phase 5)
Manage files (cleanup, list, get info).
Parameters:
operation
(required): Management operation ("cleanup", "list", "info")pattern
(optional): File pattern for cleanup/list, or file path for infomax_age
(optional): Max age in hours for cleanup (default: 24)
Example Usage:
# Cleanup old temporary files
file_management_cremotemcp:
operation: "cleanup"
pattern: "/tmp/cremote-*"
max_age: "24"
# List files in directory
file_management_cremotemcp:
operation: "list"
pattern: "/tmp/*"
# Get info about specific file
file_management_cremotemcp:
operation: "info"
pattern: "/tmp/specific-file.txt"
Response Format: Returns file management results:
{
"operation": "cleanup",
"cleaned": [
"/tmp/cremote-old-file1.txt",
"/tmp/cremote-old-file2.txt"
],
"summary": {
"files_cleaned": 2,
"total_size_freed": 2048,
"cutoff_time": "2025-08-15T10:30:00Z"
}
}
Common Usage Patterns
1. Basic Web Navigation
# Navigate to a website
web_navigate_cremotemcp:
url: "https://example.com"
screenshot: true
2. Form Interaction Sequence
# 1. Navigate to the page
web_navigate_cremotemcp:
url: "https://example.com/login"
# 2. Fill username field
web_interact_cremotemcp:
action: "fill"
selector: "input[name='username']"
value: "myusername"
# 3. Fill password field
web_interact_cremotemcp:
action: "fill"
selector: "input[name='password']"
value: "mypassword"
# 4. Submit the form
web_interact_cremotemcp:
action: "submit"
selector: "form"
3. Clicking Elements
# Click a button
web_interact_cremotemcp:
action: "click"
selector: "button#submit-btn"
# Click a link
web_interact_cremotemcp:
action: "click"
selector: "a[href='/dashboard']"
4. Smart Form Handling (Phase 3 Pattern)
# 1. Analyze the form first
web_form_analyze_cremotemcp:
selector: "#registration-form"
# 2. Fill the form efficiently
web_form_fill_bulk_cremotemcp:
form_selector: "#registration-form"
fields:
username: "newuser123"
email: "user@example.com"
password: "securepass"
confirm_password: "securepass"
# 3. Submit the form
web_interact_cremotemcp:
action: "click"
selector: "button[type='submit']"
5. Batch Operations (Phase 3 Pattern)
# Complete multiple actions efficiently
web_interact_multiple_cremotemcp:
interactions:
- selector: "#agree-terms"
action: "check"
- selector: "#newsletter"
action: "uncheck"
- selector: "#country"
action: "select"
value: "United States"
- selector: "#submit-btn"
action: "click"
6. Complex Form Workflows (Phase 3 Pattern)
# 1. Navigate to registration page
web_navigate_cremotemcp:
url: "https://example.com/register"
# 2. Analyze the form structure
web_form_analyze_cremotemcp:
selector: "form"
# 3. Fill all fields at once
web_form_fill_bulk_cremotemcp:
fields:
first_name: "John"
last_name: "Doe"
email: "john.doe@example.com"
phone: "555-123-4567"
company: "Acme Corp"
# 4. Handle checkboxes and selections
web_interact_multiple_cremotemcp:
interactions:
- selector: "#terms-agreement"
action: "check"
- selector: "#marketing-emails"
action: "uncheck"
- selector: "#account-type"
action: "select"
value: "business"
# 5. Submit and verify
web_interact_cremotemcp:
action: "click"
selector: "button[type='submit']"
7. Page State Monitoring (Phase 4 Pattern)
# 1. Navigate to a page
web_navigate_cremotemcp:
url: "https://example.com/dashboard"
# 2. Get comprehensive page information
web_page_info_cremotemcp:
timeout: 5
# 3. Check viewport and scroll position
web_viewport_info_cremotemcp:
timeout: 5
# 4. Verify content is loaded
web_content_check_cremotemcp:
type: "images"
# 5. Check for any errors
web_content_check_cremotemcp:
type: "errors"
# 6. Get performance metrics
web_performance_metrics_cremotemcp:
timeout: 5
Best Practices for LLMs
1. Always Start with Navigation
Before interacting with elements, navigate to the target page:
web_navigate_cremotemcp:
url: "https://target-website.com"
2. Use Specific CSS Selectors
Be as specific as possible with selectors to avoid ambiguity:
- Good:
input[name='email']
,button.primary-submit
- Avoid:
input
,button
3. Take Screenshots for Debugging
When troubleshooting or documenting, use screenshots:
web_navigate_cremotemcp:
url: "https://example.com"
screenshot: true
4. Handle Timeouts Appropriately
For slow-loading pages, increase timeout:
web_navigate_cremotemcp:
url: "https://slow-website.com"
timeout: 10
5. Sequential Operations
Perform operations in logical sequence:
- Navigate to page
- Fill required fields
- Submit forms
- Navigate to next page if needed
Error Handling
Common Error Scenarios:
- Element not found: Selector doesn't match any elements
- Timeout: Page takes too long to load or element to appear
- Navigation failed: URL is invalid or unreachable
Troubleshooting Tips:
- Verify the URL is correct and accessible
- Check CSS selectors using browser developer tools
- Increase timeout for slow-loading content
- Take screenshots to see current page state
Tab Management
The tools automatically manage browser tabs:
- If no tab is specified, a new tab is created automatically
- Tab IDs are returned in responses for reference
- Multiple tabs can be managed by specifying tab IDs
Security Considerations
Safe Practices:
- Only navigate to trusted websites
- Be cautious with form submissions
- Avoid entering sensitive information in examples
- Use screenshots sparingly to avoid exposing sensitive data
Limitations:
- Cannot bypass CAPTCHA or other anti-automation measures
- Subject to same-origin policy restrictions
- May not work with heavily JavaScript-dependent sites
Example: Complete Web Automation Task
Here's a complete example of automating a web form:
# Step 1: Navigate to the form page
web_navigate_cremotemcp:
url: "https://example.com/contact"
screenshot: true
# Step 2: Fill out the contact form
web_interact_cremotemcp:
action: "fill"
selector: "input[name='name']"
value: "John Doe"
web_interact_cremotemcp:
action: "fill"
selector: "input[name='email']"
value: "john@example.com"
web_interact_cremotemcp:
action: "fill"
selector: "textarea[name='message']"
value: "Hello, this is a test message."
# Step 3: Submit the form
web_interact_cremotemcp:
action: "submit"
selector: "form#contact-form"
# Step 4: Take a screenshot of the result
web_navigate_cremotemcp:
url: "current" # Stay on current page
screenshot: true
Example: File Upload Workflow
Here's how to upload a file and use it in a web form:
# Step 1: Upload a file from client to container
file_upload_cremotemcp:
local_path: "/home/user/resume.pdf"
container_path: "/tmp/resume.pdf"
# Step 2: Navigate to upload form
web_navigate_cremotemcp:
url: "https://jobsite.com/apply"
# Step 3: Fill out application form
web_interact_cremotemcp:
action: "fill"
selector: "input[name='name']"
value: "Jane Smith"
# Step 4: Upload the file using the container path
web_interact_cremotemcp:
action: "upload"
selector: "input[type='file']"
value: "/tmp/resume.pdf"
# Step 5: Submit the application
web_interact_cremotemcp:
action: "submit"
selector: "form.application"
Example: Download Files from Browser
Here's how to download files that the browser saves:
# Step 1: Navigate to download page
web_navigate_cremotemcp:
url: "https://example.com/reports"
# Step 2: Click download link
web_interact_cremotemcp:
action: "click"
selector: "a.download-report"
# Step 3: Wait a moment for download to complete
# (In practice, you might need to check for file existence)
# Step 4: Download the file from container to client
file_download_cremotemcp:
container_path: "/tmp/downloads/report.pdf"
local_path: "/home/user/Downloads/report.pdf"
Example: Console Debugging Workflow
Here's how to use console tools for debugging web automation:
# Step 1: Navigate to a page
web_navigate_cremotemcp:
url: "https://example.com/form"
# Step 2: Check what's in the console initially
console_logs_cremotemcp:
clear: false
# Step 3: Execute some debugging commands
console_command_cremotemcp:
command: "console.log('Starting form automation')"
console_command_cremotemcp:
command: "document.querySelectorAll('input').length"
# Step 4: Try to interact with form
web_interact_cremotemcp:
action: "fill"
selector: "input[name='email']"
value: "test@example.com"
# Step 5: Check for any console errors after interaction
console_logs_cremotemcp:
clear: false
# Step 6: Debug element selection
console_command_cremotemcp:
command: "document.querySelector('input[name=\"email\"]') ? 'Found' : 'Not found'"
# Step 7: Check element properties
console_command_cremotemcp:
command: "document.querySelector('input[name=\"email\"]').value"
Example: Phase 2 Enhanced Data Extraction
Here's how to use the new Phase 2 tools for efficient data extraction:
# Step 1: Navigate to a product page
web_navigate_cremotemcp:
url: "https://ecommerce.com/product/123"
# Step 2: Extract multiple data points in one call
web_extract_multiple_cremotemcp:
selectors:
title: "h1.product-title"
price: ".price-current"
description: ".product-description"
rating: ".rating-score"
availability: ".stock-status"
# Step 3: Extract all product images and links
web_extract_links_cremotemcp:
container_selector: ".product-gallery"
href_pattern: ".*\\.(jpg|png|gif).*"
# Step 4: Extract reviews table if present
web_extract_table_cremotemcp:
selector: "#reviews-table"
include_headers: true
# Step 5: Extract specific text patterns (like model numbers)
web_extract_text_cremotemcp:
selector: ".product-specs"
pattern: "Model: ([A-Z0-9-]+)"
Example: Comprehensive Site Analysis
Here's how to analyze a website's structure using Phase 2 tools:
# Step 1: Navigate to the homepage
web_navigate_cremotemcp:
url: "https://example.com"
# Step 2: Extract all navigation links
web_extract_links_cremotemcp:
container_selector: "nav"
# Step 3: Extract social media links
web_extract_links_cremotemcp:
href_pattern: ".*(facebook|twitter|linkedin|instagram).*"
# Step 4: Extract contact information
web_extract_text_cremotemcp:
selector: ".contact-info"
pattern: "\\b[A-Za-z0-9._%+-]+@[A-Za-z0-9.-]+\\.[A-Z|a-z]{2,}\\b"
# Step 5: Extract phone numbers
web_extract_text_cremotemcp:
selector: "body"
pattern: "\\(?\\d{3}\\)?[-. ]?\\d{3}[-. ]?\\d{4}"
# Step 6: Extract key page elements in one call
web_extract_multiple_cremotemcp:
selectors:
title: "title"
heading: "h1"
meta_description: "meta[name='description']"
footer_text: "footer"
Phase 6: Accessibility Tree Support (3 Tools)
get_accessibility_tree_cremotemcp
Get the full accessibility tree for a page or with limited depth for semantic understanding.
Parameters:
tab
(optional): Tab ID, uses current tab if not specifieddepth
(optional): Maximum depth to retrieve, omit for full treetimeout
(optional): Timeout in seconds, default 5
Use Cases:
- Accessibility testing and compliance verification
- Understanding page structure for screen readers
- Semantic element discovery and analysis
Example:
get_accessibility_tree_cremotemcp:
depth: 3
timeout: 10
get_partial_accessibility_tree_cremotemcp
Get accessibility tree for a specific element and its relatives (ancestors, siblings, children).
Parameters:
selector
: CSS selector for the target element (required)tab
(optional): Tab ID, uses current tab if not specifiedfetch_relatives
(optional): Include relatives, default truetimeout
(optional): Timeout in seconds, default 5
Use Cases:
- Focused accessibility analysis of specific components
- Form accessibility structure analysis
- Widget accessibility verification
Example:
get_partial_accessibility_tree_cremotemcp:
selector: "form.login-form"
fetch_relatives: true
timeout: 10
query_accessibility_tree_cremotemcp
Query accessibility tree for nodes matching specific criteria (accessible name, role, or scope).
Parameters:
tab
(optional): Tab ID, uses current tab if not specifiedselector
(optional): CSS selector to limit search scopeaccessible_name
(optional): Accessible name to matchrole
(optional): ARIA role to match (e.g., "button", "textbox", "link")timeout
(optional): Timeout in seconds, default 5
Use Cases:
- Find elements by their accessible names (what screen readers announce)
- Locate elements by ARIA roles for semantic interaction
- Accessibility-aware element discovery and testing
Examples:
# Find all buttons on the page
query_accessibility_tree_cremotemcp:
role: "button"
# Find submit button by accessible name
query_accessibility_tree_cremotemcp:
accessible_name: "Submit"
role: "button"
# Find form controls within a specific form
query_accessibility_tree_cremotemcp:
selector: "form.checkout"
role: "textbox"
Integration Notes
- Tools use the
_cremotemcp
suffix to avoid naming conflicts - Responses include success status and descriptive messages
- Screenshots are saved to
/tmp/
directory with timestamps - The underlying cremote daemon handles browser management
- Accessibility tree tools provide semantic understanding of page structure
Advanced Usage Examples
Testing Web Applications
# Navigate to application
web_navigate_cremotemcp:
url: "https://myapp.com/login"
screenshot: true
# Test login functionality
web_interact_cremotemcp:
action: "fill"
selector: "#username"
value: "testuser"
web_interact_cremotemcp:
action: "fill"
selector: "#password"
value: "testpass"
web_interact_cremotemcp:
action: "click"
selector: "button[type='submit']"
# Verify successful login
web_navigate_cremotemcp:
url: "current"
screenshot: true
Data Extraction Workflows
# Navigate to data source
web_navigate_cremotemcp:
url: "https://data-site.com/table"
# Click through pagination or filters
web_interact_cremotemcp:
action: "click"
selector: ".filter-button"
# Take screenshot to document current state
web_navigate_cremotemcp:
url: "current"
screenshot: true
File Upload Testing
# Navigate to upload form
web_navigate_cremotemcp:
url: "https://example.com/upload"
# Upload a file
web_interact_cremotemcp:
action: "upload"
selector: "input[type='file']"
value: "/path/to/test-file.pdf"
# Submit the upload form
web_interact_cremotemcp:
action: "click"
selector: "button.upload-submit"
Tool Response Format
Both tools return structured responses:
Success Response:
"Successfully navigated to https://example.com in tab ABC123 (screenshot saved to /tmp/navigate-1234567890.png)"
Error Response:
"failed to load URL: context deadline exceeded"
CSS Selector Best Practices
Recommended Selector Types:
- ID selectors:
#unique-id
(most reliable) - Name attributes:
input[name='fieldname']
- Class combinations:
.primary.button
- Attribute selectors:
button[data-action='submit']
Avoid These Selectors:
- Generic tags:
div
,span
,input
(too broad) - Position-based:
:nth-child()
(fragile) - Text-based:
:contains()
(not standard CSS)
🚀 Advanced Workflow Examples
Efficient Form Completion Workflow
# 1. Check if form exists
web_element_check_cremotemcp:
selector: "#registration-form"
check_type: "exists"
# 2. Analyze form structure
web_form_analyze_cremotemcp:
selector: "#registration-form"
# 3. Fill entire form in one call
web_form_fill_bulk_cremotemcp:
form_selector: "#registration-form"
fields:
name: "John Doe"
email: "john@example.com"
password: "securepass123"
confirm_password: "securepass123"
Comprehensive Data Extraction Workflow
# 1. Get page performance metrics
web_performance_metrics_cremotemcp: {}
# 2. Extract multiple data points at once
web_extract_multiple_cremotemcp:
selectors:
title: "h1"
price: ".price"
description: ".product-description"
availability: ".stock-status"
# 3. Extract all product links
web_extract_links_cremotemcp:
container_selector: ".product-grid"
href_pattern: ".*/product/.*"
# 4. Take enhanced screenshot with metadata
web_screenshot_enhanced_cremotemcp:
output: "/tmp/product-page.png"
full_page: true
Accessibility Testing and Semantic Automation
# Navigate to page for accessibility testing
web_navigate_cremotemcp:
url: "https://myapp.com/form"
screenshot: true
# Get full accessibility tree to analyze structure
get_accessibility_tree_cremotemcp:
depth: 3
timeout: 10
# Find form elements by accessible names (more robust than CSS selectors)
query_accessibility_tree_cremotemcp:
accessible_name: "Email Address"
role: "textbox"
# Fill form using accessibility-aware approach
web_interact_cremotemcp:
action: "fill"
selector: "[aria-label='Email Address']"
value: "user@example.com"
# Find and click submit button by accessible name
query_accessibility_tree_cremotemcp:
accessible_name: "Submit Form"
role: "button"
# Verify form accessibility structure
get_partial_accessibility_tree_cremotemcp:
selector: "form"
fetch_relatives: true
🎯 Best Practices for LLM Agents
1. Use Batch Operations
- Prefer
web_extract_multiple_cremotemcp
over multipleweb_extract_cremotemcp
calls - Use
web_form_fill_bulk_cremotemcp
instead of individual field interactions - Leverage
web_interact_multiple_cremotemcp
for complex interaction sequences
2. Check Before Acting
- Always use
web_element_check_cremotemcp
before interacting with elements - Verify page state with
web_page_info_cremotemcp
when needed - Check content loading with
web_content_check_cremotemcp
3. Optimize for Efficiency
- 10x Form Efficiency: Complete forms in 1-2 calls instead of 10+
- Batch Data Extraction: Extract multiple data points in single calls
- Smart Element Checking: Prevent errors with conditional logic
4. Enhanced Debugging
- Use
web_screenshot_element_cremotemcp
for targeted debugging - Leverage
console_logs_cremotemcp
for JavaScript error detection - Take
web_screenshot_enhanced_cremotemcp
with metadata for comprehensive documentation
5. Accessibility-Aware Automation
- Use
query_accessibility_tree_cremotemcp
to find elements by accessible names instead of fragile selectors - Verify accessibility compliance with
get_accessibility_tree_cremotemcp
- Test screen reader compatibility by analyzing semantic structure
- Build more robust automation using ARIA roles and accessible names
🎉 Production Ready
This comprehensive web automation platform provides 31 tools across 6 enhancement phases, optimized specifically for LLM agents and production workflows. All tools include proper error handling, timeout management, and structured responses for reliable automation.
Ready for Production: Complete web automation platform with 31 tools, designed for maximum efficiency and reliability in LLM-driven workflows.