Browser Tool

Learn how to use Browser Tool with Composio

Overview

Enum

BROWSER_TOOL

Description

Composio enables AI Agents and LLMs to authenticate and integrate with various tools via function calling.

Authentication Details

Actions

Copy currently selected text on the page to clipboard - ideal for extracting highlighted content, copying form data, or harvesting visible text selections.

Action Parameters

Action Response

data
object
error
string
successful
boolean
Execute precise drag and drop operations - essential for file uploads, list reordering, element moving, and complex ui interactions that require drag-based manipulation.

Action Parameters

button
stringDefaults to left
endX
integerRequired
endY
integerRequired
startX
integerRequired
startY
integerRequired

Action Response

data
object
error
string
successful
boolean
Your eyes: get page content for decision-making. use before: actions (find targets, understand state) use to verify: page transitions, major state changes, when actions seem to fail format: html=find elements | markdown=clean content | success rate: 99%

Action Parameters

format
stringDefaults to markdown
idleTtlSec
integer
newPage
boolean
returnPartialOnTimeout
booleanDefaults to True
url
string
wait
integerDefaults to 1000

Action Response

data
object
error
string
successful
boolean
Read current content from the system clipboard - essential for data transfer workflows, extracting copied text, and reading user-copied data for processing.

Action Parameters

Action Response

data
object
error
string
successful
boolean
Execute keyboard shortcuts and key combinations - essential for copy/paste, navigation, and application commands that agents need for efficient browser automation.

Action Parameters

holdTime
integer
keys
arrayRequired

Action Response

data
object
error
string
successful
boolean
Precision clicker: manual clicking with coordinates. pattern: fetchwebpage(html) → find element → estimate coordinates → click → verify hints: center buttons ~(640,350) | nav/header ~y=150 | content ~y=300-500 tip: try ±50px if first click misses | success rate: 85%

Action Parameters

button
stringDefaults to left
x
integerRequired
y
integerRequired

Action Response

data
object
error
string
successful
boolean
Execute a precise double click at specified screen coordinates - ideal for opening files, selecting text, or activating ui elements that require double click gestures.

Action Parameters

button
stringDefaults to left
x
integerRequired
y
integerRequired

Action Response

data
object
error
string
successful
boolean
Press and hold mouse button at coordinates - use for starting custom drag operations, text selections, or long-press interactions. must be followed by mouseup action to complete.

Action Parameters

button
stringDefaults to left
x
integerRequired
y
integerRequired

Action Response

data
object
error
string
successful
boolean
Move mouse cursor to precise coordinates without clicking - perfect for triggering hover effects, revealing tooltips, and positioning for subsequent interactions.

Action Parameters

x
integerRequired
y
integerRequired

Action Response

data
object
error
string
successful
boolean
Release mouse button at coordinates - completes drag operations, text selections, and long-press interactions. should be used after mousedown to finish mouse button sequences.

Action Parameters

button
stringDefaults to left
x
integerRequired
y
integerRequired

Action Response

data
object
error
string
successful
boolean
Always start here: creates browser session and navigates to url. workflow: navigate() → fetchwebpage() → manual interactions → verify print debugurl to user | success rate: 99%

Action Parameters

idleTtlSec
integer
url
stringRequired

Action Response

data
object
error
string
successful
boolean
Paste text content at the current cursor position - perfect for filling forms, inserting data into text fields, or quick content insertion at focused elements.

Action Parameters

text
stringRequired

Action Response

data
object
error
string
successful
boolean
Ai automation: complex workflows only. when to use: 10+ manual steps | dynamic/unpredictable content when to avoid: simple clicks | forms | navigation | payments strategy: try once → if fails, switch to manual immediately success rate: 40% - always have manual backup ready

Action Parameters

aiAgent
stringDefaults to browser-use
aiModel
stringDefaults to gpt-5-nano
aiProvider
stringDefaults to openai
highlightElements
booleanDefaults to True
idleTtlSec
integer
outputSchema
object
prompt
stringRequired
url
string

Action Response

data
object
error
string
successful
boolean
Capture high-quality screenshot of any webpage with extensive customization options - perfect for archiving, visual documentation, full-page captures, and cross-device viewport testing.

Action Parameters

captureFullHeight
boolean
height
integerDefaults to 720
idleTtlSec
integer
imageQuality
integerDefaults to 80
scrollAllContent
boolean
url
stringRequired
wait
integerDefaults to 1000
width
integerDefaults to 1280

Action Response

data
string
error
string
mimeType
string
successful
boolean
Page navigation: smooth scrolling. use: when target element not visible after fetchwebpage() distance: 200px=fine | 400px=sections | 800px=quick traverse always: scroll → fetchwebpage() → verify | success rate: 99%

Action Parameters

deltaX
integer
deltaY
integerRequired
steps
integerDefaults to 1
useOs
booleanDefaults to True
x
integerDefaults to 640
y
integerDefaults to 360

Action Response

data
object
error
string
successful
boolean
Store text content in the system clipboard for later paste operations - perfect for preparing data transfers, staging content for forms, or cross-application data sharing.

Action Parameters

text
stringRequired

Action Response

data
object
error
string
successful
boolean
Visual verification: capture screenshot of current browser viewport. use: debug ui issues, verify page state, document visual results renders: inline in mcp clients for immediate visual feedback tip: use after page changes to confirm they worked | success rate: 99%

Action Parameters

Action Response

data
object
error
string
successful
boolean
Controlled input: human-like typing. pattern: click to focus → typetext() → verify speed: delay=0 (fast) | delay=50 (human-like) | delay=100+ (careful) must focus input field first | success rate: 95%

Action Parameters

delay
integer
text
stringRequired

Action Response

data
object
error
string
successful
boolean