Allow LLMs to control a browser with Browserbase and Stagehand for AI-powered web automation, intelligent data extraction, and screenshot capture.
72
8 Tools
Version 4.43 or later needs to be installed to add the server automatically
Use cases
About
Allow LLMs to control a browser with Browserbase and Stagehand for AI-powered web automation, intelligent data extraction, and screenshot capture.
Attribute | Details |
---|---|
Docker Image | mcp/browserbase |
Author | browserbase |
Repository | https://github.com/browserbase/mcp-server-browserbase |
Attribute | Details |
---|---|
Dockerfile | https://github.com/browserbase/mcp-server-browserbase/blob/main/Dockerfile |
Docker Image built by | Docker Inc. |
Docker Scout Health Score | |
Verify Signature | COSIGN_REPOSITORY=mcp/signatures cosign verify mcp/browserbase --key https://raw.githubusercontent.com/docker/keyring/refs/heads/main/public/mcp/latest.pub |
Licence | Apache License 2.0 |
Tools provided by this Server | Short Description |
---|---|
browserbase_screenshot | Capture a full-page screenshot and return it (and save as a resource). |
browserbase_session_close | Close the current Browserbase session and reset the active context. |
browserbase_session_create | Create or reuse a Browserbase browser session and set it as active. |
browserbase_stagehand_act | Perform a single action on the page (e.g., click, type). |
browserbase_stagehand_extract | Extract structured data or text from the current page using an instruction. |
browserbase_stagehand_get_url | Return the current page URL (full URL with query/fragment). |
browserbase_stagehand_navigate | Navigate to a URL in the browser. |
browserbase_stagehand_observe | Find interactive elements on the page from an instruction; optionally return an action. |
browserbase_screenshot
Capture a full-page screenshot and return it (and save as a resource).
Parameters | Type | Description |
---|---|---|
name | string optional | The name of the screenshot |
browserbase_session_close
Close the current Browserbase session and reset the active context.
browserbase_session_create
Create or reuse a Browserbase browser session and set it as active.
Parameters | Type | Description |
---|---|---|
sessionId | string optional | Optional session ID to use/reuse. If not provided or invalid, a new session is created. |
browserbase_stagehand_act
Perform a single action on the page (e.g., click, type).
Parameters | Type | Description |
---|---|---|
action | string | The action to perform. Should be as atomic and specific as possible, |
i.e. 'Click the sign in button' or 'Type 'hello' into the search input'.
variables
|object
optional|Variables used in the action template. ONLY use variables if you're dealing
with sensitive data or dynamic content. When using variables, you MUST have the variable
key in the action template. ie: {"action": "Fill in the password", "variables": {"password": "123456"}}
browserbase_stagehand_extract
Extract structured data or text from the current page using an instruction.
Parameters | Type | Description |
---|---|---|
instruction | string | The specific instruction for what information to extract from the current page. |
Be as detailed and specific as possible about what you want to extract. For example:
'Extract all product names and prices from the listing page'.The more specific your instruction,
the better the extraction results will be.
browserbase_stagehand_get_url
Return the current page URL (full URL with query/fragment).
browserbase_stagehand_navigate
Navigate to a URL in the browser. Only use this tool with URLs you're confident will work and be up to date. Otherwise, use https://google.com as the starting point
Parameters | Type | Description |
---|---|---|
url | string | The URL to navigate to |
browserbase_stagehand_observe
Find interactive elements on the page from an instruction; optionally return an action.
Parameters | Type | Description |
---|---|---|
instruction | string | Detailed instruction for what specific elements or components to observe on the web page. |
This instruction must be extremely specific and descriptive. For example: 'Find the red login button
in the top right corner', 'Locate the search input field with placeholder text', or 'Identify all
clickable product cards on the page'. The more specific and detailed your instruction, the better
the observation results will be. Avoid generic instructions like 'find buttons' or 'see elements'.
Instead, describe the visual characteristics, location, text content, or functionality of the elements
you want to observe. This tool is designed to help you identify interactive elements that you can
later use with the act tool for performing actions like clicking, typing, or form submission.
returnAction
|boolean
optional|Whether to return the action to perform on the element. If true, the action will be returned as a string.
If false, the action will not be returned.
{
"mcpServers": {
"browserbase": {
"command": "docker",
"args": [
"run",
"-i",
"--rm",
"-e",
"BROWSERBASE_API_KEY",
"-e",
"BROWSERBASE_PROJECT_ID",
"-e",
"GEMINI_API_KEY",
"mcp/browserbase"
],
"env": {
"BROWSERBASE_API_KEY": "YOUR_BROWSERBASE_API_KEY_HERE",
"BROWSERBASE_PROJECT_ID": "YOUR_BROWSERBASE_PROJECT_ID_HERE",
"GEMINI_API_KEY": "YOUR_GEMINI_API_KEY_HERE"
}
}
}
}
Manual installation
You can install the MCP server using:
Installation for