Allow LLMs to control a browser with Browserbase and Stagehand for AI-powered web automation, int...
4.1K
Allow LLMs to control a browser with Browserbase and Stagehand for AI-powered web automation, intelligent data extraction, and screenshot capture.
| Attribute | Details |
|---|---|
| Docker Image | mcp/browserbase |
| Author | browserbase |
| Repository | https://github.com/browserbase/mcp-server-browserbase |
| Attribute | Details |
|---|---|
| Dockerfile | https://github.com/browserbase/mcp-server-browserbase/blob/51f2175c7956033d7984d77fec311604b2ebaf7c/Dockerfile |
| Commit | 51f2175c7956033d7984d77fec311604b2ebaf7c |
| Docker Image built by | Docker Inc. |
| Docker Scout Health Score | |
| Verify Signature | COSIGN_REPOSITORY=mcp/signatures cosign verify mcp/browserbase --key https://raw.githubusercontent.com/docker/keyring/refs/heads/main/public/mcp/latest.pub |
| Licence | Apache License 2.0 |
| Tools provided by this Server | Short Description |
|---|---|
browserbase_screenshot | Capture a full-page screenshot and return it (and save as a resource). |
browserbase_session_close | Close the current Browserbase session and reset the active context. |
browserbase_session_create | Create or reuse a Browserbase browser session and set it as active. |
browserbase_stagehand_act | Perform a single action on the page (e.g., click, type). |
browserbase_stagehand_agent | Execute a task autonomously using Gemini Computer Use agent. |
browserbase_stagehand_extract | Extract structured data or text from the current page using an instruction. |
browserbase_stagehand_get_url | Return the current page URL (full URL with query/fragment). |
browserbase_stagehand_navigate | Navigate to a URL in the browser. |
browserbase_stagehand_observe | Find interactive elements on the page from an instruction; optionally return an action. |
browserbase_screenshotCapture a full-page screenshot and return it (and save as a resource).
| Parameters | Type | Description |
|---|---|---|
name | string optional | The name of the screenshot |
browserbase_session_closeClose the current Browserbase session and reset the active context.
browserbase_session_createCreate or reuse a Browserbase browser session and set it as active.
| Parameters | Type | Description |
|---|---|---|
sessionId | string optional | Optional session ID to use/reuse. If not provided or invalid, a new session is created. |
browserbase_stagehand_actPerform a single action on the page (e.g., click, type).
| Parameters | Type | Description |
|---|---|---|
action | string | The action to perform. Should be as atomic and specific as possible, |
i.e. 'Click the sign in button' or 'Type 'hello' into the search input'.
variables|object optional|Variables used in the action template. ONLY use variables if you're dealing
with sensitive data or dynamic content. When using variables, you MUST have the variable
key in the action template. ie: {"action": "Fill in the password", "variables": {"password": "123456"}}
browserbase_stagehand_agentExecute a task autonomously using Gemini Computer Use agent. The agent will navigate and interact with web pages to complete the given task.
| Parameters | Type | Description |
|---|---|---|
prompt | string | The task prompt describing what you want the sub-agent to accomplish. |
Be clear and specific about the goal. For example:
'Go to Hacker News and find the most controversial post from today, then summarize the top 3 comments'.
The agent will autonomously navigate and interact with web pages to complete this task.
browserbase_stagehand_extractExtract structured data or text from the current page using an instruction.
| Parameters | Type | Description |
|---|---|---|
instruction | string | The specific instruction for what information to extract from the current page. |
Be as detailed and specific as possible about what you want to extract. For example:
'Extract all product names and prices from the listing page'.The more specific your instruction,
the better the extraction results will be.
browserbase_stagehand_get_urlReturn the current page URL (full URL with query/fragment).
browserbase_stagehand_navigateNavigate to a URL in the browser. Only use this tool with URLs you're confident will work and be up to date. Otherwise, use https://google.com as the starting point
| Parameters | Type | Description |
|---|---|---|
url | string | The URL to navigate to |
browserbase_stagehand_observeFind interactive elements on the page from an instruction; optionally return an action.
| Parameters | Type | Description |
|---|---|---|
instruction | string | Detailed instruction for what specific elements or components to observe on the web page. |
This instruction must be extremely specific and descriptive. For example: 'Find the red login button
in the top right corner', 'Locate the search input field with placeholder text', or 'Identify all
clickable product cards on the page'. The more specific and detailed your instruction, the better
the observation results will be. Avoid generic instructions like 'find buttons' or 'see elements'.
Instead, describe the visual characteristics, location, text content, or functionality of the elements
you want to observe. This tool is designed to help you identify interactive elements that you can
later use with the act tool for performing actions like clicking, typing, or form submission.
{
"mcpServers": {
"browserbase": {
"command": "docker",
"args": [
"run",
"-i",
"--rm",
"-e",
"BROWSERBASE_API_KEY",
"-e",
"BROWSERBASE_PROJECT_ID",
"-e",
"GEMINI_API_KEY",
"mcp/browserbase"
],
"env": {
"BROWSERBASE_API_KEY": "YOUR_BROWSERBASE_API_KEY_HERE",
"BROWSERBASE_PROJECT_ID": "YOUR_BROWSERBASE_PROJECT_ID_HERE",
"GEMINI_API_KEY": "YOUR_GEMINI_API_KEY_HERE"
}
}
}
}
Content type
Image
Digest
sha256:544c304d4…
Size
108.1 MB
Last updated
17 days ago
Requires Docker Desktop 4.37.1 or later.
Pulls:
231
Feb 2 to Feb 8