mcp/browserbase

Verified Publisher

By mcp

•Updated 17 days ago

Allow LLMs to control a browser with Browserbase and Stagehand for AI-powered web automation, int...

Image

Machine learning & AI

4.1K

View on MCP:Hub

Overview Tags

mcp/browserbase repository overview

⁠Browserbase MCP Server

Allow LLMs to control a browser with Browserbase and Stagehand for AI-powered web automation, intelligent data extraction, and screenshot capture.

What is an MCP Server?⁠

⁠MCP Info

Attribute	Details
Docker Image	mcp/browserbase⁠
Author	browserbase⁠
Repository	https://github.com/browserbase/mcp-server-browserbase⁠

⁠Image Building Info

Attribute	Details
Dockerfile	https://github.com/browserbase/mcp-server-browserbase/blob/51f2175c7956033d7984d77fec311604b2ebaf7c/Dockerfile⁠
Commit	`51f2175c7956033d7984d77fec311604b2ebaf7c`
Docker Image built by	Docker Inc.
Docker Scout Health Score
Verify Signature	`COSIGN_REPOSITORY=mcp/signatures cosign verify mcp/browserbase --key https://raw.githubusercontent.com/docker/keyring/refs/heads/main/public/mcp/latest.pub`
Licence	Apache License 2.0

⁠Available Tools (9)

Tools provided by this Server	Short Description
`browserbase_screenshot`	Capture a full-page screenshot and return it (and save as a resource).
`browserbase_session_close`	Close the current Browserbase session and reset the active context.
`browserbase_session_create`	Create or reuse a Browserbase browser session and set it as active.
`browserbase_stagehand_act`	Perform a single action on the page (e.g., click, type).
`browserbase_stagehand_agent`	Execute a task autonomously using Gemini Computer Use agent.
`browserbase_stagehand_extract`	Extract structured data or text from the current page using an instruction.
`browserbase_stagehand_get_url`	Return the current page URL (full URL with query/fragment).
`browserbase_stagehand_navigate`	Navigate to a URL in the browser.
`browserbase_stagehand_observe`	Find interactive elements on the page from an instruction; optionally return an action.

⁠Tools Details

⁠Tool: `browserbase_screenshot`

Capture a full-page screenshot and return it (and save as a resource).

Parameters	Type	Description
`name`	`string` optional	The name of the screenshot

⁠Tool: `browserbase_session_close`

Close the current Browserbase session and reset the active context.

⁠Tool: `browserbase_session_create`

Create or reuse a Browserbase browser session and set it as active.

Parameters	Type	Description
`sessionId`	`string` optional	Optional session ID to use/reuse. If not provided or invalid, a new session is created.

⁠Tool: `browserbase_stagehand_act`

Perform a single action on the page (e.g., click, type).

Parameters	Type	Description
`action`	`string`	The action to perform. Should be as atomic and specific as possible,

  i.e. 'Click the sign in button' or 'Type 'hello' into the search input'.

variables|object optional|Variables used in the action template. ONLY use variables if you're dealing with sensitive data or dynamic content. When using variables, you MUST have the variable key in the action template. ie: {"action": "Fill in the password", "variables": {"password": "123456"}}

⁠Tool: `browserbase_stagehand_agent`

Execute a task autonomously using Gemini Computer Use agent. The agent will navigate and interact with web pages to complete the given task.

Parameters	Type	Description
`prompt`	`string`	The task prompt describing what you want the sub-agent to accomplish.

Be clear and specific about the goal. For example:
'Go to Hacker News and find the most controversial post from today, then summarize the top 3 comments'.
The agent will autonomously navigate and interact with web pages to complete this task.

⁠Tool: `browserbase_stagehand_extract`

Extract structured data or text from the current page using an instruction.

Parameters	Type	Description
`instruction`	`string`	The specific instruction for what information to extract from the current page.

Be as detailed and specific as possible about what you want to extract. For example:
'Extract all product names and prices from the listing page'.The more specific your instruction,
the better the extraction results will be.

⁠Tool: `browserbase_stagehand_get_url`

Return the current page URL (full URL with query/fragment).

⁠Tool: `browserbase_stagehand_navigate`

Navigate to a URL in the browser. Only use this tool with URLs you're confident will work and be up to date. Otherwise, use https://google.com⁠ as the starting point

Parameters	Type	Description
`url`	`string`	The URL to navigate to

⁠Tool: `browserbase_stagehand_observe`

Find interactive elements on the page from an instruction; optionally return an action.

Parameters	Type	Description
`instruction`	`string`	Detailed instruction for what specific elements or components to observe on the web page.

    This instruction must be extremely specific and descriptive. For example: 'Find the red login button
    in the top right corner', 'Locate the search input field with placeholder text', or 'Identify all
    clickable product cards on the page'. The more specific and detailed your instruction, the better
    the observation results will be. Avoid generic instructions like 'find buttons' or 'see elements'.
    Instead, describe the visual characteristics, location, text content, or functionality of the elements
    you want to observe. This tool is designed to help you identify interactive elements that you can
    later use with the act tool for performing actions like clicking, typing, or form submission.

⁠Use this MCP Server

{
  "mcpServers": {
    "browserbase": {
      "command": "docker",
      "args": [
        "run",
        "-i",
        "--rm",
        "-e",
        "BROWSERBASE_API_KEY",
        "-e",
        "BROWSERBASE_PROJECT_ID",
        "-e",
        "GEMINI_API_KEY",
        "mcp/browserbase"
      ],
      "env": {
        "BROWSERBASE_API_KEY": "YOUR_BROWSERBASE_API_KEY_HERE",
        "BROWSERBASE_PROJECT_ID": "YOUR_BROWSERBASE_PROJECT_ID_HERE",
        "GEMINI_API_KEY": "YOUR_GEMINI_API_KEY_HERE"
      }
    }
  }
}

Why is it safer to run MCP Servers with Docker?⁠

Install from MCP:Hub

Browserbase

View and Install from MCP:Hub

Tag summary

Recent tags

Content type

Image

Digest

sha256:544c304d4…

Size

108.1 MB

Last updated

17 days ago

Run in Docker Desktop

Requires Docker Desktop 4.37.1 or later.

This week's pulls

Pulls:

231

Feb 2 to Feb 8

Learn more⁠