Browserbase

Browserbase

Allow LLMs to control a browser with Browserbase and Stagehand for AI-powered web automation, intelligent data extraction, and screenshot capture.

72

8 Tools

Packaged by
Requires Secrets
Add to Docker Desktop

Version 4.43 or later needs to be installed to add the server automatically

About

Browserbase MCP Server

Allow LLMs to control a browser with Browserbase and Stagehand for AI-powered web automation, intelligent data extraction, and screenshot capture.

What is an MCP Server?

MCP Info

Image Building Info

AttributeDetails
Dockerfilehttps://github.com/browserbase/mcp-server-browserbase/blob/main/Dockerfile
Docker Image built byDocker Inc.
Docker Scout Health ScoreDocker Scout Health Score
Verify SignatureCOSIGN_REPOSITORY=mcp/signatures cosign verify mcp/browserbase --key https://raw.githubusercontent.com/docker/keyring/refs/heads/main/public/mcp/latest.pub
LicenceApache License 2.0

Available Tools (8)

Tools provided by this ServerShort Description
browserbase_screenshotCapture a full-page screenshot and return it (and save as a resource).
browserbase_session_closeClose the current Browserbase session and reset the active context.
browserbase_session_createCreate or reuse a Browserbase browser session and set it as active.
browserbase_stagehand_actPerform a single action on the page (e.g., click, type).
browserbase_stagehand_extractExtract structured data or text from the current page using an instruction.
browserbase_stagehand_get_urlReturn the current page URL (full URL with query/fragment).
browserbase_stagehand_navigateNavigate to a URL in the browser.
browserbase_stagehand_observeFind interactive elements on the page from an instruction; optionally return an action.

Tools Details

Tool: browserbase_screenshot

Capture a full-page screenshot and return it (and save as a resource).

ParametersTypeDescription
namestringoptionalThe name of the screenshot

Tool: browserbase_session_close

Close the current Browserbase session and reset the active context.

Tool: browserbase_session_create

Create or reuse a Browserbase browser session and set it as active.

ParametersTypeDescription
sessionIdstringoptionalOptional session ID to use/reuse. If not provided or invalid, a new session is created.

Tool: browserbase_stagehand_act

Perform a single action on the page (e.g., click, type).

ParametersTypeDescription
actionstringThe action to perform. Should be as atomic and specific as possible,
  i.e. 'Click the sign in button' or 'Type 'hello' into the search input'.

variables|objectoptional|Variables used in the action template. ONLY use variables if you're dealing with sensitive data or dynamic content. When using variables, you MUST have the variable key in the action template. ie: {"action": "Fill in the password", "variables": {"password": "123456"}}


Tool: browserbase_stagehand_extract

Extract structured data or text from the current page using an instruction.

ParametersTypeDescription
instructionstringThe specific instruction for what information to extract from the current page.
Be as detailed and specific as possible about what you want to extract. For example:
'Extract all product names and prices from the listing page'.The more specific your instruction,
the better the extraction results will be.

Tool: browserbase_stagehand_get_url

Return the current page URL (full URL with query/fragment).

Tool: browserbase_stagehand_navigate

Navigate to a URL in the browser. Only use this tool with URLs you're confident will work and be up to date. Otherwise, use https://google.com as the starting point

ParametersTypeDescription
urlstringThe URL to navigate to

Tool: browserbase_stagehand_observe

Find interactive elements on the page from an instruction; optionally return an action.

ParametersTypeDescription
instructionstringDetailed instruction for what specific elements or components to observe on the web page.
    This instruction must be extremely specific and descriptive. For example: 'Find the red login button
    in the top right corner', 'Locate the search input field with placeholder text', or 'Identify all
    clickable product cards on the page'. The more specific and detailed your instruction, the better
    the observation results will be. Avoid generic instructions like 'find buttons' or 'see elements'.
    Instead, describe the visual characteristics, location, text content, or functionality of the elements
    you want to observe. This tool is designed to help you identify interactive elements that you can
    later use with the act tool for performing actions like clicking, typing, or form submission.

returnAction|booleanoptional|Whether to return the action to perform on the element. If true, the action will be returned as a string. If false, the action will not be returned.


Use this MCP Server

{
  "mcpServers": {
    "browserbase": {
      "command": "docker",
      "args": [
        "run",
        "-i",
        "--rm",
        "-e",
        "BROWSERBASE_API_KEY",
        "-e",
        "BROWSERBASE_PROJECT_ID",
        "-e",
        "GEMINI_API_KEY",
        "mcp/browserbase"
      ],
      "env": {
        "BROWSERBASE_API_KEY": "YOUR_BROWSERBASE_API_KEY_HERE",
        "BROWSERBASE_PROJECT_ID": "YOUR_BROWSERBASE_PROJECT_ID_HERE",
        "GEMINI_API_KEY": "YOUR_GEMINI_API_KEY_HERE"
      }
    }
  }
}

Why is it safer to run MCP Servers with Docker?

Manual installation

You can install the MCP server using:

Installation for

Related servers