mcp/browserbase

Verified Publisher

By mcp

Updated 17 days ago

Allow LLMs to control a browser with Browserbase and Stagehand for AI-powered web automation, int...

Image
Machine learning & AI
0

4.1K

mcp/browserbase repository overview

Browserbase MCP Server

Allow LLMs to control a browser with Browserbase and Stagehand for AI-powered web automation, intelligent data extraction, and screenshot capture.

What is an MCP Server?

MCP Info

Image Building Info

AttributeDetails
Dockerfilehttps://github.com/browserbase/mcp-server-browserbase/blob/51f2175c7956033d7984d77fec311604b2ebaf7c/Dockerfile
Commit51f2175c7956033d7984d77fec311604b2ebaf7c
Docker Image built byDocker Inc.
Docker Scout Health ScoreDocker Scout Health Score
Verify SignatureCOSIGN_REPOSITORY=mcp/signatures cosign verify mcp/browserbase --key https://raw.githubusercontent.com/docker/keyring/refs/heads/main/public/mcp/latest.pub
LicenceApache License 2.0

Available Tools (9)

Tools provided by this ServerShort Description
browserbase_screenshotCapture a full-page screenshot and return it (and save as a resource).
browserbase_session_closeClose the current Browserbase session and reset the active context.
browserbase_session_createCreate or reuse a Browserbase browser session and set it as active.
browserbase_stagehand_actPerform a single action on the page (e.g., click, type).
browserbase_stagehand_agentExecute a task autonomously using Gemini Computer Use agent.
browserbase_stagehand_extractExtract structured data or text from the current page using an instruction.
browserbase_stagehand_get_urlReturn the current page URL (full URL with query/fragment).
browserbase_stagehand_navigateNavigate to a URL in the browser.
browserbase_stagehand_observeFind interactive elements on the page from an instruction; optionally return an action.

Tools Details

Tool: browserbase_screenshot

Capture a full-page screenshot and return it (and save as a resource).

ParametersTypeDescription
namestring optionalThe name of the screenshot

Tool: browserbase_session_close

Close the current Browserbase session and reset the active context.

Tool: browserbase_session_create

Create or reuse a Browserbase browser session and set it as active.

ParametersTypeDescription
sessionIdstring optionalOptional session ID to use/reuse. If not provided or invalid, a new session is created.

Tool: browserbase_stagehand_act

Perform a single action on the page (e.g., click, type).

ParametersTypeDescription
actionstringThe action to perform. Should be as atomic and specific as possible,
  i.e. 'Click the sign in button' or 'Type 'hello' into the search input'.

variables|object optional|Variables used in the action template. ONLY use variables if you're dealing with sensitive data or dynamic content. When using variables, you MUST have the variable key in the action template. ie: {"action": "Fill in the password", "variables": {"password": "123456"}}


Tool: browserbase_stagehand_agent

Execute a task autonomously using Gemini Computer Use agent. The agent will navigate and interact with web pages to complete the given task.

ParametersTypeDescription
promptstringThe task prompt describing what you want the sub-agent to accomplish.
Be clear and specific about the goal. For example:
'Go to Hacker News and find the most controversial post from today, then summarize the top 3 comments'.
The agent will autonomously navigate and interact with web pages to complete this task.

Tool: browserbase_stagehand_extract

Extract structured data or text from the current page using an instruction.

ParametersTypeDescription
instructionstringThe specific instruction for what information to extract from the current page.
Be as detailed and specific as possible about what you want to extract. For example:
'Extract all product names and prices from the listing page'.The more specific your instruction,
the better the extraction results will be.

Tool: browserbase_stagehand_get_url

Return the current page URL (full URL with query/fragment).

Tool: browserbase_stagehand_navigate

Navigate to a URL in the browser. Only use this tool with URLs you're confident will work and be up to date. Otherwise, use https://google.com as the starting point

ParametersTypeDescription
urlstringThe URL to navigate to

Tool: browserbase_stagehand_observe

Find interactive elements on the page from an instruction; optionally return an action.

ParametersTypeDescription
instructionstringDetailed instruction for what specific elements or components to observe on the web page.
    This instruction must be extremely specific and descriptive. For example: 'Find the red login button
    in the top right corner', 'Locate the search input field with placeholder text', or 'Identify all
    clickable product cards on the page'. The more specific and detailed your instruction, the better
    the observation results will be. Avoid generic instructions like 'find buttons' or 'see elements'.
    Instead, describe the visual characteristics, location, text content, or functionality of the elements
    you want to observe. This tool is designed to help you identify interactive elements that you can
    later use with the act tool for performing actions like clicking, typing, or form submission.

Use this MCP Server

{
  "mcpServers": {
    "browserbase": {
      "command": "docker",
      "args": [
        "run",
        "-i",
        "--rm",
        "-e",
        "BROWSERBASE_API_KEY",
        "-e",
        "BROWSERBASE_PROJECT_ID",
        "-e",
        "GEMINI_API_KEY",
        "mcp/browserbase"
      ],
      "env": {
        "BROWSERBASE_API_KEY": "YOUR_BROWSERBASE_API_KEY_HERE",
        "BROWSERBASE_PROJECT_ID": "YOUR_BROWSERBASE_PROJECT_ID_HERE",
        "GEMINI_API_KEY": "YOUR_GEMINI_API_KEY_HERE"
      }
    }
  }
}

Why is it safer to run MCP Servers with Docker?

Install from MCP:Hub

Tag summary

Content type

Image

Digest

sha256:544c304d4

Size

108.1 MB

Last updated

17 days ago

Requires Docker Desktop 4.37.1 or later.

This week's pulls

Pulls:

231

Feb 2 to Feb 8