Browserbase is a developer platform to reliably run, manage, and monitor headless browsers.
Power your AI data retrievals with:
BROWSERBASE_API_KEY
, BROWSERBASE_PROJECT_ID
).You can load webpages into LangChain using BrowserbaseLoader
. Optionally, you can set text_content
parameter to convert the pages to text-only representation.
import os
from langchain_community.document_loaders import BrowserbaseLoader
load_dotenv()
BROWSERBASE_API_KEY = os.getenv("BROWSERBASE_API_KEY")
BROWSERBASE_PROJECT_ID = os.getenv("BROWSERBASE_PROJECT_ID")
loader = BrowserbaseLoader(
api_key=BROWSERBASE_API_KEY,
project_id=BROWSERBASE_PROJECT_ID,
urls=[
"https://example.com",
],
# Text mode
text_content=False,
)
docs = loader.load()
print(docs[0].page_content[:61])
Loader Options
urls
Required. A list of URLs to fetch.text_content
Retrieve only text content. Default is False
.api_key
Browserbase API key. Default is BROWSERBASE_API_KEY
env variable.project_id
Browserbase Project ID. Default is BROWSERBASE_PROJECT_ID
env variable.session_id
Optional. Provide an existing Session ID.proxy
Optional. Enable/Disable Proxies.RetroSearch is an open source project built by @garambo | Open a GitHub Issue
Search and Browse the WWW like it's 1997 | Search results from DuckDuckGo
HTML:
3.2
| Encoding:
UTF-8
| Version:
0.7.4