head-side-circuitScraperAPI LlamaIndex Integration

We’re excited to introduce ScraperAPI’s official LlamaIndex tools integration — a powerful new way to bring real-time web data directly into your LLM workflows.

With the llama-index-tools-scraperapi package, you can equip your agents with the ability to:

  • Scrape any public webpage

  • Extract structured data from leading platforms

  • Power intelligent, data-rich pipelines

All through a single, unified tool interface.

Supported platforms include Amazon, Google, eBay, Walmart, and Redfin — making it easier than ever to build agents that interact with real-world data at scale.

What’s New in 1.0.0

Version 1.0.0 marks the first production-ready release of the integration, introducing a suite of purpose-built tools:

Available Tools

  • scrape Fetch and return fully rendered HTML or text from any public URL.

  • Amazon Extract rich product data including title, price, ratings, availability, and ASIN.

  • Google Retrieve structured search results with titles, links, and snippets.

  • eBay Access listing data such as item names, pricing, condition, and seller info.

  • Walmart Pull structured product data from search results and product pages.

  • Redfin Gather real estate insights including property details, pricing, and listing status.

Package & Resources

  • PyPI: llama-index-tools-scraperapi

  • GitHub: github.com/scraperapi/llama-index-tools-scraperapi

  • MCP Server: https://mcp.scraperapi.com (optional if you prefer a hosted approach - no installation needed)

Quick Start

Installation

Basic Usage

Spin up a LlamaIndex agent with ScraperAPI tools in just a few lines:


MCP Server Integration (No Package Required)

Prefer a hosted approach? You can connect directly to ScraperAPI via the remote MCP server, no additional package needed.

Prerequisites

Example


Requirements

  • ScraperAPI API key (SCRAPERAPI_API_KEY or SCRAPERAPI_KEY)

  • Python 3.8+

  • LLM provider API key (e.g., OPENAI_API_KEY)

Why This Matters

This integration unlocks a key capability for modern AI systems: reliable, structured access to live web data.

Whether you're building research agents, e-commerce intelligence tools, or real estate pipelines, ScraperAPI + LlamaIndex gives your models the context they need to act intelligently in the real world.

file-linesLlamaindex integration docs

Last updated