AI Web Scrapers Documentation
This document provides an overview of the AI-powered web scrapers integrated into our system. These scrapers interact with advanced AI models to retrieve structured responses to user prompts, including relevant metadata such as sources, citations, and contextual information.
Overview
Our platform supports the following AI scrapers:
- ChatGPT Scraper
- Perplexity Scraper
- Copilot Scraper
Each scraper is designed to handle user prompts and return enriched responses, with varying levels of detail and supporting data.
How AI Scraping Differs from Traditional SERP Scraping
AI bot scraping offers a smarter, more efficient alternative to traditional SERP (Search Engine Results Page) scraping. While SERP scraping involves extracting raw HTML from search engine result pages—often requiring complex parsing and frequent maintenance—AI bot scraping uses natural language models to understand the intent behind a query and return structured, context-rich answers. Key advantages of AI bot scraping over SERP scraping:
- ✅ Understands query context and delivers direct answers
- ✅ Includes citations, sources, and enriched content (e.g., maps, product links)
- ✅ More resilient to UI changes and search engine updates
- ✅ Easier to integrate via official APIs
- ✅ Provides a better user experience with cleaner, more relevant results
In contrast, SERP scraping is limited to surface-level data, often lacks context, and may violate search engine terms of service. AI bot scraping is the modern, scalable solution for intelligent data extraction.
1. ChatGPT Scraper
The ChatGPT Scraper interfaces with OpenAI’s ChatGPT model to generate detailed responses to user queries. It is optimized for conversational answers and supports a wide range of content types.
Features:
- Accepts a natural language prompt
- Returns a comprehensive answer
Includes:
- Hyperlinks to referenced content
- Citations (when available)
- Product recommendations
- Embedded map data (e.g., locations, directions)
2. Perplexity Scraper
The Perplexity Scraper uses the Perplexity AI engine to return concise and well-sourced answers. It is particularly useful for research and fact-based queries.
Features:
- Accepts a natural language prompt
- Returns a direct answer
Includes:
- List of sources used to generate the response
- Hyperlinked citations for verification
3. Copilot Scraper
The Copilot Scraper leverages Microsoft’s Copilot AI (based on OpenAI’s models) to provide contextual answers with source references. It is ideal for productivity, technical, and business-related queries.
Features:
- Accepts a natural language prompt
- Returns a detailed answer
Includes:
- Referenced sources
- Hyperlinked citations
- Contextual explanations
4. Gemini Scraper
The Gemini Scraper integrates with Google’s Gemini AI (formerly Bard) to process natural language prompts and return informative, conversational responses. Depending on the prompt, Gemini may choose to search the web and include citations or sources in its answer.
Features
- Accepts a natural language prompt
- Returns a detailed, conversational answer
May include:
- Web-based citations and sources (if Gemini chooses to search the web)
- Hyperlinked references (when available)
The decision to search the web is controlled internally by Gemini and cannot currently be forced or disabled via the scraper. Prompt phrasing may influence whether web search is triggered.