AI-powered structured data extraction
Extract structured data from webpages using AI. Define your schema and get clean JSON. Perfect for price monitoring and building datasets.
Get started with a simple API call.
const response = await fetch('https://api.crawlkit.sh/v1/crawl/extract', {
method: 'POST',
headers: {
'Authorization': 'Bearer YOUR_API_KEY',
'Content-Type': 'application/json'
},
body: JSON.stringify({
url: 'https://example.com/products',
schema: {
type: 'array',
items: {
type: 'object',
properties: {
name: { type: 'string' },
price: { type: 'number' },
inStock: { type: 'boolean' }
}
}
},
options: {
maxDepth: 2,
timeout: 60000
}
})
});
const { data } = await response.json();
console.log(data.json);Everything you need for reliable extract extraction.
Define a JSON schema and get structured data automatically extracted using AI.
Crawl multiple pages with configurable depth, following links automatically to gather comprehensive data.
Guide the extraction with natural language prompts for complex or conditional data extraction.
Include or exclude specific URL paths to control which pages get crawled and extracted.
Automatically discover and crawl pages from sitemaps, or ignore them based on your needs.
Perform clicks, typing, scrolling, and other browser actions before extraction for dynamic content.
Get started in minutes with our simple API.
Sign up for free and get your API key instantly. No credit card required.
Use our simple REST API with your favorite programming language.
Receive clean JSON responses ready to use in your application.
Get clean, structured JSON data with every request. Our API returns comprehensive data including status codes, timing information, and your remaining credits.
{
"success": true,
"data": {
"json": [
{
"name": "Product 1",
"price": 29.99,
"inStock": true
},
{
"name": "Product 2",
"price": 49.99,
"inStock": false
}
],
"pages": [
{
"url": "https://example.com/products",
"markdown": "# Products\n\n...",
"json": { ... }
}
],
"stats": {
"pagesProcessed": 5,
"totalLinks": 25
},
"timing": { "total": 12500 },
"creditsUsed": 5,
"creditsRemaining": 95
}
}Common applications for Extract.
Extract product prices, stock status, and specifications from e-commerce sites automatically.
Build contact databases by extracting names, emails, phone numbers from business directories.
Collect articles, reviews, job listings, or any structured content from multiple sources.
Gather competitive intelligence by extracting product features, pricing, and reviews at scale.
Everything you need to know about the Extract API.
A JSON schema defines the structure of the data you want to extract. It specifies field names, types (string, number, boolean, array, object), and nested structures.
Our AI analyzes the page content and your schema definition to intelligently extract the requested data, even from complex or inconsistent page structures.
Yes, set maxDepth to control how many levels of links to follow. The system will automatically crawl and extract from all discovered pages.
Credits are based on the number of pages processed. Each page extraction costs credits, and the total depends on your maxDepth and link structure.
Get 100 free credits to test the API. No credit card required.