Search before crawl

Find candidates before spending crawl work

Search keeps agents from crawling blindly. Start from a query, pick the right channel, and return normalized result collections that can become crawl targets, review queues, or research inputs.

Recent sourcesPublisher contextFresh snippetsFollow-up crawl targets

curl

Run news search

curl -X POST "https://api.anycrawler.com/v1/search" \
  -H "Authorization: Bearer $ANYCRAWLER_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "channel": "news",
    "query": "AI browser automation startup funding",
    "country": "us",
    "page": 1,
    "results_per_page": 10
  }'

Use the unified /v1/search endpoint and select the search type with the channel field.

Use cases

News Search API for Web Monitoring use cases

Prefer recent, reputable sources, then crawl only the articles that add new facts. Store screenshot evidence for sensitive stories or compliance workflows.

Use

Monitor market movement

Track launches, funding, incidents, regulation, or competitor coverage before crawling the highest-signal articles.

Use

Build alert workflows

Convert a query into recent source candidates that can feed Slack alerts, reports, or analyst review queues.

Use

Verify fast-changing claims

Pair fresh news results with render or screenshots when your workflow needs proof of what a source currently says.

How it works

Query, normalize, route the strongest results

The gateway handles request validation, account checks, search provider routing, response normalization, caching, and credit settlement.

1

Submit a query

Send a POST request with query, channel=news, and optional localization or pagination fields.

2

AnyCrawler routes the search

The gateway validates your API key, checks credits, applies cache policy, and sends the request to the selected search collection.

3

Results are normalized

Upstream results are returned in a predictable response envelope with channel, query, nested search parameters, credits, and cache fields.

4

You choose the next step

Use result links as crawl targets, visual review candidates, research inputs, or source lists for a larger agent workflow.

Response

Normalized news results

{
  "ok": true,
  "query": "AI browser automation startup funding",
  "credits_used": 20,
  "cache_timestamp": 0,
  "status_code": 200,
  "results": {
    "search_parameters": {
      "channel": "news",
      "country": "us",
      "page": 1,
      "query": "AI browser automation startup funding",
      "results_per_page": 10
    },
    "news": [
      {
        "title": "Startup raises Series A for AI browsing tools",
        "link": "https://example.com/news/ai-browser-funding",
        "source": "Example News",
        "date": "2026-06-05T09:30:00Z",
        "snippet": "Recent coverage for monitoring and follow-up crawling."
      }
    ]
  }
}

Result collections vary by channel, but the public response envelope stays consistent.

What you get

Search results that can become crawl targets

The news search response is built for routing. Use snippets for triage, links for follow-up crawl requests, and credit/cache fields for production observability.

  • Recent result links for source monitoring
  • Publisher and story context for triage
  • Fresh snippets for alert summaries
  • Candidate URLs for render or screenshot verification

Request fields

One search endpoint, explicit channel selection

The stable public contract focuses on the fields below. Undocumented passthrough fields are not part of new integrations.

Field Type Required Notes
channel "page", "images", "news", "videos", or "scholar" Yes Selects the search vertical on the unified public search endpoint.
query string Yes The search query to run.
country string No Optional country hint for localized search results.
page integer No Search result page to request.
results_per_page integer No Requested result count. Billing is calculated in blocks of 10 results.

Response fields

Fields your integration can rely on

AnyCrawler keeps the search envelope consistent while the result collection reflects the selected channel.

Field Meaning
ok Whether the gateway completed the search successfully.
query The query submitted in the request.
credits_used Credits settled for the request.
cache_timestamp Cache marker for reused responses. Search cache does not change pricing.
status_code The upstream search provider status code returned through the gateway.
results.search_parameters.channel The search vertical used for the request.
results.search_parameters.country The country hint used for localized search results.
results.search_parameters.query The query mirrored inside the normalized search parameters.
results.search_parameters.page The search result page returned.
results.search_parameters.results_per_page The requested result count used for billing.
results.news News results with recent source context for monitoring and follow-up crawling.

JavaScript

Use the unified endpoint

const response = await fetch("https://api.anycrawler.com/v1/search", {
  method: "POST",
  headers: {
    "authorization": `Bearer ${process.env.ANYCRAWLER_API_KEY}`,
    "content-type": "application/json",
  },
  body: JSON.stringify({
    channel: "news",
    query: "AI browser automation startup funding",
    country: "us",
    page: 1,
    results_per_page: 10,
  }),
});

const data = await response.json();
console.log(data.results);

Change channel to switch between page, images, news, videos, and scholar search.

Search vs crawl

Use news search when freshness is the ranking signal

Prefer recent, reputable sources, then crawl only the articles that add new facts. Store screenshot evidence for sensitive stories or compliance workflows. Once you have promising result links, call fetch or render to turn selected pages into Markdown, metadata, links, and optionally visual evidence.

API Best for Returns
search Finding candidate pages, sources, media, news, videos, or papers Search result collections
fetch Fast, low-cost extraction when useful content is already in HTML Markdown and page fields
render JavaScript-loaded pages that need browser execution Rendered Markdown and page fields

Errors

Status codes are explicit enough for production handling

Use response status plus error_code and request headers to route retries, credit issues, authentication problems, and upstream provider failures.

200 Search completed and normalized.
400 Invalid JSON, missing channel, missing query, or unsupported field value.
401 Missing, invalid, or revoked API key.
402 Account does not have enough credits.
403 Account inactive.
429 Rate limit reached.
502 Upstream search provider connection failed.
504 Search provider or gateway timed out.

Billing

Predictable result-count pricing

Search requests settle by requested result blocks. Cache can improve response consistency, but search cache hits do not switch to crawl cache-hit pricing.

Pricing

Start free, scale with explicit search rules

Search requests use 20 credits per block of 10 requested results. The same AnyCrawler plans apply here.

Free

$0/mo

For early validation with one-time credits, 30 fetch requests per minute, and 1 browser concurrency slot.

  • 10,000 one-time credits
  • $0 / cr
  • Fetch 30 req / min
  • Browser 1 concurrent

Agent Lite

$5/mo

For low-frequency AI agents and occasional automations that need 15,000 monthly credits.

  • 15,000 credits / mo
  • $0.000333 / cr
  • Fetch 60 req / min
  • Browser 2 concurrent
  • Low-frequency AI agent usage
  • Persistent top-ups available separately

Builder

$20/mo

For builders shipping serious crawler workflows with higher limits and better unit economics.

  • 80,000 credits / mo
  • $0.000250 / cr
  • Fetch 180 req / min
  • Browser 3 concurrent
  • Best upgrade after repeated Agent Lite usage

Growth

$80/mo

For teams scaling production scraping with better unit economics.

  • 960,000 credits / mo
  • $0.0000833 / cr
  • Fetch 480 req / min
  • Browser 8 concurrent

Scale

$200/mo

For high-volume pipelines with pricing aligned to large ongoing usage.

  • 3,000,000 credits / mo
  • $0.0000667 / cr
  • Fetch 1,200 req / min
  • Browser 20 concurrent

FAQ

Common news search questions

Short answers for teams using search to choose better crawl targets.

When should I use news search?

Use news search when freshness matters: launches, incidents, market coverage, funding stories, regulatory updates, or fast-moving topics.

Can I use the same endpoint for other search types?

Yes. POST to /v1/search and change the channel field to page, images, news, videos, or scholar. Each channel also has this dedicated theme page for integration planning.

Does search cache reduce the credit cost?

Search responses can be cached for speed and operational consistency, but search billing still follows the result-count formula rather than cache-hit pricing.

How do search results fit with page crawling?

Use search to find candidate URLs, then call fetch or render on the strongest results when you need Markdown, metadata, links, or screenshots.