B2B Lead Signal Scraper

POST

Scrape public job boards for tech stack mentions and discover high-intent B2B leads

POST /api/lead-signals
~250ms avg latency
API Key auth
Free tier

Value Proposition

Scrapes public job boards (RemoteOK, Arbeitnow) for tech stack mentions. Returns matching jobs with company, title, detected tech stack, and keyword matches. Perfect for sales teams looking to identify companies actively hiring for specific technologies.

What This API Does

The B2B Lead Signal Scraper monitors public job boards in real-time to identify companies actively hiring for specific technologies. By analyzing job postings from RemoteOK and Arbeitnow, it extracts valuable buying signals that indicate which companies are investing in new tech stacks.

Key Features

  • Multi-source scraping — Aggregates jobs from RemoteOK and Arbeitnow simultaneously
  • Tech stack detection — Automatically identifies 100+ technologies with alias resolution
  • Smart deduplication — Eliminates duplicate postings across sources
  • Word-boundary matching — Precise regex matching prevents false positives
  • Parallel fetching — Fast concurrent requests to all sources

Code Examples

curl -X POST "https://api.atomicapis.dev/api/lead-signals" \
  -H "X-RapidAPI-Proxy-Secret: YOUR_SECRET" \
  -H "Content-Type: application/json" \
  -d '{
    "keywords": ["React", "TypeScript", "Node.js", "AWS"],
    "sources": ["RemoteOk", "Arbeitnow"],
    "maxResults": 50,
    "location": "remote",
    "fullTime": true
  }'

Request Parameters

Name Type Required Description Example
keywords string[] Yes Tech keywords to search for in job postings. Supports 100+ technologies with alias resolution. ["React", "AWS"]
sources string[] No Job board sources to scrape. Defaults to all available sources. ["RemoteOk", "Arbeitnow"]
maxResults number No Maximum number of results to return (1–200). Default: 50. 50
location string No Filter by location (e.g., "remote", city name). "remote"
fullTime boolean No Filter for full-time positions only. true

Supported Sources

RemoteOk Arbeitnow

Response Format

200 OK - Success Response
{
  "totalResults": 23,
  "sourcesSearched": ["RemoteOk", "Arbeitnow"],
  "signals": [
    {
      "company": "TechFlow Solutions",
      "jobTitle": "Senior Full Stack Developer",
      "location": "Remote (Global)",
      "source": "RemoteOk",
      "matchedKeywords": ["React", "TypeScript", "Node.js", "AWS"],
      "detectedTechStack": ["React", "TypeScript", "Node.js", "PostgreSQL", "AWS", "Docker"],
      "postingUrl": "https://remoteok.com/techflow-solutions-senior-full-stack",
      "postedDate": "2024-01-14"
    },
    {
      "company": "DataSphere Inc",
      "jobTitle": "Backend Engineer - Platform Team",
      "location": "Remote (EU)",
      "source": "Arbeitnow",
      "matchedKeywords": ["AWS"],
      "detectedTechStack": ["Python", "FastAPI", "Kubernetes", "Redis", "AWS Lambda"],
      "postingUrl": "https://arbeitnow.com/datasphere-backend-engineer",
      "postedDate": "2024-01-13"
    }
  ],
  "scanDurationMs": 2340.15
}

Response Fields

Field Type Description
totalResults number Number of matching results returned (capped by maxResults)
sourcesSearched string[] List of job board sources that were searched
signals[].company string Name of the hiring company
signals[].jobTitle string Job title/position
signals[].location string | null Job location (remote, on-site, hybrid). May be null if not specified.
signals[].source string Which job board this posting came from
signals[].matchedKeywords string[] Keywords from your query that matched this job
signals[].detectedTechStack string[] All detected technologies in the job posting
signals[].postingUrl string | null Direct link to the job posting. May be null if unavailable.
signals[].postedDate string | null When the job was posted. May be null if unavailable.
scanDurationMs number Scan duration in milliseconds (rounded to 2 decimal places)

Use Cases

Sales Prospecting

Identify companies actively hiring for technologies your product integrates with. Reach out while they're investing in relevant tech stacks.

Keywords: ["Salesforce", "HubSpot", "CRM"]

Market Research

Track technology adoption trends across industries. Monitor which companies are migrating to new frameworks or cloud platforms.

Keywords: ["Kubernetes", "Terraform", "DevOps"]

Competitive Intelligence

Monitor competitor hiring patterns to understand their product direction and technology investments before they announce them.

Keywords: ["AI/ML", "LLM", "OpenAI"]

Build Constraints

HttpClient-based Scraping

Uses .NET HttpClient for efficient HTTP requests to public job board APIs. Implements connection pooling and request pipelining for optimal performance.

~100 Tech Keywords with Alias Resolution

Built-in dictionary maps common aliases and variations (e.g., "JS" → "JavaScript", "K8s" → "Kubernetes"). Expands search coverage without requiring users to specify every variant.

Word-Boundary Regex Matching

Uses sophisticated regex patterns with word boundaries to prevent false positives. For example, searching for "Go" won't match words like "Google" or "Golang" unless explicitly specified.

Parallel Source Fetching with Deduplication

Concurrently fetches from all configured sources using Task.WhenAll. Deduplicates on case-insensitive company name + job title to eliminate duplicate postings across platforms.

Error Codes

Code Status Description Resolution
400 Bad Request Invalid request parameters Ensure keywords is a non-empty array of non-empty strings and maxResults is between 1 and 200
401 Unauthorized Missing or invalid API key Include a valid X-RapidAPI-Proxy-Secret header
429 Rate Limited Too many requests Wait before retrying or upgrade your plan
500 Server Error Internal server error Retry the request; contact support if persistent
503 Service Unavailable Job board source unavailable Retry with different sources or try again later

Ready to start scraping leads?

Get your API key and start discovering high-intent B2B leads in minutes. Free tier includes 100 requests per month.