B2B Lead Signal Scraper
POSTScrape public job boards for tech stack mentions and discover high-intent B2B leads
/api/lead-signals
Value Proposition
Scrapes public job boards (RemoteOK, Arbeitnow) for tech stack mentions. Returns matching jobs with company, title, detected tech stack, and keyword matches. Perfect for sales teams looking to identify companies actively hiring for specific technologies.
What This API Does
The B2B Lead Signal Scraper monitors public job boards in real-time to identify companies actively hiring for specific technologies. By analyzing job postings from RemoteOK and Arbeitnow, it extracts valuable buying signals that indicate which companies are investing in new tech stacks.
Key Features
- Multi-source scraping — Aggregates jobs from RemoteOK and Arbeitnow simultaneously
- Tech stack detection — Automatically identifies 100+ technologies with alias resolution
- Smart deduplication — Eliminates duplicate postings across sources
- Word-boundary matching — Precise regex matching prevents false positives
- Parallel fetching — Fast concurrent requests to all sources
Code Examples
curl -X POST "https://api.atomicapis.dev/api/lead-signals" \
-H "X-RapidAPI-Proxy-Secret: YOUR_SECRET" \
-H "Content-Type: application/json" \
-d '{
"keywords": ["React", "TypeScript", "Node.js", "AWS"],
"sources": ["RemoteOk", "Arbeitnow"],
"maxResults": 50,
"location": "remote",
"fullTime": true
}'
Request Parameters
| Name | Type | Required | Description | Example |
|---|---|---|---|---|
keywords |
string[] | Yes | Tech keywords to search for in job postings. Supports 100+ technologies with alias resolution. | ["React", "AWS"] |
sources |
string[] | No | Job board sources to scrape. Defaults to all available sources. | ["RemoteOk", "Arbeitnow"] |
maxResults |
number | No | Maximum number of results to return (1–200). Default: 50. | 50 |
location |
string | No | Filter by location (e.g., "remote", city name). | "remote" |
fullTime |
boolean | No | Filter for full-time positions only. | true |
Supported Sources
Response Format
{
"totalResults": 23,
"sourcesSearched": ["RemoteOk", "Arbeitnow"],
"signals": [
{
"company": "TechFlow Solutions",
"jobTitle": "Senior Full Stack Developer",
"location": "Remote (Global)",
"source": "RemoteOk",
"matchedKeywords": ["React", "TypeScript", "Node.js", "AWS"],
"detectedTechStack": ["React", "TypeScript", "Node.js", "PostgreSQL", "AWS", "Docker"],
"postingUrl": "https://remoteok.com/techflow-solutions-senior-full-stack",
"postedDate": "2024-01-14"
},
{
"company": "DataSphere Inc",
"jobTitle": "Backend Engineer - Platform Team",
"location": "Remote (EU)",
"source": "Arbeitnow",
"matchedKeywords": ["AWS"],
"detectedTechStack": ["Python", "FastAPI", "Kubernetes", "Redis", "AWS Lambda"],
"postingUrl": "https://arbeitnow.com/datasphere-backend-engineer",
"postedDate": "2024-01-13"
}
],
"scanDurationMs": 2340.15
}
Response Fields
| Field | Type | Description |
|---|---|---|
totalResults |
number | Number of matching results returned (capped by maxResults) |
sourcesSearched |
string[] | List of job board sources that were searched |
signals[].company |
string | Name of the hiring company |
signals[].jobTitle |
string | Job title/position |
signals[].location |
string | null | Job location (remote, on-site, hybrid). May be null if not specified. |
signals[].source |
string | Which job board this posting came from |
signals[].matchedKeywords |
string[] | Keywords from your query that matched this job |
signals[].detectedTechStack |
string[] | All detected technologies in the job posting |
signals[].postingUrl |
string | null | Direct link to the job posting. May be null if unavailable. |
signals[].postedDate |
string | null | When the job was posted. May be null if unavailable. |
scanDurationMs |
number | Scan duration in milliseconds (rounded to 2 decimal places) |
Use Cases
Sales Prospecting
Identify companies actively hiring for technologies your product integrates with. Reach out while they're investing in relevant tech stacks.
Keywords: ["Salesforce", "HubSpot", "CRM"]
Market Research
Track technology adoption trends across industries. Monitor which companies are migrating to new frameworks or cloud platforms.
Keywords: ["Kubernetes", "Terraform", "DevOps"]
Competitive Intelligence
Monitor competitor hiring patterns to understand their product direction and technology investments before they announce them.
Keywords: ["AI/ML", "LLM", "OpenAI"]
Build Constraints
HttpClient-based Scraping
Uses .NET HttpClient for efficient HTTP requests to public job board APIs. Implements connection pooling and request pipelining for optimal performance.
~100 Tech Keywords with Alias Resolution
Built-in dictionary maps common aliases and variations (e.g., "JS" → "JavaScript", "K8s" → "Kubernetes"). Expands search coverage without requiring users to specify every variant.
Word-Boundary Regex Matching
Uses sophisticated regex patterns with word boundaries to prevent false positives. For example, searching for "Go" won't match words like "Google" or "Golang" unless explicitly specified.
Parallel Source Fetching with Deduplication
Concurrently fetches from all configured sources using Task.WhenAll. Deduplicates on case-insensitive company name + job title to eliminate duplicate postings across platforms.
Error Codes
| Code | Status | Description | Resolution |
|---|---|---|---|
400 |
Bad Request | Invalid request parameters | Ensure keywords is a non-empty array of non-empty strings and maxResults is between 1 and 200 |
401 |
Unauthorized | Missing or invalid API key | Include a valid X-RapidAPI-Proxy-Secret header |
429 |
Rate Limited | Too many requests | Wait before retrying or upgrade your plan |
500 |
Server Error | Internal server error | Retry the request; contact support if persistent |
503 |
Service Unavailable | Job board source unavailable | Retry with different sources or try again later |
Ready to start scraping leads?
Get your API key and start discovering high-intent B2B leads in minutes. Free tier includes 100 requests per month.