Curated by Lead Orchestra β https://leadorchestra.com
A fully-curated, SEO-optimized list of the best tools, frameworks, APIs, workflows, and services for
B2B lead scraping, enrichment, automation, and CRM-ready data pipelines β maintained by Lead Orchestra.
Lead Orchestra is a complete B2B lead scraping & automation platform that orchestrates:
- Web scraping at scale
- Undetectable browser automation
- Data enrichment (email, company, social, intent)
- Lead verification & deduplication
- n8n / Make.com automation workflows
- CRM export (HubSpot, Salesforce, Pipedrive, GoHighLevel, Deal Scale)
Learn more β https://leadorchestra.com
This GitHub repository supports the project by offering the best-in-class curated tools used in modern lead generation pipelines.
- Web Scraping Frameworks
- Headless Browser & Automation Tools
- B2B Lead Enrichment APIs
- Email Verification Services
- Proxy & Anti-Bot Providers
- n8n Workflows & Automation Nodes
- Example B2B Lead Pipeline
- Contributing
- License
- SEO Keywords & Topics
High-performance, scalable frameworks for scraping B2B data:
- Scrapy β https://scrapy.org
Fast, battle-tested crawling framework for large-scale scraping. - BeautifulSoup β https://www.crummy.com/software/BeautifulSoup/
HTML/XML parsing helper for quick extraction.
- Crawlee β https://crawlee.dev
Production-grade scraping framework from Apify. - Cheerio β https://cheerio.js.org
Fast HTML parsing for Node.js scraping tasks.
- Octoparse β https://octoparse.com
Visual scraper for non-developers; supports JS-rendered sites. - ParseHub β https://parsehub.com
Good for static and semi-dynamic websites.
Use these for undetectable scraping, dynamic content, infinite scroll, and JS-heavy websites.
- Playwright β https://playwright.dev
Multi-browser (Chromium, WebKit, Firefox) automation, best anti-bot resistance. - Puppeteer β https://pptr.dev
Chrome-only automation for scraping & testing. - Selenium β https://www.selenium.dev
Classic browser automation, supports multiple languages. - Apify Actors β https://apify.com
Cloud headless browser environment with rotation, retries, storage.
Turn raw scraped data into sales-ready enriched profiles.
- Clearbit β https://clearbit.com
Person + company enrichment, intent data, technographics. - Apollo.io API β https://apollo.io
Huge B2B contact database, enrichment, verified emails. - ZoomInfo β https://zoominfo.com
Enterprise-level B2B enrichment and intent data. - People Data Labs (PDL) β https://peopledatalabs.com
Massive dataset for people + company attributes. - Clay β https://clay.com
50+ enrichment sources in one API (or UI). Great for workflows. - FullContact β https://fullcontact.com
Person-level identity resolution & enrichment.
Ensure deliverability & reduce bounce rates.
- NeverBounce β https://neverbounce.com
- ZeroBounce β https://zerobounce.net
- Kickbox β https://kickbox.com
- Hunter Verify β https://hunter.io/email-verifier
Necessary for large-scale scraping without blocks.
- Bright Data β https://brightdata.com
Industry-leading residential & mobile proxies. - Oxylabs β https://oxylabs.io
Global network with SERP scraping tools. - ScraperAPI β https://scraperapi.com
Solves CAPTCHAs, rotates proxies automatically. - ScrapingBee β https://scrapingbee.com
API for JS rendering + proxies + browser automation. - Zyte Smart Proxy Manager β https://zyte.com
Ready-to-use n8n workflow templates for B2B lead automation, sourced from awesome-n8n-templates.
- Auto-label incoming Gmail messages with AI β Automatically labels incoming Gmail messages using AI. Retrieves message content, suggests labels like Partnership or Inquiry, and assigns them for better organization. Template
- Compose reply draft in Gmail with OpenAI Assistant β Generates draft replies in Gmail using OpenAI. Triggers on new emails, extracts content, and creates a suggested reply draft. Template
- Analyze & Sort Suspicious Email Contents with ChatGPT β Analyzes suspicious emails using ChatGPT, classifies them, and can generate screenshots for review. Template
- A Very Simple "Human in the Loop" Email Response System Using AI and IMAP β Implements a workflow for human-in-the-loop email responses. Uses IMAP to fetch emails, summarizes content with AI, and drafts professional replies for review. Template
- Auto Categorise Outlook Emails with AI β Automatically categorizes Outlook emails using AI models. Moves messages to folders and assigns categories based on content. Template
- Qualify new leads in Google Sheets via OpenAI's GPT-4 β Uses OpenAI's GPT-4 to analyze and qualify new leads entered into a Google Sheet, helping sales teams prioritize their outreach. Template
- Chat with a Google Sheet using AI β Allows users to interact with and query data within a Google Sheet using natural language via an AI model. Template
- Summarize Google Sheets form feedback via OpenAI's GPT-4 β Summarizes feedback collected through Google Forms and stored in Google Sheets using OpenAI's GPT-4. Template
- Summarize the New Documents from Google Drive and Save Summary in Google Sheet β Monitors Google Drive for new documents, summarizes their content using AI, and saves these summaries into a Google Sheet. Template
- Chat with Postgresql Database β Enables an AI assistant to chat with a PostgreSQL database, allowing users to query and retrieve data using natural language. Template
- Generate SQL queries from schema only - AI-powered β Uses AI to generate SQL queries based on a given database schema. Template
- Talk to your SQLite database with a LangChain AI Agent β Allows users to interact with a SQLite database using a LangChain AI agent. Template
- AI-Driven Lead Management and Inquiry Automation with ERPNext & n8n β Lead management automation workflow. Template
- AI Data Extraction with Dynamic Prompts and Airtable β AI-driven data extraction with Airtable integration. Template
- AI-Powered Email Automation for Business: Summarize & Respond with RAG β Email automation with summarization and response capabilities. Template
- AI agent that can scrape webpages β AI agent for web scraping tasks. Template
- AI Agent to chat with Airtable and analyze data β Creates an AI agent that can chat with Airtable, analyze data, and perform queries based on user requests. Template
- Handling Job Application Submissions with AI and n8n Forms β Automates the handling of job application submissions by extracting information from resumes (PDFs) using AI. Template
- Conversational Interviews with AI Agents and n8n Forms β Implements AI-powered conversational interviews using n8n Forms for interactive data collection. Template
- Email Subscription Service with n8n Forms, Airtable and AI β Manages email subscriptions with n8n Forms, stores data in Airtable, and uses AI for processing. Template
- Qualifying Appointment Requests with AI & n8n Forms β Uses AI to qualify and process appointment requests submitted through n8n Forms. Template
- AI-Powered Information Monitoring with OpenAI, Google Sheets, Jina AI and Slack β Monitors RSS feeds, summarizes articles with OpenAI and Jina AI, classifies them, and sends formatted notifications to Slack. Template
- Customer Support Channel and Ticketing System with Slack and Linear β Automates customer support by querying Slack for messages with a ticket emoji, deciding if a new Linear ticket is needed. Template
- Enrich Pipedrive's Organization Data with OpenAI GPT-4o & Notify it in Slack β Enriches Pipedrive organization data by scraping website content, using OpenAI GPT-4o to generate a summary, and notifying a Slack channel. Template
- Ultimate Scraper Workflow for n8n β A comprehensive scraping workflow for n8n to extract data from various sources. Template
- Scrape and summarize webpages with AI β Scrapes and summarizes content from webpages using AI. Template
- Host Your Own AI Deep Research Agent with n8n, Apify and OpenAI o3 β Sets up a self-hosted AI deep research agent using n8n, Apify, and OpenAI. Template
- Automate Competitor Research with Exa.ai, Notion and AI Agents β Builds a competitor research agent using Exa.ai to find similar companies. AI agents then scour the internet for company overviews, product offerings, and customer reviews. Template
Essential community nodes for B2B lead automation, ranked by monthly downloads.
- n8n-nodes-serpapi (#10) β Connects to SerpApi API for search engine results. npm
- n8n-nodes-firecrawl-scraper (#14) β Firecrawl web scraper integration. npm
- @mendable/n8n-nodes-firecrawl (#20) β Firecrawl web crawling integration. npm
- n8n-nodes-browserless (#23) β Integrates with Browserless platform for headless browser automation. npm
- n8n-nodes-playwright (#27) β Integration with Playwright for browser automation. npm
- n8n-nodes-puppeteer (#46) β Automate browser actions using Puppeteer. npm
- n8n-nodes-webpage-content-extractor (#85) β Extract content from URLs, similar to browser 'Reader' mode. npm
- @brightdata/n8n-nodes-brightdata (#80) β Bright Data service for scraping purposes. npm
- n8n-nodes-evolution-api (#1) β WhatsApp channel hub integration. npm
- n8n-nodes-chatwoot (#7) β ChatWoot integration for customer support. npm
- n8n-nodes-imap (#33) β Connect to IMAP server and retrieve emails. npm
- n8n-nodes-waha (#62) β Connects to WAHA (WhatsApp HTTP API). npm
- n8n-nodes-mcp (#2) β Provides MCP (Model Context Protocol) nodes for n8n. npm
- n8n-nodes-elevenlabs (#3) β ElevenLabs AI voice generation integration. npm
- n8n-nodes-deepseek (#24) β DeepSeek AI node similar to OpenAI. npm
- @watzon/n8n-nodes-perplexity (#37) β Interact with the Perplexity AI API. npm
- @apify/n8n-nodes-apify (#11) β Connects to Apify API for web scraping and automation. npm
- n8n-nodes-linked-api (#22) β LinkedIn automation and data retrieval via Linked API. npm
- n8n-nodes-tavily (#30) β Tavily search engine for AI agents with real-time results. npm
- n8n-nodes-qdrant (#32) β Connects to Qdrant vector search engine for RAG workflows. npm
- n8n-nodes-close-crm (#88) β Close CRM integration for automating leads and opportunities. npm
- n8n-nodes-instantly (#83) β Connects to Instantly API v2 for email outreach. npm
- n8n-nodes-cronlytic (#13) β Advanced cron scheduling with Cronlytic integration. npm
- @splainez/n8n-nodes-phonenumber-parser (#18) β Parses phone numbers and returns information. npm
- n8n-nodes-globals (#25) β Create global constants for use across workflows. npm
- n8n-nodes-datastore (#47) β In-memory data storage within n8n workflows. npm
- n8n-nodes-pdfkit (#6) β Transforms images into PDF using PDFKit. npm
- n8n-nodes-pdforge (#38) β Automates PDF generation using AI with customizable templates. npm
- n8n-nodes-pdfco (#49) β Pdf.co integration for PDF processing. npm
- Awesome n8n Templates Repository β https://github.com/enescingoz/awesome-n8n-templates
- Top 100 n8n Community Nodes β https://github.com/pixeljets/awesome-n8n
- Official n8n Documentation β https://docs.n8n.io
- n8n Community Workflows β https://n8n.io/workflows
- Installing Community Nodes β https://docs.n8n.io/integrations/community-nodes/installation/
A real-world, production-ready pipeline:
1. Scrape β Playwright / Crawlee
2. Store Raw Data β n8n / DB / Sheets
3. Enrich Lead β Clearbit, Apollo, Clay
4. Verify Email β NeverBounce
5. Clean & Deduplicate β CRM Query / Hash Matching
6. Export to CRM β HubSpot / Salesforce / Pipedrive
7. Trigger Outreach β Deal Scale / GHL / Apollo
This is the exact architecture Lead Orchestra uses for daily B2B lead generation.
We welcome contributions:
- Fork this repo
- Add your tool/resource
- Submit a PR
- Follow formatting, keep quality high
See CONTRIBUTING.md for details.
MIT License β free to use and distribute.
This README is intentionally optimized for ranking in searches related to:
- b2b lead scraping
- web scraping tools
- lead enrichment APIs
- browser automation scraping
- scraping with Playwright
- n8n lead workflows
- CRM lead automation
- email finding & verification
- proxy rotation & anti-bot systems
- sales prospecting automation
- Lead Orchestra Website: https://leadorchestra.com
- GitHub Organization: https://github.com/Lead-Orchestra
- This Repository: https://github.com/Lead-Orchestra/awesome-b2b-leads
- GitHub Pages: https://lead-orchestra.github.io/awesome-b2b-leads/