Skip to content
View mostafam-dev's full-sized avatar

Block or report mostafam-dev

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
mostafam-dev/README.md

Mostafa Mahmoud

Full-Stack Developer · Data Engineering Specialist · 7 Years Building Things That Work

Most scrapers don't crash — they silently ship bad data for days.
I build the systems that catch them, and teach others how to do the same.

LinkedIn YouTube Website Email


About Me

I'm a full-stack developer who specializes in data engineering and web scraping infrastructure. Over 7 years, I've built everything from real-time scraping pipelines that handle millions of pages to the front-end dashboards that make the data useful. I've reverse-engineered anti-bot systems, designed schema validation engines, and deployed scraping infrastructure across AWS — and I've learned that the hardest part is never the extraction. It's knowing when your data silently went wrong.

That's why I started building tools and writing about scraper reliability — because the ecosystem has plenty of extraction frameworks and almost nothing for correctness.

When I'm not building production systems, I run Scrapem — an educational platform where I break down data engineering and web scraping concepts for developers who want to go beyond tutorials.


🔨 What I'm Building

Production-grade observability layer for web scraping pipelines. Detects when scrapers silently break and explains why — schema drift, DOM restructuring, selector failures, CAPTCHA injection — before bad data reaches downstream systems.

Python Pydantic FastAPI lxml SQLite Docker

212 tests · 8 failure classifiers · 4-component health scoring

Educational platform for data engineering and web scraping. Structured learning tracks, real-world project breakdowns, and deep-dive technical resources — built for developers who want to understand the infrastructure, not just the code.

Astro Technical Writing SEO Content Systems

Projects · Resources · Learning Tracks


⚡ Tech Stack

Languages

Python JavaScript TypeScript Go Rust PHP

Data & Scraping

Scrapy Playwright Selenium Puppeteer Pandas NumPy Scikit--Learn

Frontend

React Vue.js Svelte Next.js Astro Tailwind CSS

Backend & Databases

Node.js Django FastAPI Flask Express PostgreSQL MongoDB Redis MySQL

Infrastructure & Tools

Docker AWS GitHub Actions Nginx Linux Electron Tauri


📊 GitHub Stats


📝 Latest from Scrapem

➡️ More articles on Scrapem.com


🐍 Contribution Graph

Snake animation


Profile Views

If your scraper runs fine but your data is wrong — ScraperGuard was built for you.

Pinned Loading

  1. ScraperGuard ScraperGuard Public

    Production-grade observability layer for web scraping pipelines. Detects when scrapers silently break and explains why.

    Python 10 1