Vuong Phan - Web Scraping & Automation Guides

Vuong Phan - Web Scraping & Automation Guides https://vuongphan.dev/blog Practical guides on web scraping, anti-bot bypass, proxies, and automation. en-US How to Integrate Rotating Proxies for Web Scraping (Without Getting Blocked) https://vuongphan.dev/blog/rotating-proxies-for-web-scraping https://vuongphan.dev/blog/rotating-proxies-for-web-scraping A practical guide to integrating residential and rotating proxies into a Python scraper: proxy types, rotation strategies, retry logic, and how to avoid IP bans on protected sites. Fri, 12 Jun 2026 00:00:00 GMT How to Scrape Cloudflare-Protected Sites in 2026 (A Practical Approach) https://vuongphan.dev/blog/bypass-cloudflare-web-scraping https://vuongphan.dev/blog/bypass-cloudflare-web-scraping What Cloudflare actually checks, why most scrapers fail against it, and the layered approach of stealth browsers, fingerprinting, and residential proxies that reliably gets through. Wed, 10 Jun 2026 00:00:00 GMT Solving CAPTCHAs in Your Scraper with 2Captcha and CapSolver https://vuongphan.dev/blog/solving-captchas-2captcha-capsolver https://vuongphan.dev/blog/solving-captchas-2captcha-capsolver A practical guide to integrating CAPTCHA solving services into a Python scraper. Covers reCAPTCHA v2 and v3, hCaptcha, Cloudflare Turnstile, token injection, and cost control. Mon, 08 Jun 2026 00:00:00 GMT Building a Large-Scale Web Scraper with Scrapy https://vuongphan.dev/blog/scrapy-large-scale-scraping https://vuongphan.dev/blog/scrapy-large-scale-scraping How to use Scrapy for production scraping at scale. Covers spiders, item pipelines, concurrency tuning, proxy and retry middleware, and exporting to databases. Fri, 05 Jun 2026 00:00:00 GMT Playwright vs Puppeteer vs Selenium for Web Scraping in 2026 https://vuongphan.dev/blog/playwright-vs-puppeteer-vs-selenium https://vuongphan.dev/blog/playwright-vs-puppeteer-vs-selenium A practical comparison of the three main browser automation tools for scraping. Speed, stealth, language support, and which one to choose for your project. Tue, 02 Jun 2026 00:00:00 GMT How to Scrape Sites Protected by DataDome and PerimeterX https://vuongphan.dev/blog/bypass-datadome-perimeterx https://vuongphan.dev/blog/bypass-datadome-perimeterx What DataDome and PerimeterX detect, why they are harder than basic WAFs, and the layered approach of stealth browsers, residential proxies, and session management that gets through. Thu, 28 May 2026 00:00:00 GMT How to Scrape Amazon Product Data Reliably https://vuongphan.dev/blog/scrape-amazon-product-data https://vuongphan.dev/blog/scrape-amazon-product-data A practical guide to scraping Amazon product listings, prices, and reviews at scale. Covers selectors, anti-bot handling, the official API alternative, and staying reliable. Sun, 24 May 2026 00:00:00 GMT Residential Proxy Services Compared: Bright Data, Oxylabs, Smartproxy https://vuongphan.dev/blog/residential-proxy-services-compared https://vuongphan.dev/blog/residential-proxy-services-compared A practical comparison of the major residential proxy providers for web scraping. Pricing models, pool quality, geo targeting, and how to choose for your project. Wed, 20 May 2026 00:00:00 GMT Automating Scraping Workflows with n8n, Make, and Zapier https://vuongphan.dev/blog/no-code-scraping-automation-n8n https://vuongphan.dev/blog/no-code-scraping-automation-n8n How to connect a scraper to no-code automation tools so data flows into your business systems automatically. Covers webhooks, scheduling, and when to add custom code. Fri, 15 May 2026 00:00:00 GMT Scraping Google Search Results: SerpAPI vs Building Your Own https://vuongphan.dev/blog/scrape-google-search-results-serpapi https://vuongphan.dev/blog/scrape-google-search-results-serpapi How to extract Google search data for SEO and research. Compares SERP API services with a custom scraper, covering cost, reliability, and when each makes sense. Sun, 10 May 2026 00:00:00 GMT Running Scrapers in Production: Scheduling, Queues, and Monitoring https://vuongphan.dev/blog/scheduling-monitoring-scrapers-production https://vuongphan.dev/blog/scheduling-monitoring-scrapers-production How to take a scraper from a script to a reliable production system. Covers scheduling, task queues, retries, error alerting, and proxy health monitoring. Tue, 05 May 2026 00:00:00 GMT Reverse Engineering Private APIs for Faster, Cleaner Scraping https://vuongphan.dev/blog/reverse-engineering-private-apis https://vuongphan.dev/blog/reverse-engineering-private-apis How to find and use a site's internal API instead of scraping HTML. Covers inspecting network traffic, replicating requests, handling auth, and why it beats browser scraping. Thu, 30 Apr 2026 00:00:00 GMT