<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom">
  <channel>
    <title>Vuong Phan - Web Scraping &amp; Automation Guides</title>
    <link>https://vuongphan.dev/blog</link>
    <description>Practical guides on web scraping, anti-bot bypass, proxies, and automation.</description>
    <language>en-US</language>
    <atom:link href="https://vuongphan.dev/rss.xml" rel="self" type="application/rss+xml" />
    <item>
      <title>How to Integrate Rotating Proxies for Web Scraping (Without Getting Blocked)</title>
      <link>https://vuongphan.dev/blog/rotating-proxies-for-web-scraping</link>
      <guid isPermaLink="true">https://vuongphan.dev/blog/rotating-proxies-for-web-scraping</guid>
      <description>A practical guide to integrating residential and rotating proxies into a Python scraper: proxy types, rotation strategies, retry logic, and how to avoid IP bans on protected sites.</description>
      <pubDate>Fri, 12 Jun 2026 00:00:00 GMT</pubDate>
    </item>
    <item>
      <title>How to Scrape Cloudflare-Protected Sites in 2026 (A Practical Approach)</title>
      <link>https://vuongphan.dev/blog/bypass-cloudflare-web-scraping</link>
      <guid isPermaLink="true">https://vuongphan.dev/blog/bypass-cloudflare-web-scraping</guid>
      <description>What Cloudflare actually checks, why most scrapers fail against it, and the layered approach of stealth browsers, fingerprinting, and residential proxies that reliably gets through.</description>
      <pubDate>Wed, 10 Jun 2026 00:00:00 GMT</pubDate>
    </item>
    <item>
      <title>Solving CAPTCHAs in Your Scraper with 2Captcha and CapSolver</title>
      <link>https://vuongphan.dev/blog/solving-captchas-2captcha-capsolver</link>
      <guid isPermaLink="true">https://vuongphan.dev/blog/solving-captchas-2captcha-capsolver</guid>
      <description>A practical guide to integrating CAPTCHA solving services into a Python scraper. Covers reCAPTCHA v2 and v3, hCaptcha, Cloudflare Turnstile, token injection, and cost control.</description>
      <pubDate>Mon, 08 Jun 2026 00:00:00 GMT</pubDate>
    </item>
    <item>
      <title>Building a Large-Scale Web Scraper with Scrapy</title>
      <link>https://vuongphan.dev/blog/scrapy-large-scale-scraping</link>
      <guid isPermaLink="true">https://vuongphan.dev/blog/scrapy-large-scale-scraping</guid>
      <description>How to use Scrapy for production scraping at scale. Covers spiders, item pipelines, concurrency tuning, proxy and retry middleware, and exporting to databases.</description>
      <pubDate>Fri, 05 Jun 2026 00:00:00 GMT</pubDate>
    </item>
    <item>
      <title>Playwright vs Puppeteer vs Selenium for Web Scraping in 2026</title>
      <link>https://vuongphan.dev/blog/playwright-vs-puppeteer-vs-selenium</link>
      <guid isPermaLink="true">https://vuongphan.dev/blog/playwright-vs-puppeteer-vs-selenium</guid>
      <description>A practical comparison of the three main browser automation tools for scraping. Speed, stealth, language support, and which one to choose for your project.</description>
      <pubDate>Tue, 02 Jun 2026 00:00:00 GMT</pubDate>
    </item>
    <item>
      <title>How to Scrape Sites Protected by DataDome and PerimeterX</title>
      <link>https://vuongphan.dev/blog/bypass-datadome-perimeterx</link>
      <guid isPermaLink="true">https://vuongphan.dev/blog/bypass-datadome-perimeterx</guid>
      <description>What DataDome and PerimeterX detect, why they are harder than basic WAFs, and the layered approach of stealth browsers, residential proxies, and session management that gets through.</description>
      <pubDate>Thu, 28 May 2026 00:00:00 GMT</pubDate>
    </item>
    <item>
      <title>How to Scrape Amazon Product Data Reliably</title>
      <link>https://vuongphan.dev/blog/scrape-amazon-product-data</link>
      <guid isPermaLink="true">https://vuongphan.dev/blog/scrape-amazon-product-data</guid>
      <description>A practical guide to scraping Amazon product listings, prices, and reviews at scale. Covers selectors, anti-bot handling, the official API alternative, and staying reliable.</description>
      <pubDate>Sun, 24 May 2026 00:00:00 GMT</pubDate>
    </item>
    <item>
      <title>Residential Proxy Services Compared: Bright Data, Oxylabs, Smartproxy</title>
      <link>https://vuongphan.dev/blog/residential-proxy-services-compared</link>
      <guid isPermaLink="true">https://vuongphan.dev/blog/residential-proxy-services-compared</guid>
      <description>A practical comparison of the major residential proxy providers for web scraping. Pricing models, pool quality, geo targeting, and how to choose for your project.</description>
      <pubDate>Wed, 20 May 2026 00:00:00 GMT</pubDate>
    </item>
    <item>
      <title>Automating Scraping Workflows with n8n, Make, and Zapier</title>
      <link>https://vuongphan.dev/blog/no-code-scraping-automation-n8n</link>
      <guid isPermaLink="true">https://vuongphan.dev/blog/no-code-scraping-automation-n8n</guid>
      <description>How to connect a scraper to no-code automation tools so data flows into your business systems automatically. Covers webhooks, scheduling, and when to add custom code.</description>
      <pubDate>Fri, 15 May 2026 00:00:00 GMT</pubDate>
    </item>
    <item>
      <title>Scraping Google Search Results: SerpAPI vs Building Your Own</title>
      <link>https://vuongphan.dev/blog/scrape-google-search-results-serpapi</link>
      <guid isPermaLink="true">https://vuongphan.dev/blog/scrape-google-search-results-serpapi</guid>
      <description>How to extract Google search data for SEO and research. Compares SERP API services with a custom scraper, covering cost, reliability, and when each makes sense.</description>
      <pubDate>Sun, 10 May 2026 00:00:00 GMT</pubDate>
    </item>
    <item>
      <title>Running Scrapers in Production: Scheduling, Queues, and Monitoring</title>
      <link>https://vuongphan.dev/blog/scheduling-monitoring-scrapers-production</link>
      <guid isPermaLink="true">https://vuongphan.dev/blog/scheduling-monitoring-scrapers-production</guid>
      <description>How to take a scraper from a script to a reliable production system. Covers scheduling, task queues, retries, error alerting, and proxy health monitoring.</description>
      <pubDate>Tue, 05 May 2026 00:00:00 GMT</pubDate>
    </item>
    <item>
      <title>Reverse Engineering Private APIs for Faster, Cleaner Scraping</title>
      <link>https://vuongphan.dev/blog/reverse-engineering-private-apis</link>
      <guid isPermaLink="true">https://vuongphan.dev/blog/reverse-engineering-private-apis</guid>
      <description>How to find and use a site&apos;s internal API instead of scraping HTML. Covers inspecting network traffic, replicating requests, handling auth, and why it beats browser scraping.</description>
      <pubDate>Thu, 30 Apr 2026 00:00:00 GMT</pubDate>
    </item>
  </channel>
</rss>