Web Scraping Blog
You'll find it easier to scrape any website with our step-by-step tutorials from beginner to pro.
Bypass Cloudflare With Scrapy: Tutorial 2024
Find out how to bypass Cloudflare with Scrapy. Bypass Cloudflare's systems and get the data you need with the right middleware.
July 18, 2024
· 8 min read
Set User Agent in Axios: Step-by-Step Guide
Learn why the User Agent in Axios is getting your scraper blocked, how to change it and the steps to randomize it at scale.
July 17, 2024
· 5 min read
How to Use Selenium in NodeJS (Tutorial 2024)
Learn how to use Selenium with NodeJS for in this step-by-step tutorial. You'll master browser automation for multiple purposes.
July 16, 2024
· 8 min read
How to Patch Puppeteer Stealth to Improve Its Anti-bot Bypass Power
Learn the ins and outs of Puppeteer Stealth evasions and discover how to give your scraper an extra anti-bot boost.
July 15, 2024
· 8 min read
How to Set Up a Proxy in AIOHTTP
Learn how to set up AIOHTTP proxies: add a proxy to your script, build a proxy rotator, and discover how to use premium proxies to avoid blocks and bans.
July 11, 2024
· 7 min read
How to Use Curl_cffi for Web Scraping
Learn how to use curl_cffi for web scraping, from the basics to more advanced techniques such as concurrent requests and session handling.
July 10, 2024
· 7 min read
How to Bypass CAPTCHA With Selenium in Ruby
Learn how to bypass CAPTCHA with Selenium in Ruby by using Undetected ChromeDriver or a web scraping API.
July 9, 2024
· 7 min read
Nokogiri Tutorial: Best HTML Parser for Ruby
Learn how to efficiently parse HTML with Ruby and Nokogiri, from extracting data all the way to CSV export.
July 8, 2024
· 8 min read
How to Use Ferret for Web Scraping: Tutorial [2024]
Learn how to take advantage of Golang's scraping power without writing a Go program and use Ferret for web scraping.
July 4, 2024
· 10 min read
How to Set Up a Proxy With MechanicalSoup
Learn how to add a proxy to MechanicalSoup. Build a proxy rotator and discover how to use premium proxies to avoid all blocks and bans.
July 3, 2024
· 8 min read
Playwright in Golang for Web Scraping [Tutorial 2024]
Learn how to use Playwright with Go to scrape the web and successfully bypass all blocks and bans.
July 2, 2024
· 9 min read
How to Solve Jsoup 403 Forbidden Error
Learn how to avoid jsoup 403 forbidden error with custom User Agent, rotating proxies, and a web scraping API.
July 1, 2024
· 8 min read
Web Scraping With PHP vs. Python: Which to Choose?
Should you choose Python or PHP for web scraping? In this article, we’ll compare the two across factors such as popularity, speed, data analysis capabilities, and more.
June 28, 2024
· 17 min read
How to Use Nodriver for Web Scraping
Learn how to scrape the web with nodriver: extract HTML, deal with dynamic content, and avoid all blocks and bans.
June 26, 2024
· 10 min read
How to Use a Proxy With OkHttp [Tutorial 2024]
Learn how to use a single proxy, rotate proxies from a pool, or take advantage of premium proxies while web scraping with OkHttp.
June 25, 2024
· 8 min read
Playwright in C# for Web Scraping: Step-by-Step Tutorial
Learn to automate web scraping using Playwright in C# with this detailed guide, covering installation, setup, and data extraction.
June 24, 2024
· 12 min read
Axios vs. Got vs. Fetch: Which Should You Choose
Should you use Axios, Got, or Fetch API for your project? This article compares the three tools based on factors such as performance, popularity, and browser support.
June 21, 2024
· 10 min read
How to Bypass Akamai With Playwright
Learn how to bypass Akamai when web scraping with Playwright with three methods: stealth plugin, web scraping API, and premium proxies.
June 20, 2024
· 8 min read