How to Use a Proxy With Httpx

June 3, 2024 · 8 min read

Table of contents

How to set a poxy with httpx in Python?
- Add a proxy
- Proxy authentication
- Rotate proxies
Premium Proxy to Avoid Getting Blocked
Conclusion

Configuring httpx with proxies can make a huge difference in how websites treat your scraping requests. Want to know how that works?

We'll show you how to configure a proxy in httpx to reduce your likelihood of getting blocked. This tutorial covers basic configuration and authentication handling.

How to Set Your Proxy With Httpx in Python

A httpx proxy acts as a link between your scraper and the target server. To set a proxy in httpx, you use the proxies\ parameter when making a request. This parameter lets you include your proxy configurations as URL query parameters in the request.

Let's put it into practice.

First, here's a basic httpx script to which you can add proxy configuration.

                    scraper.py
                
import httpx
 
r = httpx.get("https://httpbin.io/ip")
print(r.text)

Copied!

The above code snippet opens HTTPbin, a test website that returns the client's IP address, and prints its content. Since no proxy settings were added, it will return your machine's IP address.

Step 1: Add a Proxy in Httpx

Note

This tutorial uses a free proxy from the Free Proxy List. So, it may no longer work at the time of reading. Feel free to switch to a new one.

To begin, define your proxy settings with this format: <PROXY_PROTOCOL>://<PROXY_IP_ADDRESS>:<PROXY_PORT>.

                    scraper.py
                
import httpx
 
# define your proxy settings
proxies = {
    "http://": "http://216.137.184.253:80",
    "https://": "http://216.137.184.253:80"
}

Copied!

The code snippet above defines separate proxy URLs for HTTP and HTTPS connections. This is essential to tailor the proxy configuration to each protocol's specific requirements and security considerations, ensuring better performance.

Next, include the proxy configuration in your request using the `proxies` parameter. Then, print the text content to verify your code works.

Putting everything together, you'll have the following complete code.

                    scraper.js
                
import httpx
 
# define your proxy settings
proxies = {
    "http://": "http://216.137.184.253:80",
    "https://": "http://216.137.184.253:80"
}
 
# make a request with the specified proxy
r = httpx.get("https://httpbin.io/ip", proxies=proxies)
 
print(r.text)

Copied!

Run it, and you'll get your proxy's IP address.

                    Output
                
{
  "origin": "216.137.184.253:40335"
}

Copied!

Awesome!

Httpx also provides the httpx.Client constructor that allows you to specify a proxy URL directly as an argument. You only need to create a Client and pass the proxy to the Client, like in the example below.

                    scraper.py
                
import httpx
 
# create a client with the specified proxy
with httpx.Client(proxy="http://216.137.184.253:80") as client:
    # make requests using the client
    r = client.get("https://httpbin.io/ip")
 
print(r.text)

Copied!

This will yield the same result as the one above.

However, you should know that free proxies are only suitable for testing since they're unreliable and easily detected by websites. In real-world use cases, you'll need premium web scraping proxies. These proxies often require additional configuration because you must include the necessary credentials in your request.

Let's see how to authenticate a httpx proxy.

Premium residential proxies to avoid getting blocked.

Access all the data you need with ZenRows' residential proxy network.

Try for Free

Step 2: Proxy Authentication With Httpx: Username and Password

Proxy authentication is necessary when the proxy server requires additional information, such as username and password, to allow access. This is common in corporate environments or when using premium proxy services.

To authenticate your httpx proxy, define your proxy settings using the following format: <PROXY_PROTOCOL>://<YOUR_USERNAME>:<YOUR_PASSWORD>@<PROXY_IP_ADDRESS>:<PROXY_PORT>

Here's how to modify the previous code to authenticate your proxy.

                    scraper.py
                
import httpx
 
# define your proxy settings
proxy_url = "http://<YOUR_USERNAME>:<YOUR_PASSWORD>@216.137.184.253:80"
 
# create a client with the specified proxy and credentials
with httpx.Client(proxy=proxy_url) as client:
    # make requests using the client
    r = client.get("https://httpbin.io/ip")
 
print(r.text)

Copied!

Step 3: Rotate Proxies With Httpx

Scraping at scale often requires rotating between multiple proxies to avoid rate limiting, throttling, or IP bans. Websites often implement restrictions on the number of requests allowed per time frame, and exceeding this limit can result in getting blocked.

By rotating proxies, you distribute your requests across different IP addresses, making it appear as if they originate from various locations or devices.

To rotate proxies with httpx, maintain a pool of proxy URLs and dynamically select a different proxy for each request.

Let's put this into practice.

Import the necessary module (random) and define your proxy pool or list to start. For this exercise, you can grab a few proxies from the Free Proxy List.

                    scraper.py
                
# import the necessary libraries
import httpx
import random
 
# define your proxy list
proxy_urls = [
    "http://20.210.113.32:8123",
    "http://47.56.110.204:8989",
    "http://50.174.214.216:80",
    # add more proxy URLs as needed
]

  
  

  
Copied!

Next, select a proxy at random from the list using the random.choice() method, a function provided by the Python random module. Then, make your request using the selected proxy.

                    scraper.py
                
# select a random proxy URL
random_proxy = random.choice(proxy_urls)
 
# make a request using the selected proxy
with httpx.Client(proxy=random_proxy) as client:
    r = client.get("https://httpbin.io/ip")
    print(r.text)

Copied!

Putting everything together, your complete code should look like this:

                    scraper.py
                
# import the necessary libraries
import httpx
import random
 
# define your proxy list
proxy_urls = [
    "http://20.210.113.32:8123",
    "http://47.56.110.204:8989",
    "http://50.174.214.216:80",
    # add more proxy URLs as needed
]
 
# select a random proxy URL
random_proxy = random.choice(proxy_urls)
 
# make a request using the selected proxy
with httpx.Client(proxy=random_proxy) as client:
    r = client.get("https://httpbin.io/ip")
    print(r.text)

  
  

  
Copied!

To verify it works, make multiple requests. You should get a different IP address each time. Here are the results for two requests:

                    Output
                
{
  "origin": "20.210.113.32:8888"
}
 
{
  "origin": "47.56.110.204:3128"
}

Copied!

Nice job!

Premium Proxy to Avoid Getting Blocked

Free proxies come with significant limitations for web scraping. Their inconsistent performance, security vulnerabilities, and compromised IP reputation make them unreliable for professional applications. Most target websites can easily detect and block these free proxies.

Premium proxies offer a more reliable solution for avoiding detection. By utilizing residential IPs from actual users, premium proxies can seamlessly mimic genuine user traffic. Features like automatic IP rotation and geographic targeting capabilities make them particularly effective for web scraping.

ZenRows' Residential Proxies is one of the best premium proxy services, providing access to more than 55M+ residential IPs distributed across 185+ countries. It includes powerful features like dynamic IP rotation, intelligent proxy selection, and flexible geo-targeting, all backed by 99.9% network uptime. This makes it an excellent choice for reliable scraping with httpx.

Let's integrate ZenRows' Residential Proxies with httpx.

First, sign up, and you'll be redirected to the Proxy Generator dashboard. Your proxy credentials will be generated automatically.

generate residential proxies with zenrows — Click to open the image in full screen

Copy your proxy credentials (username and password) and replace the placeholders in the following code:

                    scraper.py
                
import httpx

# define your proxy settings
proxy_url = "http://<ZENROWS_PROXY_USERNAME>:<ZENROWS_PROXY_PASSWORD>@superproxy.zenrows.com:1337"

# create a client with the specified proxy and credentials
with httpx.Client(proxy=proxy_url) as client:
    # make requests using the client
    r = client.get("https://httpbin.io/ip")

print(r.text)

Copied!

When you run this code multiple times, you'll see output similar to this:

                    Output
                
# request 1
{
  "origin": "178.62.45.183:39754"
}
# request 2
{
    "origin": "178.62.45.183:39754"
}

  
  

  
Copied!

Excellent! The above output confirms that your httpx requests are successfully routed through ZenRows' residential proxy network. Your HTTP client is now using premium proxies that significantly reduce the risk of being blocked during web scraping.

Conclusion

Setting a httpx proxy in Python can help you route your requests through a different IP address and avoid IP bans. However, it's important to remember that proxies aren't foolproof. Even premium proxies can be blocked by advanced anti-bot systems.

For guaranteed web scraping results, give ZenRows a try today.