Proxy locations

Europe

North America

South America

Asia

Africa

Oceania

See all locations

Network statusCareers

Back to blog

Web Scraping: Another Block In The Wall | OxyCast #2

If you’ve ever tried web scraping, you should be aware of the blocking issue. It’s a common challenge, especially if you gather public data on a large scale without a decent knowledge of using resources wisely. This is why we decided to cover this topic and share our knowledge and tips & tricks on how to avoid getting blocked.

Iveta Vistorskyte

2022-02-22

2 min read

Web Scraping for Machine Learning

This tutorial explores a real-life scenario where web scraping and machine learning work in tandem and how the step-by-step process should look like if you decide to do it on your own.

Danielius Radavicius

2022-02-22

6 min read

How to Use Wget With Proxy

This article will walk you through the step-by-step process of installing and downloading files using Wget with or without proxies, covering multiple scenarios and showcasing practical examples.

Augustas Pelakauskas

2022-02-15

6 min read

What is MAP Monitoring?

Minimum Advertised Price (MAP) monitoring is an essential process for any brand, supplier or manufacturer that works with e-commerce marketplaces. Tracking MAP policy compliance plays an important role ensuring fair competition across different channels and protecting your brand’s reputation.

Adelina Kiskyte

2022-02-11

4 min read

The Importance of Having an Ethical Data Collection Policy

Read insights from an expert in the data collection field on implementing the ethical policy in your company.

Cornelius (Con) Conlon

2022-02-03

4 min read

OxyCast: A New Podcast on Everything Web Scraping Related

Web scraping is such a broad topic, and there’s a lot of things to learn in order to collect the required public data efficiently. That’s why our team decided to start a new podcast on everything web scraping related – OxyCast!

Iveta Vistorskyte

2022-01-20

2 min read

Dashboard Update: Maximizing User Experience

Now Oxylabs dashboard is empowered with a new convenient overview and navigation. Make sure to test it yourself!

Maryia Stsiopkina

2022-01-19

2 min read

Best Python Libraries for Web Scraping

This white paper will go through four most popular Python libraries and the basics on how to get started in web scraping.

Yelyzaveta Nechytailo

2022-01-12

1 min read

AI/ML in 2022: More Real-world Deployments, Focus on AI Ethics and Blockchain Hopes

Oxylabs' AI/ML Advisory Board members make their predictions on what's in store for artificial intelligence and machine learning in 2022.

Adomas Sulcas

2022-01-10

3 min read

Oxylabs Sues Bright Data In Patent Infringement Case

Oxylabs filed a lawsuit concerning Bright Data’s (formerly Luminati Networks Ltd.) infringement on Oxylabs’ Smart Proxy Rotator and web script management patents.

Adomas Sulcas

2022-01-10

1 min read

How to Hide My IP Address

This article aims to broaden one’s knowledge of what an IP address can reveal, and how to hide an IP address. Also, we discuss whether an online business hide their IP address.

Lukas Motiejunas

2022-01-02

6 min read

Web Scraping With PHP

This article will guide you through the step-by-step process of writing various PHP web scraping routines that can extract public data from static and dynamic web pages.

Augustas Pelakauskas

2021-12-30

12 min read

Introducing Oxy® Proxy Manager App

It’s a free proxy app that allows you to add, edit and manage your proxies from any proxy provider of your choice. Learn more and try it out now!

Iveta Vistorskyte

2021-12-28

2 min read

Building a Web Scraper in Golang

This article will guide you through the step-by-step process of writing a fast and efficient Golang web scraper that can extract public data from a target website.

Augustas Pelakauskas

2021-12-23

9 min read

How to Automate Competitors' & Benchmark Analysis With Python

The purpose of this article is to help you automate the data extraction processes as much as possible. After learning how to do this, you can dedicate your time to what matters: the analysis itself and coming up with actionable insights to strategize.

Daniel Heredia Mejias

2021-12-22

4 min read

Puppeteer on AWS Lambda

There are a few challenges when it comes to getting Puppeteer to work properly on AWS Lambda, and we’ll address all of them in this post.

Jordan Hansen

2021-12-22

2 min read

Comprehensive Guide on Data Collection

In this extensive white paper, we’ve gathered a variety of technical insights to help you begin web scraping with Python.

Yelyzaveta Nechytailo

2021-12-15

1 min read

E-Commerce Keyword Research: Data Collection Challenges and Solutions

E-commerce keyword research is at the core of every successful e-commerce business. Find out what data collection challenges you may face and how to overcome them.

Iveta Vistorskyte

2021-12-15

7 min read

What Is Data Mining?

Data mining is an advanced analysis of collected datasets. Learn more about data mining techniques, specifics, and benefits

Monika Maslauskaite

2021-12-02

5 min read

What is Browser Fingerprinting?

Browser fingerprinting is being used as a new avenue of tracking. How does it work? Is it possible to reduce the likelihood of being tracked? Read on to find out more.

Adomas Sulcas

2021-12-02

4 min read

Poor Quality Data Might Cost You Too Much

Read a message on the importance of data quality from Allen O’Neill, one of the industry’s leading experts.

Allen O'Neill

2021-12-01

5 min read

Search Engine Scraping: What You Should Know

Want to find out which data sources from search engines are the most beneficial? Did you know that scraping SERPs comes with challenges that can complicate data gathering processes? Read this article and find out everything you need to know about scraping search engines.

Iveta Vistorskyte

2021-11-30

8 min read

What Is a Web Session and How Is It Used in Web Scraping?

This article will give you a general overview of web sessions, their relation to cookies, and their use in web scraping.

Augustas Pelakauskas

2021-11-26

5 min read

What Is Affiliate Fraud and How to Prevent It?

In this blog post, we’ll discuss affiliate fraud and the most common methods fraudsters use. We’ll also explain how to identify fraud and tips for not falling victim to malicious actors.

Maryia Stsiopkina

2021-11-12

7 min read

Proxy Integration With ParseHub

Extracting data might sometimes be troublesome. To make this process easier, learn how to integrate Oxylabs Residential Proxies with ParseHub tool.

Jolita Pundzaite

2021-11-05

3 min read

What Is Sentiment Analysis?

In this article, you will learn what sentiment analysis is, how it can benefit market research and brand monitoring, and how it works.

Maryia Stsiopkina

2021-10-22

10 min read

What Is Data Normalization?

Data normalization is one of the best practices of efficient data management. Find out more about database normalization and how you can benefit from normalizing data in this article.

Monika Maslauskaite

2021-10-18

6 min read

News Scraping: Everything You Need to Know

This article discusses everything you need to know about news scraping, including the benefits and use cases of news scraping as well as how you can use Python to create an article scraper.

Iveta Vistorskyte

2021-10-18

7 min read

Free White Paper: E-commerce Drives ROI Upwards with Alternative Data

Find out how UK ecommerce and retail companies are changing industry practices with alternative data. Dig through the data Oxylabs and Censuswide have collected to see how alternative data has changed ecommerce.

Adomas Sulcas

2021-10-07

1 min read

Next Chapter for Real-Time Crawler: New Products With Dedicated Focus

We have been working on some changes for our Real-Time Crawler product. We are happy to finally announce that starting today, our scraper will be switching its focus and will be split into three distinct Scraper APIs.

Gabija Fatenaite

2021-10-06

3 min read

Free White Paper: Proxies Buying Guide for Enterprises

Learn more about use cases and challenges for which datacenter and residential proxies are best suitable. Find out the key points to consider when choosing a reliable proxy provider.

Monika Maslauskaite

2021-09-27

1 min read

Data Wrangling: What Is It and Why Is It Important?

This article discusses what data wrangling is, the key steps of data wrangling, and why it’s crucial for businesses.

Iveta Vistorskyte

2021-09-23

4 min read

Most Common HTTP Headers

HTTP headers enable to transfer further details within the request or response headers. Find out 5 key HTTP headers that are crucial to use and optimize in web scraping.

Vytautas Kirjazovas

2021-09-20

4 min read

HTTP vs. HTTPS: What Is the Difference?

This article discusses the differences between HTTP and HTTPS protocols, their security parameters, and the steps you should take to switch from HTTP to HTTPS.

Maryia Stsiopkina

2021-09-17

5 min read

Introducing Mobile Proxies: Harness the Power of Mobile IPs

We’re excited to introduce Mobile Proxies with an extensive list of locations, as well as country, state, city, coordinate, and ASN targeting with no extra fees.

Iveta Vistorskyte

2021-09-15

2 min read

What is Firmographic Data? Everything You Need to Know

Firmographic data can provide valuable insight for business-to-business marketers. Read this article to learn what benefits it brings and how it can be acquired.

Maryia Stsiopkina

2021-09-14

6 min read

What Is Parsing of Data?

In this article we’ll dig a little deeper on what is data parsing, and discuss whether building an in-house data parser is more beneficial to a business, or is it better to outsource a data parser.

Gabija Fatenaite

2021-09-13

6 min read

Data-Driven Marketing: How Big Data Helps Make Business Decisions

This article discusses how big data is changing marketing and the process of decision-making. Learn what data-driven marketing is and its main benefits and use cases.

Maryia Stsiopkina

2021-09-02

5 min read

How to Read HTML Tables With Pandas

Read this article to learn everything about pandas library and how useful the pandas read_html function can be, especially when combined with other helpful functions.

Iveta Vistorskyte

2021-09-01

7 min read

Previous
12345
...
11Next

Top News on Everything Data Gathering

Subscribe to our newsletter and get monthly scraping updates delivered right to your email.

No spam whatsoever, just pure data gathering news, trending topics and useful links. Unsubscribe anytime.