Back to blog
ETL stands for extract, transform, and load. It’s an independent three-stage process that moves data from one source(s) to a database and is crucial for modern day anlysis.
Danielius Radavicius
2022-10-14
5 min read
Free White Paper: Alternative Data Defines Competition in the US & UK Ecommerce Sectors
Read our whitepaper that outlines the results of a US & UK decision maker survey on the usage of alternative data and web scraping in their daily operations
Adomas Sulcas
2022-10-11
1 min read
Screen Scraping: What Is It and How Does It Work?
Find out what screen scraping is and how it works. Learn the difference between web scraping and screen scraping.
Vytenis Kaubrė
2022-10-03
5 min read
How to Run Python Script as a Service (Windows & Linux)
From Linux daemons to Windows services, this guide will detail the process of setting up Python script as a service in a few simple steps.
Augustas Pelakauskas
2022-09-26
5 min read
Oxylabs Attends Current 2022: The Next Generation of Kafka Summit
We're excited to sponsor and participate live in the Current 2022: The Next Generation of Kafka Summit. Don't miss out and participate in one of the largest data-focused conferences worldwide!
Danielius Radavicius
2022-09-21
2 min read
Oxylabs Acquires Webshare Software Company
Oxylabs furthers industry leadership by acquiring US-based Webshare Software Company.
Adomas Sulcas
2022-09-20
1 min read
OxyCon 2022: The Top Takeaways From Day Two
See what the second day of the conference was all about and take note of all the important highlights you might have missed.
Yelyzaveta Nechytailo
2022-09-09
4 min read
OxyCon 2022: The Top Takeaways From Day One
Find out the core talking points during the first day of our annual 2022 OxyCon conference and see in more detail what topics were covered.
Danielius Radavicius
2022-09-07
7 min read
6 Web Scraping Project Ideas to Sharpen Your Skills
Planning a project on web scraping and don’t know where to start? Learn how proxies come into play when project planning for web scraping in our most recent article.
Gabija Fatenaite
2022-08-31
8 min read
How to Continuously Yield High Quality Data | Interview with Glen De Cauwsemaecker
As part of the combined efforts of Oxylabs and OTA Insight to shed light on the developments in the web scraping industry, we sat down with Glen De Cauwsemaeker, the Lead Crawler Engineer at OTA Insight, to talk about the challenges when scaling data acquisition solutions.
Glen De Cauwsemaecker
2022-08-25
10 min read
2 Weeks Until OxyCon So Hurry Up and Register!
Only two weeks left up until, OxyCon, the two-day virtual event, starts. Find out more about what awaits within the conference and don't hesitate to register, there isn't much time left!
Danielius Radavicius
2022-08-24
2 min read
In this practical tutorial, you'll learn how to create a web scraper using Rust and collect public product data from an e-commerce website.
Maryia Stsiopkina
2022-08-24
6 min read
Data Pipeline Architecture Explained
Learn what a data pipeline architecture is, why it's important for businesses and how to build one.
Roberta Aukstikalnyte
2022-08-11
5 min read
Automated Web Scraper With Python & Windows Task Scheduler
In this article, you’ll learn how to set up Windows Task Scheduler to schedule a Python web scraping script automatically & periodically.
Augustas Pelakauskas
2022-08-04
5 min read
Automating Web Scraping With Python and Cron
If you're curious about how to automate your web scraping projects with Python and cron, then check out this dedicated tutorial for more information.
Danielius Radavicius
2022-07-29
6 min read
OxyCon 2022 Agenda is Live - How to Get the Most Out of the Event
This year's OxyCon has already garnered attention from every corner of the data gathering industry. Now is the time to reveal what's in store during our two-day event on September 7-8th.
Danielius Radavicius
2022-07-28
2 min read
Marketplace SEO: How to Improve Your SERP Rankings with Data
The success of your SEO strategy will likely depend entirely on how accurately you follow a specific marketplace’s regulations and in what ways you stand out.
Danielius Radavicius
2022-07-26
7 min read
Machine Learning: The Driving Force of Web Scraping | OxyCast #6
A brand new episode of OxyCast is live! This time, our favorite host Augustinas Kalvis (Software Developer), and a special guest Jurijus Gorskovas (Machine Learning Engineer), delve deeper into the world of Machine Learning! Watch it to understand the details of machine learning and how it can make web scraping processes more efficient.
Iveta Vistorskyte
2022-07-13
3 min read
Businesses are complex, and both first and second movers encounter unique difficulties and advantages. If you're interested in how these advantages relate to web scraping, check this out!
Julius Cerniauskas
2022-07-05
4 min read
Data Quality Metrics You Should Track and Measure
Learn more about how to effectively track and measure data quality with the help of this detailed blog post
Yelyzaveta Nechytailo
2022-06-27
5 min read
How to Estimate and Reduce Data Collection Costs
Let's take a look at the key factors influencing data acquisition costs, and discuss ways to reduce these expenses.
Maryia Stsiopkina
2022-06-22
7 min read
Brand New Lessons on Everything Data Gathering
Data gathering is a rather tricky field; therefore, getting advice and lessons from people who have worked at the top of the industry for years is unparalleled.
Danielius Radavicius
2022-06-22
2 min read
Why You Shouldn't Use Free Proxies - Risks & Reasons
Free proxies are an attractive yet often unsafe solution. Its limitations, security issues, and recommendations are all discussed within the article.
Danielius Radavicius
2022-06-22
6 min read
Oxylabs Listed as Global Good Awards Finalist
Oxylabs was listed as Global Good Awards Finalist for its joint pro bono project with the CRA to detect illegal content online with a unique AI-powered web scraping solution.
Gabija Birgile
2022-06-17
1 min read
Oxylabs partnered with the University of Michigan, ranked as the No.1 public university in the United States, and School of Information professor Christopher Brooks to share expertise in the field of ethical web scraping.
Gabija Birgile
2022-06-16
2 min read
Real-Time Online Media Monitoring Infrastructure
This white paper will walk you through the critical stages of the online media monitoring process.
Maryia Stsiopkina
2022-06-15
1 min read
OxyCon 2022: Leading-Edge Conference in All Things Web Scraping
Don’t forget to save your seat at OxyCon 2022, September 7-8, to join discussions on the most relevant and recently encountered topics of public data gathering! Just like last year, the 2 day virtual event will feature discussions from a plethora of industry-leading guests.
Danielius Radavicius
2022-06-09
2 min read
Oxylabs Forms a Pro-bono Partnership With the University of Michigan
Oxylabs partnered with the University of Michigan, ranked as the No.1 public university in the United States, and School of Information professor Christopher Brooks to share expertise in the field of ethical web scraping.
Gabija Birgile
2022-05-31
3 min read
Hard Data vs. Soft Data: The Difference
While being so different, hard and soft data can complement one another in significant ways when it comes to business data analysis and forecasting. Read this blog post to learn more.
Maryia Stsiopkina
2022-05-25
7 min read
Proxies for Web Scraping: a Complete Guide | OxyCast #5
In the 5th episode of our podcast, OxyCast host and Software Engineer Augustinas Kalvis will be talking to Mindaugas Dunderis about proxies and how they go hand-in-hand with public data scraping.
Roberta Aukstikalnyte
2022-05-23
2 min read
New Cost-Effective Proxy Solution: Introducing Shared Datacenter Proxies
As of today, our Datacenter Proxies are represented by two solutions: Shared and Dedicated Datacenter Proxies. As a customer, two distinct products will allow you to increase the effectiveness of public data collection processes by providing more flexibility in making a choice.
Augustas Pelakauskas
2022-05-17
2 min read
Free White Paper: Alternative Data Unlocks Key Decisions in the UK And US Finance Industries
Oxylabs, in cooperation with Censuswide, has surveyed over 1000 senior decision makers in the finance industry. Find out how alternative data and web scraping has changed finance.
Adomas Sulcas
2022-05-04
1 min read
RegEx stands for Regular Expressions, a method to match specific patterns depending on the provided combinations, which can be used as filters to get the desired output.
Augustas Pelakauskas
2022-04-29
3 min read
Scaling: Overcoming Your Limits | OxyCast #4
At one point or another, any scaling or coding project will run into the issue of scaling and how to effectively do it. In the 4th episode of OxyCast, we will explore topics such as horizontal vs. vertical scaling, bottleneck avoidance, and many others!
Danielius Radavicius
2022-04-25
2 min read
Empowering the Lithuanian Public Sector in the Mission for a Cleaner Internet
The internet is full of illegal and harmful content, which can be hard to detect without the right tools. Web scraping is a perfect solution.
Erika Brazaityte
2022-03-31
2 min read
Real-Time Price Monitoring System Architecture
This white paper provides an action chain for price monitoring, from collecting target URLs to data parsing, along with tips and explanations on the most important elements and processes.
Augustas Pelakauskas
2022-03-24
1 min read
How to Build a Price Tracker With Python
This article explains how to build a scalable price tracker for immediate deployment to various eCommerce sites.
Augustas Pelakauskas
2022-03-23
5 min read
Data Parsing: The Basic, the Easy, and the Difficult | OxyCast #3
Parsing is an integral part of any web scraping activity that helps businesses get the data they need in the right format. In the third episode of OxyCast, we will dig deeper into data parsing and discuss such topics as easy vs. hard parsing, selectors, parser failures, and the future of parsing.
Yelyzaveta Nechytailo
2022-03-23
2 min read
Retail Competitive Pricing Strategies
Competitive pricing strategies and analysis are essential in both determining the correct price of your product/service and maximization of profits.
Danielius Radavicius
2022-03-22
6 min read
Scraping Alternative Data: Technological Challenges to Keep in Mind
The role of alternative data becomes more and more prominent. However, its collection is tied to multiple technological challenges that can disturb your business's operations. This white paper provides a detailed explanation of these challenges and proposes solutions to deal with them.
Yelyzaveta Nechytailo
2022-03-10
1 min read
Puppeteer Tutorial: Scraping With a Headless Browser
Web scraping and automation with JavaScript has evolved a lot in recent years. There are a few methods to accessing and parsing web pages, but in this tutorial we will be covering how to do it with Puppeteer.
Gabija Fatenaite
2022-03-09
7 min read
Using Data for Competitive Advantage
Learn how companies outperform their competitors through the different uses of big data.
Roberta Aukstikalnyte
2022-03-07
5 min read
Web Scraping for Machine Learning
This tutorial explores a real-life scenario where web scraping and machine learning work in tandem and how the step-by-step process should look like if you decide to do it on your own.
Danielius Radavicius
2022-02-22
6 min read
Web Scraping: Another Block In The Wall | OxyCast #2
If you’ve ever tried web scraping, you should be aware of the blocking issue. It’s a common challenge, especially if you gather public data on a large scale without a decent knowledge of using resources wisely. This is why we decided to cover this topic and share our knowledge and tips & tricks on how to avoid getting blocked.
Iveta Vistorskyte
2022-02-22
2 min read
This article will walk you through the step-by-step process of installing and downloading files using Wget with or without proxies, covering multiple scenarios and showcasing practical examples.
Augustas Pelakauskas
2022-02-15
6 min read
Minimum Advertised Price (MAP) monitoring is an essential process for any brand, supplier or manufacturer that works with e-commerce marketplaces. Tracking MAP policy compliance plays an important role ensuring fair competition across different channels and protecting your brand’s reputation.
Adelina Kiskyte
2022-02-11
4 min read
Scale up your business with Oxylabs®