Back to blog

Free Whitepaper: Acquiring High-Quality Web Data for LLM Fine-Tuning

Get a free, in-depth guide on data acquisition processes for LLM fine-tuning. Discover data categories, large-scale scraping strategies, and cost optimization tips for fine-tuning your AI models.

Roberta Aukstikalnyte

2024-11-19

1 min read

Most popular articles
How to Use cURL With Proxy
How to Use cURL With Proxy?

Iveta Vistorskyte

2024-03-18

7 min read

10 Best Proxy Providers in 2024
10 Best Proxy Providers in 2024

Yelyzaveta Nechytailo

2024-09-27

9 min read

Guide to Handling Python Requests Timeout

Learn how to manage Python requests timeout errors for smooth network operations. See comprehensive code samples and discover best practices.

Vytenis Kaubrė

2024-11-19

6 min read

What is cURL Command and How to Use It?

Discover the power of cURL. Learn how this versatile command-line tool simplifies data transfers, API testing, and more for developers and system admins.

Yelyzaveta Nechytailo

2024-11-18

3 min read

How to Scrape Google Hotels: Python Tutorial

Learn how to scrape Google Hotels data using Python. This step-by-step tutorial covers Selenium, BeautifulSoup, and CSV output to extract prices, ratings, and locations.

Maryia Stsiopkina

2024-11-15

5 min read

Building a Competitor Intelligence System for E-Commerce

The primary focus of the white paper is to provide an action chain for a competitor intelligence system, from data collection to parsing, with various tips, guidelines, and explanations of the most critical processes.

Danielius Radavicius

2024-11-15

1 min read

How to Rotate Proxies in Python Using Requests and AIOHTTP

How to Rotate Proxies in Python Using Requests and AIOHTTP

Learn to rotate proxies in Python using sync and async methods. See how to use Requests and AIOHTTP combined with Asyncio libraries to rotate a proxy list.

Roberta Aukstikalnyte

2024-11-06

6 min read

Web Scraping in JavaScript With Node.js & Puppeteer

Learn practical web scraping techniques using JavaScript and Node.js. Discover popular libraries, best practices, and effective methods using Cheerio, Axios, & Puppeteer.

Adelina Kiskyte

2024-10-29

10 min read

How To Scrape Amazon ASIN with Python

How To Scrape Amazon ASIN with Python

Learn what Amazon ASIN is and how to build a fast and scalable Amazon ASIN scraper. See code samples for creating a custom ASIN scraper and using a maintenance-free API.

Vytenis Kaubrė

2024-10-29

6 min read

How to Scrape Google People Also Ask: Python Tutorial

Learn how to scrape Google's People Also Ask (PAA) with Python. Gather valuable SEO data using BeautifulSoup and store insights for content strategy optimization.

Maryia Stsiopkina

2024-10-23

3 min read

How To Scrape Amazon Best Sellers: Python tutorial

The Amazon Best Sellers sites are product pages that can help retailers perform market research by providing information on what's selling the best and can act like a guide for what products to stock in your own store. However, Amazon, among other big sites, have mechanisms in place that make it a challenge to access this publicly available data.

Akvilė Lūžaitė

2024-10-14

8 min read

How to Bypass CAPTCHA With Playwright

How to Bypass CAPTCHA With Playwright

Bypass CAPTCHAs with Playwright and Oxylabs’ Web Unblocker in Python. Crush web barriers, scrape freely, and automate successfully. Step-by-step tutorial.

Yelyzaveta Nechytailo

2024-10-11

5 min read

What is web scraping?

What is Web Scraping & How to Scrape Data from a Website?

The concept of web scraping is becoming familiar to every modern company aiming to base its decisions on data. This article will explain web scraping and how to effectively incorporate it into your business.

Iveta Vistorskyte

2024-10-09

8 min read

How to Bypass CAPTCHA in Web Scraping Using Python

How to Bypass CAPTCHA in Web Scraping Using Python

If CAPTCHAs keep on interrupting your day-to-day scraping tasks, read this article presenting solutions that can help you go around them successfully.

Yelyzaveta Nechytailo

2024-10-03

7 min read

How to Scrape Google Maps Using Python

See this extensive guide on how to scrape Google Maps with an Oxylabs solution.

Danielius Radavicius

2024-09-25

6 min read

Web Scraper API Quick Start Guide

Web Scraper API Quick Start Guide

In this quick start guide, we introduce Web Scraper API and explain how it works. Explore integration methods, queries, their parameters, and response codes.

Augustas Pelakauskas

2024-09-25

4 min read

Advanced Web Scraping With Python Tactics in 2024

Learn advanced web scraping tactics in Python to improve your skills. Overcome CAPTCHAs, emulate Ajax requests, fine-tune your async processes, and much more.

Vytenis Kaubrė

2024-09-12

8 min read

Pagination In Web Scraping: How Challenging It May Be

Dealing with pagination in web scraping might be challenging and result in missing data. Learn about different approaches when scraping multiple pages.

Vejune Tamuliunaite

2024-09-11

7 min read

How to Scrape Google Lens Results: Python Tutorial

Learn how to scrape Google Lens Results in Python with Oxylabs' Google Lens API. Find out how to set up the API, get structured data, and save it into a JSON file.

Vytenis Kaubrė

2024-09-10

2 min read

Playwright Web Scraping Tutorial for 2024

Playwright Web Scraping Tutorial for 2024

This article explains everything about Playwright and how it can be used for automation and even web scraping.

Iveta Vistorskyte

2024-09-05

9 min read

How to Scrape Google Images With Python

See how you can scrape Google Images with this brief, step-by-step tutorial.

Danielius Radavicius

2024-09-03

4 min read

How to Web Scrape HTML Tables With Python: Step-by-Step

Learn to scrape and parse HTML tables in Python using three real table examples. This article covers the basics and the more advanced concepts.

Vytenis Kaubrė

2024-08-23

6 min read

Top News on Everything Data Gathering

Subscribe to our newsletter and get monthly scraping updates delivered right to your email.

No spam whatsoever, just pure data gathering news, trending topics and useful links. Unsubscribe anytime.