Back to blog

Free White Paper: Acquiring High-Quality Web Data for LLM Fine-Tuning

Get a free, in-depth guide on data acquisition processes for LLM fine-tuning. Discover data categories, large-scale scraping strategies, and cost optimization tips for fine-tuning your AI models.

roberta avatar

Roberta Aukstikalnyte

2024-11-19

1 min read

Most popular articles

AI in 2025: Experts Predict a Bursting Bubble, More Regulation, and New Developments

Adi Andrei and Ali Chaudhry, esteemed AI experts and members of Oxylabs' AI/ML advisory board, joined Julius Černiauskas, CEO at Oxylabs, to share their insights on the potential future of AI in 2025.

Vytautas Kirjazovas

2025-01-09

2 min read

Browsing, Labor Division, and Data Management: How AI Will Change Life in 2025?

Oxylabs experts discuss predictions for major AI and machine learning (ML) developments in 2025 in their industry and other spheres.

Vytautas Kirjazovas

2025-01-07

2 min read

How to Navigate AI, Legal, and Web Scraping: Asking a Professional

In this interview, we sit down with a legal professional to shed light on the ever-changing legal framework surrounding web scraping.

roberta avatar

Roberta Aukstikalnyte

2025-01-07

6 min read

Oxylabs+Censuswide

Free White Paper: Addressing the Main Challenges of Public Web Data Gathering

New research conducted by Oxylabs in partnership with Censuswide has surveyed scraping professionals to uncover the key issues they face when gathering public web data and the role of artificial intelligence (AI) in addressing them.

Gabija Birgile

2025-01-06

2 min read

4beta and Stanford University

Project 4β Partners with Stanford University Researcher

Oxylabs' pro bono initiative, Project 4β, has partnered with Olivia Martin, a PhD student in Economics at Stanford University's School of Humanities and Sciences to enhance research with data collection solutions.

Gabija Birgile

2024-12-12

2 min read

Oxylabs fake discounts research

Oxylabs’ Research on Fake Discounts: Are Shopping Events Worth the Deal?

Discover how Oxylabs’ advanced data collection infrastructure was leveraged to analyze the prevalence of fake discounts in major US marketplaces during Black Friday 2024. Explore key insights from this in-depth case study.

Vytautas Kirjazovas

2024-12-10

7 min read

4beta and Carnegie Mellon University researcher

Carnegie Mellon University Researcher Joins Project 4β

Oxylabs' pro bono initiative, Project 4β, has partnered with Liying Qiu, a PhD student in Business Technologies at Carnegie Mellon University's Tepper School of Business to advance research with data collection solutions.

Gabija Birgile

2024-12-10

1 min read

4beta and The Pulitzer Center

Project 4β Partners with The Pulitzer Center

Oxylabs' pro bono initiative "Project 4β" partners with The Pulitzer Center, a renowned non-profit organization known for advancing in-depth, high-impact journalism on underreported global issues.

Gabija Birgile

2024-12-05

1 min read

How Scraped Data Can Help Train LLMs and AI Tools

Watch our new webinar to learn how high-quality scraped data fuels LLMs and AI tools. Expert insights, challenges, and live demos await!

authors avatar

Maryia Stsiopkina

2024-11-04

1 min read

Introducing Oxy Parser, an Open-Source Data Parsing Tool

Introducing Oxy Parser, an Open-Source Data Parsing Tool

Oxy Parser is an open-source data parsing tool that automates HTML structurization using Pydantic models and automated XPaths.

author avatar

Augustas Pelakauskas

2024-10-15

2 min read

Enhance Your Data Workflow: All-In-One Web Scraper API and OxyCopilot

Today marks a major milestone for both Oxylabs and the web scraping industry: the launch of our unified Web Scraper API, now enhanced with the OxyCopilot feature.

authors avatar

Maryia Stsiopkina

2024-09-25

3 min read

Project 4beta Partners with Northwestern University Researchers to Explore Digital Redlining

"Project 4β" partners with researchers from Northwestern University to provides free access to Oxylabs' web intelligence collection solutions to investigate the phenomenon of digital redlining.

Gabija Birgile

2024-08-13

2 min read

Leveraging Web Scraping for Economic Research: A Collaboration with Researcher from Aston University

Oxylabs' pro bono initiative "Project 4β" partners with Dr. Ngoc Dieu Linh Vi, a lecturer in Economics at Aston University to provide free access to Oxylabs web scraping solutions.

Gabija Birgile

2024-08-12

2 min read

Fingerprinting Tactics with Pyppeteer and Playwright

Make sure to secure a slot in your calendar on July 10, 2024, to join us for a value-packed webinar hosted by Paulius Stundžia, Senior Developer at Oxylabs.

authors avatar

Maryia Stsiopkina

2024-06-26

1 min read

APIs Decoded: Simplify Data Extraction

Oxylabs hosted a webinar to discover how API endpoints can help uplift your scraping operations.

Enrika avatar

Enrika Pavlovskytė

2024-06-19

1 min read

OxyCon 2024

OxyCon 2024: Unlock the Power of Data With Web Scraping

Join us on September 25, 2024 to dive into the ever-changing landscape of web scraping.

author avatar

Yelyzaveta Nechytailo

2024-06-19

2 min read

The Ferret Joins "Project 4β" to Boost Scottish Investigative Journalism

Oxylabs' pro bono initiative "Project 4β" partners with an award-winning Scottish investigative journalism platform "The Ferret".

Gabija Birgile

2024-05-17

1 min read

Driving Social Impact Through Web Intelligence: "Project 4β" Welcomes Global Witness

Oxylabs' pro bono initiative "Project 4β" partners with an independent, non-profit climate change organisation "Global Witness" to elevate environmental monitoring and research.

Gabija Birgile

2024-05-16

2 min read

Mastering SEO Monitoring Systems: Challenges & Solutions

Watch Oxylabs' webinar to learn the principles of building an SEO monitoring system for data-driven decision-making.

author avatar

Augustas Pelakauskas

2024-05-14

2 min read

How to Overcome Anti-Bot Systems: Insights & Tactics

Join Fabien Vauchelles and Denis Zyk as they immerse in the world of anti-bot systems and share bypassing techniques you can implement in your scraping tasks.

author avatar

Yelyzaveta Nechytailo

2024-04-22

1 min read

Top News on Everything Data Gathering

Subscribe to our newsletter and get monthly scraping updates delivered right to your email.

No spam whatsoever, just pure data gathering news, trending topics and useful links. Unsubscribe anytime.