Most popular data sources

GitHub

Get data about GitHub repositories, developer profiles, contributions, issues, social interactions, and more.

StackShare

Receive information about companies and their technology, reviews, tools and services, trends, and more.

DockerHub

Access data from container images, repositories, developer profiles, contributions, usage statistics, and more.

Developer community and code data explained

It is a collection of public data points about tech companies, developers, and code repositories found all over online dev communities. With Oxylabs datasets, you will receive:

  • Usernames

  • Companies

  • Locations

  • Job Titles

  • Follower counts

  • Contact details

  • Employability statuses

  • and more

Ready-to-use community and code datasets

Flexible dataset delivery

To better suit your individual use case, you can choose different output formats, storage options, and delivery frequency:

  • Get datasets in CSV, JSON, or other formats

  • Receive data via SFTP or to your cloud storage like AWS S3

  • Acquire datasets once or upon an agreed frequency

Why partner with Oxylabs?

Fresh and accurate data

Fresh and accurate data

Get complete, clean, and structured data from scraping professionals.

Saved time and resources

We will handle data extraction and processing for you at a cost-efficient price.

Customized solution 

Customized solution 

Share your data needs, and we will tailor our data harvesting approach to a perfect fit.

Legal compliance 

Legal compliance 

Fortune 500 companies trust Oxylabs for leading ethical data collection in line with GDPR and CCPA.

Pricing

Standard Community and Code Datasets

Choose from a variety of our ready-to-use datasets.

  • Standardized data schema

  • Fresh, clean, and parsed data

  • Data points from the most difficult data sources

Delivery frequency:

• Monthly
• Quarterly
• One-time purchase

From $1000/month

Recommended

Custom Datasets

Get data from any public web domain fully tailored to your business needs.

  • Customized data schema

  • Flexible and scalable solutions

  • Dedicated Slack channel for seamless communication

Delivery frequency:

• Daily
• Weekly
• Monthly
• Quarterly
• Custom

Tailored pricing

With no additional fees & included in both plans:

Legal compliance

Dedicated Account Manager

Top-quality data extracted by leading scraping experts

Lloyd’s insurance

Custom datasets for all data needs

Understanding your data needs

Understanding your data needs

We will work closely with you to understand your business intricacies and define your data requirements.

Developing customized solution

Developing customized solution

Then, we will develop a personalized public data extraction process using our in-house web scraping infrastructure.

Delivering data sample

Delivering data sample

You will receive a sample dataset to evaluate the quality of data and the overall data delivery process.

Continuous data delivery

Continuous data delivery

Once we settle on the most suitable approach, we will consistently deliver data based on the agreed frequency.

Trusted by businesses worldwide

Since 2015, we have helped over 4000 clients improve their operations with Oxylabs solutions. On top of offering top-tier products, we strive to provide the best customer experience anytime you need it. But do not take our word for it. See it for yourself.

Experience our award-winning web intelligence solutions

Frequently asked questions

With a Standard Datasets plan, you may choose to receive fresh community and code data one time, every month, or every quarter. When it comes to a Custom Datasets plan for enterprises, you can define the frequency yourself.

If, instead, you'd like to gather data yourself from pages like GitHub, we have a scraper API solution available as well.

You can choose to receive datasets as Excel or CSV files or in JSON, JSONL, and NDJSON formats.

Developer community and code datasets allow companies to make data-driven decisions by leveraging them for:

  • Market research

  • Trends analysis

  • Talent acquisition

  • Competitive analysis

  • Investment research

scraping digest

Get the latest news from data gathering world

I'm interested