avatar

Augustas Pelakauskas

Oct 22, 2021 3 min read

The internet is bustling with layers upon layers of publicly available data. How to make more sense of it all? How to soak up such quantities in a timely manner? Simply put, scan and collect the data with automated tools for further storage and analysis on your digital turf. Web scraping extracts the data promptly, leaving more time for research. This article will guide you through the integration process of Oxylabs’ Residential Proxies with WebHarvy’s web scraper.

What is WebHarvy?

WebHarvy is an intuitive visual scraper that easily scrapes text, HTML, images, URLs, and emails from websites. The Inbuilt browser allows you to click on specific content for scraping. The cursor detects data patterns that occur on a webpage. If the data repeats, the tool scrapes automatically without any additional user input. The entire lists on multiple pages are extracted in just a few clicks. Lastly, WebHarvy saves scraped data in Excel, XML, CSV, JSON, and TSV formats.

The tool offers easy-to-use third-party proxy support. WebHarvy ensures and prevents scraping procedures from being blocked. Either a single proxy or a list of proxy servers could be used for public web scraping. Make sure to avoid using free/open proxy services as the probability of being shut off in the middle of an operation is high.

How to integrate Oxylabs Proxies with WebHarvy?

  1. Firstly, download and install the WebHarvy app via webharvy.com
  2. Once set up, navigate to Settings.

3. Click on Proxy Settings. Select to mark Enable network connection via Proxy Server and choose HTTP as your Type.

4. Fill in the required credentials. Under Address, enter pr.oxylabs.io and under Port type in 7777. You can also use country-specific entries. For example, if you fill in us-pr.oxylabs.io under Address and 10001 under Port, you’ll acquire a US exit node with a sticky session (for a complete list of country-specific entry notes, please refer to our documentation).

5. Click to mark Requires authentication to enter your Oxylabs sub-user’s Username and Password. Click on the + button to add your newly input proxy to the list. Lastly, press Apply to finish your WebHarvy proxy integration.

That’s all. Now you can browse the internet and mark the specific rows to scrape. By clicking Start you can begin selecting your target data.

Wrapping up

Implementation of web scraping is a crucial part of up-to-date data mining solutions. WebHarvy is a straightforward and capable tool able to scale your daily data processing swiftly. As the tool accepts various third-party proxies, be sure to employ a reliable proxy services provider.

If you have any questions configuring our proxies or contemplating starting using our public web scraping solutions, don’t hesitate to get in touch with us for more information.

avatar

About Augustas Pelakauskas

Augustas Pelakauskas is a Copywriter at Oxylabs. Coming from an artistic background, he is deeply invested in various creative ventures - the most recent one being writing. After testing his abilities in the field of freelance journalism, he transitioned to tech content creation. When at ease, he enjoys sunny outdoors and active recreation. As it turns out, his bicycle is his third best friend.

All information on Oxylabs Blog is provided on an "as is" basis and for informational purposes only. We make no representation and disclaim all liability with respect to your use of any information contained on Oxylabs Blog or any third-party websites that may be linked therein. Before engaging in scraping activities of any kind you should consult your legal advisors and carefully read the particular website's terms of service or receive a scraping license.

Related articles

Aezakmi Proxy Integration With Oxylabs
Aezakmi Proxy Integration With Oxylabs

Nov 16, 2021

3 min read

What Is Affiliate Fraud and How to Prevent It?
What Is Affiliate Fraud and How to Prevent It?

Nov 12, 2021

9 min read

Proxy Integration With ParseHub
Proxy Integration With ParseHub

Nov 05, 2021

3 min read