Paul Morgan

Data Collections Team Lead at Datasembly

Paul Morgan started his career by building various websites and mobile applications, and now, he has been developing software for almost 15 years. Over the last 3 years, he has transitioned from building apps to dissecting, analyzing, and acquiring data from them. At Datasembly, his team has developed a data collection architecture that allows them to collect billions of product listings weekly, supplying the required data to some of the biggest players in the field. Paul describes himself as a problem solver and explorer of complex scenarios, leading him to hold a Guinness world record and become a chess champion in Colorado this year. 

Paul Morgan

Learn more about Paul's presentation:

Data Collection: Orchestration, Observability and Introspection

It's not a secret that the data collection world constantly brings unexpected situations and challenging moments for developers. Deploying, orchestrating, and monitoring web scraping architecture with various tools like Airflow, Kubernetes, and Prometheus isn't an easy task for every developer, especially newbies. Paul will show a general presentation about scraping and show some funny examples of product listings his team comes across.

His presentation will touch on a variety of different data collection topics, including:

  • Strange and challenging moments of data collection;

  • Managing data collection job deployments;

  • Orchestrating and scheduling data collection jobs;

  • Monitoring running collection jobs and detecting issues early.

Meet OxyCon 2022 speakers

Coming from different industries and backgrounds but united in common passion and aspirations, these web scraping experts will share their experiences and answer your questions. 

Glen De Cauwsemaecker

Lead Crawler Engineer @ OTA Insight

More details

Denas Grybauskas

Head of Legal @ Oxylabs

More details

Allen O'Neill

CEO/CTO @ The DataWorks

More details

Martynas Saulius

Python Developer @ Oxylabs

More details

Gabija Fatėnaitė

Moderator @ OxyCon

More details

Eivydas Vilčinskas

Technical Team Lead @ Oxylabs

More details

Ovidijus Balkauskas

Linux Systems Engineer @ Oxylabs

More details

Sarah McKenna

CEO @ Sequentum

More details

Sanaea Daruwalla

General Counsel @ Zyte

More details

Tadas Malinauskas

Python Developer @ Oxylabs

More details

Ondra Urban

COO @ Apify

More details

Vaidotas Šedys

Moderator @ OxyCon

More details

Alex Reese

Partner @ Farella Braun + Martel

More details

Julius Zaleskis

CEO @ Dataistic

More details

Karsten Madsen

CEO @ Morningscore

More details

Glen De Cauwsemaecker

Lead Crawler Engineer @ OTA Insight

More details

Denas Grybauskas

Head of Legal @ Oxylabs

More details

Allen O'Neill

CEO/CTO @ The DataWorks

More details

Martynas Saulius

Python Developer @ Oxylabs

More details

Gabija Fatėnaitė

Moderator @ OxyCon

More details

Eivydas Vilčinskas

Technical Team Lead @ Oxylabs

More details

Ovidijus Balkauskas

Linux Systems Engineer @ Oxylabs

More details

Sarah McKenna

CEO @ Sequentum

More details