- This event has passed.
Exploratory Data Science at Scale
May 25, 2017Free
Session Time and Date
Thursday, May 25th from 11am – 12pm CDT
How To Attend A Live Conference Session
Steps to Follow
- Click the link below to attend:
- Name and Email
- Join the conference session by clicking the link on the GoToWebinar confirmation email
Running Python, R and Scala at scale in the cloud allows you to work on the full dataset and has its own unique challenges. The cloud’s flexibility allows you to scale the engine proportionally to the dataset, so that query running times remain at comfortable levels. This increases your efficiency as a data scientist.
We’ll show what architecture changes are required for running in the cloud versus on-premises. We separate compute from storage and use Docker for easy scaling. We’ll also cover data import and export, cost optimization, common performance bottlenecks, and data protection.