What Happened at
Data Platforms 2017?

Watch Videos

Wednesday May 24, 2017

11:00 am – 6:00 pm

Registration & Networking

1:00 pm – 5:00 pm

On-Site office hours with a Solutions Architect (Aztec C)

1:00 pm – 6:00 pm

QuMbaya: Networking Lounge

1:00 pm – 3:00 pm

Using Qubole Data Service 101

2:00 pm – 4:00 pm

Qubole Data Service Workshop – From Ingestion to Insights in 120

3:30 pm – 5:30 pm

Using Qubole Data Service 201 – Choosing your Big Data SQL Engine

6:30 pm – 8:30 pm

Welcome Reception

Qubole Co-Founders Ashish Thusoo and Joydeep Sen Sarma welcome you to Data Platforms 2017 to kick off this inaugural event. Pick up a copy of their new book “Creating a Data Driven Enterprise with DataOps: Insights from Facebook, Uber, LinkedIn, Twitter and eBay” Published by O’Reilly media and released May 24!!

Enjoy cocktails & conversation.

Thursday May 25, 2017

7:00 am – 8:30 am

Breakfast

8:30 am – 9:30 am

DataOps & The Modern (Big) Data Platform

Ashish Thusoo will discuss his role creating the first modern, big data platform at Facebook, as well as insights from the new book. Joining Ashish will be book contributors and big data pioneers Shrikanth Shankar, LinkedIn, and Karthik Ramasamy, Cofounder of Streamlio, formerly at Twitter, sharing how they led their organizations through similar transformations to become data-driven businesses. They will share what they did, how they did it, and lessons learned on their journeys.

9:30 am – 10:15 am

Building The Modern Data Platform

When we were growing up….

From startups to enterprises, industry leaders will discuss the growth aspects and challenges from various stages along the way. Which of the challenges led to technological innovation within the organization and/or adoption of newer tools and technologies and why?

Panelists

Karthik Subramaniam, Data Platform Lead, Data Science & Engineering, Under Armour Connected Fitness

Oskar Austegard, Director, Data Solutions, Gannett

Colin Riddell, Senior Data Architect, EpicGames

Wade Warren, VP Engineering, Wikia

Tripp Smith, Clarity Insights

Rakesh Soni, Intersys Consulting

Moderated by: Andy Sautins, Technical Manager, Google

10:15 am – 10:30 am

Break

10:30 am – 11:00 am

The Intersection of Cloud and Big Data…..what’s next?

Experience the future! Join Joydeep Sen Sarma and team for some exciting announcements and cool demos!

11:00 am – 12:00 pm

Big Data in the Clouds

Industry visionaries from Amazon, Microsoft and Oracle share their views on the future and promise of the next wave of cloud computing. Hear from:

John Alioto, CTO DX Technology Evangelism & Startups, Microsoft
Jeff Barr, Chief Evangelist, Amazon Web Services
Vinay Kumar, Senior Director of Product Management, Oracle

12:00 pm – 1:15 pm

Lunch

1:30 pm – 4:30 pm

Tech Talks

Practitioners share best practices, techniques, challenges and solutions in these deep dive sessions on the technical, organizational and cultural aspects of building modern, big data platforms.

1:30 pm – 2:15 pm Session I

Playing Offense with The Data Platform — From On-Premise to The Cloud

Speaker: Santanu Dey, Director of Data Science and Engineering, Fanatics Inc.

Over last two years, Fanatics Inc., the global leader in licensed sports merchandise, went through major transformations in terms of technology, and especially in data, by not only moving to Cloud from on-premise but also in terms of how the data is being strategically used to power the e-commerce and backend supply chain systems. From the very start, we expected the data platform to be exposed via a set of data and analytical web services that can act as a brain to provide a delightful customer experience–whether it’s ranking the relevant products or recommending the most interesting ones.

As web visitors browse through the web pages and transact, all events that are generated flow through a Kafka messaging system–Fanflow. These events are then aggregated by Flink and Spark Streaming consumers and machine learned models adapt and react to changes in behaviors evident in these events. Setting up data as services and blurring the line with application stack to provide business metrics and metadata, in addition to storing them as traditional warehouse or data lake, made all the difference. In this session, we will deep dive into couple of these data services and discuss how you may benefit from implementing similar patterns.

1:30 pm – 2:15 pm Session I

Industrializing Data Science Workflows

Speaker: Sean Downes, Senior Data Scientist, Expedia

Discover the evolution of data science workflows implemented at Expedia with a special emphasis on Learning to Rank problems. This session will explore the process of industrializing the data science workflow and best practices on how to keep your data productive, or even pull your organization out of the data swamp.

1:30 pm – 2:15 pm Session I

Virtualizing Big Data in the Cloud

Speaker: Kellyn Pot’Vin-Gorman, Technical Intelligence Manager of CTO, Delphix

Big Data encompasses a large landscape and building into the cloud can introduce more unique challenges. Two of the primary are cost and storage. Join Kellyn as she discusses cost savings by utilizing virtualization of multiple tiers encompassing the big data landscape through a review of real use cases, along with methods of discovery to gain incredible success and the technical specifications behind different big data platforms when engaging virtualization when data is big and platforms are vast.

1:30 pm – 2:15 pm Session I

How We Built a Scalable, Real-time User Targeting System

Sriranjan Manjunath, CTO and Head of Engineering @ Saavn

Saavn is India’s leading music streaming service. Since context is key to music, we have built a system called Sniper that lets us identify cohorts of users in real-time and target them for marketing, advertising and recommendation purposes. This system allows us to understand user behavior by quantifying their engagement characteristics such as stream consumption, affinities or ads. Speed and scalability are critical to its design. This talk will cover our motivations behind building such a system and how big data technologies have helped us architect it.

1:30 pm – 2:15 pm Session I

Power your Big Data Infrastructure with Data Intelligence for Analytics and Data Operations

Speaker: Balaji Mohanam, Product Manager @ Qubole

Discover the newly launched features in Qubole, powered by Data Intelligence, that automates mundane Data Model performance appraisal and simplifies Data Ops. This session will provide detailed walkthrough of Qubole’s latest offering in Data Intelligence that includes Data Model insights and Recommendations including Partitioning, Formatting and Sorting that helps optimize data models for improved performance and computing resources. In addition, learn about Qubole’s latest offering in self-service analytics and how it can improve analysts productivity by making data discovery easy through column and table name auto-suggestion and completion, and insights preview.