Apache Spark is written in Scala. Hence, many if not most data developers adopting Spark are also adopting Scala, while Python and R remain popular with data scientists. I think that Spark shows Scala at its best and largely hides the more difficult aspects of the language. This tutorial introduces you to the core features of Scala you need to be productive with Spark quickly, using hands-on exercises with the Spark APIs. It's designed for developers, data scientists interested in using Scala for Spark. Using hands-on examples, you'll learn the most important Scala syntax, idioms, and APIs for Spark development. Prerequisites
Topics covered include:
Trainer: Chaoran Yu, Software Engineer, Fast Data, Lightbend Inc.
What are the essential components of a data platform? This tutorial will explain how the various parts of the Hadoop, Spark and big data ecosystem fit together in production to create a data platform supporting batch, interactive, and real-time analytical workloads.
By tracing the flow of data from source to output, we'll explore the options and considerations for components, including:
We'll also give advice on:
Instructors: John Akred, Stephen O'Sullivan, and Andrew Ray
Want to get ramped up on how to use Amazon's big data web services and launch your first big data application on the cloud? Join us for this hands-on workshop as we build a big data application in real time. We will use a combination of open source technologies such as Apache Spark and Zeppelin; as well as AWS managed services such as Amazon Kinesis, a platform for streaming data on AWS, Amazon S3, and more. We review best practices, architecture design patterns, and serverless concepts for big data applications in the Cloud.
Architectural patterns for big dataStreaming data & real-time analyticsExtract, Transform, Load (ETL)Ad-hoc querying & data visualizationServerless data processingPrerequisites:
Attendees should have some basic familiarity with Amazon Web Services as well as data warehousing and analytic data flows.This workshop is completely hosted on-line. Attendees need to bring their own laptops to complete the hands-on activities.