Talk to an instructur:
+41 61 551 00 82
jonas@letsboot.ch

Apache Spark 

Course & Training

Introduction to Apache Spark, the most powerful computing engine for big data.

Introduction to Apache Spark, currently the most powerful open source platform to run data engineering, data science, machine learning and AI pipelines. Two days of intensive dive into Spark core concepts, features and best practices. Using Scala and Python, we will ensure you build up your skills to start your journey with Spark.

In-House Course:

We are happy to conduct tailored courses for your team - on-site, remotely or in our course rooms.

Request In-House Course

 

Content:


Apache Spark is a multi-language engine for executing data engineering, data science, and machine learning tasks. Thousands of companies, including 80% of the Fortune 500, use Apache Spark™. Spark can process data in batches and streams, it can run on a single-node or on clusters. The Spark open source project has over 2,000 contributors from industry and academia.

We will cover all the fundamentals so that you are ready to run your next big data processing pipelines.

Amongst many topics, we will cover Spark architecture, using Scala, Python, R and SQL, Data Sources, DataFrames, Datasets, Joins, Aggregations, Spark Types, SparkUI, MLlib, etc.


Disclaimer: The actual course content may vary from the above, depending on the trainer, implementation, duration and constellation of participants.

Whether we call it training, course, workshop or seminar, we want to pick up participants at their point and equip them with the necessary practical knowledge so that they can apply the technology directly after the training and deepen it independently.

Goal:

At the end of the course, participants will understand Spark core concepts and features to run their next big data engineering and analysis project.


Form:

Most time will be spend working on real coding.


Target Audience:

Software engineers, data engineers and data scientists working on big data and looking to Spark to their toolbox.


Requirements:

Each participant will receive a questionnaire with installation instructions after registration. According to the answers we send an individual feedback.


Preparation:

Basic understanding of (big) data, statistics and programming in language like Java, Scala, Python or R, etc.

Request In-House Course:

In-House Kurs Anfragen

Waitinglist for public course:

Sign up for the waiting list for more public course dates. Once we have enough people on the waiting list, we will determine a date that suits everyone as much as possible and schedule a new session. If you want to participate directly with two colleagues, we can even plan a public course specifically for you.

Waiting List Request

(If you already have 3 or more participants, we will discuss your preferred date directly with you and announce the course.)

Share by: