fb

Spark with Scala - Hands On with Big Data!

Spark gives us a comprehensive, unified framework to manage big data processing requirements with a variety of data sets that are diverse in nature (text data, graph data, etc) as well as the source of data (batch vs. real-time streaming data). In this course, we will understand the key difference between Spark and other Hadoop frameworks. We will also learn about programming and streaming data using Spark.

365 days course access

Live instructor-led online classes

Industry-based projects

Build your big data skills to the higher level by analying large data sets with Apache Spark.

E-box Job Assistant

Get noticed by the top hiring companies

Guide from “Amphi”

The Super teacher

Includes:

  • 1 hour of lecture Videos
  • 38 hands-on practice exercises
  • 2 Assessment exercises
  • 125 knowledge based questions
  • 1 Live connect session
             (Master classes)
  • Lifetime access
Contact Us
+91 95669 33778

Spark with Scala - Hands On with Big Data!

ABOUT THE COURSE

This course helps you to understand the Spark program flow, basic Scala constructs, RDD operations, querying data using Spark SQL and Spark Streaming to initialize, transform, deploy and monitor applications.

COURSE OBJECTIVES

Upon successful completion of the course, the learner will be able to :
  • Learn about Spark and Scala, and the program flow of Spark.
  • Learn about loading and storing data in various formats using RDD operations
  • Learn about querying using Spark SQL
  • Learn about Spark Streaming to initialize, transform, deploy and monitor applications

Course Content

Spark and Scala Fundamentals

In this module, you will be able to understand to the difference between Spark and Hadoop framework, key components of Spark ecosystem, Spark program flow, how to work with basic Scala constructs and building programs in Spark.

  • 1 Video
  • 5 Hours
  • 45 Problems

Spark Programming

In this module, you will be able to understand the creation and performance of RDD operations, how to pass functions to Spark, perform transformations and actions on RDD, how to work with key/value pairs and how to load and save data in various formats.

  • 1 Video
  • 5 Hours
  • 45 Problems

Spark SQL

In this module, you will be able to understand the use of SchemaRDD in Spark programs, how to learn and query data with Apache Hive and JSON support, how to use Spark SQL JDBC server to run Spark SQL, how to use Spark SQL UDFs and Hive UDFs and Fine-Tune Spark SQL Performance.

  • 1 Video
  • 2 Hours
  • 25 Problems

Spark Streaming

In this module, you will be able to understand spark Streaming architecture and the concept of linking, how to initialize StreamingContext, input DStreams and receivers, various transformations on DStreams, how to deploy Spark streaming applications and Monitor streaming applications.

  • 1 Video
  • 2 Hours
  • 25 Problems

Spark MLlib and GraphX

Module description In this module, you will be aware of Graphs and its computational features, GraphX and its use-cases, Machine Learning Tools and its Algorithms.

  • 1 Video
  • 2 Hours
  • 25 Problems

Recommended Courses

You can opt for the following courses once you complete your ongoing course

About E-Box

E-Box is a Technology Enabled Active Learning and
Assessment platform for technology and engineering
domains apart from the basic LMS components like
quizzes, assignments, lesson components.

Connect with us

E-Box Google Playstore