Is there any preparation required before to the course?

Pre-course reading will be provided so that you are familiar with the content before the class begins.

Is it possible to pay in instalments for the programme?

Yes. Visit our payment plans website to discover more about the payment choices available at Croma Campus.

What if I need to cancel or will be unable to attend the class?

You will be able to register for our next training session, as we only offer two to three each year.

What are the different payment options?

Yes, there are a variety of payment options.

Home /Big Data & Hadoop/ Apache Spark and Scala Training

Master Spark ecosystem and Scala. Enroll now to learn from a Spark and Scala expert.

4.9 out of 5 based on 987466 votes

4.2/5

4.8/5

4.6/5

4.3/5

4.5/5

Course Duration

24 Hrs.

Live Project

2 Project

Certification Pass

Guaranteed

Training Format

Live Online /Self-Paced/Classroom

Watch Live Classes

GUIDE ME

Speciality

250+

Professionals Trained

Batches every month

20+

Countries & Counting

100+

Corporate Served

CURRICULUM REAL STORIES PROJECTS COURSE FEE SELF-ASSESSMENT ACTIVITIES PLACEMENTS practice tests ENROLL NOW

CURRICULUM & PROJECTS

Apache Spark and Scala Training

This Spark certification training helps you master the essential skills of the Apache Spark open-source framework and Scala programming language, including Spark Streaming, Spark SQL, machine learning programming, GraphX programming, and Shell Scripting Spark.

In this program you will learn:

Introduction

Scala

Using Resilient Distributed Datasets

Spark SQL, Data Frames, and Data Sets

Running Spark on a cluster

Machine Learning with Spark ML

Spark Streaming

Graph X

Overview

Kafka Producer

Kafka Consumers

Kafka Internals

Cluster Architecture and Administering Kafka

Kafka Monitoring & Kafka Connect

Kafka Stream Processing

Kafka Integration with Hadoop, Storm, and Spark

Kafka Integration with Flume, Talend and Cassandra

Career Guidance and Roadmap

Introduction

PIG Architecture

Data Models, Operators, and Streaming in PIG

Functions in PIG

Advanced Concepts in PIG

Hadoop Overview

No SQL Databases Hbase

Administration in HBASE

Troubleshooting in Hbase

Troubleshooting in Hbase

Troubleshooting in Hbase

Apache HBASE Ecosystem

Get full course syllabus in your inbox

Introduction

Big Data Overview

Apache Hadoop Overview

Hadoop Distibution File System

Hadoop MapReduce Overview

Introduction to IntelliJ and Scala

Installing IntelliJ and Scala

Apache Spark Overview

What’s new in Apache Spark 3

Get full course syllabus in your inbox

Scala

Flow control in Scala

Functions in Scala

Data Structures in Scala

Get full course syllabus in your inbox

Using Resilient Distributed Datasets

The Resilient Distributed Dataset

Ratings Histogram Example

Key/value RDD's and the Average Friends by Age example

Filtering RDD's and the Minimun Temperature by Location Example

Check Your Results and Implementation Against Min

Get full course syllabus in your inbox

Spark SQL, Data Frames, and Data Sets

Introduction to Spark SQL

What are Data Frames

What are Data Sets

Item-Based Collaborative Filtering in Spark, cache (), and persist ()

Get full course syllabus in your inbox

Running Spark on a cluster

What is a Cluster

Cluster management in Hadoop

Introducing Amazing Elastic MapReduce

Partitioning Concepts

Troubleshooting and managing dependencies

Get full course syllabus in your inbox

Machine Learning with Spark ML

Introduction MLLib

Using MLLib

Linear Regression with MLLib

Get full course syllabus in your inbox

Spark Streaming

Spark Streaming

The DStream API for Spark Streaming

Get full course syllabus in your inbox

Graph X

What is Graph X

About Pregel

Breadth-First-Search with Pregel

Using Pregel API with Spark API

Get full course syllabus in your inbox

Overview

Introduction to Big Data

Big Data Analytics

Need for Kafka

What is Kafka

Kafka Features

Kafka Concepts

Kafka Architecture

Kafka Components

Zookeeper

Where is Kafka Used

Kafka Installation

Kafka Cluster

Type of Kafka Clusters

Configuring Single Node Single Broker Cluster

Get full course syllabus in your inbox

Kafka Producer

Configuring Single Node Multi Broker Cluster

Sending a Message to Kafka

Producing Keyed and Non-Keyed Messages

Sending a Message Synchronously & Asynchronously

Configuring Producers

Serializers

Serializing Using Apache Avro

Partitions

Get full course syllabus in your inbox

Kafka Consumers

Consumers and Consumer Groups

Standalone Consumer

Consumer Groups and Partition Rebalance

Creating a Kafka Consumer

Subscribing to Topics

The Poll Loop

Configuring Consumers

Commits and Offsets

Rebalance Listeners

Consuming Records With Specific Offsets

De-serializers

Get full course syllabus in your inbox

Kafka Internals

Cluster Membership

The Controller

Replication

Request Processing

Physical Storage

Reliability

Broker Configuration

Using Producers in a Reliable System

Using Consumers in a Reliable System

Validating System Reliability

Performance Tuning in Kafka

Get full course syllabus in your inbox

Cluster Architecture and Administering Kafka

Use Cases - Cross-Cluster Mirroring

Multi-Cluster Architectures

Apache Kafka’s Mirror Maker

Other Cross-Cluster Mirroring Solutions

Topic Operations

Consumer Groups

Dynamic Configuration Changes

Partition Management

Consuming and Producing

Unsafe Operations

Get full course syllabus in your inbox

Kafka Monitoring & Kafka Connect

Considerations When Building Data Pipelines

Metric Basics

Kafka Broker Metrics

Client Monitoring

Lag Monitoring

End-to-End Monitoring

Kafka Connect

When to Use Kafka Connect

Kafka Connect Properties

Get full course syllabus in your inbox

Kafka Stream Processing

Stream Processing

Stream-Processing Concepts

Stream-Processing Design Patterns

Kafka Streams by Example

Kafka Streams: Architecture Overview

Get full course syllabus in your inbox

Kafka Integration with Hadoop, Storm, and Spark

Apache Hadoop Basics

Hadoop Configuration

Kafka Integration with Hadoop

Apache Storm Basics

Configuration of Storm

Integration of Kafka with Storm

Apache Spark Basics

Spark Configuration

Kafka Integration with Spark

Get full course syllabus in your inbox

Kafka Integration with Flume, Talend and Cassandra

Flume Basics

Integration of Kafka with Flume

Cassandra Basics Such as Key Space and Table Creation

Integration of Kafka with Cassandra

Talend Basics

Integration of Kafka with Talend

Get full course syllabus in your inbox

Career Guidance and Roadmap

Apache Hadoop Overview

Hadoop Distribution File System

Hadoop MapReduce Overview

Introduction to PIG

Prerequisites for Apache PIG

Exploring use cases for PIG

History of Apache PIG

Why you need PIG

Significance od PIG

PIG over MapReduce

When PIG suits the most

When to avoid PIG

Get full course syllabus in your inbox

PIG Architecture

PIG Latin Language

Running PIG in Different Modes

PIG Architecture

GRUNT Shell

PIG Latin Statements

Running Pig Scripts

Utility Commands

Get full course syllabus in your inbox

Data Models, Operators, and Streaming in PIG

PIG Data Model - Scarlar Data Type

PIG Data Model - Complex Data Type

Arithmetic Operators

Comparison Operators

Cast Operators

Type Construction Operators

Relation Operators

Loading and Stroing Operators

Filtering Operators

Filtering Operators- Pig Streaming with Python

Grouping and Joining Operators-

Sorting Operator

Combining and Splitting Operators

Diagnostic Operators

Get full course syllabus in your inbox

Functions in PIG

Eval Functions

Load and Store Funtions

Tuple and Bag Functions

String Functions

Math Function

Get full course syllabus in your inbox

Advanced Concepts in PIG

File compression in PIG

Intermediate Compression

Pig Unit Testing

Embedded PIG in JAVA

Pig Macros

Import Macros

Parameter Substitutions

Get full course syllabus in your inbox

Hadoop Overview

Course overview

Big Data Overview

Hadoop Overview

HDFS

Hadoop Ecosystem

What is a Hadoop Framework

Type of Hadoop Frameworks

Get full course syllabus in your inbox

No SQL Databases Hbase

NoSQL Databases Hbase

NoSQL Introduction

HBase Overview

HBase Architecture

Data Model

Connecting to HBase

HBase Shell

Get full course syllabus in your inbox

Administration in HBASE

Introduction

Learn and Understand Hbase Fault Tolerance

Hardware Recommendations

Software Recommendations

Hbase Deployment at scale

Installation with Cloudera Manager

Basic Static Configuration

Rolling Restarts and Upgrades

Interacting with HBase

Get full course syllabus in your inbox

Troubleshooting in Hbase

Introduction

Troubleshooting Distributed Clusters

Learn How To Use the Hbase UI

Learn How To Use the Metrics

Learn How To Use the Logs

Get full course syllabus in your inbox

Troubleshooting in Hbase

Introduction

Generating Load & Load Test Tool

Generating With YCSB

Region Tuning

Table Storage Tuning

Memory Tuning

Tuning with Failures

Tuning for Modern Hardware

Get full course syllabus in your inbox

Apache HBase Operations Continuity

Introduction

Corruption: hbck

Corruption: Other Tools

Security

Security Demo

Snapshots

Import Export and copy Paste

Cluster Replication

Get full course syllabus in your inbox

Apache HBASE Ecosystem

Introduction

HBase With Apache Phoenix’

Get full course syllabus in your inbox

+ More Lessons

Course Design By

Nasscom & Wipro

Course Offered By

Croma Campus

Real

Stories

success

inspiration

Abhishek

career upgrad

Upasana Singh

career upgrad

Shashank

career upgrad

Abhishek Rawat

career upgrad

Course Duration

24 Hrs.

Know More...

Weekday1 Hr/Day

Weekend2 Hr/Day

Training ModeClassroom/Online

Flexible Batches For You

05-Jul-2025*
Weekend
SAT - SUN
Mor | Aft | Eve - Slot

07-Jul-2025*
Weekday
MON - FRI
Mor | Aft | Eve - Slot

02-Jul-2025*
Weekday
MON - FRI
Mor | Aft | Eve - Slot

05-Jul-2025*
Weekend

SAT - SUN

Mor | Aft | Eve - Slot

07-Jul-2025*
Weekday

MON - FRI

Mor | Aft | Eve - Slot

02-Jul-2025*
Weekday

MON - FRI

Mor | Aft | Eve - Slot

Course Price :

For Indian

Want To Know More About

This Course

Program fees are indicative only* Know more

SELF ASSESSMENT

Learn, Grow & Test your skill with Online Assessment Exam to
achieve your Certification Goals

Get exclusive
access to career resources
upon completion

Mock Session

You will get certificate after
completion of program

LMS Learning

You will get certificate after
completion of program

Career Support

You will get certificate after
completion of program

Showcase your Course Completion Certificate to Recruiters

Training Certificate is Govern By 12 Global Associations.
Training Certificate is Powered by “Wipro DICE ID”
Training Certificate is Powered by "Verifiable Skill Credentials"

in Collaboration with

Not Just Studying

Name: Apache Spark and Scala Training
Item: Apache Spark and Scala Training
Rating: 4.90

We’re Doing Much More!

Empowering Learning Through Real Experiences and Innovation

Career Transition

Career Gap

Placement Activities

Placement Drives

Latest Hiring

Whats App Reviews

Mock Interviews

Prepare & Practice for real-life job interviews by joining the Mock Interviews drive at Croma Campus and learn to perform with confidence with our expert team.Not sure of Interview environments? Don’t worry, our team will familiarize you and help you in giving your best shot even under heavy pressures.Our Mock Interviews are conducted by trailblazing industry-experts having years of experience and they will surely help you to improve your chances of getting hired in real.

How Croma Campus Mock Interview Works?