Apache Spark Hands on Specialization for Big Data Analytics
In-depth course to master Apache Spark Development using Scala for Big Data (with 30+ real-world & hands-on examples)
3.96 (544 reviews)

12 454
students
12 hours
content
Aug 2019
last update
$54.99
regular price
What you will learn
Understand the relationship between Apache Spark and Hadoop Ecosystem
Understand Apache Spark use-cases and advanced characteristics
Understand Apache Spark Architecture and how it works
Understand how Apache Spark on YARN (Hadoop) works in multiple modes
Understand development life-cycle of Apache Spark Applications in Python and Scala
Learn the foundations of Scala programming language
Understand Apache Spark's primary data abstraction (RDDs)
Understand and use RDDs advanced characteristics (e.g. partitioning)
Learn nuances in loading files in Hadoop Distributed File system in Apache Spark
Learn implications of delimiters in text files and its processing in Spark
Create and use RDDs by parallelizing Scala's collection objects and implications
Learn the usage of Spark and YARN Web UI to gain in-depth operational insights
Understand Spark's Direct Acyclic Graph (DAG) based execution model and implications
Learn Transformations and their lazy execution semantics
Learn Map transformation and master its applications in real-world challenges
Learn Filter transformation and master its usage in real-world challenges
Learn Apache Spark's advanced Transformations and Actions
Learn and use RDDs of different JVM objects including collections and understanding critical nuances
Learn and use Apache Spark for statistical analysis
Learn and master Key Value Pair RDDs and their applications in complex Big Data problems
Learn and master Join Operations on complex Key Value Pair RDDs in Apache Spark
Learn how RDDs caching works and use it for advanced performance optimization
Learn how to use Apache Spark for Data Ranking problems
Learn how to use Apache Spark for handling and processing structured and unstructured data
Learn how to use Apache Spark for advanced Business Analytics
Learn how to use Apache Spark for advanced data integrity and quality checks
Learn how to use Scala's advanced features like functional programming and pattern matching
Learn how to use Apache Spark for logs processing
Course Gallery




Charts
Students
Price
Rating & Reviews
Enrollment Distribution
Comidoc Review
Our Verdict
While the course title is somewhat misleading in suggesting a broader scope than Spark Core alone, this specialization offers in-depth explorations of key RDD concepts through hands-on exercises. This laser focus on Spark Core programming makes it an ideal resource for beginners and mid-level professionals seeking to strengthen their foundation of RDD transformations and actions. Unfortunately, those hoping to learn about other Apache Spark components like SQL and Streaming may need to seek additional learning resources as these topics are not covered in this specialization.
What We Liked
- Comprehensive coverage of Spark Core with in-depth explanations and hands-on examples
- Focus on practical programming scenarios that build confidence in using RDDs, various transformations, and actions
- Rich course content updated frequently to ensure relevance and accuracy
- Highly responsive instructor praised for knowledge and accessibility
Potential Drawbacks
- Misleading title implies broader focus on the Apache Spark ecosystem than just Spark Core
- Lack of integration with other spark components like SQL, Streaming, MLlib, or GraphX
- Tutorial design relies on Cloudera environment which can be impractical for users without access to it
- Instructional pace may feel rushed and challenging at times, impeding grasp of complex topics
Related Topics
1281488
udemy ID
06/07/2017
course created date
24/11/2019
course indexed date
Bot
course submited by