Apache Spark Hands on Specialization for Big Data Analytics

In-depth course to master Apache Spark Development using Scala for Big Data (with 30+ real-world & hands-on examples)
3.96 (544 reviews)
Udemy
platform
English
language
Data Science
category
instructor
Apache Spark Hands on Specialization for Big Data Analytics
12 454
students
12 hours
content
Aug 2019
last update
$54.99
regular price

What you will learn

Understand the relationship between Apache Spark and Hadoop Ecosystem

Understand Apache Spark use-cases and advanced characteristics

Understand Apache Spark Architecture and how it works

Understand how Apache Spark on YARN (Hadoop) works in multiple modes

Understand development life-cycle of Apache Spark Applications in Python and Scala

Learn the foundations of Scala programming language

Understand Apache Spark's primary data abstraction (RDDs)

Understand and use RDDs advanced characteristics (e.g. partitioning)

Learn nuances in loading files in Hadoop Distributed File system in Apache Spark

Learn implications of delimiters in text files and its processing in Spark

Create and use RDDs by parallelizing Scala's collection objects and implications

Learn the usage of Spark and YARN Web UI to gain in-depth operational insights

Understand Spark's Direct Acyclic Graph (DAG) based execution model and implications

Learn Transformations and their lazy execution semantics

Learn Map transformation and master its applications in real-world challenges

Learn Filter transformation and master its usage in real-world challenges

Learn Apache Spark's advanced Transformations and Actions

Learn and use RDDs of different JVM objects including collections and understanding critical nuances

Learn and use Apache Spark for statistical analysis

Learn and master Key Value Pair RDDs and their applications in complex Big Data problems

Learn and master Join Operations on complex Key Value Pair RDDs in Apache Spark

Learn how RDDs caching works and use it for advanced performance optimization

Learn how to use Apache Spark for Data Ranking problems

Learn how to use Apache Spark for handling and processing structured and unstructured data

Learn how to use Apache Spark for advanced Business Analytics

Learn how to use Apache Spark for advanced data integrity and quality checks

Learn how to use Scala's advanced features like functional programming and pattern matching

Learn how to use Apache Spark for logs processing

Course Gallery

Apache Spark Hands on Specialization for Big Data Analytics – Screenshot 1
Screenshot 1Apache Spark Hands on Specialization for Big Data Analytics
Apache Spark Hands on Specialization for Big Data Analytics – Screenshot 2
Screenshot 2Apache Spark Hands on Specialization for Big Data Analytics
Apache Spark Hands on Specialization for Big Data Analytics – Screenshot 3
Screenshot 3Apache Spark Hands on Specialization for Big Data Analytics
Apache Spark Hands on Specialization for Big Data Analytics – Screenshot 4
Screenshot 4Apache Spark Hands on Specialization for Big Data Analytics

Charts

Students
12/1902/2004/2006/2008/2010/2012/2002/2104/2106/2108/2110/2112/2102/2204/2207/2209/2211/2201/2304/2306/2309/2312/2302/2405/2407/2410/2401/2505/2503 5007 00010 50014 000
Price
Rating & Reviews
Enrollment Distribution

Comidoc Review

Our Verdict

While the course title is somewhat misleading in suggesting a broader scope than Spark Core alone, this specialization offers in-depth explorations of key RDD concepts through hands-on exercises. This laser focus on Spark Core programming makes it an ideal resource for beginners and mid-level professionals seeking to strengthen their foundation of RDD transformations and actions. Unfortunately, those hoping to learn about other Apache Spark components like SQL and Streaming may need to seek additional learning resources as these topics are not covered in this specialization.

What We Liked

  • Comprehensive coverage of Spark Core with in-depth explanations and hands-on examples
  • Focus on practical programming scenarios that build confidence in using RDDs, various transformations, and actions
  • Rich course content updated frequently to ensure relevance and accuracy
  • Highly responsive instructor praised for knowledge and accessibility

Potential Drawbacks

  • Misleading title implies broader focus on the Apache Spark ecosystem than just Spark Core
  • Lack of integration with other spark components like SQL, Streaming, MLlib, or GraphX
  • Tutorial design relies on Cloudera environment which can be impractical for users without access to it
  • Instructional pace may feel rushed and challenging at times, impeding grasp of complex topics
1281488
udemy ID
06/07/2017
course created date
24/11/2019
course indexed date
Bot
course submited by
Apache Spark Hands on Specialization for Big Data Analytics - | Comidoc