PySpark Essentials for Data Scientists (Big Data + Python)

Learn how to wrangle Big Data for Machine Learning using Python in PySpark taught by an industry expert!
4.42 (820 reviews)
Udemy
platform
English
language
Data Science
category
instructor
PySpark Essentials for Data Scientists (Big Data + Python)
5 648
students
17.5 hours
content
May 2022
last update
$54.99
regular price

What you will learn

Use Python with Big Data on a distributed framework (Apache Spark)

Work with REAL datasets on realistic consulting projects

How to streaming LIVE data from Twitter using Spark Structured Streaming

Learn how to create a "Pandora Like" app that classifies songs into genres using machine learning

Flag suspicious job postings using Natural Language Processing

Use machine learning to predict optimal cement strength and the factors that affect it

Classify Christmas cooking recipes using Topic Modeling (LDA)

Customer Segmentation using Gaussian Mixture Modeling (Clustering)

Use cluster analysis to develop a strategy designed to increase college graduation rates for under-priveleged populations

How to use the k-means clustering algorithm to define a marketing outreach strategy

Integrate a UI to monitor your model training and development process with MLflow

Theory and application of cutting edge data science algorithms

Manipulate, Join and Aggregate Dataframes in Spark with Python

Learn how to apply Spark's machine learning techniques on distributed Dataframes

Cross Validation & Hyperparameter Tuning

Frequent Pattern Mining Techniques

Classification & Regression Techniques

Data Wrangling for Natural Language Processing

How to write SQL Queries in Spark

Course Gallery

PySpark Essentials for Data Scientists (Big Data + Python) – Screenshot 1
Screenshot 1PySpark Essentials for Data Scientists (Big Data + Python)
PySpark Essentials for Data Scientists (Big Data + Python) – Screenshot 2
Screenshot 2PySpark Essentials for Data Scientists (Big Data + Python)
PySpark Essentials for Data Scientists (Big Data + Python) – Screenshot 3
Screenshot 3PySpark Essentials for Data Scientists (Big Data + Python)
PySpark Essentials for Data Scientists (Big Data + Python) – Screenshot 4
Screenshot 4PySpark Essentials for Data Scientists (Big Data + Python)

Charts

Students
Price
Rating & Reviews
Enrollment Distribution

Comidoc Review

Our Verdict

This course offers comprehensive coverage of PySpark, particularly within the context of machine learning applications. However, improvements can be made regarding instructor interaction, lecture quality throughout the entire course duration, and a more consistent alignment of presented content with corresponding coding examples.

What We Liked

  • Exhaustive coverage of PySpark for machine learning, from basics to advanced concepts
  • Well-designed projects that provide practical experience
  • Helpful custom functions provided for further use
  • Instructor's elegant and graceful approach to programming

Potential Drawbacks

  • Insufficient responses from the instructor, leading to unresolved questions
  • Rapid decline in lecture quality during machine learning portion
  • Lack of detailed explanations for typos and unintuitive syntax
  • Occasional mismatch between presented content and coding content
2839728
udemy ID
27/02/2020
course created date
04/10/2020
course indexed date
Bot
course submited by
PySpark Essentials for Data Scientists (Big Data + Python) - | Comidoc