Writing production-ready ETL pipelines in Python / Pandas

Learn how to write professional ETL pipelines using best practices in Python and Data Engineering.
4.36 (854 reviews)
Udemy
platform
English
language
Software Engineering
category
instructor
Writing production-ready ETL pipelines in Python / Pandas
6 519
students
7 hours
content
Jul 2022
last update
$59.99
regular price

What you will learn

How to write professional ETL pipelines in Python.

Steps to write production level Python code.

How to apply functional programming in Data Engineering.

How to do a proper object oriented code design.

How to use a meta file for job control.

Coding best practices for Python in ETL/Data Engineering.

How to implement a pipeline in Python extracting data from an AWS S3 source, transforming and loading the data to another AWS S3 target.

Course Gallery

Writing production-ready ETL pipelines in Python / Pandas – Screenshot 1
Screenshot 1Writing production-ready ETL pipelines in Python / Pandas
Writing production-ready ETL pipelines in Python / Pandas – Screenshot 2
Screenshot 2Writing production-ready ETL pipelines in Python / Pandas
Writing production-ready ETL pipelines in Python / Pandas – Screenshot 3
Screenshot 3Writing production-ready ETL pipelines in Python / Pandas
Writing production-ready ETL pipelines in Python / Pandas – Screenshot 4
Screenshot 4Writing production-ready ETL pipelines in Python / Pandas

Charts

Students
Price
Rating & Reviews
Enrollment Distribution

Comidoc Review

Our Verdict

This course offers valuable insights into writing ETL pipelines using Python, pandas, and Data Engineering best practices. Though it lacks detailed explanations for a few concepts and has some areas with minimal coverage, the opportunity to learn from a real-world project outweighs these weaknesses. Considering its recent updates in July 2022, students can benefit from a production-ready pipeline perspective and apply this knowledge to their projects.

What We Liked

  • Covers end-to-end ETL pipeline development using best practices in Python and Data Engineering.
  • Provides an opportunity to learn from a real-world project and see the instructor's thinking process.
  • Incorporates functional programming, object-oriented code design, and a meta file for job control.
  • Exposes students to pandas and Windowed SQL functions, allowing them to implement ETL pipelines in Python.

Potential Drawbacks

  • Lacks detailed explanations for some concepts, causing the need for self-driven research.
  • Expectations regarding testing and productionizing the pipeline may not be fully met.
  • Object-oriented approach in certain parts might introduce unnecessary complexity to basic tasks.
  • Minimal coverage provided on deploying pipelines and AWS/Kubernetes setup, leaving some students disappointed.
4175968
udemy ID
10/07/2021
course created date
16/07/2021
course indexed date
Bot
course submited by
Writing production-ready ETL pipelines in Python / Pandas - | Comidoc