Apache Airflow 2.0 : Complete Distributed Configuration

Setup HA Airflow using multiple Schedulers and Celery Workers
4.29 (31 reviews)
Udemy
platform
English
language
Other
category
Apache Airflow 2.0 : Complete Distributed Configuration
209
students
2.5 hours
content
Apr 2021
last update
$19.99
regular price

Why take this course?


Course Title: Apache Airflow 2.🚀: Complete Distributed Configuration

**Course Instructor:** Ganesh Dhareshwar


Unlock the Full Potential of Airflow with Distributed Configuration! 🚀

Airflow, the open-source workflow automation platform, is your go-to solution for orchestrating complex data pipelines. With its emphasis on simplicity, scalability, and extensibility, it's no wonder that it's the choice for data engineers worldwide. 🌟

Why Take This Course?

  • Master Distributed Airflow: Learn to manage over 100 jobs or DAGs in parallel with a distributed Apache Airflow setup.
  • Expertise in Celery Executor: Understand the inner workings of Sequential, Local, and Celery Executor, and how to configure them effectively.
  • Embrace Airflow 2.0: Get hands-on experience with the latest features and performance enhancements in Airflow 2.0.
  • High Availability Setup: Discover how to set up a HA Airflow using multiple schedulers and Celery workers for resilience and redundancy.
  • Real-World Application: Learn best practices that can be applied directly to your organisation's environment, making it ready for multi-team collaboration.

Course Highlights:

  • Dive into Airflow 2.0: Explore the new enhancements and HA architecture of Airflow 2.0, which brings a massive performance improvement. 📈
  • Install Essential Components: Step-by-step guidance on installing Webserver, Scheduler, Celery workers, and Flower components.
  • Configure Multiple Schedulers: Learn how to configure and optimize multiple schedulers for peak performance.
  • Key Features Explained: Understand the crucial aspects of login management, email alerting, and effective logs management. 💌

Course Structure: 📚

Module 1: Introduction to Airflow

  • Understanding Airflow architecture and concepts
  • Setting up a basic Airflow instance

Module 2: Distributed Airflow with Celery Executor

  • Configuring Celery Executor on AWS EC2 instances
  • Best practices for parallel processing with Celery workers

Module 3: Upgrading to Apache Airflow 2.0

  • Installing and configuring the latest Airflow 2.0
  • Exploring new features and enhancements in Airflow 2.0

Module 4: High Availability Setup with Multiple Schedulers

  • Setting up HA architecture with multiple schedulers
  • Performance tuning for distributed configurations

Module 5: Advanced Features and Optimization

  • Configure login, email alerts, and logs management
  • Final touches on a scalable and resilient Airflow setup

What You Will Achieve:

By completing this course, you will have a robust, distributed Apache Airflow 2.0 configuration capable of handling high volumes of jobs in parallel. You'll understand the intricacies of HA configurations, the performance implications of using multiple schedulers, and how to manage your pipeline with ease and efficiency. This knowledge is not just theoretical—it's directly applicable to real-world scenarios where data processing needs to be both efficient and reliable. 🛠️💻

Are you ready to take control of your data pipelines? Enroll in Apache Airflow 2.0: Complete Distributed Configuration today and transform the way you automate workflows! 🎉

Loading charts...

Related Topics

3447220
udemy ID
25/08/2020
course created date
25/10/2020
course indexed date
Bot
course submited by