ETL and Data Pipelines with Shell, Airflow and Kafka
Using industry-standard technologies like Shell (Bash) scripting, Apache Airflow, and Apache Kafka, this IBM course explains how to design, create, automate, and manage ETL (Extract, Transform, Load) processes and data pipelines. Students have practical experience building batch and streaming data pipelines that are utilized in contemporary data engineering settings.
- Provider/Creator: IBM
- Platform: Coursera
- Category: ETL & Pipelines
- Level: Intermediate
- Duration: ~13 Hours
- Certificate: Optional
- Rating: 4.6+/5
- Direct Course Link: ETL and Data Pipelines with Shell, Airflow and Kafka
- Recommended For: Aspiring Data Engineers Data Analysts transitioning to Data Engineering Database Administrators ETL Developers Cloud & Big Data Professionals Anyone preparing for the IBM Data Engineering Professional Certificate
- Prerequisites
- Basic knowledge of SQL, relational databases, datasets, data analysis, and Linux/Bash Commands
- Having prior Python knowledge is helpful but not mandatory.
