Tools

Apache Airflow

Platform to programmatically author, schedule, and monitor complex data workflows and dependencies.

Apache Kafka

Event streaming platform for high‑throughput pipelines and real‑time analytics.

Apache Spark

Unified engine for data processing, SQL, and reproducible pipelines.

Apache Superset

Data exploration and visualization platform for creating interactive dashboards and business intelligence insights.

SQL Server

Local SQL Server Docker stack to create a database, create a table, and load sample data.