Tools
Apache Airflow
Platform to programmatically author, schedule, and monitor complex data workflows and dependencies.
Apache Kafka
Event streaming platform for high‑throughput pipelines and real‑time analytics.
Apache Spark
Unified engine for data processing, SQL, and reproducible pipelines.
Apache Superset
Data exploration and visualization platform for creating interactive dashboards and business intelligence insights.
SQL Server
Local SQL Server Docker stack to create a database, create a table, and load sample data.