Tools

Apache Airflow

Platform to programmatically author, schedule, and monitor complex data workflows and dependencies.

Apache Kafka

Distributed event streaming platform for high-throughput data pipelines and real-time analytics.

Apache Spark

Unified engine for scalable data processing, SQL analytics, and machine learning across large datasets.

Apache Superset

Data exploration and visualization platform for creating interactive dashboards and business intelligence insights.

SQL Server

Relational database platform for enterprise data storage, querying, and business intelligence workloads.