This project demonstrates how raw Instacart order data can be transformed into reliable, analysis-ready datasets using a lakehouse architecture. The pipeline ingests CSV files into Databricks, ...
🚀 Instacart Medallion Data Engineering Pipeline using PySpark & Airflow 📌 Project Overview Built an end-to-end Data Engineering pipeline processing over 32.4 million order-product records and 3.4 ...