PinnedUnderstanding the Basics of Delta Lake ArchitectureDelta Lake is a powerful open-source data storage layer that brings a new level of reliability and performance to big data processing. It…Apr 8, 2023Apr 8, 2023
Scalable Cumulative Sum in PySpark: From Window Functions to Partition-Aware AggregationWhile working on a large-scale search analytics pipeline, I encountered a classic but deceptively complex challenge — calculating a…17h ago17h ago
Why is Microsoft Fabric a Game changer in the world of Analytics?Discover the transformative power of Microsoft Fabric in data analytics based on my personal experienceSep 29, 2023A response icon1Sep 29, 2023A response icon1
Airflow End-To-End Project: ETL Pipeline using Airflow for Wiki Page ViewsAirflow End-To-End ETL project using Wiki Page ViewsSep 19, 2023A response icon1Sep 19, 2023A response icon1
How Can Airflow’s Branch Operator Solve Your Workflow Branching Problems?Branching Tasks in Airflow DAGs using BranchPython OperatorSep 18, 2023A response icon2Sep 18, 2023A response icon2
Two Vital Concepts To Build Efficient Airflow DAGsEvery Data Engineer must understand these two concepts to build efficient Airflow DAGs.Sep 17, 2023Sep 17, 2023
One Way to Execute Airflow DAGs Back in TimeExecute Airflow DAGs Back in time by changing just one parameter….Sep 16, 2023Sep 16, 2023
How to easily install Apache Airflow on Windows?Install Apache Airflow on Windows easily without Docker or VirtualBoxSep 13, 2023A response icon3Sep 13, 2023A response icon3
Why Data Engineers Should Learn Apache Airflow: Mastering Efficient Workflow OrchestrationLearning Airflow for Data Engineers opens door for new oppertunitiesSep 12, 2023Sep 12, 2023
Unlocking Azure’s Power: How to Create a Stellar Storage Account Using ARM Templates?Azure Resource Manager (ARM) template for the deployment of Azure Storage AccountAug 24, 2023Aug 24, 2023