PinnedDay 1: AWS Certified Data Engineer Associate (DEA-C01) — Your Complete 30-Day Roadmap to…Over the next 30 days, I’ll be sharing a comprehensive series of articles to prepare for AWS Certified Data Engineer Associate…Oct 27Oct 27
PinnedUnderstanding the Basics of Delta Lake ArchitectureDelta Lake is a powerful open-source data storage layer that brings a new level of reliability and performance to big data processing. It…Apr 8, 2023Apr 8, 2023
Day 4: AWS Glue — In-Depth ETL Service for AWS Certified Data Engineer Associate ExamAWS Glue, the ETL tool on AWSOct 30Oct 30
Day 3: Amazon Kinesis Deep Dive — Essential Concepts for AWS Data Engineer Certification SuccessAmazon Kinesis Deep Dive — Essential Concepts for AWS Data Engineer Certification SuccessOct 29Oct 29
Day 2: Understanding Data Ingestion Patterns — Batch vs Streaming on AWSWelcome to Day 2 of the AWS Certified Data Engineer Associate 30-Day Series! Check out Day 1: AWS Certified Data Engineer Associate…Oct 28Oct 28
Scalable Cumulative Sum in PySpark: From Window Functions to Partition-Aware AggregationWhile working on a large-scale search analytics pipeline, I encountered a classic but deceptively complex challenge — calculating a…Jun 17Jun 17
Why is Microsoft Fabric a Game changer in the world of Analytics?Discover the transformative power of Microsoft Fabric in data analytics based on my personal experienceSep 29, 2023A response icon1Sep 29, 2023A response icon1
Airflow End-To-End Project: ETL Pipeline using Airflow for Wiki Page ViewsAirflow End-To-End ETL project using Wiki Page ViewsSep 19, 2023A response icon1Sep 19, 2023A response icon1
How Can Airflow’s Branch Operator Solve Your Workflow Branching Problems?Branching Tasks in Airflow DAGs using BranchPython OperatorSep 18, 2023A response icon2Sep 18, 2023A response icon2
Two Vital Concepts To Build Efficient Airflow DAGsEvery Data Engineer must understand these two concepts to build efficient Airflow DAGs.Sep 17, 2023Sep 17, 2023