Skip to content

Blog

Welcome to my personal blog. Here I share tutorials and updates on my research.

Building a Clean Data Loading Stack: Kestra, Terraform, and the Art of Not Repeating Yourself

Integrate a data loading pipeline with Terraform and Kestra through Docker and a Makefile. It solves the common problem of manual configuration by using Terraform outputs to dynamically generate environment variables, ensuring a 'Single Source of Truth' for infrastructure and orchestration.

The Architect’s Blueprint: Navigating Data Hierarchies in GCP and AWS

A comparative guide for data professionals navigating the structural differences between Google Cloud Platform (GCP) and Amazon Web Services (AWS). It focuses specifically on how these platforms organize data within Data Lakes and Data Warehouses, emphasizing the philosophical differences between GCP’s 'Project-centric' model and AWS’s 'Account-centric' model.

Introduction to Data Warehousing

Explore the fundamentals of data architecture, comparing high-speed OLTP systems used for daily transactions with analytical OLAP environments like Data Warehouses, Data Marts, and Data Lakes. The guide explains the mechanical differences between row-oriented and column-oriented storage and evaluates ETL versus ELT plumbing strategies for moving and cleaning data.