Archives
- 27 Apr CloudWatch Observability Checklist for AWS Data Pipelines (Beginner-Friendly)
- 13 Apr AWS IAM Basics for Data Engineers: Least-Privilege Access Without the Confusion
- 30 Mar AWS Data Lake Folder Structure for Beginners: A Simple S3 Layout That Scales
- 17 Mar How to Choose the Right AWS Data Pipeline Architecture for a New Data Product
- 02 Mar Lakehouse Patterns for Retrieval and Semantic Search
- 16 Feb Prompt Version Governance for Data Teams
- 02 Feb Building Evaluation Datasets from Warehouse Data
- 19 Jan RAG Data Pipelines: Chunking, Metadata, and Freshness
- 05 Jan Feature-Ready Tables: Preparing Data for ML and GenAI Workloads
- 15 Dec Data Engineer to AI Engineer: A Practical Roadmap That Actually Works
- 08 Sep Step Functions + Glue: A Practical Reference Architecture for Reliable Pipelines
- 16 Jun Data Quality Checks That Actually Catch Production Issues
- 02 Jun How to Design Idempotent ETL Jobs in AWS
- 19 May Step Functions Orchestration Patterns That Reduce Data Incidents
- 05 May Data Contracts for Analytics Pipelines: A Practical Guide for Small Teams
- 21 Apr Designing Your First Medallion Lakehouse on AWS (Without Overengineering)
- 18 Apr BigQuery Data Transfer Service: What, why and how?
- 07 Apr Glue vs Athena vs dbt: Where Each Tool Fits in a Real AWS Data Stack
- 08 Mar Productionizing dbt as a Cloud Run Job: Infrastructure Management with Terraform and CI/CD with GitHub Actions - Part 3
- 04 Mar Productionizing dbt as a Cloud Run Job: Infrastructure Management with Terraform and CI/CD with GitHub Actions - Part 2
- 01 Mar Productionizing dbt as a Cloud Run Job: Infrastructure Management with Terraform and CI/CD with GitHub Actions - Part 1