Portfolio

Real world
implementations

Each project built to solve actual data processing problems.

4

Live Projects

3

In Progress

10+

Tools Used

Featured Work

All repos

Event-Driven ETL Pipeline

Automated pipeline triggered by S3 events. Orchestrates validation, transformation, and loading with proper error handling, retry logic, and comprehensive monitoring.

AWS Step FunctionsLambdaGlueS3AthenaTerraform
View on GitHub

Kafka E-commerce Analytics

Real-time analytics engine generating mock e-commerce data, processing through Kafka and Spark, storing in S3, and visualizing with interactive dashboards. Full event streaming pipeline with schema registry.

Confluent KafkaSparkStreamlitSchema RegistryDocker
View on GitHub

Databricks dbt Airflow

Complete ETL workflow with Airflow DAGs orchestrating Databricks jobs, dbt models for transformations, and automated data quality checks. Modern data stack implementation.

DatabricksAirflowdbtDelta LakeTerraform
View on GitHub

Snowflake dbt Airflow

Automated data warehouse setup with dbt models, tests, and documentation. Airflow manages transformation schedules and dependencies with comprehensive monitoring.

SnowflakedbtAirflowTerraformSQL
View on GitHub

In Progress

Coinbase API Step Function Pipeline

Building

Cryptocurrency data ingestion and processing pipeline using AWS Step Functions.

AWS CDC Pipeline

Building

Change Data Capture pipeline with DMS and Lambda on AWS.

Batch Databricks ETL

Building

Scalable batch processing pipeline using Databricks and Airflow.