I work at the intersection of Data Engineering, Cloud Platforms, and Real-Time Analytics.
Most days you'll find me designing scalable data pipelines, optimizing Spark workloads, building cloud-native solutions, and improving the reliability of data systems.
I enjoy building systems that are:
- Predictable
- Observable
- Easy to Maintain
- Scalable
Clean logs, calm infrastructure, and well-documented workflows make me happy.
- Building Streamlit applications integrated with AWS S3
- Developing batch and near real-time data pipelines using Python and SQL
- Deepening expertise in Azure Databricks, PySpark, and Kafka
- Creating cleaner, observable, and maintainable data workflows
- Advanced Data Modeling for analytics and warehouse workloads
- Spark Internals, Partitioning, Broadcast Joins, and Performance Tuning
- Event-Driven Architectures and Reliable Messaging Systems
- Documentation and Architecture Design using Mermaid and Excalidraw

