Skip to content
View aksingh4545's full-sized avatar
🎯
Focusing
🎯
Focusing

Highlights

  • Pro

Block or report aksingh4545

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
aksingh4545/README.md

Data Engineer & Analytics

Profile Views

👋 About Me

I work at the intersection of Data Engineering, Cloud Platforms, and Real-Time Analytics.

Most days you'll find me designing scalable data pipelines, optimizing Spark workloads, building cloud-native solutions, and improving the reliability of data systems.

I enjoy building systems that are:

  • Predictable
  • Observable
  • Easy to Maintain
  • Scalable

Clean logs, calm infrastructure, and well-documented workflows make me happy.


🧩 Tech Stack

Core Technologies



Data Engineering & Analytics


🔭 What I'm Working On

  • Building Streamlit applications integrated with AWS S3
  • Developing batch and near real-time data pipelines using Python and SQL
  • Deepening expertise in Azure Databricks, PySpark, and Kafka
  • Creating cleaner, observable, and maintainable data workflows

🌱 Currently Learning

  • Advanced Data Modeling for analytics and warehouse workloads
  • Spark Internals, Partitioning, Broadcast Joins, and Performance Tuning
  • Event-Driven Architectures and Reliable Messaging Systems
  • Documentation and Architecture Design using Mermaid and Excalidraw

📊 GitHub Stats

GitHub Streak

🤝 Connect With Me


💡 Philosophy

Building calm, reliable data systems — one pipeline at a time.

Pinned Loading

  1. image_resize image_resize Public

    This project implements an event-driven, serverless image processing pipeline on AWS. Images uploaded to Amazon S3 are automatically resized using AWS Lambda and Pillow, stored in a destination buc…

    Python 3 1

  2. streamlit_s3_pipeline streamlit_s3_pipeline Public

    The system supports real-world resumes (PDF, DOCX, TXT), handles noisy formats, and follows industry-grade data engineering practices.

    Python 2 1

  3. Recommendation-system-ml Recommendation-system-ml Public

    A machine learning–based product recommendation system that suggests similar products using NLP embeddings and vector similarity search. It leverages Sentence Transformers for generating product em…

    Python 1

  4. Coordinate Coordinate Public

    Coordinate is about the project in which i have create the website that shows the live location of the connected devices around location and main feature is that i will also show the connection bet…

    JavaScript 8 1

  5. Login_Cognito Login_Cognito Public

    This repo about how to use AWS Congito fully managed services with streamlit application.

    Python 2 2

  6. airflow-kafka-s3 airflow-kafka-s3 Public

    An Apache Airflow ETL pipeline that extracts CSV data from S3, streams it through Kafka for processing, and loads the transformed data back to S3 and PostgreSQL. Orchestrates data flow using XCom f…

    Python