Skip to content
View DHANA5982's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report DHANA5982

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
DHANA5982/README.md

πŸš€ DHANASEKAR GOVINDARAJ β€” Big Data Engineer | Cloud Data Engineer | Azure | AWS | Spark | SQL | Databricks | Snowflake

πŸŽ“ MSc in Data Science (University of Essex, UK) (Distinction)

πŸš€ Transitioning into Data Engineering | Skilled in Python, PySpark, SQL, Databricks, Azure, AWS

βš™οΈ Building scalable ETL pipelines and cloud data architectures

πŸ“Š Passionate about automation, orchestration, and real-time streaming


πŸ‘€ What I’m All About

I’m driven by curiosity and a love for:

  • πŸ” Research & Implementation
  • πŸ”— Integration & Best Practices
  • 🧠 Decision-making through data
  • 🧹 Clean, organized workflows that just work

Whether it’s deploying real-time pipelines or experimenting with GenAI agents, I bring precision, creativity, and a growth mindset to every project.


🌱 Currently Leveling Up In

  • 🧱 Databricks (Delta-Lake), Snowflake, Microsoft Fabric & Terraform data engineering, orchestration & infrastructure-as-code
  • πŸ”„ Kafka & Airflow real-time streaming, event-driven pipelines & DAG-based workflow scheduling
  • ☁️ Azure, GCP, AWS cloud deployment & computing
  • 🧠 GenAI & AgenticAI solutions (Gemini, LLaMA, and beyond)
  • 🧠 Image Processing & Decision Making

πŸ’žοΈ Let’s Collaborate On

  • πŸ€– End-to-end Data Pipeline & Data Modeling & Lakehouse Architecture & Data Governance & Security
  • βš™οΈ Real-time Streaming & Batch Processing & Workflow Orchestration
  • πŸ“¦ Automation & Cloud Integration
  • πŸ§ͺ Testing & Validation & CI/CD & Containerization & Monitoring & Deployment

πŸ“‚ Featured Projects

πŸ—οΈ Big Data Engineering & πŸ“‘ Cloud Deployment

Snowflake-PowerBI Data Platform: Healthcare Data Solution (in progress)

Azure-Powered Data Lakehouse & ETL Pipeline: E-Commerce Data Solution

Streaming Data Integration with Kafka & Airflow: Healthcare - IoT Data Solution

Scalable Config-Driven Multi-Source ETL Pipeline: Supply Chain & Logistics Data Solution

πŸ”¬ Data Science

Customer Behavior Analysis, Churn Prediction & Customer Segmentation: E-Commerce Data Solution

Analysis & Prediction of Climate Events: El NiΓ±o & La NiΓ±a

Portfolio Optimization using Capital Asset Pricing Model & Quadratic Programming: Stock Market Solution

Analysis & Prediction of Renewable Energy: Solar & Wind Energy

🧠 AI, Reinforcement Learning, & Generative AI

YouTube Video Summarizer: Generative AI Solution (Google's Gemini)

Deep Q-Network Solution: Lunar Lander (Remote Sensing & Landing)

Reinforcement Agent using Bellman's Equation: Frozen Lake Solution

Reinforcement Agent using SARSA & Q-Learning: Connect X Solution

πŸ“Š Data Analyst

Automated Job Application Tracker: Power Automate & BI Solution (Private)

Adventure Works Sales Analysis: Interactive Power BI Dashboard & Report

Analysis of Ecological Data in the UK: Endangered Species


πŸŽ“ Certifications


πŸ“« Let’s Connect


πŸ˜„ Fun Facts

I love keeping things clean, organized, and joyful. Outside of tech, you’ll find me:

  • 🍳 Cooking
  • πŸ’ƒ Dancing
  • 🎢 Vibing to music
  • 🧴 Practicing personal care & mindfulness

If you find my work helpful, feel free to ⭐ star the repositories and 🍴 fork them to explore or build your own versions!

Pinned Loading

  1. Azure-Powered-Data-Lakehouse-and-ETL-Pipeline Azure-Powered-Data-Lakehouse-and-ETL-Pipeline Public

    End-to-end data pipeline transforming Olist e-commerce data through Azure cloud services. Implements medallion architecture (Bronze-Silver-Gold) with multi-source ingestion, Spark-based processing,…

    Jupyter Notebook 4

  2. Portfolio-Optimisation-CAPM-QP-Stock-Market-Solution Portfolio-Optimisation-CAPM-QP-Stock-Market-Solution Public

    This project constructs an optimized portfolio for trading stocks using Capital Assets Pricing Model. Build by Quadratic Programming Techniques and apply constraints using Shape Ratio.

    Jupyter Notebook 1

  3. Scalable-Config-Driven-Multi-Source-ETL-Pipeline Scalable-Config-Driven-Multi-Source-ETL-Pipeline Public

    End-to-end data pipeline showcasing file ingestion, transformation, testing, and CI/CD deployment with Python, PostgreSQL, Docker, and GitHub Actions.

    Python 1

  4. Behavior-Analysis-Churn-Prediction-and-Customer-Segmentation Behavior-Analysis-Churn-Prediction-and-Customer-Segmentation Public

    πŸš€ End-to-end ML project: 93% accurate churn prediction + customer segmentation with Random Forest & K-Means. Features automated pipeline, Streamlit dashboard, comprehensive testing, and cross-platf…

    Jupyter Notebook 1

  5. YouTube-Video-Summarizer-Generative-AI-Solution YouTube-Video-Summarizer-Generative-AI-Solution Public

    A Streamlit web app that extracts transcripts from YouTube videos and generates concise summaries using Google Gemini Pro. Enter a YouTube link to get key points and detailed notes in seconds. Powe…

    Python 1

  6. Adventure-Works-Sales-Analysis-Power-BI Adventure-Works-Sales-Analysis-Power-BI Public

    Power BI interactive dashboards using AdventureWorks datasets. Includes raw data, PBIX files, and visually compelling dashboards for business intelligence scenarios.

    1