Skip to content
View scriptdruid's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report scriptdruid

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
scriptdruid/README.md

Vipul Rai

Senior Data Engineer | Distributed Computing
📍 Amsterdam, Netherlands 🔗 LinkedIn


Summary

Data Engineer with 12+ years of experience in distributed computing, big data architecture, and DevOps automation.
Currently at Knab, I architect and deliver scalable data solutions leveraging AWS, Airflow, dbt, and Spark, optimizing ETL workflows to drive data-driven insights.
Expertise in cloud platforms, data pipeline development, and automation to ensure efficiency, security, and high performance in financial data operations.


Technical Skills

  • Big Data & Cloud: Apache Spark, Databricks, Airflow, Kafka, Azure, AWS, dbt
  • Programming & Frameworks: Python, PySpark, SQL, Django
  • DevOps & Automation: CI/CD, Terraform, Docker, Kubernetes, Pytest
  • Machine Learning & Analytics: MLOps, Feature Engineering, Predictive Analytics
  • Streaming & IoT: IoT Hub, Event Hubs, Stream Processing
  • Data Storage & Modeling: Redshift, MongoDB
  • AI & LLM Integrations: Hugging Face, OpenAI API, AI Agents

Pinned Loading

  1. pandas-dev/pandas pandas-dev/pandas Public

    Flexible and powerful data analysis / manipulation library for Python, providing labeled data structures similar to R data.frame objects, statistical functions, and much more

    Python 45.3k 18.5k