Skip to content
View longNguyen010203's full-sized avatar
:octocat:
Focusing
:octocat:
Focusing

Organizations

@NTL-DE
Block or Report

Block or report longNguyen010203

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
longNguyen010203/README.md

Hey, I'm Long Nguyen πŸ‘‹

I am an Artificial Intelligence major in Vietnam with a passion for data engineer and I am actively seeking job opportunities in this field.

πŸ“¦ Technologies

Languages: Python SQL PySpark Shell C++

Processing: Polars Apache Spark Apache Hadoop Dbt Seaborn Matplotlib Selenium BeautifulSoup Pandas ETL ELT

Storage: PostgreSQL MySQL SQL Server Redshift Mongodb Snowflake MinIO S3 SQLite

Orchestration: Apache Airflow Dagster

Cloud: S3 EC2 IAM VPC Redshift RDS EMR Glue

DevOps: Docker Terraform Git GitLab

Testing & Logging: Unittest Pytest Logging

⚑ Fun fact

  • One-Punch Man is my favorite anime.
  • I enjoy listening to gentle songs, but sometimes I also like remixes.

πŸ“« Contact

Connect with me, LinkedIn

Pinned Loading

  1. Spark-Processing-AWS Spark-Processing-AWS Public

    πŸ‘·πŸŒ‡ Set up and build a big data processing pipeline with Apache Spark, πŸ“¦ AWS services (S3, EMR, EC2, IAM, VPC, Redshift) and Terraform to setup the infrastructureπŸ₯Š

    Python 1

  2. Youtube-ETL-Pipeline Youtube-ETL-Pipeline Public

    πŸ’œπŸŒˆπŸ“Š A Data Engineering Project that implements an ETL data pipeline using Dagster, Apache Spark, Streamlit, MinIO, Metabase, Dbt, Polars, Docker. Data from kaggle and youtube-api 🌺

    Jupyter Notebook 8 1

  3. ECommerce-ELT-Pipeline ECommerce-ELT-Pipeline Public

    πŸŒ„πŸ“ˆπŸ“‰ A Data Engineering Project 🌈 that implements an ELT data pipeline using Dagster, Docker, Dbt, Polars, Snowflake, PostgreSQL. Data from kaggle website πŸ”₯

    Python 1

  4. Bank-DataWarehouse Bank-DataWarehouse Public

    πŸ“ŠπŸŒˆπŸ› This project develop a data warehouse for a bank using Amazon Redshift, VPC, Glue, S3 and DBT, following a ⭐ Star Schema architecture. The goal is to storage, manage, and optimize data to suppo…

    1

  5. Zillow-Home-Value-Prediction Zillow-Home-Value-Prediction Public

    πŸŒˆπŸ“ŠπŸ“ˆ The Zillow Home Value Prediction project employs linear regression models on Kaggle datasets to forecast house prices. πŸ“‰πŸ’°Using Apache Spark (PySpark) within a Docker setup enables efficient dat…

    Jupyter Notebook 2

  6. InspireAI-Web-2024 InspireAI-Web-2024 Public

    πŸ€–πŸ’ŽπŸ“Ί This project involves creating an AI chatbot with OpenAI using ChatGPT, DALL-E, Codex, and Django to develop the web application 🍁

    Python 2 1