Skip to content
View anopsy's full-sized avatar

Organizations

@narwhals-dev
Block or Report

Block or report anopsy

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
anopsy/README.md

Hi πŸ‘‹, I'm Magdalena Kowalczuk

data & ml fan with a soft spot for OSS

Driven by curiosity, eager self-learner, Kaggle Notebook Expert, Datathons enjoyer, PyData volunteer, currently engaged in Women in AI Mentorship, exploring and contributing to Open Source Land and building my Machine Learning portfolio.

  • I’m currently working on:
    πŸ“Š PyData Amsterdam 2024 Talk I will be talking about my experience of stepping into the rabbit hole of contributing to open-source software, highlighting key learnings and practical steps for beginners. It covers overcoming self-doubt, learning through collaboration, and the unexpected joys of community engagement. What you can learn from contributing to Open Source and what you probably will not as an aspiring Data Scientist.

    πŸƒPyData Amsterdam 2024 Open Source Sprint: Narwhals Narwhals is an extremely lightweight and extensible compatibility layer between dataframe libraries, and it needs your help! An open source sprint is the perfect opportunity to make your first contribution to open source. The core maintainers of the Narwhals package will prepare a list of easy and accessible first issues to get started with, and will be present in this session to guide you to make your first commit to the package. This is the perfect opportunity to give back to the Python ecosystem, while having some fun.

    🐳Contributing to Dask/PyArrow backend in Narwhals At Narwhals, we’re committed to helping you build dataframe-agnostic tools. Whether your users prefer pandas, polars dataframes, or even pyarrow tables, Narwhals has you covered. There’s still plenty of work to do, so if you’d like to contribute and enhance Narwhals, feel free to check out our Contributing Guide and join us on Discord.

    πŸ€– Did ChatGPT replace Juniors? Inspired by personal curiosity and a 2023 Hackathon challenge (won in the β€˜Most Polished’ category). This project investigates the impact of large language models like ChatGPT on entry-level roles in tech. Demonstrated skills include data cleaning, data wrangling, data analysis, and modeling, using tools such as Python, APIs, Polars, and Hvplot.

  • 🌱 I’m currently learning πŸ»β€β„οΈ Polars and that's Ritchie Vink - creator of Polars with my graffiti:

    Ritchie Vink
  • πŸ‘¨β€πŸ’» All of my projects are available at https://github.com/anopsy

  • πŸ“‘If you'd like to hire me, check my CV

  • πŸ“ I write about my learning journey on https://medium.com/@anopsy28

  • πŸ“« How to reach me [email protected]

  • ⚑ Fun fact 🎨 I paint graffiti portraits

🎨 Selected Portfolio Projects
┣━━ contributing to OSS at:
┃   ┣━━ 🧱scikit-lego
┃   ┃   ┣━━ contributed to docs  
┃   ┃   ┗━━ made ColumnSelector dataframe agnostic using Narwhals 
┃   ┗━━ πŸ³πŸ¦„narwhals 
┃   ┃   ┣━━ worked on pyarrow/dask backend implementation  
┃   ┃   ┗━━ contributed to docs and tests   
┃   ┗━━ πŸ’‘embetter
┃       ┣━━ deprecated a method  
┃       ┗━━ added pre-commit hooks  
┃ 
┣━━ Juniors_vs_ChatGPT 
┃   - Did ChatGPT replaced Juniors and Interns? 
┃   ┣━━ data cleaning
┃   ┣━━ data wrangling
┃   ┣━━ data analysis
┃   ┣━━ modeling
┃   ┗━━ python🐍/API/polarsπŸ»β€β„οΈ/hvplotπŸ“Š
┃ 
┣━━ Compensation Prediction 
┃   - How much do Engineers earn? 
┃   ┣━━ data modeling
┃   ┣━━ model evaluation
┃   ┣━━ containerization using docker
┃   ┣━━ building streamlit app
┃   ┗━━ python🐍/scikit-learn/streamlitπŸ“ˆ/dockerπŸ“¦
┃  
┣━━ MaskMap: Decoding the Hidden Spectrum  
┃   - Prototype of a diagnosis support tool using the power of NLP to identify symptoms of Autistic Masking
┃   ┣━━ data scraping
┃   ┣━━ data cleaning
┃   ┣━━ modeling
┃   ┣━━ deploying
┃   ┗━━ python🐍/pandas🐼/FastAPI
┃  
┣━━ Equity in Healthcare: Women in Data Science Datathon 2024 
┃   - WIDS Datathon Project predicting a timely diagnosis of Metastatic Cancer
┃   ┣━━ data cleaning
┃   ┣━━ data wrangling
┃   ┣━━ data analysis
┃   ┣━━ modeling
┃   ┗━━ python🐍/pandas🐼/ensemble🌳/keras🧠
┃  
┣━━ Relative Search Volumes Analysis  
┃   - Search Volumes for Autism vs Autism Spectrum Disorder around the world
┃   ┣━━ data scraping
┃   ┣━━ data cleaning
┃   ┣━━ modeling WIP
┃   ┗━━ python🐍/pandas🐼
┃  
┣━━ Steelplate Defect Visual EDA  
┃   - Colorful joyplots for Visual EDA
┃   ┣━━ data visualization
┃   ┣━━ ensemble
┃   ┗━━ python🐍/pandas🐼/xgb🌳/seaborn🎨
┃  
┣━━ hossenfelder - 🦺WIP  
┃ - Data Analysis and Prediction of views on Sabine Hossenfelder YT channel
┃   ┣━━ data scraping
┃   ┣━━ data cleaning
┃   ┣━━ modeling WIP
┃   ┗━━ python🐍/pandas🐼
┃  
┗━━ MyFalaClassifier - 🦺WIP  
- Detector of surfable waves
    ┣━━ live-stream scraping
    ┣━━ image processing
    ┣━━ transfer learning
    ┣━━ deploying
    ┗━━ python🐍/keras🧠

Languages and Tools:

pandas polars scikit_learn python seaborn bash git postgresql tensorflow go gcp

Connect with me:

anopsy madkowalczuk anopsy anopsy_amsterdam @anopsy28

anopsy

Β anopsy

anopsy

anopsy

anopsy

Pinned Loading

  1. Juniors_vs_ChatGPT Juniors_vs_ChatGPT Public

    Inspired by personal curiosity and a 2023 Hackathon challenge (won in the β€˜Most Polished’ category). This project investigates the impact of large language models like ChatGPT on entry-level roles …

    Jupyter Notebook 1

  2. Compensation-prediction Compensation-prediction Public

    An integrated data modeling and model experimentation project, packaged as a Streamlit app for predicting estimated compensation in engineering jobs

    Jupyter Notebook 1

  3. MaskMap MaskMap Public

    Prototype of a diagnosis support tool using the power of NLP to identify symptoms of Autistic Masking (AM) and help medical staff and patients differentiate between anxiety, depression, and the lon…

    Jupyter Notebook

  4. Equity_in_Healthcare Equity_in_Healthcare Public

    Predicitng a timely diagnosis in metastatic cancer patients. Data cleaning, feature engineering and hyperparams tuning of classification model ensemble

    Jupyter Notebook 1

  5. koaning/scikit-lego koaning/scikit-lego Public

    Extra blocks for scikit-learn pipelines.

    Python 1.2k 117

  6. narwhals-dev/narwhals narwhals-dev/narwhals Public

    Lightweight and extensible compatibility layer between dataframe libraries!

    Python 303 43