Skip to content
View audreydottech's full-sized avatar

Block or report audreydottech

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this userโ€™s behavior. Learn more about reporting abuse.

Report abuse
audreydottech/README.md

Hi there ๐Ÿ‘‹

My name is Audrey, and I am a data engineer and ontologist with Python, React.JS, Spark and SQL experience. I made the transition from a career in metadata management because I am an impatient person by nature and would look for ways at work to automate tasks. I tend to get very excited about the process of gathering requirements and designing data pipelines, with the goal of empowering users with analytics.

Projects (ordered by recency)

Building an eCommerce App in React repo

A simple e-commerce app that displays products, a navigation bar and cart for a fictitious brand. Part of a Girls Develop It project.

Stack: React | HTML | CSS | Redux

The ESG Stock Performance Dashboard repo

A data pipeline that extracts data from various finance APIs daily and automatically updates the database tables after checking for duplicate data. Originally coded in Python and Scala to stream data using Kafka and sink it into Delta Lake tables stored in Azure with Databricks as the analytical engine. Original repo I rewrote the notebook to make it more readily available to the public, using my AWS credits to save on storage and Github Actions to run the data pipeline on schedule. This data pipeline is part of a six-month project to build a dashboard to understand the relationship between ESG ratings and a stock's performance based on the sector, region and investor confidence in company news.

Stack: Github Actions | Databricks | AWS | Azure

Access Map repo

A web application that allows users to view a location's accessibility rating prior to their visit. Ratings are given on a 5-point scale (where 0 is least accessible and 5 is most accessible). There are multiple categories such as: sensory rating, mobility rating, service dog relief, wheelchair accessibility rating, common allergen rating. This app was built in a team of women developers and designers, with a technical lead representing the very demographics to whom the app would be marketed. The app is currently under construction. The backend extracts the crowdsourced ratings from a Google form and ingests it into a Postgres database. I wrote the stored procedures as Python functions with embedded SQL queries to update the database and display the ratings onto a custom Google Maps.

Stack: Google Cloud | Google Maps | Flask | Javascript | HTML | CSS

IMDB Vector Database repo

This is a Jupyter notebook that showcases my ability to read nested json data. The movie titles were transformed into vectors and stored in a Pinecone vector database to run the k-nearest neighbor algorithm and generate movie recommendations based on a user's keyword search.

Stack: Vector database | Similarity search

The Wikipedia Property Graph repo

A notebook that includes all the tools needed for a taxonomist to query a knowledge graph via metadata or property without the need to pay for taxonomy software to slice and dice it. I built a parser to transform RDF triples into separate vertex and edge Spark RDDs, whereby the edge RDDs store the ids to the vertices directly with the relationship data. Storing the indices so to speak allows Spark workers to retrieve the distributed data faster because it is partitioned by relationship and the integer IDs point to an address in memory, therefore allowing the running time of graph algorithms that retrieve a node's neighbors to be closer to O(V+E). For taxonomists, it's also a clear win because it allows them to explore connections in a graph (and to determine areas of a taxonomy that are bloated and those that are missing metadata) without the assistance of a data scientist.

Stack: Spark RDDs | Spark GraphX

Predicting ICU Deaths repo

A machine learning pipeline to predict the strongest factor in ICU deaths across hospitals around the world. Made with a team of medical experts, a data scientist, a data engineer and a software engineer. We experimented with various ensemble learning methods such as AdaBoost and XGBoost to build our predictive model out of multiple decision tree algorithms (to improve the model's predictive performance from a basic logistic regression), using GridSearchCV to fine-tune the hyperparameters (i.e. learning rate) and SMOTE to balance our dataset (to avoid overfitting). Then, we compared the ROC/AUC scores of each model and concluded that XGBoost with stratified random sampling would yield the most accurate predictions.

Stack: Ensemble Learning | AdaBoost | XGBoost | GridSearchCV

Neural Art Style Transfer repo

Neural style transfer takes two images and uses convolutional neural networks (CNNs) to transfer the style of one image to the other.

Stack: Tensorflow | Convolutional Neural Networks

More About Me

  • ๐Ÿ”ญ Iโ€™m currently working on an auto-classifier for a federal government agency.
  • ๐ŸŒฑ Iโ€™m currently learning React, MongoDB and Snowflake.
  • ๐Ÿ‘ฏ Iโ€™m looking to collaborate on data analytics and data governance.
  • ๐Ÿ’ฌ Ask me about how taxonomy can help your organization ensure the highest level of integrity and acesss for your data.
  • ๐Ÿ“ซ How to reach me: DM me on LinkedIn, I have an avatar.
  • ๐Ÿ˜„ Pronouns: she/her/hers
  • โšก Fun fact: I'm a Pilates fan and wine connoisseur.

Popular repositories Loading

  1. AccessMap AccessMap Public

    HTML 3 6

  2. esg-data-pipeline esg-data-pipeline Public

    Python

  3. audreydottech audreydottech Public

  4. neural-art-style-transfer neural-art-style-transfer Public

    Jupyter Notebook

  5. ecommerce-react-js-app ecommerce-react-js-app Public

    Forked from CodaBae/React2GDI

    Building an e-commerce app in React.js

    JavaScript