Skip to content

Commit

Permalink
Merge pull request #41 from UofT-DSI/team-project-p2
Browse files Browse the repository at this point in the history
Team Project pt.2
  • Loading branch information
danielrazavi authored Aug 6, 2024
2 parents 18c5d8a + 53d87e1 commit c1d2823
Show file tree
Hide file tree
Showing 8 changed files with 376 additions and 1 deletion.
2 changes: 1 addition & 1 deletion instructional_team/generate_slides.sh
Original file line number Diff line number Diff line change
Expand Up @@ -2,7 +2,7 @@

# CONFIGURATION
folder_md="./markdown_slides"
folder_output="../slides/" # This will be used for both PDF and HTML
folder_output="../slides" # This will be used for both PDF and HTML

# Clear the screen for the splash screen
clear
Expand Down
186 changes: 186 additions & 0 deletions instructional_team/markdown_slides/04_project_showcase.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,186 @@
---
marp: true
theme: dsi-certificates-theme
_class: invert
paginate: true
---

# Introduction to Data Science Portfolios

```
$ echo "Data Sciences Institute"
```


<!-- Why a data science portfolio is crucial for your career. Overview of portfolio components. -->

<!-- Speaker notes: A portfolio is essential for showcasing your real-world skills and experiences, making you stand out to potential employers. It provides concrete examples of your ability to solve problems and generate insights from data, which is crucial in the competitive field of data science. By effectively presenting your projects, you demonstrate not just technical proficiency but also your capability to communicate complex ideas clearly. -->

---

# Module Overview

The content in this module is a response to the observations made during Team Project 1. There is no specific structure to the slides, as they present a collection of important points based on the feedback and common issues identified.

---


# What is a Good Portfolio?

A good portfolio showcases your best work and highlights skills that are highly valued by employers in data science and machine learning. It demonstrates your ability to handle real-world tasks, making you an attractive candidate for employment.

**Personalize Your Portfolio:**
- **Remove generic content:** For instance, delete the default README we provided. It's designed to describe the module, not your unique project.
- **Highlight your unique contributions and skills:** This personalization shows employers that you're not just completing assignments but are engaged and innovating on your projects.

---


# Types of Projects to Include in Your Portfolio

Including a diverse range of projects in your portfolio can significantly enhance your appeal to potential employers. Consider including a variety of project types to demonstrate the breadth and depth of your data science skills:

1. **Data Cleaning Project:** Show your ability to prepare data for analysis.
2. **Data Storytelling and Visualization Project:** Highlight your skills in interpreting and presenting data in compelling ways.
3. **ML Modeling:** Demonstrate your proficiency in building and tuning models.
4. **Group Project:** Showcase your teamwork and collaboration skills. (already doing this! 🥳)

For more, read "[How to Create a Project Portfolio for Data Science Job Applications](https://www.dataquest.io/blog/career-guide-data-science-projects-portfolio/)"


---

# Selecting Projects That Showcase Your Skills

Choosing the right projects for your portfolio is crucial. Each project should:

- **Solve Real Problems**: Use actual datasets to address genuine issues in your field. (*examples in the next slide*)
- **Demonstrate Industry Relevance**: Select projects that are pertinent to your specific area, such as marketing analytics for marketers or predictive maintenance for engineers.
- **Provide Actionable Insights**: Focus on projects that deliver clear, practical outcomes that demonstrate your ability to impact real-world scenarios.

These criteria ensure that your projects not only highlight your technical skills but also your understanding of and adaptability to industry-specific challenges, making you a valuable candidate to potential employers.

---

# Some Open Source Real Datasets

- **University of Toronto Libraries Data**: [onesearch.library.utoronto.ca/researchdata](https://onesearch.library.utoronto.ca/researchdata/accessing-data-and-tools)
- **Open Government Data Portal (Canada)**: [open.canada.ca/data](https://open.canada.ca/data)
- **Ontario Data Catalogue**: [data.ontario.ca](https://data.ontario.ca)
- **Kaggle Datasets**: [kaggle.com/datasets](https://www.kaggle.com/datasets)
- **UCI Machine Learning Repository**: [archive.ics.uci.edu/ml](https://archive.ics.uci.edu/ml/index.php)
- **World Bank Open Data**: [data.worldbank.org](https://data.worldbankzorg)

Utilize these resources to find relevant datasets that can help you build projects showcasing your ability to solve real-world problems, demonstrate industry relevance, and provide actionable insights.

---

Let me know if this works for you or if you need any further adjustments!

# Showcase Projects That Use Diverse Data Types

Enhance your projects by effectively using a mix of structured, unstructured, and time series data:

- **Structured Data**: Employ database data for clear, quantifiable insights.
- **Unstructured Data**: Add depth with text, images, or videos.
- **Time Series Data**: Utilize data in sequential order for trend analysis and forecasting.

Select data types strategically to align with your project's objectives. Ensure each type contributes to a clear and coherent narrative, avoiding unnecessary complexity to maintain focus and utility.

---

# Demonstrating Technical and Business Skills

- **Demonstrate Coding Proficiency**: All projects should showcase your ability to write clean, efficient, and well-documented code.
- **Showcase Communication Skills**: All projects should effectively communicated complex data-driven insights to Non-technical team members.

Your ability to document code clearly and communicate effectively can drastically reduce onboarding times for new team members and help non-technical stakeholders make informed decisions, thereby enhancing overall project success and team efficiency.

---


# Crafting a Comprehensive Main README File

- **Purpose & Overview:** Introduce the project with essential details, concise description and a project objective.
- **Goals & Objectives:** Articulate what the project aims to achieve.
- **Techniques & Technologies:** Highlight the tools and methods used.
- **Key Findings & Instructions:** Summarize outcomes and provide setup instructions.
- **Visuals & Credits:** Enhance with visuals; acknowledge contributors.

<!--
Focus on the essence of what makes a README effective: clarity and conciseness. A well-organized README provides a snapshot of the project, helping others quickly understand its value and how to engage with it. Ensure that each element is presented in a way that contributes to an overall understanding of the project.
-->

---


# Better Documentation - README

While **you can have multiple README files** in your repository, include only one README file per folder to keep things clean and simple. This structure showcases your ability to manage and present complex information effectively, increasing your appeal to potential employers.


---

# Better Documentation - Comments

**Effective commenting enhances code readability and maintainability, crucial for collaborative environments.**

A lot of participants had code that wasn't properly commented. The idea here is that if someone with no tech background reads your code, they should get the essence of what you're doing and how the code flows. This practice not only aids in understanding but also facilitates smoother transitions and updates within team projects.

📰 [Best practices for writing code comments](https://stackoverflow.blog/2021/12/23/best-practices-for-writing-code-comments/)

---
## Example

```
function calculateTotal(price, quantity) {
return price * quantity;
}
let totalPrice = calculateTotal(25, 5);
console.log(totalPrice); // Output: 125
```

```
// Calculates the total cost by multiplying the price per item with the quantity
function calculateTotal(price, quantity) {
return price * quantity;
}
// Example usage: Calculates the total price for 5 items at $25 each by multiplying the price
// per item ($25) with the quantity (5), and stores the result in the totalPrice variable.
let totalPrice = calculateTotal(25, 5);
console.log(totalPrice); // Output: 125
```

---


# Better Documentation - STAR Method

- The STAR method is typically used for answering interview questions, applying this structure to your project overviews can be highly effective.
- By organizing your projects using the Situation, Task, Action, Result format, your portfolio essentially speaks for you, conducting a virtual interview with potential employers even in your absence.
- This approach ensures that employers can clearly understand the value and impact of your work, even without direct interaction.
- Remember, the results don't have to be ground breaking, anything that showcases your abilities in data science/machine learning, even "**learning**" counts as an important result.

---

# Effective GitHub Repository Organization

- Ensure your GitHub repository is neatly organized; avoid unused or empty folders and ensure each folder has a clear purpose.
- Use READMEs in key folders (like `src`, `iac`, `backend`, `frontend`, etc.) to detail their contents and purpose, aiding clarity for complex sections.
- Avoid READMEs in simple folders (like `images`, `docs`, etc.), unless there's specific information that needs to be explained.

---

# Effective Presentation of Findings

- Focus on crafting your project presentations to be clear and to the point. Utilize storytelling techniques to make your data findings compelling and employ visualizations to simplify and clarify complex information.

- Your ability to communicate insights effectively is **crucial** — well-presented findings can make your work stand out to potential employers and demonstrate your capability to aid decision-makers like managers or CEOs in understanding critical data quickly. This approach not only showcases your skills but significantly enhances your employability.

---

# Questions?
Loading

0 comments on commit c1d2823

Please sign in to comment.