This repository provides a comprehensive suite of tools for visualizing application data through various stages and processes. Whether tracking job and internship applications, analyzing recruitment trends, or visualizing similar workflows, this project offers powerful and flexible visualization capabilities tailored to your data.
- Sankey Diagrams: Visualize the flow of applications through different stages, identifying key transitions and drop-off points.
- Interactive Dashboards: Leverage Plotly to create dashboards that allow dynamic filtering and data exploration.
- Python-Powered Visualizations: Utilize popular libraries, including Plotly, Matplotlib, and Pandas, for static and interactive data visualizations.
- Conda Environment: A ready-to-use Conda environment ensures easy installation and compatibility.
Follow these steps to set up the environment and get started with the ApplicationVisualizer project:
Begin by cloning the repository to your local machine:
git clone https://github.com/willfliaw/ApplicationVisualizer.git
cd ApplicationVisualizer
To ensure all required dependencies are installed, use the provided environment.yml
file to create a Conda environment:
conda env create -f environment.yml
Alternatively, you can manually install the required packages by creating an environment and installing the dependencies:
conda create -n appviz python ipykernel jupyter matplotlib numpy pandas plotly python-kaleido seaborn -c conda-forge
Activate the environment before running any scripts:
conda activate appviz
To explore the capabilities of this repository, a Jupyter Notebook is provided, allowing for an interactive and user-friendly experience. Follow the steps below to get started:
After activating the Conda environment, launch Jupyter Notebook from the command line:
jupyter notebook
This will open the Jupyter Notebook interface in your default web browser.
Navigate to the repository's directory and open the provided notebook file, e.g., ApplicationVisualizer.ipynb
.
The notebook contains:
- Guidance on Data Preparation: Learn how to format your JSON or CSV data for use in the visualizations.
- Code Examples: Run cells to generate Sankey diagrams, bar charts, line charts, and more using your data.
- Interactive Exploration: Adjust parameters and experiment with different visualization settings directly in the notebook.
- Prepare your data file (e.g.,
applications.json
). - Upload the file to the
data
directory. - Follow the steps in the notebook to visualize your data.
A sample dataset is provided in the data
directory to help you get started quickly. Load the dataset and run the provided cells to generate example visualizations.
- Save your notebook progress regularly to preserve your visualizations and notes.
- You can export the generated visualizations from the notebook to use in reports or presentations.
Showcase the progression of applications through different stages.
Visualize the timeline of application updates for each position. Each line represents a specific position, with color coding indicating the associated company. The chart helps track the progression of application stages over time.
Visualize the distribution of application statuses (e.g., Rejected, No Answer, Other) across different application sources.
Explore the contribution of each application source to different statuses in a stacked bar chart.
Contributions and suggestions are welcome! Feel free to:
- Open an issue for bug reports or feature requests.
- Submit a pull request with improvements or additional visualization scripts.
- Expand support for new visualization types, such as heatmaps and scatter plots.
- Incorporate advanced database tools like MongoDB.