Skip to content

Commit

Permalink
Merge pull request #332 from jeanbez/main
Browse files Browse the repository at this point in the history
add new projects for OSPO24
  • Loading branch information
slieggi authored Jan 30, 2024
2 parents 98add17 + 280d7de commit 2fb7e95
Show file tree
Hide file tree
Showing 6 changed files with 121 additions and 0 deletions.
59 changes: 59 additions & 0 deletions content/authors/jeanlucabez/_index.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,59 @@
---
# Display name
title: Jean Luca Bez

# Username (this should match the folder name)
authors:
- jeanlucabez

# Is this the primary user of the site?
superuser: false

# Role/position
role: "Research Scientist, Lawrence Berkeley National Laboratory"


# Organizations/Affiliations
organizations:
- name: Scientific Data Management Research
url: "https://crd.lbl.gov/divisions/scidata/sdm"
- name: Computing Sciences Research Division
url: "https://crd.lbl.gov"
- name: Lawrence Berkeley National Laboratory
url: "https://www.lbl.gov"


# Short bio (displayed in user profile at end of posts)
bio: Jean Luca's research interests are in high-performance computing + I/O + storage.



# Social/Academic Networking
# For available icons, see: https://sourcethemes.com/academic/docs/widgets/#icons
# For an email link, use "fas" icon pack, "envelope" icon, and a link in the
# form "mailto:[email protected]" or "#contact" for contact widget.
social:
- icon: home
icon_pack: fas
link: https://crd.lbl.gov/divisions/scidata/sdm/staff/jean-luca-bez/
- icon: github
icon_pack: fas
link: https://github.com/jeanbez
- icon: envelope
icon_pack: fas
link: mailto:[email protected]
- icon: linkedin
icon_pack: fab
link: https://www.linkedin.com/in/jeanbez


# Enter email to display Gravatar (if Gravatar enabled in Config)
email: "[email protected]"


# Organizational groups that you belong to (for People widget)
# Set this to `[]` or comment out if you are not using People widget.
user_groups:
- University of California Mentors
---
Jean Luca is a Career-Track Research Scientist at Lawrence Berkeley National Laboratory (LBNL), USA. He is passionate about High-Performance I/O, Parallel I/O, Education, and Competitive Programming. His research focuses on optimizing the I/O performance of scientific applications and data management.
Binary file added content/authors/jeanlucabez/avatar.jpg
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added content/project/osre24/lbl/drishti/featured.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
30 changes: 30 additions & 0 deletions content/project/osre24/lbl/drishti/index.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,30 @@
---
title: "Dristhi"
authors: [jeanlucabez, "Suren Byna"]
author_notes: ["Research Scientist, Lawrence Berkeley Lab", " The Ohio State University (OSU)"]
tags: ["osre24", "uc", "LBNL", "data science", "visualization", "performance analysis"]
date: 2024-01-30T10:15:00-07:00
lastmod: 2024-01-30T10:15:00-07:00
---

[Drishti](https://github.com/hpc-io/drishti) is a novel interactive web-based analysis framework to visualize I/O traces, highlight bottlenecks, and help understand the I/O behavior of scientific applications. Drishti aims to fill the gap between the trace collection, analysis, and tuning phases. The framework contains an interactive I/O trace analysis component for end-users to visually inspect their applications' I/O behavior, focusing on areas of interest and getting a clear picture of common root causes of I/O performance bottlenecks. Based on the automatic detection of I/O performance bottlenecks, our framework maps numerous common and well-known bottlenecks and their solution recommendations that can be implemented by users.

### Dristhi / Server-side Visualization Service

The proposed work will include investigating and building server-side solutions to support the visualization of larger I/O traces and logs, while integrating with the existing analysis, reports, and recommendations.

- **Topics:** `I/O` `HPC` `visualization`, `performance analysis`
- **Skills:** Python, HTML/CSS, JavaScript
- **Difficulty:** Moderate
- **Size:** Large (350 hours)
- **Mentors:** {{% mention jeanlucabez %}} and [Suren Byna](mailto:[email protected])

### Dristhi / Visualization and Analysis of AI-based Applications

Drishti to handle metrics from non-MPI applications, specifically, AI/ML codes and applications. This work entails adapting the existing framework, heuristics, and recommendations to support metrics collected from AI/ML workloads.

- **Topics:** `I/O` `HPC` `AI` `visualization`, `performance analysis`
- **Skills:** Python, AI, performance profiling
- **Difficulty:** Moderate
- **Size:** Large (350 hours)
- **Mentors:** {{% mention jeanlucabez %}} and [Suren Byna](mailto:[email protected])
Binary file added content/project/osre24/lbl/h5bench/featured.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
32 changes: 32 additions & 0 deletions content/project/osre24/lbl/h5bench/index.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,32 @@
---
title: "h5bench"
authors: [jeanlucabez, "Suren Byna"]
author_notes: ["Research Scientist, Lawrence Berkeley Lab", " The Ohio State University (OSU)"]
tags: ["osre24", "uc", "LBNL", "data science", "benchmarking", "compression"]
date: 2024-01-30T10:15:00-07:00
lastmod: 2024-01-30T10:15:00-07:00
---

[h5bench](https://github.com/hpc-io/h5bench) is a suite of parallel I/O benchmarks or kernels representing I/O patterns that are commonly used in HDF5 applications on high performance computing systems. h5bench measures I/O performance from various aspects, including the I/O overhead, and observed I/O rate.

Parallel I/O is a critical technique for moving data between compute and storage subsystems of supercomputers. With massive amounts of data produced or consumed by compute nodes, high-performant parallel I/O is essential. I/O benchmarks play an important role in this process; however, there is a scarcity of I/O benchmarks representative of current workloads on HPC systems. Toward creating representative I/O kernels from real-world applications, we have created h5bench, a set of I/O kernels that exercise HDF5 I/O on parallel file systems in numerous dimensions. Our focus on HDF5 is due to the parallel I/O library's heavy usage in various scientific applications running on supercomputing systems. The various tests benchmarked in the h5bench suite include I/O operations (read and write), data locality (arrays of basic data types and arrays of structures), array dimensionality (1D arrays, 2D meshes, 3D cubes), I/O modes (synchronous and asynchronous). h5bench measurements can be used to identify performance bottlenecks and their root causes and evaluate I/O optimizations. As the I/O patterns of h5bench are diverse and capture the I/O behaviors of various HPC applications, this study will be helpful to the broader supercomputing and I/O community.

### h5bench / Reporting and Enhancing

The proposed work will include standardizing and enhancing the reports generated by the suite, and integrate additional I/O kernels (e.g., HACC-IO).

- **Topics:** `I/O` `HPC` `benchmarking`
- **Skills:** Python, C/C++, good communicator
- **Difficulty:** Moderate
- **Size:** Large (350 hours)
- **Mentors:** {{% mention jeanlucabez %}} and [Suren Byna](mailto:[email protected])

### h5bench / Compression

The proposed work will focus on including compression capabilities into the h5bench core access patterns through HDF5 filters.

- **Topics:** `I/O` `HPC` `benchmarking`, `compression`
- **Skills:** C/C++, Python, HDF5
- **Difficulty:** Moderate
- **Size:** Large (350 hours)
- **Mentors:** {{% mention jeanlucabez %}} and [Suren Byna](mailto:[email protected])

0 comments on commit 2fb7e95

Please sign in to comment.