From 83444a95960a60ab9c3ce98ef8139c552e71b9ee Mon Sep 17 00:00:00 2001 From: "Christina K." Date: Fri, 20 Sep 2024 13:04:03 -0500 Subject: [PATCH 1/2] Create YSU_Yu.yaml --- projects/YSU_Yu.yaml | 27 +++++++++++++++++++++++++++ 1 file changed, 27 insertions(+) create mode 100644 projects/YSU_Yu.yaml diff --git a/projects/YSU_Yu.yaml b/projects/YSU_Yu.yaml new file mode 100644 index 000000000..9dee6a1c3 --- /dev/null +++ b/projects/YSU_Yu.yaml @@ -0,0 +1,27 @@ +Department: Computer Science and Information Systems +Description: "Approximate query processing (or AQP) is an emerging research topic\ + \ in big data analytics. AQP focuses on deriving fast and accurate estimations for\ + \ complex queries that are usually time-consuming and expensive to run on large\ + \ datasets. Traditional methods, such as histogram and sketch, are insufficient\ + \ when applied to big data because of the processing limits. An essential question\ + \ lacking research is how to assess the errors of AQP estimations.\nThis project\ + \ focuses on assessing the errors of AQP query estimations, especially for common\ + \ selection queries. Traditional methods can generate confidence intervals for query\ + \ estimations based on strict assumptions such as the normal distribution assumption.\ + \ Therefore, they are not applicable to massive datasets. In this project, the PI\ + \ will employ a novel non-parametric statistical method called bootstrap sampling\ + \ which requires less strict assumptions and brings many statistical advantages.\n\ + A prototype system will be developed employing bootstrap sampling to efficiently\ + \ compute standard errors and confidence intervals for AQP systems, especially those\ + \ answering selection queries, namely \u03C3-AQP. Selection queries comprise a large\ + \ portion of daily data queries. For broader applications, this framework will allow\ + \ selection queries to include common aggregation operators such as average, sum,\ + \ and count. The PI will investigate the computing and storage costs when bootstrap\ + \ replicas are computed. A framework will be developed to automate both the AQP\ + \ estimation and error estimation operations. Extensive benchmarks will be performed\ + \ on large datasets such as the TPC-H benchmark." +FieldOfScience: Computer Science +FieldOfScienceID: '11.0701' +InstitutionID: Unknown +Organization: Youngstown State University +PIName: Feng Yu From 4176c6211dbc902c6fc5730a6474910343b7b3b0 Mon Sep 17 00:00:00 2001 From: Matyas Selmeci Date: Tue, 24 Sep 2024 13:32:18 -0500 Subject: [PATCH 2/2] Add InstitutionID --- projects/YSU_Yu.yaml | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/projects/YSU_Yu.yaml b/projects/YSU_Yu.yaml index 9dee6a1c3..a8a5687c7 100644 --- a/projects/YSU_Yu.yaml +++ b/projects/YSU_Yu.yaml @@ -22,6 +22,6 @@ Description: "Approximate query processing (or AQP) is an emerging research topi \ on large datasets such as the TPC-H benchmark." FieldOfScience: Computer Science FieldOfScienceID: '11.0701' -InstitutionID: Unknown +InstitutionID: "https://osg-htc.org/iid/afu2chyu4qlj" Organization: Youngstown State University PIName: Feng Yu