-
Notifications
You must be signed in to change notification settings - Fork 3
/
index.html--
228 lines (228 loc) · 12.2 KB
/
index.html--
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
<!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.1//EN"
"http://www.w3.org/TR/xhtml11/DTD/xhtml11.dtd">
<html xmlns="http://www.w3.org/1999/xhtml" xml:lang="en">
<head>
<!-- Global site tag (gtag.js) - Google Analytics -->
<script async src="https://www.googletagmanager.com/gtag/js?id=UA-108628208-1"></script>
<script>
window.dataLayer = window.dataLayer || [];
function gtag(){dataLayer.push(arguments);}
gtag('js', new Date());
gtag('config', 'UA-108628208-1');
</script>
<meta name="generator" content="jemdoc, see http://jemdoc.jaboc.net/" />
<meta http-equiv="Content-Type" content="text/html;charset=utf-8" />
<link rel="stylesheet" href="jemdoc.css" type="text/css" />
<title>Arun's Data Analytics (ADA) Lab @ UCSD</title>
</head>
<body>
<table summary="Table for page layout." id="tlayout">
<tr valign="top">
<td id="layout-menu">
<div class="menu-item"><a href="index.html" class="current">Home</a></div>
<div class="menu-item"><a href="index.html#members">Members</a></div>
<div class="menu-item"><a href="publications.html">Publications</a></div>
<div class="menu-item"><a href="news.html">News</a></div>
<div class="menu-item"><a href="impact.html">Impact</a></div>
<div class="menu-item"><a href="blog.html">Blog/Misc.</a></div>
<div class="menu-item"><a href="projects.html"><br /> Projects</a></div>
<div class="menu-item"><a href="cerebro.html">Cerebro</a></div>
<div class="menu-item"><a href="sortinghat.html">SortingHat</a></div>
<div class="menu-item"><a href="speakql.html">SpeakQL</a></div>
<div class="menu-category"><br /> Past Projects</div>
<div class="menu-item"><a href="morpheus.html">Morpheus</a></div>
<div class="menu-item"><a href="krypton.html">Krypton</a></div>
<div class="menu-item"><a href="vista.html">Vista</a></div>
<div class="menu-item"><a href="panorama.html">Panorama</a></div>
<div class="menu-item"><a href="hamlet.html">Hamlet</a></div>
<div class="menu-item"><a href="nimbus.html">Nimbus</a></div>
<div class="menu-item"><a href="slab.html">SLAB</a></div>
<div class="menu-item"><a href="orion.html">Orion</a></div>
<div class="menu-item"><a href="http://i.stanford.edu/hazy/victor/columbus/">Columbus</a></div>
<div class="menu-item"><a href="http://i.stanford.edu/hazy/victor/bismarck/">Bismarck</a></div>
<div class="menu-item"><a href="http://i.stanford.edu/hazy/staccato/">Staccato</a></div>
</td>
<td id="layout-content">
<div id="toptitle">
<h1>Arun's Data Analytics (ADA) Lab @ UCSD</h1>
</div>
<h2>Introduction</h2>
<p>As the scale, complexity, and variety of data grows (aka <i>Big Data</i>), the
use of machine learning (ML) and artificial intelligence (AI) techniques
to make sense of, and interact with, such data — collectively called predictive
data analytics, statistical data analytics, ML-based data analytics, or simply
<i>advanced data analytics</i> (also ADA!) — is increasingly critical for data-driven
applications in the enterprise, Web, science, and other domains.
Alas, building and deploying ML/AI-powered data analytics applications
still involves far too many bottlenecks that slow down the lifecycle of
such applications, raise costs, frustrate many application users, and in
some cases, make high-quality data-driven decision making almost impossible.</p>
<div class="infoblock">
<div class="blockcontent">
<p>The mission of the ADALab is to <i>democratize advanced data analytics</i>
by making it dramatically easier, faster, and cheaper to build and deploy
ML/AI-powered data analytics applications throughout their lifecycle.</p>
</div></div>
<p>We are an academic research group headed by <a href="http://cseweb.ucsd.edu/~arunkk" target=_blank>Dr. Arun Kumar</a>, and we are a
part of the <a href="http://cse.ucsd.edu/" target=_blank>Department of Computer Science and Engineering (CSE)</a> and
the <a href="https://datascience.ucsd.edu/" target=_blank>Halicioglu Data Science Institute</a> at
the <a href="https://ucsd.edu/" target=_blank>University of California, San Diego (UCSD)</a>.
We are members of CSE's <a href="https://dbucsd.github.io/" target=_blank>Database Lab</a> and affiliate
members of the <a href="http://ai.ucsd.edu/" target=_blank>Artificial Intelligence Group</a> and
<a href="http://cns.ucsd.edu/" target=_blank>Center for Networked Systems</a>.</p>
<h2>Overview of Our Research</h2>
<p>The ADA lifecycle typically revolves around data scientists or ML engineers.
Based on conversations with dozens of such data-related professionals, we
abstract the ADA lifecyle as follows.
After identifying the tasks where ML/AI might benefit their application in
terms of business impact or scientific insights, the data scientist steers
three main processes, as illustrated below:</p>
<ul>
<li><p><b>Data Sourcing</b>: Identify, collect, clean, and organize data in to a form that can be used to train ML models.</p>
</li>
<li><p><b>Model Building</b>: Perform <i>model selection</i> with the data to obtain desired prediction functions.</p>
</li>
<li><p><b>Model Deployment</b>: Integrate trained prediction functions with the application and oversee lifecycle.</p>
</li>
</ul>
<table class="imgtable"><tr><td>
<img src="images/adalab.jpg" alt="" width="600px" /> </td>
<td align="left"></td></tr></table>
<div class="infoblock">
<div class="blockcontent">
<p>The ADALab's approach to democratizing advanced data analytics involves
<i>accelerating the ADA lifecyle</i> by removing bottlenecks for both the <i>efficiency
of the systems and algorithms</i> involved and the <i>productivity of the data
practitioners</i> involved.</p>
</div></div>
<p>Towards this grand goal, we synthesize and innovate upon the fields of data
management, ML/AI, systems, and human-computer interaction.
Our projects target all parts of the ADA lifecycle, and our work spans the whole
gamut of building new data systems, algorithms, empirical analysis, and theoretical
analysis. All of our systems are released as open source software.</p>
<p>We also enjoy interacting with, and learning from, practitioners — data
scientists, ML/software engineers, and domain scientists — and working with them to
help them adopt our systems and ideas.</p>
<p>The list of current ADALab projects is here: <a href="projects.html" target=_blank><b>Projects</b></a>.</p>
<p>The list of ADALab publications is here: <a href="publications.html" target=_blank><b>Publications</b></a>.</p>
<h2>Recent ADALab News</h2>
<ul>
<li><p>9/21: Arun is tenured and promoted to Associate Professor at UC San Diego.</p>
</li>
<li><p>8/21: A new <a href="https://adalabucsd.github.io/research-blog/research/2021/08/13/vldb2021.html" target=_blank><b>blog post</b></a> summarizing the papers, talks, demo, and panel discussions at VLDB and KDD by ADALab members.</p>
</li>
<li><p>7/21: The Kingpin paper on optimized execution of mass ML model building over data sub-groups,
part of the <a href="https://adalabucsd.github.io/cerebro.html" target=_blank><b>Cerebro</b></a> project, is accepted to VLDB 2021.</p>
</li>
<li><p>6/21: Huge congrats to Kabir on winning 1st place at the undergraduate level and to
Side on winning 2nd place at the graduate level of the SIGMOD 2021 Student Research Competition!
This is the strongest showing ever at the SIGMOD SRC by Database Lab students.</p>
</li>
</ul>
<p>Full list of lab news items here: <a href="news.html" target=_blank><b>News</b></a>.<br /></p>
<a name="members"><h2>Members</h2></a>
<h3>Faculty</h3>
<table class="imgtable"><tr><td>
<img src="images/arun.jpg" alt="" width="100px" /> </td>
<td align="left"><p><a href="http://cseweb.ucsd.edu/~arunkk" target=_blank><b>Arun Kumar</b></a><br />
Associate Professor, CSE and HDSI<br />
Email: arunkk [at] eng [dot] ucsd [dot] edu<br />
Office: CSE 3218</p>
</td></tr></table>
<h3>Graduate Students</h3>
<table class="imgtable"><tr><td>
<img src="images/kabir.png" alt="" width="100px" /> </td>
<td align="left"><p><a href="https://www.linkedin.com/in/kabir-nagrecha-952591152/" target=_blank><b>Kabir Nagrecha</b></a><br />
PhD, CSE, UCSD<br />
Email: knagrech [at] ucsd [dot] edu</p>
</td></tr></table>
<table class="imgtable"><tr><td>
<img src="https://kyleluoma.github.io/assets/img/profile.jpg" alt="" width="100px" /> </td>
<td align="left"><p><a href="https://kyleluoma.github.io/" target=_blank><b>Kyle Luoma</b></a><br />
PhD, CSE, UCSD<br />
Email: kluoma [at] ucsd [dot] edu<br />
Office: CSE 3232</p>
</td></tr></table>
<table class="imgtable"><tr><td>
<img src="https://scnakandala.github.io/assets/images/profile_pic.jpeg" alt="" width="100px" /> </td>
<td align="left"><p><a href="https://scnakandala.github.io/" target=_blank><b>Supun Nakandala</b></a><br />
PhD, CSE, UCSD<br />
Email: snakanda [at] eng [dot] ucsd [dot] edu<br />
Office: CSE 3232</p>
</td></tr></table>
<table class="imgtable"><tr><td>
<img src="images/tara.png" alt="" width="100px" /> </td>
<td align="left"><p><a href="https://www.linkedin.com/in/tara-mirmira/" target=_blank><b>Tara Mirmira</b></a><br />
PhD, CSE, UCSD<br />
Email: tmirmira [at] eng [dot] ucsd [dot] edu<br />
Office: CSE 3232</p>
</td></tr></table>
<table class="imgtable"><tr><td>
<img src="https://pvn25.github.io/images/vraj.jpeg" alt="" width="100px" /> </td>
<td align="left"><p><a href="https://pvn25.github.io/" target=_blank><b>Vraj Shah</b></a><br />
PhD, CSE, UCSD<br />
Email: vps002 [at] eng [dot] ucsd [dot] edu<br />
Office: CSE 3230</p>
</td></tr></table>
<table class="imgtable"><tr><td>
<img src="images/xiuwen.jpeg" alt="" width="100px" /> </td>
<td align="left"><p><a href="https://xiz675.github.io/" target=_blank><b>Xiuwen Zheng</b></a><br />
PhD, CSE, UCSD<br />
Email: xiz675 [at] eng [dot] ucsd [dot] edu<br />
Office: CSE 3232</p>
</td></tr></table>
<table class="imgtable"><tr><td>
<img src="https://yhzhang.info/images/bio-photo.jpg" alt="" width="100px" /> </td>
<td align="left"><p><a href="http://yhzhang.info/" target=_blank><b>Yuhao Zhang</b></a><br />
PhD, CSE, UCSD<br />
Email: yuz870 [at] eng [dot] ucsd [dot] edu<br />
Office: CSE 3230</p>
</td></tr></table>
<table class="imgtable"><tr><td>
<img src="images/liangde.jpeg" alt="" width="100px" /> </td>
<td align="left"><p><b>Liangde Li</b><br />
MS, CSE, UCSD<br />
Email: lil009 [at] ucsd [dot] edu<br />
Office: CSE 3232</p>
</td></tr></table>
<h3>Alumni</h3>
<ul>
<li><p>Advitya Gemawat, BS, HDSI, UCSD, 2021. First employment: Microsoft NERD AI.</p>
</li>
<li><p>Side Li, MS, CSE, UCSD, 2021. First employment: Google.</p>
</li>
<li><p>Kabir Nagrecha, BS, CSE, UCSD, 2021. First employment: PhD at UCSD.</p>
</li>
<li><p>Shaoqing Yi, BS, HDSI and Math, UCSD, 2021. First employment: PhD at UC Berkeley.</p>
</li>
<li><p>Kevin Yang, BS, CSE, UCSD, 2020. First employment: MS at UPenn.</p>
</li>
<li><p>David Justo, MS, CSE, UCSD, 2019 (Co-advisor: Nadia Polikarpova). First employment: Microsoft.</p>
</li>
<li><p>Lingjiao Chen. MS, CS, UW-Madison, 2018 (Co-advisor: Paraschos Koutris). First employment: PhD at Stanford.</p>
</li>
<li><p>Side Li. BS, CSE, UCSD, 2018. First employment: Amazon.</p>
</li>
<li><p>Anthony Thomas. MS, CSE, UCSD, 2018. First employment: PhD at UCSD.</p>
</li>
<li><p>Mingyang Wang. MS, CSE, UCSD, 2017. First employment: Amazon.</p>
</li>
</ul>
<h2>Sponsors</h2>
<p>We thank the following organizations for their generous support of our research. Any findings or opinions expressed in our research publications or articles are our own and do not necessarily reflect the views of any of these organizations.</p>
<table class="imgtable"><tr><td>
<img src="images/sponsors.jpg" alt="" height="100px" /> </td>
<td align="left"></td></tr></table>
<p>Past sponsors: Hellman Fellows Fund, NVIDIA, and Opera Solutions.</p>
<h2>About our Lab's Name</h2>
<p>Apart from being a convenient acronym, it is also a tribute to <a href="https://en.wikipedia.org/wiki/Ada_Lovelace" target=_blank>Ada Lovelace</a>, widely regarded as the first computer programmer. This tribute is part of our lab's commitment to help foster a diverse and inclusive community in computing, as enshrined in the <a href="https://ucsd.edu/about/principles.html" target=_blank>UCSD Principles of Community</a>, for people from all backgrounds, including women, LGBTQ+ people, people of color, and people with disabilities.</p>
<div id="footer">
<div id="footer-text">
HTML generated by <a href="http://jemdoc.jaboc.net/" target=_blank>jemdoc</a>.
</div>
</div>
</td>
</tr>
</table>
</body>
</html>