index.html

<!DOCTYPE html>
<html class="h-100">
    <head>
        <title>Jason Vega - CS PhD Student @ UIUC</title>
        <meta charset="utf-8">
        <meta name="viewport" content="width=device-width, initial-scale=1">
        <link rel="stylesheet" href="style.css">
        <link rel="icon" type="image/png" href="favicon.png">
        <link href="https://cdn.jsdelivr.net/npm/bootstrap@5.3.2/dist/css/bootstrap.min.css" rel="stylesheet" integrity="sha384-T3c6CoIi6uLrA9TneNEoa7RxnatzjcDSCmG1MXxSR1GAsXEV/Dwwykc2MPK8M2HN" crossorigin="anonymous">
        <script src="https://cdn.jsdelivr.net/npm/bootstrap@5.3.2/dist/js/bootstrap.bundle.min.js" integrity="sha384-C6RzsynM9kWDrMNeT87bh95OGNyZPhcTNXj1NW7RuBCsyN/o0jlpcV8Qyq46cDfL" crossorigin="anonymous"></script>
    </head>
    <body class="h-100 bg-light text-dark">
        <div class="d-flex flex-column h-100">
            <nav class="navbar navbar-expand-sm navbar-dark bg-dark text-light px-3">
                <a class="navbar-brand" href="index.html">Jason Vega</a>
            </nav>
            <div class="container-fluid flex-grow-1">
                <div class="row p-4 bg-light bg-gradient text-dark justify-content-center">
                    <div class="col" id="bioCol">
                        <div class="row align-items-center">
                            <div class="col-lg-3 mb-4 mb-lg-0 text-center">
                                <img src="headshot.jpeg" class="img-fluid img-thumbnail rounded-circle" />
                            </div>
                            <div class="col-lg-9">
                                <div class="row">
                                    <div class="col">
                                        <p>
                                            Hi there! I'm a third-year computer science Ph.D. student at the <a href="https://cs.illinois.edu">University of Illinois Urbana-Champaign</a> working on artificial intelligence research, particularly on topics in trustworthy machine learning. I'm a member of the <a href="https://ggndpsngh.github.io">FOrmally
                                            Certified Automation and Learning (FOCAL) Lab</a>, where I'm advised by Prof. Gagandeep Singh. I graduated from the <a href="https://cse.ucsd.edu">University of California San Diego</a> in June 2022 with a B.S. in Computer Science. My research vision is to enable efficient, ethical development of intelligent systems that are 
                                            highly performant yet safe, transparent and ultimately beneficial to humanity.
                                        </p>
                                    </div>
                                </div>
                                <div class="row align-items-center justify-content-center">
                                    <div class="col-auto py-2 text-center">
                                        <a class="btn btn-outline-primary" href="CV.pdf">CV</a>
                                    </div>
                                    <div class="col-auto py-2 text-center">
                                        <a class="btn btn-outline-primary" href="https://www.threads.net/@_jasonvega">Threads</a>
                                    </div>
                                    <div class="col-auto py-2 text-center">
                                        <a class="btn btn-outline-primary" href="https://medium.com/@jasonvega14">Medium</a>
                                    </div>
                                    <div class="col-auto py-2 text-center">
                                        <a class="btn btn-outline-primary" href="https://www.linkedin.com/in/jason-vega/">LinkedIn</a>
                                    </div>
                                    <div class="col-auto py-2 text-center">
                                        <a class="btn btn-outline-primary" href="mailto:javega3@illinois.edu">Email</a>
                                    </div>
                                </div>
                            </div>
                        </div>
                    </div>
                </div>
                <div class="row pb-4 px-4 justify-content-center">
                    <div class="col" id="mainContentCol">
                        <h2>
                            Research Interests
                        </h2>
                        <ul class="mb-0">
                            <li>
                                <b>Safety of Large Language Models (LLMs)</b>
                                <ul>
                                    <li>
                                        Efficient attacks for bypassing the safety alignment of LLMs
                                    </li>
                                </ul>
                            </li>
                        </ul>
                        <br>
                        <h2>
                            Papers
                        </h2>
                        <p>
                            (* denotes equal contribution)
                        </p>
                        <div class="card bg-light text-dark">
                            <div class="card-body">
                                <h5 class="card-title">
                                    Stochastic Monkeys at Play: Random Augmentations Cheaply Break LLM Safety Alignment
                                </h5>
                                <h6 class="card-subtitle mb-2 text-muted">
                                    <b>Jason Vega</b>, Junsheng Huang*, Gaokai Zhang*, Hangoo Kang*, Minjia Zhang, Gagandeep Singh
                                </h6>
                                <h6 class="card-subtitle mb-2 text-muted">
                                    Arxiv, 2024; under peer review
                                </h6>
                                <p class="card-text">
                                    We show that low-resource and unsophisticated attackers, i.e. <i>stochastic monkeys</i>, can significantly improve their chances of bypassing safety alignment of SoTA LLMs with just 25 random augmentations per prompt.
                                </p>
                                <a href="https://arxiv.org/abs/2411.02785" class="card-link">Paper</a>
                            </div>
                        </div>
                        <br>
                        <div class="card bg-light text-dark">
                            <div class="card-body">
                                <h5 class="card-title">
                                    Bypassing the Safety Training of Open-Source LLMs with Priming Attacks
                                </h5>
                                <h6 class="card-subtitle mb-2 text-muted">
                                    <b>Jason Vega*</b>, Isha Chaudhary*, Changming Xu*, Gagandeep Singh
                                </h6>
                                <h6 class="card-subtitle mb-2 text-muted">
                                    ICLR 2024, Tiny Papers
                                </h6>
                                <p class="card-text">
                                    We investigate the fragility of SOTA open-source LLMs under simple, optimization-free attacks we refer to as priming attacks (now known as <i>prefilling attacks</i>), which are easy to execute and effectively bypass alignment from safety training.
                                </p>
                                <a href="https://arxiv.org/abs/2312.12321" class="card-link">Paper</a>
                                <a href="https://github.com/uiuc-focal-lab/llm-priming-attacks" class="card-link">Code</a>
                                <a href="https://llmpriming.focallab.org" class="card-link">Website</a>
                            </div>
                        </div>
                        <br>
                        <h2>
                            Other
                        </h2>
                        <ul>
                            <li>
                                I grew up in the Bay Area &#x1F309; and will always be a Californian &#x1F43B; at heart.
                            </li>
                            <li>
                                Outside of research, I enjoy:
                                <ul>
                                    <li>
                                        Playing the violin &#x1F3BB; in the <a href="https://music.illinois.edu/perform/orchestras/philharmonia-orchestra/">UIUC Philharmonia Orchestra</a>
                                    </li>
                                    <li>
                                        Going for a run &#x1f3c3;
                                    </li>
                                    <li>
                                        Watching films and shows &#x1f3a5;
                                    </li>
                                </ul>
                            </li>
                        </ul>
                    </div>
                </div>
            </div>
        </div>
    </body>
</html>