Skip to content

Commit

Permalink
Update articles and CV
Browse files Browse the repository at this point in the history
  • Loading branch information
spaidataiga committed Jul 30, 2024
1 parent 73df296 commit e015692
Show file tree
Hide file tree
Showing 3 changed files with 24 additions and 0 deletions.
Binary file modified Marjanovic_Sara_CV_PUBLIC.pdf
Binary file not shown.
12 changes: 12 additions & 0 deletions _posts/2024-07-24-ConfirmationBias.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,12 @@
---
layout: post
title: From Internal Conflict to Contextual Adaptation of Language Models.
subtitle: SV Marjanović, H Yu, P Atanasova, M Maistro, C Lioma, I Augenstein
# cover-img: /assets/img/path.jpg
# thumbnail-img: /assets/img/thumb.png
# share-img: /assets/img/path.jpg
tags: [knowledge conflict, intra-memory conflict, dataset, uncertainty]
# author: Sharon Smith and Barry Simpson
---

Knowledge-intensive language understanding tasks require Language Models (LMs) to integrate relevant context, mitigating their inherent weaknesses, such as incomplete or outdated knowledge. Nevertheless, studies indicate that LMs often ignore the provided context as it can conflict with the pre-existing LM's memory learned during pre-training. Moreover, conflicting knowledge can already be present in the LM's parameters, termed intra-memory conflict. Existing works have studied the two types of knowledge conflicts only in isolation. We conjecture that the (degree of) intra-memory conflicts can in turn affect LM's handling of context-memory conflicts. To study this, we introduce the DYNAMICQA dataset, which includes facts with a temporal dynamic nature where a fact can change with a varying time frequency and disputable dynamic facts, which can change depending on the viewpoint. DYNAMICQA is the first to include real-world knowledge conflicts and provide context to study the link between the different types of knowledge conflicts. With the proposed dataset, we assess the use of uncertainty for measuring the intra-memory conflict and introduce a novel Coherent Persuasion (CP) score to evaluate the context's ability to sway LM's semantic output. In this [preprint](https://arxiv.org/abs/2407.17023), our extensive experiments reveal that static facts, which are unlikely to change, are more easily updated with additional context, relative to temporal and disputable facts.
12 changes: 12 additions & 0 deletions _posts/2024-08-11-Instability.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,12 @@
---
layout: post
title: Investigating the Impact of Model Instability on Explanations and Uncertainty.
subtitle: SV Marjanović, I Augenstein, C Lioma
# cover-img: /assets/img/path.jpg
# thumbnail-img: /assets/img/thumb.png
# share-img: /assets/img/path.jpg
tags: [uncertainty quantification, xai, model instability, adversarial perturbation]
# author: Sharon Smith and Barry Simpson
---

Explainable AI methods facilitate the understanding of model behaviour, yet, small, imperceptible perturbations to inputs can vastly distort explanations. As these explanations are typically evaluated holistically, before model deployment, it is difficult to assess when a particular explanation is trustworthy. Some studies have tried to create confidence estimators for explanations, but none have investigated an existing link between uncertainty and explanation quality. We artificially simulate epistemic uncertainty in text input by introducing noise at inference time. In this large-scale empirical [study](https://arxiv.org/abs/2402.13006), accepted to ACL Findings 2024, we insert different levels of noise perturbations and measure the effect on the output of pre-trained language models and different uncertainty metrics. Realistic perturbations have minimal effect on performance and explanations, yet masking has a drastic effect. We find that high uncertainty doesn't necessarily imply low explanation plausibility; the correlation between the two metrics can be moderately positive when noise is exposed during the training process. This suggests that noise-augmented models may be better at identifying salient tokens when uncertain. Furthermore, when predictive and epistemic uncertainty measures are over-confident, the robustness of a saliency map to perturbation can indicate model stability issues. Integrated Gradients shows the overall greatest robustness to perturbation, while still showing model-specific patterns in performance; however, this phenomenon is limited to smaller Transformer-based language models.

0 comments on commit e015692

Please sign in to comment.