Skip to content

Commit

Permalink
Update clustering-visualizing-word-embeddings.md
Browse files Browse the repository at this point in the history
Remove duplicate title.
  • Loading branch information
anisa-hawes authored Sep 29, 2023
1 parent 34d4916 commit 5a68995
Showing 1 changed file with 0 additions and 2 deletions.
2 changes: 0 additions & 2 deletions en/lessons/clustering-visualizing-word-embeddings.md
Original file line number Diff line number Diff line change
Expand Up @@ -26,8 +26,6 @@ doi: 10.46430/phen0111

{% include toc.html %}

# Clustering and Visualising Documents using Word Embeddings

## Introduction

As corpora are increasingly 'born digital' on hard drives as well as web and email servers, we are moving from being able to select or group documents using keyword or manual searches to needing to be able to automate this task at scale. Moreover, large-ish, unlabelled corpora of thousands or tens-of-thousands of documents are not particularly well-suited to topic modelling or TF/IDF analysis either. Since we don't have a sense of what kinds of groups might exist, what kinds of topics might be covered, or what level of distinctiveness in vocabulary might matter, we need different, more flexible ways to visualise and extract structure from texts.
Expand Down

0 comments on commit 5a68995

Please sign in to comment.