Skip to content

Commit

Permalink
Merge pull request #3034 from programminghistorian/Issue-3033
Browse files Browse the repository at this point in the history
Update clustering-visualizing-word-embeddings.md
  • Loading branch information
anisa-hawes authored Oct 4, 2023
2 parents f550603 + 5a68995 commit df1b5e8
Showing 1 changed file with 0 additions and 2 deletions.
2 changes: 0 additions & 2 deletions en/lessons/clustering-visualizing-word-embeddings.md
Original file line number Diff line number Diff line change
Expand Up @@ -26,8 +26,6 @@ doi: 10.46430/phen0111

{% include toc.html %}

# Clustering and Visualising Documents using Word Embeddings

## Introduction

As corpora are increasingly 'born digital' on hard drives as well as web and email servers, we are moving from being able to select or group documents using keyword or manual searches to needing to be able to automate this task at scale. Moreover, large-ish, unlabelled corpora of thousands or tens-of-thousands of documents are not particularly well-suited to topic modelling or TF/IDF analysis either. Since we don't have a sense of what kinds of groups might exist, what kinds of topics might be covered, or what level of distinctiveness in vocabulary might matter, we need different, more flexible ways to visualise and extract structure from texts.
Expand Down

0 comments on commit df1b5e8

Please sign in to comment.