Skip to content

Commit

Permalink
Changed labels variable to content labels
Browse files Browse the repository at this point in the history
  • Loading branch information
justinbt1 committed Apr 24, 2022
1 parent 942acd1 commit 2b29513
Showing 1 changed file with 2 additions and 2 deletions.
4 changes: 2 additions & 2 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -25,13 +25,13 @@ content = [
]

# Labels for each text in content.
labels = [1, 2, 3, 4, 5, 6, 7, 8, 9]
content_labels = [1, 2, 3, 4, 5, 6, 7, 8, 9]

# Create MinHash object.
minhash = MinHash(content, n_gram=9, permutations=100, hash_bits=64, seed=3)

# Create LSH model.
lsh = LSH(minhash, labels, no_of_bands=50)
lsh = LSH(minhash, content_labels, no_of_bands=50)

# Query to find near duplicates for text 1.
print(lsh.query(1, min_jaccard=0.5))
Expand Down

0 comments on commit 2b29513

Please sign in to comment.