Skip to content

Commit

Permalink
Update polydedupe.py
Browse files Browse the repository at this point in the history
  • Loading branch information
gagan3012 committed Dec 4, 2023
1 parent f3f92c2 commit 9ff6b15
Showing 1 changed file with 2 additions and 2 deletions.
4 changes: 2 additions & 2 deletions PolyDeDupe/polydedupe.py
Original file line number Diff line number Diff line change
Expand Up @@ -101,9 +101,9 @@ def save(self, filepath) -> None:
json.dump(duplicate_clusters, f)


def _compute_min_hash(element):
def _compute_min_hash(element, column_name="text"):
index, data = element
min_hash = get_min_hash(get_data(data["text"]))
min_hash = get_min_hash(get_data(data[column_name]))
if min_hash is not None:
return index, min_hash

Expand Down

0 comments on commit 9ff6b15

Please sign in to comment.