End-to-end earthquake detection pipeline via efficient time series similarity search
-
Updated
Jul 6, 2023 - Jupyter Notebook
End-to-end earthquake detection pipeline via efficient time series similarity search
There are Python 2.7 codes and learning notes for Spark 2.1.1
A text similarity computation using minhashing and Jaccard distance on reuters dataset
SetSketch: Filling the Gap between MinHash and HyperLogLog
A Clojure library for querying large data-sets on similarity
A simple audio fingerprinting system
insight data engineering fellow project
Project 1: Similar document searching via MinHash and Locality Sensitive Hashing
MinHash and LSH index written in Rust for Node.js
An easy-to-use script for fast similarity search in the textual data (and embedding space) with GPU & Multi-core support.
Minhash text analyzer developed during Algorithmics subject.
Minhash clustering of text documents
SpellChecker: an application to check for spell errors.
📃Document similarity detection using hashing
An improved method of locality-sensitive hashing for scalable instance matching. In this study, we propose a scalable approach for automatically identifying similar candidate instance pairs in very large datasets utilizing minhash-lsh-algorithm in C#.
Textual data manipulation projects with applications of advanced data mining techniques: recommendation systems, information retrieval systems, search engines, latent sentiment analysis, pagerank, PCA.
documents my master's level thesis work on building continous, topical web crawler based on mercator 1999
Fast Jaccard similarity search for abstract sets (documents, products, users, etc.) using MinHashing and Locality Sensitve Hashing
Add a description, image, and links to the minhash-lsh-algorithm topic page so that developers can more easily learn about it.
To associate your repository with the minhash-lsh-algorithm topic, visit your repo's landing page and select "manage topics."