Get Pre-trained Sentence Embeddings from TensorFlow Hub

An easy script to get sentence embeddings from Google's pre-trained models on TensorFlow Hub. This script includes 6 such models of varying embedding dimensions (20-512) and/or architectures.

Example Usage

Returns an embedding vector per sentence of the input.

EMBEDDING_SIZE = n

Input = ["Colorless green ideas sleep furiously.", \
	"Noam Chomsky offered this as an example of a grammatically valid, \
	semantically nonsensical sentence."]
				
Output = array of shape (m,n) #m=number of sentences(=2), n=EMBEDDING_SIZE

Installation and Use

Run pip3 install -r requirements.txt

Run python3 get_embeddings.py

Can't find the embedding dimension/model you need?

Add the required model's URL available here to dictionary here.

Note

Create a key for new dictionary values (URLs) with the format "embed_size/modelName_model_url".
Pass size/modelName as EMBEDDING_SIZE.

Credits

All the models used (and more) are available here on TensorFlow Hub.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

Get Pre-trained Sentence Embeddings from TensorFlow Hub

Example Usage

Installation and Use

Can't find the embedding dimension/model you need?

Note

Credits

Files

README.md

Latest commit

History

README.md

File metadata and controls

Get Pre-trained Sentence Embeddings from TensorFlow Hub

Example Usage

Installation and Use

Can't find the embedding dimension/model you need?

Note

Credits