Optimized Faiss

This is an optimized version of Faiss by Intel. It is based on open-sourced Faiss 1.6.3.

To build original Faiss with no optimization, just follow the original build way, like:

./configure --without-cuda
make clean
make

Up to now, features included are as followings:

- IVFPQ Relayout

This feature changes the layout of PQ code in InvertedLists in IndexIVFPQ. The new layout not only improves the cache hit rate, but also enables compiler-level SIMD optimization.

To enable this feature, you should append --enable-ivfpq-relayout to ./configure, such as:

./configure --without-cuda --enable-ivfpq-relayout
make clean
make

Then, an IndexIVFPQ instance can be set to use this feature by such code:

# in Python
ps = faiss.ParameterSpace ()
ps.set_index_parameter (index, "ivfpq_relayout", 4)

or

# in C++
faiss::ParameterSpace ps;
ps.set_index_parameter (index, "ivfpq_relayout", 4);

where ivfpq_relayout means the group size of relayout. Although ivfpq_relayout can be any non-negative integer, but 4 is usually the best based on experience.

You can set ivfpq_relayout to any non-negative integer at any time, no matter before or after trainging or adding vectors. The only drawback is that, if you set ivfpq_relayout when IndexIVFPQ already has some base vectors, it will take some time to convert the memory layout.

The feature doesn't change the format of index writen to disk, so is compatible with old index files.

- DType (Diverse Type)

This is a family of extension, including new metric types, new data types and new implementation types.

-- Metirc Type

Two new metric types, METRIC_L2_EXPAND and METRIC_PROJECTION are added to MetricType.

METRIC_L2_EXPAND is similar to METRIC_L2, but it calculates y*y - 2*x*y. It has same effect if you only case the recalling and ranking, but ignore the calculated distances. METRIC_L2_EXPAND is better than METRIC_L2 in that the calculation is more efficient to take advantage of BLAS.

METRIC_PROJECTION is to calculate the length of projection of x in the direction of y. Ff you only case the recalling and ranking, it has same effect as Cosine Distance.

-- Data Type

This feature also enhances Index with optional data types (of internal storage). Currently, FP32 and BFP16 are supported. This feature makes it possible to get a balance between accuarcy and performance. Take BFP16 for an example, BFP16 can be seen as a short version of traditional single float-point, FP32, skipping the least significant 16 bits of mantissa. Compared with FP16 which is supported by GPU, it provides as large range as FP32 (while FP16 has much smaller range) with the sacrifice of precision. The benifit is that the memory size and bandwidth is half of that of FP32, which offers much lower latency and much higher throughput.

-- Implementation Type

The original Faiss doesn't use BLAS when scanning InvertedList. This feature brings a serial of InvertedListScanner using BLAS.

To enable these features, you should append --enable-flat-dtype and --enable-ivfflat-dtype to ./configure, such as:

./configure --without-cuda --enable-flat-dtype --enable-ivfflat-dtype
make clean
make

An instance of IndexFlat can be built by such code:

# in Python

# an original 128-dimension IndexFlat
index1 = faiss.index_factory (128, "Flat")

# a 128-dimension IndexFlat using FP32 to store vectors
index2 = faiss.index_factory (128, "FP32,Flat")

# a 128-dimension IndexFlat using BFP16 to store vectors
index3 = faiss.index_factory (128, "BFP16,Flat")

or

# in C++

# an original 128-dimension IndexFlat
Index* index1 = faiss::index_factory (128, "Flat")

# a 128-dimension IndexFlat of L2_EXPAND using FP32 to store vectors
Index* index2 = faiss::index_factory (128, "FP32,Flat", faiss::METRIC_L2_EXPAND)

# a 128-dimension IndexFlat using BFP16 to store vectors
Index* index3 = faiss::index_factory (128, "BFP16,Flat")

An instance of IndexIVFFlat can be built by such code:

# in Python

# an original 128-dimension Index1024,Flat
index1 = faiss.index_factory (128, "IVF1024,Flat")

# a 128-dimension Index1024,Flat using FP32 to store vectors
index2 = faiss.index_factory (128, "FP32,IVF1024,Flat")

# a 128-dimension Index1024,Flat using BFP16 to store vectors
index3 = faiss.index_factory (128, "BFP16,IVF1024,Flat")

or

# in C++

# an original 128-dimension Index1024,Flat
Index* index1 = faiss::index_factory (128, "IVF1024,Flat")

# a 128-dimension Index1024,Flat using FP32 to store vectors
Index* index2 = faiss::index_factory (128, "FP32,IVF1024,Flat")

# a 128-dimension Index1024,Flat using BFP16 to store vectors
Index* index3 = faiss::index_factory (128, "BFP16,IVF1024,Flat")

When in conjunction with Flat DType, it is possible to build a mixed IVFFlat. For example, "FP32,IVF1024,BFP16,Flat" builds an IVFFlat where the first layer is a FP32 Flat, while the second layer is BFP16 Flat.

The new IndexFlat and IndexIVFFlat with DType is NOT compatible with old index files. So you need to train and save a new one.

Name		Name	Last commit message	Last commit date
Latest commit History 300 Commits
.github		.github
.travis		.travis
acinclude		acinclude
benchs		benchs
build-aux		build-aux
c_api		c_api
conda		conda
demos		demos
docs		docs
example_makefiles		example_makefiles
gpu		gpu
images		images
impl		impl
misc		misc
python		python
tests		tests
tutorial		tutorial
utils		utils
.dockerignore		.dockerignore
.gitignore		.gitignore
.travis.yml		.travis.yml
AutoTune.cpp		AutoTune.cpp
AutoTune.h		AutoTune.h
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
Clustering.cpp		Clustering.cpp
Clustering.h		Clustering.h
DirectMap.cpp		DirectMap.cpp
DirectMap.h		DirectMap.h
Dockerfile		Dockerfile
INSTALL.md		INSTALL.md
IVFlib.cpp		IVFlib.cpp
IVFlib.h		IVFlib.h
Index.cpp		Index.cpp
Index.h		Index.h
Index2Layer.cpp		Index2Layer.cpp
Index2Layer.h		Index2Layer.h
IndexBinary.cpp		IndexBinary.cpp
IndexBinary.h		IndexBinary.h
IndexBinaryFlat.cpp		IndexBinaryFlat.cpp
IndexBinaryFlat.h		IndexBinaryFlat.h
IndexBinaryFromFloat.cpp		IndexBinaryFromFloat.cpp
IndexBinaryFromFloat.h		IndexBinaryFromFloat.h
IndexBinaryHNSW.cpp		IndexBinaryHNSW.cpp
IndexBinaryHNSW.h		IndexBinaryHNSW.h
IndexBinaryHash.cpp		IndexBinaryHash.cpp
IndexBinaryHash.h		IndexBinaryHash.h
IndexBinaryIVF.cpp		IndexBinaryIVF.cpp
IndexBinaryIVF.h		IndexBinaryIVF.h
IndexFlat.cpp		IndexFlat.cpp
IndexFlat.h		IndexFlat.h
IndexFlat_T.h		IndexFlat_T.h
IndexHNSW.cpp		IndexHNSW.cpp
IndexHNSW.h		IndexHNSW.h
IndexIVF.cpp		IndexIVF.cpp
IndexIVF.h		IndexIVF.h
IndexIVFFlat.cpp		IndexIVFFlat.cpp
IndexIVFFlat.h		IndexIVFFlat.h
IndexIVFFlat_T.h		IndexIVFFlat_T.h
IndexIVFPQ.cpp		IndexIVFPQ.cpp
IndexIVFPQ.h		IndexIVFPQ.h
IndexIVFPQR.cpp		IndexIVFPQR.cpp
IndexIVFPQR.h		IndexIVFPQR.h
IndexIVFSpectralHash.cpp		IndexIVFSpectralHash.cpp
IndexIVFSpectralHash.h		IndexIVFSpectralHash.h
IndexLSH.cpp		IndexLSH.cpp
IndexLSH.h		IndexLSH.h
IndexLattice.cpp		IndexLattice.cpp
IndexLattice.h		IndexLattice.h
IndexPQ.cpp		IndexPQ.cpp
IndexPQ.h		IndexPQ.h
IndexPreTransform.cpp		IndexPreTransform.cpp
IndexPreTransform.h		IndexPreTransform.h
IndexReplicas.cpp		IndexReplicas.cpp
IndexReplicas.h		IndexReplicas.h
IndexScalarQuantizer.cpp		IndexScalarQuantizer.cpp
IndexScalarQuantizer.h		IndexScalarQuantizer.h
IndexShards.cpp		IndexShards.cpp
IndexShards.h		IndexShards.h
InvertedLists.cpp		InvertedLists.cpp
InvertedLists.h		InvertedLists.h
LICENSE		LICENSE
Makefile		Makefile
MatrixStats.cpp		MatrixStats.cpp
MatrixStats.h		MatrixStats.h
MetaIndexes.cpp		MetaIndexes.cpp
MetaIndexes.h		MetaIndexes.h
MetricType.h		MetricType.h
OnDiskInvertedLists.cpp		OnDiskInvertedLists.cpp
OnDiskInvertedLists.h		OnDiskInvertedLists.h
README.md		README.md
VectorTransform.cpp		VectorTransform.cpp
VectorTransform.h		VectorTransform.h
clone_index.cpp		clone_index.cpp
clone_index.h		clone_index.h
configure		configure
configure.ac		configure.ac
faiss		faiss
index_factory.cpp		index_factory.cpp
index_factory.h		index_factory.h

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Optimized Faiss

- IVFPQ Relayout

- DType (Diverse Type)

-- Metirc Type

-- Data Type

-- Implementation Type

About

Releases

Packages

Languages

License

zhou-yuxin/optimized-faiss

Folders and files

Latest commit

History

Repository files navigation

Optimized Faiss

- IVFPQ Relayout

- DType (Diverse Type)

-- Metirc Type

-- Data Type

-- Implementation Type

About

Resources

License

Code of conduct

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages