redis
diff --git a/‎content/develop/clients/go/vecsearch.md
Lines changed: 22 additions & 39 deletions b/‎content/develop/clients/go/vecsearch.md
Lines changed: 22 additions & 39 deletions
diff --git a/‎content/develop/clients/go/vecsets.md
Lines changed: 174 additions & 0 deletions b/‎content/develop/clients/go/vecsets.md
Lines changed: 174 additions & 0 deletions
@@ -29,9 +29,8 @@ or JSON fields, Redis can retrieve documents that closely match the query in ter
 of their meaning.
 
 In the example below, we use the
-[`huggingfaceembedder`](https://pkg.go.dev/github.com/henomis/[email protected]/embedder/huggingface)
-package from the [`LinGoose`](https://pkg.go.dev/github.com/henomis/[email protected])
-framework to generate vector embeddings to store and index with
+[`Hugot`](https://pkg.go.dev/github.com/knights-analytics/hugot)
+library to generate vector embeddings to store and index with
 Redis Query Engine.  The code is first demonstrated for hash documents with a
 separate section to explain the
 [differences with JSON documents](#differences-with-json-documents).
@@ -47,38 +46,23 @@ for more information.
 
 ## Initialize
 
-Start a new Go module with the following command:
-
-```bash 
-go mod init vecexample
-```
-
-Then, in your module folder, install
-[`go-redis`]({{< relref "/develop/clients/go" >}})
-and the 
-[`huggingfaceembedder`](https://pkg.go.dev/github.com/henomis/[email protected]/embedder/huggingface)
-package:
+First, install [`go-redis`]({{< relref "/develop/clients/go" >}})
+if you haven't already done so. Then, install
+[`Hugot`](https://pkg.go.dev/github.com/knights-analytics/hugot)
+using the following command:
 
 ```bash
-go get github.com/redis/go-redis/v9
-go get github.com/henomis/lingoose/embedder/huggingface
+go get github.com/knights-analytics/hugot
 ```
 
 Add the following imports to your module's main program file:
 
 {{< clients-example set="home_query_vec" step="import" lang_filter="Go" >}}
 {{< /clients-example >}}
 
-You must also create a [HuggingFace account](https://huggingface.co/join)
-and add a new access token to use the embedding model. See the
-[HuggingFace](https://huggingface.co/docs/hub/en/security-tokens)
-docs to learn how to create and manage access tokens. Note that the
-account and the `all-MiniLM-L6-v2` model that we will use to produce
-the embeddings for this example are both available for free.
-
 ## Add a helper function
 
-The `huggingfaceembedder` model outputs the embeddings as a
+The `Hugot` model outputs the embeddings as a
 `[]float32` array. If you are storing your documents as
 [hash]({{< relref "/develop/data-types/hashes" >}}) objects, then you
 must convert this array to a `byte` string before adding it as a hash field.
@@ -119,11 +103,10 @@ and 384 dimensions, as required by the `all-MiniLM-L6-v2` embedding model.
 
 ## Create an embedder instance
 
-You need an instance of the `huggingfaceembedder` class to
+You need an instance of the `FeatureExtractionPipeline` class to
 generate the embeddings. Use the code below to create an
 instance that uses the `sentence-transformers/all-MiniLM-L6-v2`
-model, passing your HuggingFace access token to the `WithToken()`
-method.
+model:
 
 {{< clients-example set="home_query_vec" step="embedder" lang_filter="Go" >}}
 {{< /clients-example >}}
@@ -134,12 +117,12 @@ You can now supply the data objects, which will be indexed automatically
 when you add them with [`HSet()`]({{< relref "/commands/hset" >}}), as long as
 you use the `doc:` prefix specified in the index definition.
 
-Use the `Embed()` method of `huggingfacetransformer`
+Use the `RunPipeline()` method of `FeatureExtractionPipeline`
 as shown below to create the embeddings that represent the `content` fields.
 This method takes an array of strings and outputs a corresponding
-array of `Embedding` objects.
-Use the `ToFloat32()` method of `Embedding` to produce the array of float
-values that we need, and use the `floatsToBytes()` function we defined
+array of `FeatureExtractionOutput` objects.
+The `Embeddings` field of `FeatureExtractionOutput` contains the array of float
+values that you need for the index. Use the `floatsToBytes()` function defined
 above to convert this array to a `byte` string.
 
 {{< clients-example set="home_query_vec" step="add_data" lang_filter="Go" >}}
@@ -153,7 +136,7 @@ text. Redis calculates the similarity between the query vector and each
 embedding vector in the index as it runs the query. It then ranks the
 results in order of this numeric similarity value.
 
-The code below creates the query embedding using `Embed()`, as with
+The code below creates the query embedding using `RunPipeline()`, as with
 the indexing, and passes it as a parameter when the query executes
 (see
 [Vector search]({{< relref "/develop/ai/search-and-query/query/vector-search" >}})
@@ -163,14 +146,14 @@ for more information about using query parameters with embeddings).
 {{< /clients-example >}}
 
 The code is now ready to run, but note that it may take a while to complete when
-you run it for the first time (which happens because `huggingfacetransformer`
+you run it for the first time (which happens because `Hugot`
 must download the `all-MiniLM-L6-v2` model data before it can
 generate the embeddings). When you run the code, it outputs the following text:
 
 ```
-ID: doc:0, Distance:0.114169843495, Content:'That is a very happy person'
-ID: doc:1, Distance:0.610845327377, Content:'That is a happy dog'
-ID: doc:2, Distance:1.48624765873, Content:'Today is a sunny day'
+ID: doc:0, Distance:2.96992516518, Content:'That is a very happy person'
+ID: doc:1, Distance:17.3678302765, Content:'That is a happy dog'
+ID: doc:2, Distance:43.7771987915, Content:'Today is a sunny day'
 ```
 
 The results are ordered according to the value of the `vector_distance`
@@ -220,9 +203,9 @@ Apart from the `jdoc:` prefixes for the keys, the result from the JSON
 query is the same as for hash:
 
 ```
-ID: jdoc:0, Distance:0.114169843495, Content:'That is a very happy person'
-ID: jdoc:1, Distance:0.610845327377, Content:'That is a happy dog'
-ID: jdoc:2, Distance:1.48624765873, Content:'Today is a sunny day'
+ID: jdoc:0, Distance:2.96992516518, Content:'That is a very happy person'
+ID: jdoc:1, Distance:17.3678302765, Content:'That is a happy dog'
+ID: jdoc:2, Distance:43.7771987915, Content:'Today is a sunny day'
 ```
 
 ## Learn more
 
@@ -0,0 +1,174 @@
+---
+categories:
+- docs
+- develop
+- stack
+- oss
+- rs
+- rc
+- oss
+- kubernetes
+- clients
+description: Index and query embeddings with Redis vector sets
+linkTitle: Vector set embeddings
+title: Vector set embeddings
+weight: 35
+bannerText: Vector set is a new data type that is currently in preview and may be subject to change.
+bannerChildren: true
+---
+
+A Redis [vector set]({{< relref "/develop/data-types/vector-sets" >}}) lets
+you store a set of unique keys, each with its own associated vector.
+You can then retrieve keys from the set according to the similarity between
+their stored vectors and a query vector that you specify.
+
+You can use vector sets to store any type of numeric vector but they are
+particularly optimized to work with text embedding vectors (see
+[Redis for AI]({{< relref "/develop/ai" >}}) to learn more about text
+embeddings). The example below shows how to use the
+[`Hugot`](https://github.com/knights-analytics/hugot)
+library to generate vector embeddings and then
+store and retrieve them using a vector set with `go-redis`.
+
+## Initialize
+
+Start by [installing]({{< relref "/develop/clients/go#install" >}}) `go-redis` if you haven't already done so. Note that you need `go-redis`
+[v9.10.0](https://github.com/redis/go-redis/releases/tag/v9.10.0)
+or later to use vector sets.
+
+Also, install `hugot`:
+
+```bash
+go get github.com/knights-analytics/hugot
+```
+
+In a new Go file, add the required imports:
+
+{{< clients-example set="home_vecsets" step="import"  lang_filter="Go" >}}
+{{< /clients-example >}}
+
+These include the `Hugot` library to generate an embedding from a section of text.
+This example uses an instance of the `SentenceTransformer` model
+[`all-MiniLM-L6-v2`](https://huggingface.co/sentence-transformers/all-MiniLM-L6-v2)
+for the embeddings. This model generates vectors with 384 dimensions, regardless
+of the length of the input text, but note that the input is truncated to 256
+tokens (see
+[Word piece tokenization](https://huggingface.co/learn/nlp-course/en/chapter6/6)
+at the [Hugging Face](https://huggingface.co/) docs to learn more about the way tokens
+are related to the original text).
+
+{{< clients-example set="home_vecsets" step="model" lang_filter="Go" >}}
+{{< /clients-example >}}
+
+## Create the data
+
+The example data is contained in a map with some brief
+descriptions of famous people:
+
+{{< clients-example set="home_vecsets" step="data" lang_filter="Go" >}}
+{{< /clients-example >}}
+
+## Add the data to a vector set
+
+The next step is to connect to Redis and add the data to a new vector set.
+
+The code below iterates through the `peopleData` map and adds corresponding
+elements to a vector set called `famousPeople`.
+
+Use the
+`RunPipeline()` method of `embeddingPipeline` to generate the
+embedding as an array of `float32` values, then use a loop like the one
+shown below to convert the `float32` array to a `float64` array.
+You can then pass this array to the
+[`VAdd()`]({{< relref "/commands/vadd" >}}) command to set the embedding.
+
+The call to `VAdd()` also adds the `born` and `died` values from the
+original map as attribute data. You can access this during a query
+or by using the [`VGetAttr()`]({{< relref "/commands/vgetattr" >}}) method.
+
+{{< clients-example set="home_vecsets" step="add_data" lang_filter="Go" >}}
+{{< /clients-example >}}
+
+## Query the vector set
+
+You can now query the data in the set. The basic approach is to use the
+`RunPipeline()` method to generate another embedding vector for the query text.
+(This is the same method used to add the elements to the set.) Then, pass
+the query vector to [`VSim()`]({{< relref "/commands/vsim" >}}) to return elements
+of the set, ranked in order of similarity to the query.
+
+Start with a simple query for "actors":
+
+{{< clients-example set="home_vecsets" step="basic_query" lang_filter="Go" >}}
+{{< /clients-example >}}
+
+This returns the following list of elements (formatted slightly for clarity):
+
+```
+'actors': Masako Natsume, Chaim Topol, Linus Pauling, Marie Fredriksson,
+    Maryam Mirzakhani, Marie Curie, Freddie Mercury, Paul Erdos
+```
+
+The first two people in the list are the two actors, as expected, but none of the
+people from Linus Pauling onward was especially well-known for acting (and there certainly
+isn't any information about that in the short description text).
+As it stands, the search attempts to rank all the elements in the set, based
+on the information contained in the embedding model.
+You can use the `Count` parameter of `VSimWithArgs()` to limit the list of elements
+to just the most relevant few items:
+
+{{< clients-example set="home_vecsets" step="limited_query" lang_filter="Go" >}}
+{{< /clients-example >}}
+
+The reason for using text embeddings rather than simple text search
+is that the embeddings represent semantic information. This allows a query
+to find elements with a similar meaning even if the text is
+different. For example, the word "entertainer" doesn't appear in any of the
+descriptions but if you use it as a query, the actors and musicians are ranked
+highest in the results list:
+
+{{< clients-example set="home_vecsets" step="entertainer_query" lang_filter="Go" >}}
+{{< /clients-example >}}
+
+Similarly, if you use "science" as a query, you get the following results:
+
+```
+'science': Marie Curie, Linus Pauling, Maryam Mirzakhani, Paul Erdos,
+Marie Fredriksson, Freddie Mercury, Masako Natsume, Chaim Topol
+```
+
+The scientists are ranked highest but they are then followed by the
+mathematicians. This seems reasonable given the connection between mathematics
+and science.
+
+You can also use
+[filter expressions]({{< relref "/develop/data-types/vector-sets/filtered-search" >}})
+with `VSimWithArgs()` to restrict the search further. For example,
+repeat the "science" query, but this time limit the results to people
+who died before the year 2000:
+
+{{< clients-example set="home_vecsets" step="filtered_query" lang_filter="Go" >}}
+{{< /clients-example >}}
+
+Note that the boolean filter expression is applied to items in the list
+before the vector distance calculation is performed. Items that don't
+pass the filter test are removed from the results completely, rather
+than just reduced in rank. This can help to improve the performance of the
+search because there is no need to calculate the vector distance for
+elements that have already been filtered out of the search.
+
+## More information
+
+See the [vector sets]({{< relref "/develop/data-types/vector-sets" >}})
+docs for more information and code examples. See the
+[Redis for AI]({{< relref "/develop/ai" >}}) section for more details
+about text embeddings and other AI techniques you can use with Redis.
+
+You may also be interested in
+[vector search]({{< relref "/develop/clients/go/vecsearch" >}}).
+This is a feature of the
+[Redis query engine]({{< relref "/develop/ai/search-and-query" >}})
+that lets you retrieve
+[JSON]({{< relref "/develop/data-types/json" >}}) and
+[hash]({{< relref "/develop/data-types/hashes" >}}) documents based on
+vector data stored in their fields.