Skip to content

Commit

Permalink
Indexing language improvements and "Add Index" endpoint
Browse files Browse the repository at this point in the history
The spec has the concept of encrypted indexing which essentially is the use of blinded attributes/properties/tags in Encrypted Documents stored within a vault. Some of the language used in the spec on this subject is somewhat confusing. For instance, the term "index" is used in some places to refer to what are really just key-value attribute pairs. In database/storage terminology, in general an "index" refers to the underlying mechanism by which a database speeds up data queries. It is usually optional - queries can still be done without them.

I've updated some of the terminology to be more consistent and hopefully easier to follow by using the term "attribute" when referring to the key-value pairs that may exist in Encrypted Documents and the term "index" to refer to the mechanism by which querying can be sped up.

The current spec does not state how/when indexes are created. Usually with databases, indexes are manually created based on the user's/admin's needs. Along with the language changes, there's a new "index" endpoint that can be used for adding indexes based on attribute keys. For now, I've only specified how to add indexes, but in the future the endpoint could be used to remove/modify existing indexes.
  • Loading branch information
Derek Trider committed Apr 26, 2022
1 parent ed66c93 commit 8c67acf
Showing 1 changed file with 56 additions and 26 deletions.
82 changes: 56 additions & 26 deletions index.html
Original file line number Diff line number Diff line change
Expand Up @@ -2420,27 +2420,25 @@ <h4>

<section class="normative">
<h3>
Creating Encrypted Indexes
Blinded Document Attributes
</h3>

<p>
It is often useful to search a data vault for structured documents that contain
specific metadata. Efficient searching requires the use of search indexes
and local access to data. This poses an interesting challenge as the
specific metadata. This poses an interesting challenge as the
search has to be performed on the <a>storage provider</a> without leaking
information that could violate the privacy of the entities that are storing
information in the data vault. This section details how encrypted indexes
can be created and used to perform efficient searching while protecting
information in the data vault. This section details how blinded attributes
can be created and used to perform searches while protecting
the privacy of entities that are storing information in the data vault.
</p>

<p>
When creating an <a>EncryptedDocument</a>, blinded index properties MAY
be used to perform efficient searches. An example of the use of these
properties is shown below:
When creating an <a>EncryptedDocument</a>, blinded attributes MAY
be used to perform searches. An example of the use of these properties is shown below:
</p>

<pre class="example highlight" title="Example encrypted document with encrypted indexes">
<pre class="example highlight" title="Example encrypted document with blinded attributes">
{
"id": "urn:uuid:698f3fb6-592f-4d22-9e04-462cc4606a23",
"sequence": 0,
Expand Down Expand Up @@ -2479,15 +2477,15 @@ <h3>
</pre>

<p class="note">
The example above demonstrates the use of unique index values as well as
non-unique indexes.
The example above demonstrates the use of unique attributes as well as
non-unique attributes.
</p>

<p>
The example above enables the <a>storage provider</a> to build
efficient indexes on encrypted properties while enabling
efficient indexes on blinded properties while enabling
<a>storage agents</a> to search the information without leaking information
that would create privacy concerns.
that would create privacy concerns. See <a href="#blinded-document-attributes"></a>
</p>

<div class="issue">
Expand Down Expand Up @@ -2611,7 +2609,7 @@ <h3>
the combination of the blinded entries for "content.type" and "content.country",
and finally, the combination of the blinded entries for "content.type",
"content.country", and "country.region". The combinations are built by HMACing
the concatenated the blinded attribute names using a colon (`:`) and HMACing the
the concatenated the blinded attribute keys using a colon (`:`) and HMACing the
concatenated blinded attribute values using a colon (`:`). Note that a colon
(`:`) was selected because it is not a character in the base64url alphabet. In
pseudo code, blinded compound indexes entries look like:
Expand Down Expand Up @@ -2676,16 +2674,16 @@ <h3>
</p>

<p>
The contents of a data vault can be searched using encrypted indexes created
using the processes described in <a href="#creating-encrypted-indexes"></a>.
The contents of a data vault can be searched using blinded document attributes created
using the processes described in <a href="#blinded-document-attributes"></a>.
There are two primary ways of searching for encrypted documents. The first
is to search for a specific value associated with a specific index. The
second is to search to see if a specific index exists on a document.
is to search for a specific attribute value associated with a specific attribute key. The
second is to search to see if a specific attribute key exists on a document.
</p>


<p>
When doing a search for a specific value associated with a specific index, the syntax for the <code>equals</code> filter is as follows:
When doing a search for a specific attribute value associated with a specific attribute key, the syntax for the <code>equals</code> filter is as follows:
<code>equals</code> is an array of one or more subfilters.
Each subfilter consists of one or more key-value attribute pairs.
For a document to be matched, it MUST match at least one of those subfilters.
Expand Down Expand Up @@ -2914,11 +2912,11 @@ <h3>
</pre>

<p>
The contents of a data vault can also be searched to see if a certain attribute
name is indexed by using the <code>has</code> keyword.
The contents of a data vault can also be searched to see which documents have a certain attribute
key (regardless of attribute value) by using the <code>has</code> keyword.
</p>

<pre class="example highlight" title="data vault query for a particular attribute name">
<pre class="example highlight" title="data vault query for a particular attribute key">
POST https://example.com/edvs/z4sRgBJJLnYy/query HTTP/1.1
Host: example.com
Content-Type: application/json
Expand All @@ -2933,10 +2931,10 @@ <h3>

<p>
If <code>returnFullDocuments</code> was set to false, a successful query will result in a standard HTTP 200 response with a list
of <a>EncryptedDocument</a> identifiers that contain the value:
of <a>EncryptedDocument</a> identifiers that match the query:
</p>

<pre class="example highlight" title="Successful query response containing only document identifiers for a specific attribute name">
<pre class="example highlight" title="Successful query response containing only document identifiers for a specific attribute key">
HTTP/1.1 200 OK
Cache-Control: no-cache, no-store, must-revalidate
Date: Fri, 14 Jun 2019 18:45:18 GMT
Expand All @@ -2947,10 +2945,10 @@ <h3>

<p>
If <code>returnFullDocuments</code> was set to true, a successful query will result in a standard HTTP 200 response with a list
of <a>EncryptedDocuments</a> that contain the value:
of <a>EncryptedDocuments</a> that match the query:
</p>

<pre class="example highlight" title="Successful query response containing full documents for a specific attribute name">
<pre class="example highlight" title="Successful query response containing full documents for a specific attribute key">
HTTP/1.1 200 OK
Cache-Control: no-cache, no-store, must-revalidate
Date: Fri, 14 Jun 2019 18:45:18 GMT
Expand Down Expand Up @@ -3001,6 +2999,38 @@ <h3>

</section>

<section class="normative">
<h3>
Adding an index
</h3>

<p>
Encrypted Documents can have embedded attribute key-value pairs, as seen in <a href="#blinded-document-attributes"></a>.
To speed up searches in vaults containing many Encrypted Documents, indexes can be created based on attribute keys.
</p>


<p>
The example below shows how to create indexes in a vault.
</p>

<pre class="example highlight" title="add indexes to data vault">
POST https://example.com/edvs/z4sRgBJJLnYy/index HTTP/1.1
Host: example.com
Content-Type: application/json
Accept: application/json, text/plain, */*
Accept-Encoding: gzip, deflate

{
"operation": "add",
"attributeKeys":["DUQaxPtSLtd8L3WBAIkJ4DiVJeqoF6bdnhR7lSaPloZ", "AarngVIZLl0kIp2xEHUH5o5uVc-470roQaOIbqMUD7DFQQypWQ=="]
}

</pre>

</section>


</section>

<section class="informative">
Expand Down

0 comments on commit 8c67acf

Please sign in to comment.