catalogd `metas` https endpoint proposal #1749

grokspawn · 2025-01-31T22:25:09Z

No description provided.

openshift-ci · 2025-01-31T22:25:13Z

Skipping CI for Draft Pull Request.
If you want CI signal for your change, please convert it to an actual PR.
You can still manually trigger a test run with /test all

openshift-ci · 2025-01-31T22:25:21Z

[APPROVALNOTIFIER] This PR is NOT APPROVED

This pull-request has been approved by:
Once this PR has been reviewed and has the lgtm label, please assign mandre for approval. For more information see the Code Review Process.

The full list of commands accepted by this bot can be found here.

Needs approval from an approver in each of these files:

OWNERS

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

Signed-off-by: Jordan Keister <[email protected]>

joelanford · 2025-02-03T19:16:31Z

enhancements/olm/catalogd-query-endpoint.md

+redhat-marketplace-index:  9MB
+-->
+
+Requiring clients to retrieve the full catalog can also result in a 4-10 second delay _per catalog_ even under optimal network conditions. 


On the surface, 4-10s seems like a suspiciously long time to process 21MB of data. Are we talking about in-cluster clients here or clients outside the cluster that may have slower connections?

Also, does this include client processing of the data, or simply clients downloading raw bytes as quickly as possible?

In that timeframe the POC performs:

download of the full FBC

decomposition of JSONL objects to JSON objects (which I've been told is a pain)

loading essential objects to the content delivery framework to fulfill catalog package listing

rendering the catalog listing in the web UI

I'd have to get breakdown details from @TheRealJon to be more specific. For now, I've backed off the language so it doesn't read as specific condemnation of download speeds.

joelanford · 2025-02-03T19:18:49Z

enhancements/olm/catalogd-query-endpoint.md

+
+## Proposal
+
+This proposal introduces an additional HTTPS endpoint to an existing catalogd API.  The existing HTTPS "all" endpoint will remain as a default option; the user will be able to enable this new capability via a feature gate.


Should we talk about how we plan to deprecate "all" once the new endpoint is GA?

enhancements/olm/catalogd-query-endpoint.md

joelanford · 2025-02-03T19:26:19Z

enhancements/olm/catalogd-query-endpoint.md

+}
+```
+Query parameters will be logically ANDed and used to restrict response scope.   
+This API will be conditionally enabled by an upstream `APIV1QueryEndpoint` feature gate as part of a downstream `NewOLM{suffix}` style OCP TP feature gate, and will be disabled by default.


Suggested change

This API will be conditionally enabled by an upstream `APIV1QueryEndpoint` feature gate as part of a downstream `NewOLM{suffix}` style OCP TP feature gate, and will be disabled by default.

This API will be conditionally enabled by an upstream `APIV1QueryEndpoint` feature gate as part of a downstream `NewOLM{suffix}` style OCP feature gate.

We can talk more about TP vs GA in the graduation criteria section.

enhancements/olm/catalogd-query-endpoint.md

joelanford · 2025-02-03T19:54:03Z

enhancements/olm/catalogd-query-endpoint.md

+
+This option would require clients to query the entirety of the data (~21 MB for operatorhubio catalog) and parse the response to retrieve relevant information every time the client needs the data. Even if clients’ implement some form of caching, the first query the client does to catalogd server is still the dealbreaker. In a highly resource constrained environment (e.g. clusters in Edge devices), this basically translates to a chokepoint for the clients to get started.
+
+- A “path hierarchy” based construction of API endpoints to expose filtered FBC metadata


If we are worried about the fact that query endpoint responses will almost always be incomplete, a middle ground might be an endpoint that returns all of the FBC metadata for a specific package, but I'm not sure that endpoint would provide the necessary latency requirements we're shooting for.

I think this comes down to whether we require the new endpoint to always provide valid FBC.
When we start revising FBC schemas I think we're going to have to juggle this.
For example, if we revise olm.package.v2 which uses its package field self-referentially, then we also get package-scoped valid FBC without a change to this endpoint.
But how does a client request this? Does it have to request the v2 schema specifically?

Signed-off-by: Jordan Keister <[email protected]>

openshift-ci · 2025-02-11T21:03:42Z

@grokspawn: all tests passed!

Full PR test history. Your PR dashboard.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository. I understand the commands that are listed here.

joelanford · 2025-02-12T19:25:28Z