project-zot / zot

zot - A scale-out production-ready vendor-neutral OCI-native container image/artifact registry (purely based on OCI Distribution Specification)
https://zotregistry.dev
Apache License 2.0
902 stars 94 forks source link

[Feat]: Scale-out cluster support for independent per-instance storage deployments #2434

Open vrajashkr opened 4 months ago

vrajashkr commented 4 months ago

Is your feature request related to a problem? Please describe.

The current implementation of scale-out cluster is supported on shared-storage deployments - many instances and one reliable shared storage such as S3.

Currently, some REST APIs will not work for deployments where the image storage for each instance is isolated (for example, one zot instance per VM with independent disk storage).

Describe the solution you'd like

In a scale-out cluster deployment using isolated per-instance storage, APIs such as /v2/_catalog as well as GraphQL endpoints will need to fan-out requests to all the members and aggregate the received data before returning results.

Describe alternatives you've considered

N/A

Additional context

No response

vrajashkr commented 4 months ago

/v2/_catalog is easier to handle as the logic is entirely implemented using data structs. For this, we can simply detect a scale out cluster in non-shared storage mode and then fan-out + aggregate.

However, the GQL APIs are handled via a GQL server handling auto generated GQL - the implementation of which cannot be changed. Additionally, there is a need to support pagination.

extRouter.Methods(allowedMethods...).
        Handler(gqlHandler.NewDefaultServer(gql_generated.NewExecutableSchema(resConfig)))

One approach might be to look into the Resolver code (resolver.go) and modify the data read to include proxied results before the response is taken for pagination.

For example:

func (cveinfo BaseCveInfo) GetImageListForCVE(ctx context.Context, repo, cveID string) ([]cvemodel.TagInfo, error)

This function must ask all other nodes for images having this CVE before returning the TagInfo objects.

That logic could be introduced as a wrapper on top of these functions that will do the following:

  1. Call the function locally
  2. Call the function for all other members
  3. Return the result to client

To facilitate this, there needs to be a new set of API endpoints that return this data without pagination. i.e the same GQL query except without any pagination.

Since the request data does not flow to the resolver, there would be a need to bring that logic to the resolver so GetImageListForCVE knows to call GetImageListForCVE on all other hosts via the API. Each zot instance would potentially become a GQL client as well.

rchincha commented 4 months ago

@vrajashkr do you want to start with paginated result (aggregate) of paginated results (fanout)? The second part is what needs to redesigned? Maybe we simply add a pagination=off param.

rchincha commented 4 months ago

Separately, keep an eye on sync as a means to "heal" after a cluster resize.

vrajashkr commented 4 months ago

There are 2 major items to handle:

/v2/_catalog

The approach for this would be fairly straightforward as there is no pagination for this. zot-local-scale-out-catalog

/v2/_zot/ext/search

Here's a diagram representing the approach we discussed: zot-local-scale-out-search-gql

Challenges with this approach:

vrajashkr commented 4 months ago

@andaaron @rchincha It would be great to get your thoughts on the approaches detailed above! Thanks!

vrajashkr commented 3 months ago

GQL Query behaviours: reference: pkg/extensions/search/schema.graphql

Proxy-once to target member or handle locally

CVEListForImage ImageList - only if there is a repo specified in the arguments ExpandedRepoInfo GlobalSearch - when searching for a tag inside a repo Image Referrers

Fan-out proxy to all other members + local

ImageListForCVE ImageListWithCVEFixed ImageListForDigest RepoListWithNewestImage ImageList - when the repoName is sent as "" GlobalSearch - when searching for repos DerivedImageList BaseImageList

Depending on meta storage strategy

If the metadata DB is shared, then they can be processed locally, but if the metadata DB is not shared, this request needs to be fanned out: StarredRepos BookmarkedRepos

Handler logic change to dynamically query other zot instances

CVEDiffListForImages - the data for each image (subtrahend and minuend) are both available from their respective single zot instances. Depending on where the request lands, there may be 2 requests made to other zot instances (if both repos are not on the current instance), one request (if either of the images are not available locally), or no requests (if both images are available locally). Note: this requires a change in the handler itself as there is no logical way to proxy the incoming request to get the required data.