FgForrest / evitaDB

evitaDB is a specialized database with an easy-to-use API for e-commerce systems. It is a low-latency NoSQL in-memory engine that handles all the complex tasks that e-commerce systems have to deal with on a daily basis. evitaDB is expected to act as a fast secondary lookup/search index used by front stores.
https://evitadb.io
Other
55 stars 6 forks source link

GraphQL API breaking change detection #219

Open novoj opened 10 months ago

novoj commented 10 months ago

We have a backward compatibility problems with the team already working on Next.JS problem. I'd like to introduce some process that will warn us before switching to a new version that the GraphQL API schema has been changed by a new evitaDB version / compare schemas on demand when catalog has been reindexed and the publication process built different schema.

Initial idea:

The endpoint will compare the schemas and prints out differences in a human readable way (the raw block segments would be enough).

How it'll be used ...

Scenario 1.

I'll update the installation script so that the installation of the new version will look like this:

  1. I download and start new docker image of the evitaDB under different container name and start it up with the same data location in read only mode
  2. I call endpoints that can generate graphql / rest API schemas and store output to the file
  3. I wait for the start and then call verification endpoint on a new container and upload ther the zip with files exported in step 2
  4. if no change is detected I'll stop and delete the new container
  5. I'll continue with the replacing the old version with the new one

When changes are detected, they'll be printed to the console (or a file) and the new container is deleted, but old is not replaced. The frontend team is safe - still works with the previous version.

Scenario 2.

I'll install a new version of the EdeeCMS server which acts as an evitaDB client and redefines the schemas according to a Java logic. When full reindexation occurs the new contents are indexed into the new catalog and at the end the current catalog is replaced with the newly built one. But before this happens I'll call the verification endpoint to compare previous and new APIs (both catalogs still exists) and the difference will be printed to the indexing job log (if no change occurs the job will result with OK status, if change occurs the job will result with WARNING status).

This will be handy in production as well.

novoj commented 10 months ago

During the discussion with @lukashornych we found out the graphql-java has already basic tools in place:

If we invest some time into the investigation the compatibility check for GraphQL might be doable. If it works and brings a benefit we might provide the same functionality for REST API as well, but it's not on the table for now.

novoj commented 10 months ago

Similar tool exists also for Open API: https://github.com/OpenAPITools/openapi-diff

novoj commented 8 months ago

GraphQL schema in text form will be provided on GET method for url: https://server:port/gql/catalogName/ (and the same for the GraphQL schema of evita schema and system endpoint)

Open API schema in text form will be provided on GET method for url: https://server:port/rest/catalogName/ (and the same for the REST schema of evita schema and system endpoint) ... this is already done.

novoj commented 8 months ago

New repository created: https://github.com/FgForrest/evitaDB-API-DiffDive

This tool should

The tool will be distributed as a JAR file via MavenCentral and as a Docker image via DockerHub using evitaDB credentials and sharing the same package.

The tool will be installed on evitadb.io - for example as: https://diffdive.evitadb.io as a docker container. This URL will be used by evitaLab to visualise the report for two schemas listed in it. The tool will also be used in the CI/CD pipeline job in the future (not part of this task).

novoj commented 8 months ago

This is a good example of checking the GraphQL backward compatibility. We can find some inspiration there @lukashornych: https://www.apollographql.com/docs/graphos/delivery/schema-checks/#operation-checks

lukashornych commented 7 months ago

@novoj the GraphQL schema pretty printing endpoint is released.

The DSL schema can be fetched by simply making GET HTTP request to URL where the GraphQL instance is running (the POST method is reserved for queries and mutations). Although this is not how GET method should be used on GraphQL instance, we currently don't want to support GET request for queries, so we used it for DSL schema like we do it with REST API were GET request on root URL of certain catalog API returns OpenAPI specification.

So the DSL schema can be fetched from:

lukashornych commented 4 months ago

So after some research of already available tools for diffing of GQL schemas and OpenAPI schemas, I would split the implementation into 2 sections: CLI checking for CI/CD, and evitaLab visualization for developers to AD-HOC check.

CLI

For CLI (CI/CD) env. there are already awesome tools that can be run in Docker, so I wouldn't invent anything ours.

GraphQL

Example output

Detected the following changes (2) between schemas:

[log] ✖  Field allLocales was removed from object type AdjustedPricePolicy
[log] ✔  Field allLocaless was added to object type AdjustedPricePolicy
[error] Detected 1 breaking change

OpenAPI

Example output

==========================================================================
==                            API CHANGE LOG                            ==
==========================================================================
                    Web services for catalog `evita`.                     
--------------------------------------------------------------------------
--                            What's Changed                            --
--------------------------------------------------------------------------
- GET    /Category/get
  Parameter:
    - Delete primaryKey in query
--------------------------------------------------------------------------
--                                Result                                --
--------------------------------------------------------------------------
                 API changes broke backward compatibility                 
--------------------------------------------------------------------------

evitaLab visualization

Here it's not so good. The already mentioned https://github.com/kamilkisiela/graphql-inspector can be used easily in browser JS, unfortunately for OpenAPI there is not such library. There is only https://www.npmjs.com/package/openapi-diff which works only in Node.js.

So I suggest that we create only a custom API in lab API for the OpenAPI diffing using the https://github.com/OpenAPITools/openapi-diff. The GraphQL diffing could be also done in the API or we could just use the JS library. Maybe I would try to create uniform lab API for both GQL and OA that would return JSON array with changes so that I don't need to create two separate UIs in evitaLab GUI.

Would do you think? @novoj

novoj commented 4 months ago

I completely agree with your conclusions. Is there any CL tool that would also protect incompatible changes in gRPC API for evitaDB API evolution?!

lukashornych commented 4 months ago

There are some CLI tools for comparing protobuf files (protobuf-format-diff or buf). But it could be difficult to obtain the proto files for comparison from running evitaDB instances. There is Java lib for introspection:

<dependency>
    <groupId>io.grpc</groupId>
    <artifactId>grpc-services</artifactId>
    <version>${grpc.version}</version>
</dependency>

with ProtoReflectionService.newInstance() service. But this service returns some kind of JSON from which we would have to reconstruct the proto files. Unfortunately, I couldn't find any existing tool for this.

Or maybe create our own service that would return the actual proto file from classpath? But there could still be problem with SSL certificate for mTLS I would guess.

novoj commented 4 months ago

I think, we just need something for our internal evitaDB CI/CD pipeline to avoid non-intentional backward incompatible changes in gRPC API. We probably don't need a tool for evitaLab / or a tooling that would our users use.

lukashornych commented 4 months ago

In such case, I think we could come up with some custom GH Action to retrieve proto files from previous release and compare it against the new ones with the mention tools.

novoj commented 4 months ago

This would be great - could we do that as a part of this issue?

lukashornych commented 4 months ago

I'll try to create some prototype then.

novoj commented 3 days ago

Can you @lukashornych summarize what is done and what needs to be done yet? We have to interrupt work on this issue due lack of time, so that when we return back, we have clear instructions where to continue.

lukashornych commented 3 days ago

@novoj I believe the lab API for comparing GQL and OpenAPI schemas is somewhat functional, it just needs some tidying. The CLI tools need to implemented.

Regarding evitaLab GUI, there is rough prototype for rendering GQL changes, but it is missing paging, i18n, file upload functionaility, todo tools, and other small things. GUI for OpenAPI schema is non-existent, but it will hopefully share most of the code with the GQL changes viewer. Also, it will need to be ported the the new internal API of evitaLab after refactoring, but that shouldn't be too time consuming.

The code for the API is in branch 219-api-compatibility-api (probably, broken, had to merge it onto a new dev). The code for evitaLab is in branch evitaLab#117-api-compatibility.

The question is, do we keep it in the custom lab REST API or do we either merge it into the gRPC driver (doesn't seem right) or do we create extension to the main gRPC API i.e. lab gRPC API instead of the lab REST API now that the evitaLab will work with the gRPC-web?