api: stricter field validation

jmorganca commented 2 years ago

Quite a few API fields don't trim or validate against leading & trailing whitespace:

name in all resources
url, clientID and clientSecret in /v1/providers

This results in api objects being created with fields with leading whitespace. We should trim whitespace for these fields in our API (or at least in the client)

dnephin commented 2 years ago

I've looked into this a bit. Today we use go-playground/validator for api request validation. It uses struct tags to define the validation. This works well enough in some cases, but doesn't work well for applying a consistent set of validators to similar fields (ex: we want the same validation for all name fields).

The options it provides are:

RegisterAlias - not viable because the errors are not detailed enough (ex: Error:Field validation for 'Name' failed on the 'name' tag). It doesn't say which of the validations was a problem. There is no way to customize the message to say why the validation failed (ex: too long, bad characters)
RegisterValidation - same error message as above, no way to customize the message
RegisterStructValidation - allows for custom validation and custom errors, but requires registering every request struct on the static validator. The validator would have to inspect and validate every field itself, and we have no way to reading the validation from our open api document generation.
RegisterCustomTypeFunc - is interesting. It allows us to register a function that will be used to translate types into other types. It seems like if this function returns an error, that error will also be used as a validation error. This would require us to use a bunch of custom types (Ex: type Name string) as the api request fields.This could also work, but has the same problem as RegisterStructValidation. There's no way to interpret the validation in our openapi doc generation. We could make assumptions about the type and ignore the validation itself, but this seems like a lot of hoops to jump through, and also relatively error prone because the openapi docs generation has to hard code assumptions about the type separate from where the validation is implemented.

It doesn't seem like there is a good way to do this with our existing library.

dnephin commented 2 years ago

I believe our requirements here are:

we need the validation to be declarative, so that we can inspect the validation from openapi doc generation (it doesn't necessarily need to be struct tags though)
we need the error messages produced by validation to be informative, so the user can correct the problem. Ideally the error is also structured, so that we can identify the problems with each field separately
we should be able to group validation rules together in some way, so that we can apply the same validation to similar fields in different structs, without duplicating the list

dnephin commented 2 years ago

One option (suggested by @ssoroka ) is go-ozzo/ozzo-validation/v4.

That library appears to produce nice errors, and would allow us to group validation together in a few ways. Either by implementing a Validate method, or by putting the rules into a function.

The only challenge I can see with this library would be inspecting the validation rules in the openapi doc generation. The validation is declarative, but the fields on the rules are not exported. We can pass the list of validation.FieldRules to the openapi doc generation, and it can match the rules to the fields on the struct, but we need some way of translating a Rule into the openapi Schema objects.

There hasn't been any updates to the library in over 2 years, so I'm not sure if it's still actively maintained, or willing to accept contributions. If we are ok with forking it, we would have 2 options:

Export the fields on all the rules so that we can inspect them
Add a method to the library that accepts the rules, builds a representation of the validation requirements, and returns that representation.

dnephin commented 2 years ago

I really like the idea of not using struct tags for this validation. There's nothing particularly wrong with using struct tags, but it does force us to use reflection to receive them. Using struct tags for something that already uses reflection (encoding/json or mapstructure) is great because reflection is necessary anyway. But validation doesn't require reflection, so we could avoid it.

Avoiding reflection likely makes the code more obvious (easier to trace, doesn't require any understanding of reflection), and also more performant.

Today we use the validate struct tag in 104 places. Using git grep -E -oh 'validate:"[^"]*"' | sort | uniq to track down all the uses, it seems like:

mostly we use the required validation
there are a few uses of required_without, and excluded_with, and dive
there is one instance of min=8 and email

I wonder how much value we really get from using a validation library. Writing a validator for these cases is pretty easy, and we may even be able to avoid reflection by using generics. The libraries do give us more advanced validators like email, but I'm not sure how much we should trust a random library to validate those properly. Likely we'll want to audit those sufficiently that copying them into our repo isn't much more work.

I think we have a couple options for approaches we could take. Both could leverage a library like ozzo-validation, without relying on it as our primary interface.

Option 1 would be to continue to use the struct tags we have, but replace go-playground/validator with a small library that uses mitchellh/reflectwalk to read all the tags. We already use reflectwalk in cliopts. The validation itself could still be done with a library, if we wanted to.

Option 2, which isn't necessarily mutually exclusive, would be to define our own interface for validation. Something like this:

type Validator interface {
    Validate() error
    Describe(schema *openapi3.Schema) error
}

This interface should make it easy to keep the validation rules next to the logic that translates those rules into the openapi schema objects. Each request struct would implement this interface. The implementation could rely on the struct tags, or it could use ozzo-validation. The important part is that we define the interface, and that it keeps the validation and description of the validation together.

Each request type shouldn't need to implement much. Using a library like ozzo-validation all they would need to do is define the list of rules. Pass that list of rules to Validate(rules) to implement Validate, and pass it to another function to implement Describe.

ssoroka commented 2 years ago

mostly our lack of use of validation today is because it hides in tags and it's not obvious how to use tags for validation (eg What functions are even supported?). I'm not a fan of tag-based validation either.

reflectwalk looks interesting, but it's not obvious how to use it from the tests, and there are no docs. looking at cliopts, I don't find that code easy to read or understand.

What I've done in the past is have the validation package build the rules per field up into an internal structure that you could ask for after running the validation. This is nice in that the developer only needs to write the validation and not worry about how that translates into openapi, and the translation code can be written once in a single place to translate those rules into openapi. This sounds a bit like your option to expose the rules from the ozzo package, so I'd be interested in investigating that.

dnephin commented 2 years ago

What I've done in the past is have the validation package build the rules per field up into an internal structure that you could ask for after running the validation.

Ya, this sounds like what I was thinking for option 2. This would be a nice approach.

I realized the one nice property of using reflection is that we can look up the field name for errors without having to repeat it. I'm not sure if the cost of reflection is worth it for just looking up the field name, but something to consider.

ssoroka commented 2 years ago

Hmm. Ideally the validation solution should know the real field name specifically for the errors.

infrahq / infra

api: stricter field validation #1763