golang / go

The Go programming language
https://go.dev
BSD 3-Clause "New" or "Revised" License
119.84k stars 17.21k forks source link

proposal: Go 2: enums as an extension to types #28987

Open deanveloper opened 5 years ago

deanveloper commented 5 years ago

Yet another enum proposal

Related: #19814, #28438

First of all, what is the issue with const? Why can't we use that instead?

Well first of all, iota of course only works with anything that works with an untyped integer. Also, the namespace for the constants are at the package level, meaning that if your package provides multiple utilities, there is no distinction between them other than their type, which may not be immediately obvious.

For instance if I had my own mat (material) package, I'd want to define mat.Metal, mat.Plastic, and mat.Wood. Then maybe classify my materials as mat.Soft, mat.Neutral, and mat.Hard. Currently, all of these would be in the same namespace. What would be good is to have something along the lines of mat.Material.Metal, mat.Material.Plastic, mat.Material.Wood, and then mat.Hardness.Soft, mat.Hardness.Neutral, and mat.Hardness.Hard.

Another issue with using constants is that they may have a lot of runtime issues. Consider the following:

var ErrInvalidWeekday = errors.New("invalid weekday")

type Weekday byte

const (
    Sunday Weekday = iota
    Monday
    Tuesday
    // ...
)
func (f Weekday) Valid() bool {
    return f <= Saturday
}

func (d Weekday) Tomorrow() Weekday {
    if !d.Valid() {
        panic(ErrInvalidWeekday)
    }

    if d == Sunday {
        return Saturday
    }
    return d + 1
}

Not only is there a lot of boilerplate code where we define the "enum", but there is also a lot of boilerplate whenever we use the "enum", not to mention that it means that we need to do runtime error checking, as there are bitflags that are not valid.

I thought to myself. What even are enums? Let's take a look at some other languages:

C

typedef enum week{Sun,Mon,Tue,Wed,Thu,Fri,Sat} Weekday;

Weekday day = Sun;

This ends up being similar to Go's iota. But it suffers the same pitfalls that we have with iota, of course. But since it has a dedicated type, there is some compile-time checking to make sure that you don't mess up too easily. I had assumed there was compile-time checking to make sure that things like Weekday day = 20 were at least compile-time warnings, but at least with gcc -Wextra -Wall there are no warnings for it.

C++

This section was added in an edit, originally C and C++ were grouped together, but C++11 has added enum class and enum struct which are very similar to Java's (next section). They do have compile-time checking to make sure that you don't compare two different types, or do something like Weekday day = 20. Weeday day = static_cast<Weekday>(20) still works, however. We should not allow something like this. https://github.com/golang/go/issues/28987#issuecomment-445296770

Syntax:

enum class Weekday { sun, mon, tues, ... };

Weekday day = Weekday::sun;
Weekday day2 = static_cast<Weekday>(2); // tuesday

Java

An enum is a kind of class. This class has several static members, named after the enum values you define. The type of the enum value is of the class itself, so each enum value is an object.

enum Weekday {
    SUNDAY(), // parentheses optional, if we define a constructor, we can add arguments here
    MONDAY,
    TUESDAY,
    // ...
    SATURDAY;

    // define methods here

    public String toString() {
        // ...
    }
}

I personally like this implementation, although I would appreciate if the objects were immutable.

The good thing about this implementation is that you are able to define methods on your enum types, which can be extremely useful. We can do this in Go today, but with Go you need to validate the value at runtime which adds quite a bit of boilerplate and a small efficiency cost. This is not a problem in Java because there are no possible enum values other than the ones you define.

Kotlin

Kotlin, being heavily inspired by Java, has the same implementation. They are even more clearly objects, as they are called enum class instead of simply enum.

Swift

Proposal #28438 was inspired by these. I personally don't think they're a good fit for Go, but it's a different one, so let's take a look:

enum Weekday {
    case Sunday
    case Monday
    case Tuesday
    // ...
}

The idea becomes more powerful, as you can define "case functions" (syntax is case SomeCase(args...), which allow something like EnumType.number(5) being separate from EnumType.number(6). I personally think it is more fitting to just use a function instead, although it does seem like a powerful idea.

I barely have any Swift experience though, so I don't know the advantages of a lot of the features that come with Swift's implementation.

JavaScript

const Weekday = Object.freeze({
    Sunday:  Symbol("Sunday"),
    Monday:  Symbol("Monday"),
    Tuesday: Symbol("Tuesday"),
    // ...
});

This is probably the best you can do in JavaScript without a static type system. I find this to be a good implementation for JavaScript, though. It also allows the values to actually have behavior.

Okay, so enough with other languages. What about Go?

We need to ask ourselves, what would we want out of enums?

  1. Named, Immutable values.
  2. Compile-time validation. (We don't want to have to manually check at runtime to see if enum values are valid)
  3. A consise way to define the values (the only thing that iota really provides)

And what is an enum? The way that I have always seen it, enums are an exhaustive list of immutable values for a given type.

Proposal

Enums limit what values a type can hold. So really, enums are just an extension on what a type can do. "Extension" perhaps isn't the right word, but the syntax should hopefully make my point.

The enum syntax should reflect this. The proposed syntax would be type TypeName <base type> enum { <values> }

package mat // import "github.com/user/mat"

// iota can be used in enums
type Hardness int enum {
    Soft = iota
    Neutral
    Hard
}

// Enums should be able to be objects similar to Java, but
// they should be required to be immutable. A readonly types
// proposal may help this out. Until then, it may be good just to either
// have it as a special case that enum values' fields cannot be edited,
// or have a `go vet` warning if you try to assign to an enum value's field.
type Material struct {
    Name string
    Strength Hardness
} enum {
    Metal = Material{Name: "Metal", Strength: values(Hardness).Hard } // these would greatly benefit from issue #12854
    Plastic = Material{Name: "Plastic", Strength: values(Hardness).Neutral }
    Foam = Material{Name: "Foam", Strength: values(Hardness).Soft }
}

// We can define functions on `Material` like we can on any type.

// Strong returns true if this is a strong material
func (m Material) Strong() bool {
    return m.Strength >= Hardness.Neutral
}

The following would be true with enums:

Syntax ideas for reading syntax values:

  1. Type.Name
    • It's a common syntax people are familiar with, but it makes Type look like a value.
  2. Type#Name, Type@Name, etc
    • Something like these would make the distinction that Type is not a value, but it doesn't feel familiar or intuitive.
  3. Type().Name
    • This one doesn't make too much sense to me but it popped in my head.
  4. values(Type).Name, enum(Type).Name, etc
    • values would be a builtin function that takes a type, and returns its enumeration values as a struct value. Passing a type that has no enum part would of trivially return struct{}{}. It seems extremely verbose though. It would also clash as values is a pretty common name. Many go vet errors may result from this name. A different name such as enum may be good.

I personally believe values(Type).Name (or something similar) is the best option, although I can see Type.Name being used because of it's familiarity.

I would like more critique on the enum definitions rather than reading the values, as that is mainly what the proposal mainly focuses on. Reading values from an enum is trivial once you have a syntax, so it doesn't really shouldn't need to be critiqued too much. What needs to be critiqued is what the goal of an enum is, how well this solution accomplishes that goal, and if the solution is feasible.

Points of discussion

There has been some discussion in the comments about how we can improve the design, mainly the syntax. I'll take the highlights and put them here. If new things come up and I forget to add them, please remind me.

Value list for the enum should use parentheses instead of curly braces, to match var/const declaration syntax.

ianlancetaylor commented 5 years ago

Is there a section missing after "Let's take a look at some other languages"?

ianlancetaylor commented 5 years ago

A commonly requested feature for enums is a way to iterate through the valid enum values. That doesn't seem to be supported here.

You discuss converting an integer value to Hardness, but is it also permitted to convert a value to Material? If not, what is the essential difference?

I'm not sure but I suspect that this syntax is going to introduce parsing ambiguities. There are many places where a type can appear. You also have to consider cases like

for _, x := range []struct { f int } enum { A = /* what do I write here? */
deanveloper commented 5 years ago

Is there a section missing after "Let's take a look at some other languages"?

Yes, that was my bad. I've updated to include other languages.

A commonly requested feature for enums is a way to iterate through the valid enum values. That doesn't seem to be supported here.

Oops... That's my bad. Either way, once there is a syntax for it, it should be easily supported. In your upcoming example I will use values(Type).slice, which evaluates to a []Type.

I'm not sure but I suspect that this syntax is going to introduce parsing ambiguities. There are many places where a type can appear. You also have to consider cases like...

That's true. It's confusing if []struct { f int } enum { ... } is an enum of []struct { f int } or a slice of struct { f int } enum { ... }, making the contents of the enum very confusing. This also isn't even really that contrived of a case either, as type TSlice []T isn't uncommon. I'd personally assume that the enum is the last thing that is "applied" to the type, since the syntax is type Type <underlying type> enum <values>, but this isn't immediately obvious.

Under that assumption, the ambiguous code would work as:

for i, x := range values([]struct { f int } enum { A = []struct{f int}{struct{f int}{5},struct{f int}{2} }).slice {
    fmt.Println(i, x)
}

which would have the same output as:

for i, x := range [][]struct{f int}{{struct{f int}{5}, struct{f int}{2}}} {
    fmt.Println(i, x)
}

https://play.golang.org/p/d3HtDbyFZVp

networkimprov commented 5 years ago

Non-primitive enums (i.e. struct, array) is an interesting concept!

We should also consider a) the var/const (...) declaration syntax b) defining a namespace from a parent type name

enum Name ( global = "Atlas" ) // defines namespace & typename; underlying type inferred
type Name enum ( global = "Atlas" ) // alternatively

var word enum ( stop = "red"; go = "green" ) // inline enum applies to one variable
                                             // no namespace or typename defined

type Person struct {
   name Name
   status enum ( follows byte = iota; leads ) // namespace is Person
}

func f() {
   v := Person{ status: follows, name: Name.global } // Person namespace is implicit
   v.status = Person.leads

   word = stop // global namespace
   word = Name.global // compile error

   for n := range Name { ... } // iterate over enum with typename
}

type Material struct {
   kind enum ( Metal = 1 ... )
   hardness enum ( Malleable = 1 ... )
}
enum Composite ( gold = Material{Metal, Malleable} )

(My up-thumb is a qualified one :-)

deanveloper commented 5 years ago

a) the var/const (...) declaration syntax

The reason behind picking { ... } over ( ... ) was that it just looked more visually appealing when defining enums of struct types.

example:

type Example struct {
    i int
} enum (
    A = Example{1}
    B = Example{2}
)

versus

type Example struct {
    i int
} enum {
    A = Example{1}
    B = Example{2}
}

The symmetry of } enum { looks much nicer. I would say using parentheses does make more sense, since we are declaring a list of variables. I was pretty tied on which one to use.

b) defining a namespace from a parent type name

I addressed this, I didn't like it because Type.Value makes Type look like a value, even though it isn't. It does feel much more familiar to other languages however.

Something that bothers me about the example code is that I don't like the type inference. I think that since we don't typically need to declare enums too often, the extra verbosity makes the code much more readable. For instance:

type SomeNumbers enum (
    A = 15
    B = 92
    C = 29993
    D = 29.3
    E = 1939
)

What is the underlying type of SomeNumbers? Well you'd look at the first few numbers and think it's an int type, but because the D constant is 29.3, it would instead have to be float64. This is not immediately obvious and would be an especially large problem for long enums. Just using type SomeNumbers float64 enum ( ... ) would mitigate this issue

networkimprov commented 5 years ago

Type.Value makes Type look like a value, even though it isn't

Type.Method is how you reference a method.

I don't like the type inference

Leveraging the const/var (...) pattern, let the first item clarify the type:

type SomeNumbers enum ( A float64 = 15; B = 92 ... )

Which makes type T struct { ... } enum { ... } unnecessary. (It isn't that readable to my eye.)

deanveloper commented 5 years ago

Type.Method is how you reference a method.

Fair point, completely forgot about that. One of those features that don't get used much, haha

Leveraging the const/var (...) pattern, let the first item clarify the type:

I personally think that

type Status int enum (
    Success = iota
    TimedOut
    Interrupted
    // ...
)

is more readable than

type Status enum (
    Success int = iota
    TimedOut
    Interrupted
    // ...
)

Although that's just be a matter of opinion. I think that the extra verbosity helps the readability in this case, and since people shouldn't be declaring enums all the time, it comes at a relatively low cost.

Which makes type T struct { ... } enum { ... } unnecessary. (It isn't that readable to my eye.)

I actually thought about forcing the underlying struct and enum definition to be separate (which this would effectively do). It was actually the initial design, but as I made examples, it raised a few issues.

Now, you have defined two types (an internal one for the structure, and the type with the enum to actually be used). So now you have this useless internal type floating around, which is only used to define an enum. Not a huge deal per se, but it's pretty inconvenient and seems like a waste of typing. It would also clutter up autocompleters for those who use them.

Another issue is documentation. Presumably you wouldn't want that struct to be exported, because it's only use is to be used by the enum. The issue with not exporting the struct is that now it doesn't appear on godoc.org, so people don't know what fields your enum has. So your two choices are to either export your struct, which is bad design since it's not supposed to be used outside of the enum, or to keep the struct unexported, which makes it invisible to godoc. The type T struct { ... } enum { ... } fixes this issue, since the underlying type is defined along with the enum rather than separate.

Also, defining the enum all at once illustrates that enums are an extension on types, and not types on their own. Doing type SomeNumbers enum ( ... ) makes it look like that enum ( ... ) is the underlying type, even though it's actually an int where enum just limits which int values that SomeNumbers can hold. The proposed syntax is a bit more verbose, but I think that a syntax where the underlying type is defined along with the type illustrates how it works a bit better.

Also, if you want to define the struct and enum separately, you still can:

type internalT struct { ... }
type T internalT enum ( ... )

type SomeNumbers enum ( A float64 = 15; B = 92 ... )

Even in the const/var pattern, that doesn't work. B would still be an int since you assign an untyped integer to it: https://play.golang.org/p/NnYCrYIsENm

Either way, this is all just syntax. I'm glad that there haven't been any problems with the actual concept yet!

jonas-schulze commented 5 years ago

I think you are using the wrong associativity: type T enum int {} should resolve the ambiguity.

I like the overall concept, but how would you cover "multi-value" enums like this one: https://play.golang.org/p/V7DAZ1HWkN4? From the top of my head one could use

but this would add yet another keyword to the language. How would one extract the bit information? Using a & b != 0 feels clumsy, but a ~= b is even more syntax to consider.

Update: I already mixed up the base values. 🙈

deanveloper commented 5 years ago

I think you are using the wrong associativity: type T enum int {} should resolve the ambiguity.

That's true that it would solve ambiguity with existing data structures since "extensions" on types are usually prefixes (ie slices/arrays, channels, etc).

Apply this to structs and it gets a bit more messy in my eyes, especially if we adopt the (...) syntax rather than {...}

type Example enum struct {
    i int
} (
    A = Example{1}
    B = Example{2}
)

versus

type Example struct {
    i int
} enum (
    A = Example{1}
    B = Example{2}
)

I think the second example illustrates that Example is just a struct { i int } and that the enum is simply an extension to that type, rather than enum struct { ... } being some entirely new concept.

jonas-schulze commented 5 years ago

But Example isn't just a struct { int }, is it? I think it really is a struct { int } wrapped as an enum. Putting enum first would also be compatible with future type additions to the language. However, that's all syntax. What do you think about the "bit flag" use case for enums?

networkimprov commented 5 years ago

You only need to specify the value type if its literal form isn't unique, i.e. numeric types other than int & float64.

enum int8 (a=iota; b) is indeed a good way to achieve that, and enum (a int8 = iota; b) should also work for consistency with var/const.

Anonymous value types have problems...

type E struct { // looks like a struct type
  ...
} enum (        // surprise, an enum type
  a = E{...}    // surprise, it works like a struct type here
)
var x = E{...}  // surprise, error

Anonymous enum types are valuable (as I described above)...

type S struct {
   v enum (a = 1)
}
deanveloper commented 5 years ago

But Example isn't just a struct { int }, is it? I think it really is a struct { int } wrapped as an enum. Putting enum first would also be compatible with future type additions to the language. However, that's all syntax.

Example is a struct. "Enum type" (while I have used this term) is a misnomer. The enum keyword simply limits what values type Example is able to hold.

What do you think about the "bit flag" use case for enums?

I personally think that bit-flags should not be handled by enums. They are safe to just use as a raw numeric type in my eyes, since with bitflags you can simply ignore any bits which do not have meaning. I do not see a use-case for bitflags to need a construct, what we have for them now is more than enough.

Example is not wrapped as an enum. The enum specifier is an "attribute" to a type which limits what values it can hold.

Anonymous enum types are valuable (as I described above)...

You could equally achieve that with v int enum ( a = 1 ) which could follow the same namespacing rules that you described earlier. I didn't think of this in my original design, thanks for bringing it up!

type E struct { // looks like a struct type
  ...
} enum (        // surprise, an enum type
  a = E{...}    // surprise, it works like a struct type here
)
var x = E{...}  // surprise, error

I will accept that it may be bad for enum to be after the struct since type E struct ... looks like a plain struct type, but you don't see that it's enumerated until further down. But I could not think of a better syntax for defining the new type's underlying type AND value within the same statement.

networkimprov commented 5 years ago
type E enum struct {
  a, b int
} (
  x = { 1, 2 }      // type inferred, as with []struct{...} { {...}, {...} }
  y = { a:2, b:3 }
)

type E enum [2]int (
  x = { 1, 2 }
  y = { 2, 3 }
)

Voila!

deanveloper commented 5 years ago

I really don't like the } ( on the third line, I had mentioned it before. It feels wrong, but that's probably just me being picky haha. I'd be fine with it if it were implemented that way

networkimprov commented 5 years ago

Maybe you could rev the proposal to incorporate the above insights and alternatives?

deanveloper commented 5 years ago

I've added a list of points at the end which summarize what has been talked about so far.

networkimprov commented 5 years ago

Thanks! Also you could cover...

a) anonymous enums

var a enum (x=1; y=2)
a = x
type T struct { a enum (x=1; y=2) }
t := T{a:x}
t.a = T.x

b) type inference examples for struct and array, essential for anonymous value types: enum struct { ... } ( x = {...}; y = {...} )

c) the default value; is it the type's zero value or the first enum value?

d) enum (x int8 = iota; y) for consistency with the Go 1 enumeration method. Granted that doesn't work well for an anonymous struct.

UFOXD commented 5 years ago

good

deanveloper commented 5 years ago

a) anonymous enums

I haven't directly addressed them, but it is implied they are allowed because enum int ( ... ) is just as much of a type as int is.

b) type inference examples for struct and array, essential for anonymous value types:

Adding after I'm done with this reply

Actually - I'd rather not. Not because I don't like type inference or anything, but these things are mentioned in a bulletted list. The list is meant to be a TL;DR of the discussion, and I don't want to be polluting it with long examples. I personally don't think it's "essential for anonymous value types" or anything, struct and array enums are just as much enums as ints are and really doesn't take much to wrap your head around how they work.

c) the default value; is it the type's zero value or the first enum value?

Thanks for the reminder, I'll add that in

d) enum (x int8 = iota; y) for consistency with the Go 1 enumeration method. Granted that doesn't work well for an anonymous struct.

I've already used iota in enum definitions within my proposal.

deanveloper commented 5 years ago

@networkimprov I've updated the points of discussion

@clareyy Please read https://github.com/golang/go/wiki/NoPlusOne

networkimprov commented 5 years ago

Re type inference, you have this example, which weakens your case

type Material struct {
   ...
} enum {
   Metal = Material{...}, // Material is an enum; the struct is anonymous

Re zero value, I often do this: const ( _ int = iota; eFirst; eSecond ) Here the zero-value is in the enum, but anonymous. That probably shouldn't produce a compile error.

deanveloper commented 5 years ago

Re type inference, you have this example, which weakens your case

Material is not an enum. It is a type just like any other, but the enum keyword limits what values a type may hold. Doing Material{ ... } outside of the enum section of the type is still valid as long as the value comes out to a value that is within the enum section. I'd imagine tools like golint should discourage this behavior though to make it more clear that an enum is being used.

Re zero value, I often do this: const ( _ int = iota; eFirst; eSecond ) Here the zero-value is in the enum, but anonymous. That probably shouldn't produce a compile error.

I'd argue it should. iota is always zero for the zero'th index const. If you do _ MyEnum = 0 on an enum that does not contain a 0 value, it should produce a compile error as the second bullet in the "The following would be true with enums:" part states.

A work-around would be:

type Foo enum int ( x = iota+1; y; z) // note: no zero value is allowed for Foo

const (
    _ int = iota
    a Foo = iota
    b
    c
)
jonas-schulze commented 5 years ago

As a side note: C++11 introduced enum classes that entail more rigorous type checking:

deanveloper commented 5 years ago

As a side note: C++11 introduced enum classes that entail more rigorous type checking:

Actually that's very useful, thank you. My C++ knowledge is a bit outdated since my university professor didn't like C++11 haha.

I'll separate C and C++ in the list of languages.

networkimprov commented 5 years ago

If type Material struct { ... } enum ( ... ) defines type Material, then the following defines type Intish, which is not an int[1], so you must convert constants to it:

type Intish int enum (
   a = Intish(1)
   b = Intish(2)
)

[1] https://golang.org/ref/spec#Type_identity

deanveloper commented 5 years ago

1, 2, 3, etc are untyped. So you can assign them to any value that has an underlying numeric type.

networkimprov commented 5 years ago

You spotted my syntax error and ignored my point. Intish is type clutter.

const CA int = 1
type E int enum ( B = 1 )

type Intish int enum (
   a = Intish(CA)
   b = Intish(E.B)

EDIT: what happens when you alias an enum?

type Intish = int enum (...)
deanveloper commented 5 years ago

Aliasing an enum would work the same way as if you just typed int enum ( ... ) everywhere you type Intish, as with normal type aliasing. This comes at the consequence of the types values not being namespaced. The problems caused by type aliases should (almost) be ignored because type aliasing is a feature of Go that is rarely meant to be used, and to only be used if you know the consequences.

Intish type variable declarations are verbose, I wouldn't quite call it clutter. Intish is NOT an int, so we can not set it to an int value. That's Go, and that's how we do things. Not exactly sure even on what a "good" solution to setting a non-int variable to an int would be, if it's not to convert it.

networkimprov commented 5 years ago

Popping up a few thousand feet... There are three separate issues here: a) sub-package namespacing b) limiting a variable to predefined values c) struct constants

Addressing these separately would grant greater flexibility.

deanveloper commented 5 years ago

IIRC we have proposals for struct constants already, although I'm not exactly sure how they're doing right now since I am on my phone.

You are correct that addressing them as three separate problems is a better idea, though. Packing them in to one idea rather than three separate ones wasn't a good idea. I might get around to making a new proposal which better addresses this, and then close this one once I've done that.

This seems like it's a great start though - thanks for all the criticism.

networkimprov commented 5 years ago

One way to specify a limited set of values for a variable is with a namespace in the declaration, so that may be a section of the namespace proposal:

var a int Namespace = value1
a = Namespace.value2
a = 1 // error

b := a                // also limited
c := Namespace.value1 // not limited
deanveloper commented 5 years ago

Currently writing the new proposal. One problem with eliminating the Namespace before the dot is when using other packages namespaces. Example:

var a int mat.Hardness
a = mat.Soft     // could be ambiguous, especially if mat.Soft is already defined

I think it's best to just keep the namespace always in the value to reduce ambiguities. I'm really liking this proposal so far though.

deanveloper commented 5 years ago

As I continue writing the namespace proposal, it keeps getting messier and messier. Integrating namespaces and types together to make an enum-like structure keeps making things more complex and hard to make a concise syntax for. I think namespaces would need to be an entirely separate concept from enums, unfortunately.

networkimprov commented 5 years ago

Post what you have on a wiki and I'll take a look?

deanveloper commented 5 years ago

I've put it in a gist for now, feel free to comment on it - https://gist.github.com/deanveloper/0a1af8edd4a670e7c6912b284abb313f

networkimprov commented 5 years ago

See also #20467 - support nested types or name spaces

ianlancetaylor commented 5 years ago

Per this proposal, an enum is a list of immutable values for a given type. Although this proposal supports any type, we could reduce the proposal to only support integer enum types, if we use the integer as an index into an array of permitted values. That is, permitting any non-integer type as an enum saves a small amount of boilerplate code--calling a function or doing an array index to retrieve the corresponding value (a function could be used to make the values immutable outside the package). That would eliminate the concerns about the zero value (assuming the valid indexes included 0) and would make iteration very simple.

(Integer enum types are already being discussed at #19814.)

UFOXD commented 5 years ago

it is great, and I think native thread is necessary !

deanveloper commented 5 years ago

@ianlancetaylor The pitfall to this is that it suffers from the same issues as defining the enum type and enum values separately as described in this comment: https://github.com/golang/go/issues/28987#issuecomment-442996502

As an example for clarity, let's use the following code:

type Material enum int ( Plastic; Metal )
type matValue struct { Name string }
var materials = []matValue { matValue{"Plastic"}, matValue{"Metal"} }

func (m Material) Value() matValue {
    return materials[m]
}

// in some other package using this api...

func DoSomething(mat Material) {
    mat.Value().??? // how do I get the name of the material again? I can't see with `go doc` since matValue is not exported!
}

The issue here is documentation. Because matValue is unexported, the value that the Material enum represents is now hidden in go doc and any tools that rely on it (which is a lot, including things like IDE tools). Returning an unexported value from an exported function (as done in (Material) Value()) is not allowed by golint.

So let's try exporting matValue:

type Material enum int ( Plastic; Metal )
type MatValue struct { Name string }
var materials = []MatValue { MatValue{"Plastic"}, MatValue{"Metal"} }
func (m Material) Value() MatValue {
    return materials[m]
}

// in some other package using this api...

func DoSomething(mat MatValue) { ... } // wrong! shouldn't take a MatValue

Now we've got a whole new issue. Since MatValue is exported, people can make their own instances of it, defeating the purpose (under this proposal) of an enum. You may document that values of type MatValue shouldn't be created, and that you shouldn't have any functions that take a MatValue, only ones that take a Material. But at that point you shouldn't export MatValue in order to prevent people from using it, and we've already gone over the issue behind not exporting MatValue.

ianlancetaylor commented 5 years ago

Thanks, but I have to say that I am less concerned about changing the documentation and linting tools than I am about changing the language.

deanveloper commented 5 years ago

There's going to be a pretty significant language change either way. Although I think my main gripe with returning an unexported type is that an unexported type logically should only be used by the package that uses it. In fact, var x pack.matValue is invalid, which is a similar issue to what this proposal encounters with the whole "what's the zero value of an enum" problem.

networkimprov commented 5 years ago

@deanveloper let's keep thinking on ways to limit a defined or anonymous type to predefined values...

type t struct {...}
var ( Allow1 = t{...}; Allow2 = t{...} ) // common today

enum t ( Allow1, Allow2 ) // plugs into existing code!

... struct {
   a int enum (1, 2) // defines anonymous type
}
deanveloper commented 5 years ago

That example has the issue that you explained earlier, where we don't see that type t is an enum until much later in the code. Also seems a bit clumsy since you need to restate yourself. Ideally, you should only need to type out the list of variables once

networkimprov commented 5 years ago

The point is to enhance existing code with minimal effort. Take two:

type t struct {...}
var ( Allow1 = t{}; ... AllowN = t{} ) // existing code

var A t enum                    // any var t in scope
var B t enum ( Allow1, Allow2 ) // just these
var C t enum ( Permit = t{} )   // new one in current scope?
deanveloper commented 5 years ago

It's an interesting idea to store the enum values in a variable of type t enum .... But now declaring an enum in a new project becomes clumsy, and I'd like to avoid having multiple ways to do write the same code.

Also, I'd like to point out that under this proposal, we cannot create an enum from a list of var because enum values should be required to be immutable. Take the following example:

// a person is defined by their name.
type person string
var ( Me = "Dean" )

func main() {
    var a string enum(Me) = Me
    Me = "Deanveloper"
    fmt.Printf("T:%[0]T a:%v Me:%v\n", a, a, Me) // T:string enum(Me) a:Dean Me:Deanveloper
    // problem: `Me`=="Deanveloper" but `a`=="Dean", even though `a` should only hold what `Me` does.
}
networkimprov commented 5 years ago

It's no more clumsy than in Go1; you simply add enum to a variable declaration; supplying a list of values would be uncommon. Values would be immutable when const, and there are open items proposing new const types.

My proposal actually preserves the Go1 method as the only way, and is a relatively small language change. An embedded definition was just a thought; more below.

func f() {
   var a t enum ( Permit = t{} ) // new definition only in func scope?
}
type S struct {
   a t enum ( Permit = t{} ) // new definition only in type def, which namespaces it?
}
var s S
s.a = S.Permit
deanveloper commented 5 years ago

It's not just as clumsy as Go 1, because in Go 1 we don't have to redeclare our values (because we don't have enums), which is where I feel that the clumsiness comes from.

Also I really think that we should be primarily focusing on named enums which get reused in many places, not enums that are inlined (and therefore only used in one place). That's the primary goal of this proposal, other enum proposals, and enums in other languages.

networkimprov commented 5 years ago

Re "we don't have to redeclare our values" you missed the first of my three enum options above. https://github.com/golang/go/issues/28987#issuecomment-448741518

The Go team considered and omitted enums. If we need to limit vars to certain values, I imagine they'd prefer to address that within the current framework.

deanveloper commented 5 years ago

Sorry, when I say "redeclare" I mean you declare the enum values (in the var declaration) and then a second time in the enum declaration (ie with var B ...)

The var A t enum actually works pretty okay, however.

Enums should work on the type-level instead of the var level. An ideal use case for an enum would be something like time.Weekday, where we would be using time.Weekday in multiple areas. Things like that are the primary use-case that I am going for. In these cases, enums need to be on the type level, so that when I am writing a function that takes a weekday, it doesn't look like

func Tomorrow(today time.Weekday enum(time.Sunday, time.Monday, time.Tuesday, ...)) time.Weekday enum(time.Sunday, time.Monday, time.Tuesday, ...) { ... }

That is what I mean when I say that this proposal focuses on named enums, rather than enums that only operate on a single variable.

networkimprov commented 5 years ago

You would write

func Tomorrow(day time.Weekday enum) time.Weekday { ... }