Open crawshaw opened 8 years ago
This turns out to require more invasive changes than I originally realized.
To be useful, it needs to be possible to add methods to types created at run time. That means taking some equivalent to func(receiver reflect.Value, args []reflect.Value) (results []reflect.Value)
as the user's implementation of a method.
But methods are called directly from pointers to the text segment (stored in the itab). The unboxed receiver value is on the stack for the method body to use, but no type is available. So there's no way for the reflect package to take the receiver value and box it up into a reflect.Value when the method is called.
Assuming we want to avoid code generation, some change to dynamic method calling would have to be made. The smallest is probably dropping the underlying type and method number onto the stack. A more significant change would be making the function pointers in the itab indirect like func() pointers (https://golang.org/s/go11func). A third option would be a note on the itab saying these are func()-style pointers. Any of these actions incurs some cost on dynamic method calls.
Still pondering.
The unboxed receiver value is on the stack for the method body to use, but no type is available.
I thought the precise GC work back in Go1.4 meant all types on the stack are known to the runtime. Is that no longer true?
We only have pointer/nonpointer bits for each word on the stack. We do not have full type information.
A discussion in the original thread starting at https://github.com/golang/go/issues/4146#issuecomment-318200547 eventually honed in on really wanting something like described in this issue.
Referencing here because there is some discussion about how NamedOf
/NewTypeOf
would ideally behave to satisfy the goal of extending known interfaces without hiding unknown optional methods.
Since NamedOf() knows the new type T that it is creating, couldn't it build a closure for each method, and store a pointer to the runtime._type for T in each closure? I'm assuming we can build a reflect.Value given an unboxed value and runtime._type.
Since there is no subtyping, we don't have to worry about the receiver's actual type being different from T (or *T), right?
@nerdatmath The language doesn't permit methods to be closures. So, for simplicity, the same mechanism is used to pass the receiver value as is used to pass the pointer to the closure. For a method, we pass the receiver, and for a closure, we pass the closure pointer. We don't have a way to pass both.
OK another shot. Suppose T is a concrete type, s is of type T, x is of type A, and m is a method of T accepting a single parameter of type A. From the programmer's perspective, calling s.m(x) is essentially the same as calling T.m(s, x). Does that correspondence extend down to the calling convention? When calling T.m the compiler knows the concrete type of s (and knows that T.m knows it too), so it doesn't need to pass the type info.
If so, we already have a way to create functions of type func(T, A): runtime.FuncOf. Could this or a similar function create method pointers that could be put in itabs? Please forgive me if I'm being hopelessly naive.
@nerdatmath, I think the issue that you'll run into that @ianlancetaylor is referring to is that once you do create the struct/type of T
, the code that compiler will generate for s.m(x)
will behave such that if you try to interact with s
as the expected receivever, you will encounter a nil pointer exception do to the nature of creating a closure and the calling conventions described in https://golang.org/s/go11func. The word passed along the stack in the call to the function will be the closure data pointer, not the receiver (s
).
I believe the long standing calling conventions would need to modified or extended in order to support this. There's probably a safe way to extend the compiler to support a new calling convention. I went down a very deep rabbit hole trying to implement NamedOf
using StructOf
as a reference and hit this roadblock.
But methods are called directly from pointers to the text segment (stored in the itab). The unboxed receiver value is on the stack for the method body to use, but no type is available.
I thought the concrete type was pointed at by the other half of the interface value (https://research.swtch.com/interfaces). If one is calling through the interface value, would the type not be available? What am I missing?
I dropped the idea of creating a closure. What does reflect.MakeFunc create? Not a closure but an actual function like TopLevel in https://golang.org/s/go11func, right? From that document:
Direct call of method. In order to use the same generated code for both an indirect call of a func value and for a direct call, the code generated for a method (both value and pointer receivers) is chosen to have the same calling convention as a top-level function with the receiver as a leading argument.
So essentially, creating a method should be the same as creating a top-level function, which is done by reflect.MakeFunc. My apologies for referring to it as runtime.FuncOf earlier.
OK so I guess reflect.MakeFunc actually creates something like a func literal / closure. So I think I see the difficulty.
@glycerine The interface value has the type, yes, but in the current calling convention we don't pass the entire interface value in a method call. We only pass the value pointer inside the interface value. We don't need to pass the type, since by definition a method knows which type it has been compiled for.
Thanks Ian.
Related discussion (for others catching up) from Carl Chatfield's proposal a couple years back, calling convention adaptation to provide the entire interface value.
https://groups.google.com/d/msg/golang-dev/coyYwxU3dfM/dZzOleaeFu4J
In my opinion, reflect.NamedOf would be useful even without support for adding methods to the newly created type - just like reflect.StructOf is useful even without (full) support for wrapper methods of embedded fields.
In any case, I think that reflect.NamedOf() should have, as proposed, a function parameter to pass the list of methods and, until support for adding methods is implemented, it could simply panic if the methods list is non-empty.
I also have an idea on how to support adding methods to types created with reflect.NamedOf without changing the calling convention for methods, but it's quite tricky - better if I write a prototype first.
P.S. it is worth noting that the signature func NamedOf(t Type, name string, methods []Method) Type
proposed above would not be able to create recursive types: they need a multi-step procedure, analogous to what go/types.Named has:
func NewNamed(obj *TypeName, underlying Type, methods []*Func) *Named
// then create an underlying type that contains the result of NewNamed
// and pass it to SetUnderlying:
func (t *Named) SetUnderlying(underlying Type)
In my opinion, reflect.NamedOf would be useful even without support for adding methods to the newly created type - just like reflect.StructOf is useful even without (full) support for wrapper methods of embedded fields.
In any case, I think that reflect.NamedOf() should have, as proposed, a function parameter to pass the list of methods and, until support for adding methods is implemented, it could simply panic if the methods list is non-empty.
I also have an idea on how to support adding methods to types created with reflect.NamedOf without changing the calling convention for methods, but it's quite tricky - better if I write a prototype first.
P.S. it is worth noting that the signature
func NamedOf(t Type, name string, methods []Method) Type
proposed above would not be able to create recursive types: they need a multi-step procedure, analogous to what go/types.Named has:func NewNamed(obj *TypeName, underlying Type, methods []*Func) *Named // then create an underlying type that contains the result of NewNamed // and pass it to SetUnderlying: func (t *Named) SetUnderlying(underlying Type)
Yes you are right, and it will introduce to the idea of Proxy Object.
I have a vague idea which I wanted to run past you before I spend time digging deeper.
So methods have no access to a closure pointer (assuming we don't want to change calling conventions), we don't want to incur extra costs for dynamic method calls unrelated to reflect
, and we don't want dynamic code generation.
The idea is that the method code of a NamedOf
generated method is just the equivalent of
TEXT Forwarder
CALL DispatchMethod
DispatchMethod
then inspects the stack to identify the Forwarder
's virtual address and uses that to look up the actual user function (as an interface{}
, so we also have its type) and (un-)wrap receiver, args and results (plus some stack manipulation to return directly to the original caller).
Obviously, this requires multiple copies of Forwarder
so that we have a unique virtual address to look up the function. How do we do this without runtime code generation? We can pre-seed a virtual memory page with copies of Forwarder
at compile time. For example, if the page size is 4096 bytes and Forwarder
requires 16 bytes, we can make 256 individual methods from that one page.
What happens if we run out of free slots? We create a new virtual memory mapping to the same physical memory (or possibly pre-seed the binary with "enough" of such mappings at compile time). So the same physical code gets new virtual addresses and can serve more methods. No generation of new code necessary.
In total, the cost would be:
Forwarder
copies in the binary image.reflect
package.NamedOf
-generated methods would be slower than regular method calls, but unrelated method or function calls would not be affected at all.PAGE_SIZE/ForwarderSize
new methods.@ianlancetaylor is this the kind of issue the team would entertain a community member working on/proposing or is it so deep in probable compiler changes that it'd be easier for it to be done internally?
@edaniels It's fine for a community member to work on this, but it's really hard to get right.
Consider adding a function to the reflect package for creating a new named type with a method set:
(Broken out from #4146.)