Ideas about possible extensions to heap snapshot format

mkustermann commented 1 year ago

One can view the heap snapshot the VM produces as a snapshot of the application at a specific point in time. There's benefits to analyzing such a heapsnapshot outside VM over doing many RPCs to vm-service (which also offers some functionality - e.g. retaining path, inspecting objects, ...).

In some sense it would be nice to have similar capabilities in the tools analyzing heap snapshot as we have with a life debugger (except for actually running code - i.e. no expression evaluation).

Right now the heap snapshot format is quite minimal and pure in what it contains. That's a nice thing, but there's a few things we could do to make it more amenable to analysis and getting it closer to what one can do with live application and debugger / observatory / ...:

[ ] Include class hierarchy information: Currently an analysis tool cannot compute "all objects of class X or subclasses/subtypes of X". e.g. We currently end up having many seemingly unrelated classes containing a field of same name + offset - the fact that this field comes from a common base class is lost in the heapsnapshot format
[ ] More fine grained representation of:
- Statics (currently one sees Root -> Isolate -> StaticObject - there is no mention of the class of the static field)
- Globals (currently one sees Root -> Isolate -> GlobalObject - there is no mention of which library the global is in)
- Stack + Frames + Variable Names? (currently one sees Root -> Isolate -> StackObject - there is no mention of the fact that the object is held on to by the stack, or which frame in the stack)
[ ] Flag to exclude VM-internal objects: It includes many things that end users may not understand. One could put the burden of hiding those VM-internal things on the analysis tools, but that makes those tools highly coupled with VM-specifics. It may be nice to make heapsnapshot generation configurable to exclude VM-specifics (library/class/function/field objects + dictionaries, instructions, code objects, object pools, type feedback, subtype caches, ...).
[ ] VM version / Heapsnapshot format version / ...: Analysis tools could detect when the format is incompatible. In addition to the format itself, it would allow detecting if the structure of the graph or certain classes relied upon by analysis tools change.

/cc @rmacnak-google

mkustermann commented 1 year ago

/cc @polina-c

polina-c commented 1 year ago

Thank you, Martin, for initiating this.

Some thoughts:

How about just flagging VM-internal objects instead of hiding them? They still take memory so it make sense to have them visible. Should not it be analyser's task to decide what to show, unless it saves performance?
Other missing information:
- [ ] For object x referencing object y, it is missing which field(s) of x reference y
- [ ] Line numbers for class and field definition in the library will make it possible to open the source code from the analysis tool
- [ ] Documentation on heap snapshot is hard to understand https://api.flutter.dev/flutter/vm_service/HeapSnapshotGraph-class.html https://api.flutter.dev/flutter/vm_service/HeapSnapshotObject-class.html
  
  Examples of unclear questions: a. Is there difference between references and successors or it is just different format of the same information? b. Which object is root?

mkustermann commented 1 year ago

How about just flagging VM-internal objects instead of hiding them?

Sure, that would work as well.

Though various individual algorithms would then need to keep this in mind and possibly ignore edges (e.g. dominators calculation, retaining paths, successors/predecessors, ...)

For object x referencing object y, it is missing which field(s) of x reference y

This information has been there for a long time. The field name of object.references[i] is graph.classes[object.classId].fields[i].name. The exception is mainly variable-sized objects such as arrays, which will only have field information for the array header. (There were some bugs in this information, but they were fixed, e.g. recently in d68ca2cc57302c64d535993bfc0e4cad4c6e51dc, 3669086a40814ba0cbc92436bd6c39dc4bf7b357)

a. Is there difference between references and successors or it is just different format of the same information?

That's not a question of the heap format (which this issue is about), but rather the API that package:vm_service's HeapSnapshotGraph exposes. The main difference I believe is that one of them is a compact Uint32List view while the other is a rather inefficient sync* function yielding actual HeapSnapshotObject.

polina-c commented 1 year ago

For object x referencing object y, it is missing which field(s) of x reference y

This information has been there for a long time. The field name of object.references[i] is graph.classes[object.classId].fields[i].name.

If two objects are of the same class, do they have the same classId? If yes, it will mean that all instances of the same class will have the same reference field for the same index, e.g. all instances of class X will have the same field name for reference #1. Is it how it structured?

mkustermann commented 1 year ago

If two objects are of the same class, do they have the same classId?

Yes, all classes are assigned a number which we call class-id. An object is an instance of a class. The objects don't point to their class, but they store the id of the class they are an instance of.

If yes, it will mean that all instances of the same class will have the same reference field for the same index, e.g. all instances of class X will have the same field name for reference https://github.com/dart-lang/sdk/issues/1. Is it how it structured?

I'm not quite sure where the misunderstanding is. Let me try to rephrase it:

Each class has a list of fields. The fields are densely 0-indexed. The index of the field says where in an object's outgoing references the fields value is. i.e. if you want to know the value of field class.fields[i] in object x you can get that via x.references[class.fields[i].index] - which I believe can be simplified to x.references[i] (because class.fields[i].index == i. (As mentioned the only exception are variable-sized objects such as arrays, where the <obj>.references can be larger than the number of fields).

polina-c commented 1 year ago

It helps. Thank you.

dart-lang / sdk

Ideas about possible extensions to heap snapshot format #50546