Open marcenacp opened 1 year ago
Will this need to be assigned to an individual, or can I take a stab at it and make a PR?
@Milksheyke Please, feel free to send a PR :) Thanks!
Installed repo and dev tools, but I'm unable to generate the database with monkeytype. At least not fully.
croissant/python/mlcroissant/mlcroissant/_src/operation_graph/execute.py", line 98, in build_record_set
len(result) == 1
AssertionError: "GroupRecordSet(default)" should have one and only one predecessor. Got: 0.
execute_operations_in_streaming in mlcroissant/_src/operation_graph/execute.py is passing in a DiGraph while build_record_set is expecting a MultiDiGraph. I'm guessing that's where the assertion error is coming from.
Am I missing something here, or should we introduce some sort of default behavior to handle if a node lacks a predecessor?
Why?
This feature can have several benefits:
How?
pip install monkeytype
, but do not add it to pyproject.toml. We'll use monkeytype to just generate types.pip install flake8-annotations
and add it to pyproject.toml in the[dev]
section.pyflakes . --disallow-untyped-defs
.monkeytype apply ml_croissant._src.core.json_ld
.