Open falexwolf opened 1 day ago
Now I've fixed everything and calling it another time:
! indexing datasets with gene symbols can be problematic: https://docs.lamin.ai/faq/symbol-mapping
β 'var_index' is validated against Gene.symbol
β 'perturbation' is validated against ULabel.name
β 'cell_type_by_expert' is validated against CellType.name
β 'cell_type_by_model' is validated against CellType.name
! no run & transform got linked, call `ln.track()` & re-run
... storing 'perturbation' as categorical
... storing 'cell_type_by_expert' as categorical
... storing 'cell_type_by_model' as categorical
β returning existing artifact with same hash: Artifact(uid='7pgG6hxGTyNUbcOW0000', is_latest=True, key='datasets/dataset1.h5ad', suffix='.h5ad', type='dataset', size=23352, hash='NYni1vTRM7pqfle8ufPZwQ', n_observations=3, _hash_type='md5', _accessor='AnnData', visibility=1, _key_is_virtual=True, storage_id=1, created_by_id=1, created_at=2024-11-25 06:54:11 UTC)
! run input wasn't tracked, call `ln.track()` and re-run
The Curator
seems to do something strange because it logs this warning twice:
! run input wasn't tracked, call `ln.track()` and re-run
@sunnyosun, can you look?
I'm working with this example:
It logs the following
I made the first line of the logging consistent with our convention of lower-case logging messages, @Zethson; also added a link:
Ok, now, upon re-running, I get this. Because some bionty-validated things have been validated already, the logging looks much less verbose:
However, the 3-lines just for
perturbation
throw me off:Can we get this onto one line? What does "remove non-existent values"? That seems pretty confusing. I'd simply remove this (I get that you mean 'remove values from your AnnData that aren't in the registry', but I can't see that this is a practical case; it's mostly a confusing case.)
Here is a suggestion for compressing 3 lines onto 1 line:
Can you implement this, @sunnyosun?