iterative / studio-support

❓ DVC Studio Issues, Question, and Discussions
https://studio.iterative.ai
16 stars 1 forks source link

5000 column limit #87

Closed slevang closed 1 year ago

slevang commented 1 year ago

I'm running into an issue with a repo where recently added metrics and params files are not being detected in the columns view of Studio.

The issue seems to be that at one point in the commit history, we had a huge number of metrics dumped (~10000) since we are operating in a large multi-dimensional space. I noticed that the counter on the columns settings page seems to top out at 5000. Testing this behavior with a toy repo, I confirmed that studio stops parsing columns >5000, and that if I make a new squashed version of our repo, Studio picks up the desired columns just fine.

We've since pared the metrics back greatly in the main repo, but Studio still won't pick up the desired columns because it is holding on to the old ones.

Any thoughts on how to recover from this, short of squashing the entire commit history?

shcheklein commented 1 year ago

hey, let us confirm this. I think it should be prioritizing the latest metrics above the oldest ones.

Could you for now try to reparse it:

Screenshot 2023-05-23 at 11 43 52 AM

The Force Import option.

slevang commented 1 year ago

Yeah, have tried Force Import and even delete and recreate the project a couple times. Is there an expected ordering of branches in the parsing, i.e. latest vs default?

I did try this in a test repo and was able to get the latest metrics/params to show again after exceeding the 5000 limit in a prior commit. And as mentioned I did get everything showing by squashing commits and making a new copied repo. So something unexpected in the branch order parsing is all I can think of.

Related question, is there any particular reason to limit metrics in Studio?

slevang commented 1 year ago

Any further suggestions for this issue?

shcheklein commented 1 year ago

@slevang sorry for the delay. I think we've expanded the limit and it should be deployed now. Could you try to force reparse the project? (force import). Also, separately, we are looking into why the new metrics were not prioritized. We'll keep you updated here.

slevang commented 1 year ago

Yes it works now, thank you!

This can be closed from my side, but feel free to leave open if you're still investigating the prioritization order.

tapadipti commented 1 year ago

@slevang Thanks for confirming. I'm closing this issue. But we have an internal issue to investigate the prioritization order and we will comment here once there is an update.