Closed brianleect closed 1 year ago
Suspected cause would be that element extraction to get address no longer works, perhaps UI change?
UI change should not have impacted since code extracts for all mentions of href . weird. need to research deeper.
Basic scraping fixed by https://github.com/brianleect/etherscan-labels/commit/fa2ca24da39cab7961169a89076da6047c5d201c
New problem identified is that subcatid does not seem to be reliably retrieved. E.g. 1inch retrieval only retrieves main, subcatid is empty for some reason.
However, in certain cases as augmented-finance
we see successful subcat_values retrieval
augmented-finance subcat_values: ['1', '0']
Suspected cause, maybe not enough time to load and scrape subcat id?
retried again and it failed on the first try but worked on the follow ups? Not sure what is the cause.
Steps
However, unable to replicate. Suddenly it started working.
May be able to determine faulty labels by a decrease in label count from the previous scrape.
Length of values (0) does not match length of index (3)
Code of interest