Closed Isabel-Gan closed 4 years ago
will not catch the case where there is a constant length of markdown cells (see nb id 165313, incorrectly says "true")
define a "longer" markdown cell to have a length of at least 5 lines
issues:
possible solution: count md cells by character instead???
the above issues are resolved, but new issues (false positives):
the issue is that it's counting stuff like html tags in the character count, when those don't actually show up in the markdown cell
maybe going by line was the better way to do it? or filter the actual text out from those lines somehow?
experimenting in https://github.com/Isabel-Gan/quantifying-notebook-features/tree/quantifiying-markdown-length
"fixed" by https://github.com/Isabel-Gan/quantifying-notebook-features/pull/11, but still results in some of the false positives above. upon re-inspection of the notebook, the first markdown cell was actually one line longer than the rest, and the script is correct
found a lot of false positives in the actual run, need to change how this is measured
results from discussion with Shurui:
last round of fixes:
will not catch the case where there may only be a few markdown cells, all placed at the beginning or the end
this would fit in the definition of the feature, but the code will not catch it as it calculates an average length across all markdown cells