Closed andymeneely closed 9 years ago
@andymeneely
After doing some research it looks like all comments are repeated in the messages. When adding inline comments you have to commit them at the end like git with a commit message. The message becomes a reitveld message with the line numbers attached to each inline comment. I think this means we can stop parsing comments as well as messages and just parse the messages. Unless you can think of a reason we would need the duplicate data?
Just need to verify this works in production build
I thought we had found some counterexamples where it was just a comment and not a message? I'll take a look now.
Nope, you're right. I just went through a bunch of random issues with comments and they all were double-counted. Is it possible to still parse messages properly and then just not parse the comments?
I removed the parsing of comments from the vocab generation. Do you want to kill the comment table entirely?
Keep the table. We might use it for something else and it's not really slowing us down.
The word "rightli" is very common because of lines like this:
https://codereview.chromium.org/2413
Filter out the entire line of
File blahblah (right):
Also, look at the
Line
lines as another to filter out.