logpai / bughub

A collection of free-text bug reports for duplicate issue identification
113 stars 26 forks source link

What is the format of bug's description in the duplicate dataset? #4

Open ghost opened 4 years ago

ghost commented 4 years ago

For the duplicate dataset, i hvae noticed there are many flags in the description of bug, such as ";;",";;;;","\tat", etc. Even though, there are some flag ";" in middle of one sentence, so i want to know what's the menas of these flags?

SabaRG commented 3 years ago

These are usually used in source codes, e.g., ";" is used for the end of a statement in C++ language. The "\t" is used to indicate a Tab character in C++, and "\tat" is kind of equal to " at". These are not related to the dataset. You should be familiar with those applications related to bug reports.