src-d / style-analyzer

Lookout Style Analyzer: fixing code formatting and typos during code reviews
GNU Affero General Public License v3.0
32 stars 21 forks source link

Add get_datasets function with tests #659

Closed irinakhismatullina closed 5 years ago

irinakhismatullina commented 5 years ago

Small functions for generating train and test datasets with artificial typos.

Currently there is a possibility of non-zero intersection between train and test examples (very small for typoed ones, not small for non-typoed). It can be removed, if needed.