Closed rpytel1 closed 5 years ago
Abstract Syntax Tree Parser and parts of a preprocessing pipeline: https://github.com/jan-gerling/mmsr_repo_sim
Idea 1: Transfer Learning for code2vec Idea 2: As a comparison SVM (perhaps other traditional ML tasks) Idea 3: Reduce the feature space( data ablation study)
Preporcessing TODOs:
Training
Validation
Open questions
Components to be implemented