WM-SEMERU / ds4se

Data Science for Software Engineering (ds4se) is an academic initiative to perform exploratory and causal inference analysis on software engineering artifacts and metadata. Data Management, Analysis, and Benchmarking for DL and Traceability.
https://wm-csci-435-f19.github.io/ds4se/
Apache License 2.0
7 stars 3 forks source link

Preprocessing of Canonical Datasets #86

Closed danaderp closed 3 years ago

danaderp commented 3 years ago

Preprocess and format canonical datasets into pandas dataframes.

danaderp commented 3 years ago

Albergate preprocessed You might find the path in: 'tree/main/dvc-ds4se/se-benchmarking/traceability/testbeds/processed'

danaderp commented 3 years ago

E-tour dataset generated! Find the file in the same path!

danaderp commented 3 years ago

I-trust generated! Find the csv file in the same path!

danaderp commented 3 years ago

Smos generated!

danaderp commented 3 years ago

EBT was generated! All canonical datasets are preprocessed in the required .csv format.