DeveloperLiberationFront / Spreadsheet-Corpus-Paper

1 stars 0 forks source link

Run parallel VCL jobs of CheckCell #4

Open CaptainEmerson opened 10 years ago

CaptainEmerson commented 10 years ago

Assigning Titus, but this should really be John.

slankas commented 10 years ago

Still having issues just being able to run CheckCell. Process was using over 7gb of RAM for a 128k sized file. also not getting any results to appear in Excel.

Have emailed the paper authors requesting a sample file.

slankas commented 10 years ago

Was finally able to install component properly. Needed F# runtime which was not in their docs.

Am now using their OOPSLA version which can run in batch mode from Excel. Still takes a substantial period of time to execute (hours). Will need to investigate AutoIT

slankas commented 10 years ago

Executed their batch process against the sample set in their paper. (We'll close it it. They had a distribution set of 61 files in their artifact set while the paper list 64)

average processing time per spreadsheet = 47 sec. Paper = 49.45

I'd like to get a sample set from Enron that we can try to run against to compare numbers.

CaptainEmerson commented 10 years ago

Cool, sound it's working like it should.