Performance problems - Githubissues

alteryx / Automated-Manual-Comparison

Automated vs Manual Feature Engineering Comparison. Implemented using Featuretools.

BSD 3-Clause "New" or "Revised" License

327 stars 150 forks source link

Hi,

thanks for your article. Automated Feature Engineering is very promising. I am running the Loan Repayment script right now to compare it with my own engineered features. I am very curious about the results.

What is the recommended horse power to compute the result on one day (like mentioned in the article)? Elapsed: 18:50:30 | Remaining: 22358:53:57 | Progress: 0%| | Calculated: 3/3563 chunks

The ft.py uses one job by default. Any other value but 1 crashes the script. I am using a r4.2xlarge aws ec2 instance. But with one job it cannot utilize more than one core. Nevertheless even with all eight cores, it would still take weeks.

Can you recommend some specs to speed this up?

Best regards

alteryx / Automated-Manual-Comparison

Performance problems #1