NoahCarnahan / plagcomps

Other
6 stars 1 forks source link

Populating evolved features is slow #21

Open zachwooddoughty opened 10 years ago

zachwooddoughty commented 10 years ago

We talked about this today, but the population of evolved features seems to take a disproportionate amount of time -- like an hour for 50 docs.

evolved_feature_three (as example) has an _init method that calls self.get_feature_vectors() on the necessary features, and then saves that information to self.features["evolved_feature_three"]. Each query of evolved_feature_three grabs the necessary values from the self.features hash, and does a simple addition/subtraction combination of the values. The code is below https://github.com/NoahCarnahan/plagcomps/blob/master/intrinsic/featureextraction.py#L614

Why does this take so long?