Lab41 / soft-boiled

Library for Geo-Inferencing in Twitter Data
Other
28 stars 11 forks source link

Updating to Take in SQL RDD #8

Closed ymt123 closed 9 years ago

ymt123 commented 9 years ago

Updating input for algorithm train/test functions to be a Spark SQL RDD instead of a path to data. This should hopefully be helpful for both integration with datasets in different formats and speed up analysis that runs the algorithm over the same dataset multiple times.