DATAVIEW is a big data workflow management system. It uses Dropbox as the data cloud and Amazon EC2 as the compute cloud. Current research focuses on the security and privacy aspects of DATAVIEW as well as performance and cost optimization for running workflows in clouds.
11
stars
5
forks
source link
How to list your contributions explicitly in a paper? #12
we propose a novel sampling method for GBDT that can achieve a good balance between reducing the number of data instances and keeping the accuracy for learned decision trees
we propose an innovative method called EFB to bundle mutually exclusive features (i.e., they rarely take nonzerovalues simultaneously) to reduce the number of features.