ddf-project / DDF

Distributed DataFrame: Productivity = Power x Simplicity For Scientists & Engineers, on any Data Engine
http://ddf.io
Apache License 2.0
167 stars 42 forks source link

Can get a DDF by sampling a Spark DDF by size #327

Closed nhanitvn closed 8 years ago

nhanitvn commented 8 years ago

Description and related tickets, documents

Also:

Reviewers: @Huandao0812 @phvu @hai-adatao

Breaking changes & backward compatible issues

No breaking changes in API. However, we need to take care about performance.

How to test

Describe how this PR is tested. In case manual testing is required, describe how to do so.

PR Progress

Make sure all checkboxes below are checked before merged

Change-Id: I853d48575247d9179936d2a8c67ae79035ccb959

nhanitvn commented 8 years ago

The change on API names leads to a lot of changes in ddf-jdbc and PE. So I will close this ticket and open another one focusing on just the logic, not name changes.

nhanitvn commented 8 years ago

Here it is https://github.com/ddf-project/DDF/pull/328