ddf-project / DDF

Distributed DataFrame: Productivity = Power x Simplicity For Scientists & Engineers, on any Data Engine
http://ddf.io
Apache License 2.0
167 stars 42 forks source link

Feature/gc ddf #351

Closed Huandao0812 closed 8 years ago

Huandao0812 commented 8 years ago

Description and related tickets, documents

DDFCache is backed by a guava's Loading cache, https://github.com/google/guava/wiki/CachesExplained The eviction policy is LRU For DDF cache we use both size-based eviction and time eviction:

Reviewers: @zkidkid @nhanitvn @phvu @lebinh @ubolonton

PR Progress

Make sure all checkboxes below are checked before merged

phvu commented 8 years ago

lgtm

nhanitvn commented 8 years ago

lgtm

ubolonton commented 8 years ago
Huandao0812 commented 8 years ago

@ubolonton we could extends and override the class, one simple solution is adding a cache flag to signal don't remove this DDF

ducleminh commented 8 years ago

@ubolonton @Huandao0812 or don't cache the temporary DDFs if we can work out whether they are temporary.

nhanitvn commented 8 years ago

retest this please

hai-adatao commented 8 years ago

Hold on, don't merge this one yet

Huandao0812 commented 8 years ago

I change the DDFCache to keep DDF with name on a separate cache, let's go with this first, a separate flag to signal long live versus ephemeral can be add later. Please review @nhanitvn @phvu @ubolonton @zkidkid @lebinh

nhanitvn commented 8 years ago

That would be a better approach. Anyhow, we have no-named DDFs and named DDFs.