issues
search
ddf-project
/
DDF
Distributed DataFrame: Productivity = Power x Simplicity For Scientists & Engineers, on any Data Engine
http://ddf.io
Apache License 2.0
168
stars
42
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Remove unused dependencies
#371
Celebrate-future
opened
3 years ago
0
[SECURITY] Use HTTPS to resolve dependencies in Maven Build
#370
JLLeitschuh
opened
4 years ago
0
Remove MySQL, Rserve, REngine dependencies
#369
Huandao0812
opened
7 years ago
1
[No Jira] Don't try checking S3 account owner, which is not always permitted
#368
ubolonton
closed
8 years ago
5
don't create query from table name when create ddf from jdbc source
#367
Huandao0812
closed
8 years ago
0
ETL APIs for handling Time Series
#366
binhmop
opened
8 years ago
0
[PE-2181] Support wildcard and multi-paths
#365
PangZhi
closed
8 years ago
2
[Feature] remove implicit cache function in DDF
#364
Huandao0812
closed
8 years ago
2
[PE-2161] Improve Cache Behavior
#363
dnsang
closed
8 years ago
5
Backends for DDF
#362
wirawan0
opened
8 years ago
0
[PE-2058] Improve load from S3 speed by using s3a instead of s3n
#361
lebinh
closed
8 years ago
3
Migrate aggregate handler test
#360
hai-adatao
closed
8 years ago
1
CastType support for multiple columns
#359
ducleminh
closed
8 years ago
1
temporary fix for S3DDF/HDFSDDF not recognizing lower-case format names
#358
ubolonton
closed
8 years ago
0
Add back binning with previous signature for backward-compatability
#357
nhanitvn
closed
8 years ago
0
[BIGAPPS-6226] Fix not being able to run SQL on DDF
#356
lebinh
closed
8 years ago
0
Revert "Feature/gc ddf"
#355
ubolonton
closed
8 years ago
0
Revert "Better check for integer number of samples"
#354
nhanitvn
closed
8 years ago
0
Revert "Revert "Use Spark DF/RDD APIs for sampling without replacement instead of SQL""
#353
hai-adatao
closed
8 years ago
0
Revert "Use Spark DF/RDD APIs for sampling without replacement instead of SQL"
#352
hai-adatao
closed
8 years ago
0
Feature/gc ddf
#351
Huandao0812
closed
8 years ago
9
binning API now can return a factor column or an integer one
#350
nhanitvn
closed
8 years ago
2
Refactor and optimize sampling by size
#349
nhanitvn
closed
8 years ago
0
Use Spark DF/RDD APIs for sampling without replacement instead of SQL
#348
nhanitvn
closed
8 years ago
3
Add inPlace flag to fillNA and dropNA
#347
ducleminh
closed
8 years ago
1
Add week UDF which performs the same task as weekofyear
#346
nhanitvn
closed
8 years ago
0
Revert "Add median and median_approx for Spark engine"
#345
hai-adatao
closed
8 years ago
0
fixed bug in InverseFactorIndexer
#344
phvu
closed
8 years ago
0
Fix bug in FactorIndexer
#343
phvu
closed
8 years ago
0
FactorIndexer returns new columns
#342
phvu
closed
8 years ago
1
add default implementation for IGloballyAddressable
#341
Huandao0812
closed
8 years ago
4
Add support to scale just a subset of columns plus and inPlace flag
#340
ducleminh
closed
8 years ago
5
Add median and median_approx for Spark engine
#339
nhanitvn
closed
8 years ago
2
Validate the scala value wrt numeric columns
#338
nhanitvn
closed
8 years ago
0
Enhance Parsing Summary Accuracy & Fix S3 return credential
#337
dnsang
closed
8 years ago
0
Copy factors information in getRandomSampleByNum when replacement=False
#336
phvu
closed
8 years ago
1
applySchema can return a new DDF or modify it inplace
#335
Huandao0812
closed
8 years ago
5
applySchema can return a new DDF or modify it inplace
#334
Huandao0812
closed
8 years ago
0
Update the way we do sampling without replacement
#333
nhanitvn
closed
8 years ago
4
Add inplace param to flatten operation.
#332
ducleminh
closed
8 years ago
3
getColumn should throw exception if column does not exist
#331
Huandao0812
closed
8 years ago
3
Add inplace flag sort to sort operation
#330
ducleminh
closed
8 years ago
0
Add inplace flag to transformrserve methods.
#329
ducleminh
closed
8 years ago
0
Can get a DDF by sampling a Spark DDF by size
#328
nhanitvn
closed
8 years ago
3
Can get a DDF by sampling a Spark DDF by size
#327
nhanitvn
closed
8 years ago
2
Feature: Improve data ingestion
#326
ubolonton
closed
8 years ago
3
Add multiple-column sort functionality to DDF
#325
ducleminh
closed
8 years ago
5
Revert "Revert "Make transformUDFWithNames & castType & removeColumns return new DDF by default""
#324
Huandao0812
closed
8 years ago
0
makeVector to be a static function in Row2LabeledPoint
#323
phvu
closed
8 years ago
0
Revert "Make transformUDFWithNames & castType & removeColumns return new DDF by default"
#322
Huandao0812
closed
8 years ago
0
Next