issues
search
nichollsbr
/
cmsc611-project
0
stars
0
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Rewrite Basic-* run scripts
#19
nichollsbr
closed
5 years ago
0
Dataset Test
#18
nichollsbr
opened
5 years ago
0
#3 basic dataset
#17
nichollsbr
closed
5 years ago
0
Fix DateFormat for both dataframes and datasets
#16
nichollsbr
opened
5 years ago
0
Record Total Runtime
#15
nichollsbr
closed
5 years ago
1
Dataframe Test
#14
nichollsbr
opened
5 years ago
0
Basic dataframe and data cleanup
#13
nichollsbr
closed
5 years ago
0
Only log WARN or more
#12
nichollsbr
closed
5 years ago
1
Make sure headers of csv files are not being included in rdd/dataframe
#11
nichollsbr
closed
5 years ago
1
Update jobs to cache data in memory and on hard drive
#10
nichollsbr
opened
5 years ago
0
Update jobs so the data is repartitioned
#9
nichollsbr
closed
5 years ago
0
Update jobs so mapPartitions calls toList to bring everything into memory
#8
nichollsbr
opened
5 years ago
0
Update jobs to compare map vs mapPartitions
#7
nichollsbr
closed
5 years ago
0
Create python notebook to analyze count results
#6
nichollsbr
opened
5 years ago
0
Everyone runs count job locally and upload results to google drive
#5
nichollsbr
opened
5 years ago
0
Update README to include run instructions
#4
nichollsbr
closed
5 years ago
0
Count using Dataset
#3
nichollsbr
closed
5 years ago
0
Count using Dataframe
#2
nichollsbr
closed
5 years ago
1
Test Task Metrics Configuration
#1
nichollsbr
closed
5 years ago
1