issues
search
emgrasmeder
/
emc-aws-data-project
Using Python, Boto, Pandas, and maybe Spark to analyze a bit of data on AWS
1
stars
1
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Is R/RStudio the way to go? It may be more streamlined
#16
emgrasmeder
closed
9 years ago
3
What is a Mapper? What is a Reducer? <EMR>
#15
emgrasmeder
opened
9 years ago
2
When is ec2 AMI necessary?
#14
emgrasmeder
opened
9 years ago
0
Host a python file on S3
#13
emgrasmeder
opened
9 years ago
0
Repeated files uploaded with the extension ".py~"
#12
emgrasmeder
closed
9 years ago
1
"examples" directory
#11
emgrasmeder
opened
9 years ago
0
Upload data into AWS instance (yes, across multiple servers or whatever) and start data cleaning with Pandas
#10
mcpeate
opened
9 years ago
4
Running OLS with python/pandas
#9
emgrasmeder
opened
9 years ago
0
Puppet Show to Vent AWS Frustration
#8
emgrasmeder
opened
9 years ago
1
Run "hello world" distributed across n aws instances with EMR/Hadoop
#7
emgrasmeder
opened
9 years ago
3
make / understand .gitignore
#6
emgrasmeder
opened
9 years ago
3
Programmatically log in with unique credentials safely
#5
emgrasmeder
closed
9 years ago
3
PySpark / Spark
#4
emgrasmeder
opened
9 years ago
0
Pytesting
#3
emgrasmeder
opened
9 years ago
0
Make a pandas file
#2
emgrasmeder
opened
9 years ago
0
Make repo look professional
#1
emgrasmeder
opened
9 years ago
0