liu431 / Big-Data-Project

CAPP 30123 Class Project
2 stars 5 forks source link
hadoop mrjob-dataproc programming-language sentiment-analysis stackoverflow

Big Data Analysis of the Developer Community

This project investigates the changing sentiments of developers on StackOverflow.

Binder

Project Information

Class: CAPP 30123 (Computer Science with Applications III) @ UChicago

Group name: HackyStacks

Group members: Adam Shelton, Dhruval Bhatt, Li Liu, Sanittawan Tan

Data sets: Stack Overflow Data

Quick Links

Table of Contents

1 Note: The two versions of the code below are almost identical. The main difference is the package used for Geocoding.

2 Note: Different bi-gram generating methods