ESSECAnalytics / BDA

Course: Big Data Analytics (IDSS31165)
http://strategic-business-analytics-chair.essec.edu
3 stars 3 forks source link

Big Data Analytics

Professor: Nicolas Glady
Code: IDSS31165

Welcome to the GitHub repository of the course Big Data Analytics of the ESSEC and Accenture Strategic Business Analytics Chair.

Course Materials

  1. Syllabus
  2. Schedule
  3. Sessions
  4. Case
    • Data (note: the groups will be assigned before Session 3 as indicated in the syllabus.)

The approach of the course is dynamics, and what you are expected to do will be explained during the first session.

Please already note that you are expected to have installed R and R studio prior the second session. And you should always download the data and script of a session before joining the class. In any case, you are invited to consult all the material in preparation of each class.

GitHub ?

What the hell is GitHub?

GitHub is how people build software.
-- GitHub

GitHub is a web-based Git repository hosting service. It offers all of the distributed version control and source code management (SCM) functionality of Git as well as adding its own features. It provides access control and several collaboration features such as bug tracking, feature requests, task management, and wikis for every project.
-- Wikipedia

Umm OK, but what is Git then?

Git is a version control system (VCS) for tracking changes in computer files and coordinating work on those files among multiple people. It is primarily used for software development, but it can be used to keep track of changes in any files.
-- Wikipedia

In short, GitHub offers a web-based platform to work with Git, a version control system used by software developers to work in team and keep track of code versions of the same project.

Fine, but why should I need to use GitHub? I'm not a software developer.

For this course, we'll use GitHub as a code discussion platform among students. Nicolas's R codes are available in the Code tab and you can discuss them in the Issues section. Don't hesitate to ask for help, propose improvement or share your tips. Remember that you should not share the solution, only tips when answering other's question

Perfect, tell me how to use it GitHub.

It's simple.

Global ESSECAnalytics account
Repositories - All repositories hosted by the ESSEC Strategic Business Analytics Chair. This is where codes are shared and discussion happens. Each course has its own repository.

BDA repository
Codes - Nicolas's codes of the Big Data Analytics (BDA) course are available here.
Discussion - Where you can discuss codes, ask for help and share your thoughts. When openning a new issue don't forget to add the label refering to the right Session (Session 1, Session 5, etc...), you can also add a label to make your question more precise (help, question, bug, tips...). To learn more about how to use this section, read the Issues Tutorial (10min). For past discussions see here. Remember: you should not share the solution, only tips when answering other's question.

I mean... how to use GitHub in general o_O.

GitHub 101: Introduction to GitHub

Good. But it doesn't make sense to use GitHub without learning Git. Can I learn basic Git without wasting my all day?

In order to use GitHub properly, you should follow this very simple tutorial. You will learn the basics of commit and branching. It takes 15min to complete.
Simple and interactive Git tutorial for beginners (15min)

Not enough! I want to go further with Git / GitHub. Any tips?

Working with GitHub means following a Git workflow. You can learn the Git Flow here: Git Flow (5min).

For further information, you can refer to the Course Wiki

This wiki provides you with various ressources (mostly about R and Git/GitHub). Feel free to ask additional ressources in the Discussion section of this repo.