Social data science exam project
Project description
Group 23 - Thamina Rauf, Tróndur Møller Sandoy, Martin Schondelmaier, and Rasmus Ruhnau
Carpooling in Denmark
In our exam project we will analyse the carpooling in Denmark, by using data from the service GoMore, which facilitates shared rides between users and allows for users to rent out their car to other users.
Data
For the project we will use data from GoMore.dk. Each registered user has entered information on their hometown, profession and the site shows data on number of shared trips, number of created trips among other things. The profiles are publicly available and by scraping the more than 900.000 profiles, we will be able to obtain a very nice dataset on carpooling in Denmark.
For each ride offered on gomore.dk, there is a site with information on that ride. These can also be scraped to obtain a dataset with distance, start/stop destination, number of people riding along and price for each ride on gomore.dk. This information is limited to recent rides.
Analysis
In the project we will analyze several things from out dataset. Among these we want to examine which regions of Denmark are the most active carpoolers, how many trips the users go on in different regions, and the rate between listed trips and trips that actually have other users on them.
With the information from the dataset on rides, we want to use a regression model to predict 1) the price level on certain routes at certain times, and 2) the probability that a user offering a ride, will find someone to ride along.
Project
The project will be divided into several different parts:
-Introduction
This chapter will serve to introduce the problem and the data source, providing key background info for the project
-Method
This chapter will describe the method used to scrape and clean the data from GoMore.dk
-Analysis
This chapter will include the different analyses of the project, along with figures to show the results
-Conclusion
This chapter will include concluding remarks of the analysis
Social data science exam project Project description Group 23 - Thamina Rauf, Tróndur Møller Sandoy, Martin Schondelmaier, and Rasmus Ruhnau
Carpooling in Denmark In our exam project we will analyse the carpooling in Denmark, by using data from the service GoMore, which facilitates shared rides between users and allows for users to rent out their car to other users. Data For the project we will use data from GoMore.dk. Each registered user has entered information on their hometown, profession and the site shows data on number of shared trips, number of created trips among other things. The profiles are publicly available and by scraping the more than 900.000 profiles, we will be able to obtain a very nice dataset on carpooling in Denmark. For each ride offered on gomore.dk, there is a site with information on that ride. These can also be scraped to obtain a dataset with distance, start/stop destination, number of people riding along and price for each ride on gomore.dk. This information is limited to recent rides. Analysis In the project we will analyze several things from out dataset. Among these we want to examine which regions of Denmark are the most active carpoolers, how many trips the users go on in different regions, and the rate between listed trips and trips that actually have other users on them. With the information from the dataset on rides, we want to use a regression model to predict 1) the price level on certain routes at certain times, and 2) the probability that a user offering a ride, will find someone to ride along. Project The project will be divided into several different parts: -Introduction This chapter will serve to introduce the problem and the data source, providing key background info for the project -Method This chapter will describe the method used to scrape and clean the data from GoMore.dk -Analysis This chapter will include the different analyses of the project, along with figures to show the results -Conclusion This chapter will include concluding remarks of the analysis