datacamp / Brand-Analysis-using-Social-Media-Data-in-R-Live-Training

Live Training Session: Brand Analysis using Social Media Data in R
MIT License
3 stars 6 forks source link

Problem importing and working with tweet datasets in csv and rds formats #2

Open vivekv73y opened 4 years ago

vivekv73y commented 4 years ago

Hi @adelnehme ,

I started working on the solution notebook in GitHub - Collab. I am having trouble with importing the datasets in both RDS and CSV format.

With the CSV format dataset, Collab does not see all the columns in the tweet datasets - there are 90 columns which I can see in R but they are not showing up in Collab which shows only 1 column. As a result I am not able to continue with the subsequent steps in the solution file.

The solution file currently has the CSV dataset loaded. RDS format file does not load at all in Colab with the correct URL included for path.

Can you please take a look and help me in resolving this issue? The solution file is called "brand_analysis_solution.ipynb" in the notebook folder.

The files in CSV and RDS form that I tried out are in the data folder and are called: tesladf.csv / tesla.rds users_twt.csv / users1.RDS

Thanks Vivek

adelnehme commented 4 years ago

Hi @vivekv73y :wave:

Thanks for following up! The error encountered is that datasets should have their raw link imported.

For example, when reading a csv file - the link should be something along the lines of:

users_twt = read.csv("https://github.com/datacamp/Brand-Analysis-using-Social-Media-Data-in-R-Live-Training/blob/master/data/users_twt.csv?raw=true")

Where the argument ?raw=true is added at the end. You can also get raw links using this method.

I'm not sure if this is applicable to RDS files as well since using the second method in the loom on an RDS file immediately downloads the file and is giving me an error when I add raw=True to readRDS(). Would it be possible to use just csv files for the session?

Cheers,

Adel

vivekv73y commented 4 years ago

Hi @adelnehme ,

Thanks so much. This solution works fine with the CSV format.

I will stick to CSV format and avoid using RDS. I will let you know if any further support is needed.

Many thanks Vivek