Computational-Content-Analysis-2020 / frequently-asked-questions-spring

Questions or doubts about organisation/code for Spring 2020 running of Computational Content Analysis.
0 stars 0 forks source link

Week0 code questions #13

Open Sunnyleee opened 4 years ago

Sunnyleee commented 4 years ago

I have two questions in week 0 code --

First, Section 1: why do we use the following part of the code? Why we have ‘ not a number, not a number,…’ image

Second, Section 3:

read PDF, WORD part, it always opens a file using a link, for example: image

Why we need to use links and how can I convert a PDF or word to HTML link?

I would be grateful for a prompt answer!

Sunny

bhargavvader commented 4 years ago

Hello @Sunnyleee 1) we use that to denote the part of the string which is not a number. the regular expression then finds the number, which is 21... which is right after "numbers". 2) This is just to demonstrate how to directly load pdfs and docs which might happen to be on the internet. You can also load files on your local (link, link)