dgrtwo / tidy-text-mining

Manuscript of the book "Tidy Text Mining with R" by Julia Silge and David Robinson
http://tidytextmining.com
Other
1.31k stars 806 forks source link

Mining financial articles - Error in mutate_impl(.data, dots) : 1: failed to load HTTP resource #32

Closed t1ber1us closed 6 years ago

t1ber1us commented 7 years ago

Hi,

I get the error above in section 5.3.1 when executing the code to download articles for technology stocks. I've both typed it in myself and copied and pasted the text from here with the same result. I'm guessing this is something in the wider set up of my pc or the world, but would be grateful for any thoughts you have on how I might fix it.

Thanks,

Ian

juliasilge commented 7 years ago

I believe this is an error from the XML package downloading the finance data; sounds like R is not successfully accessing that data. You might need to do some debugging around accessing the internet through R? Also perhaps try just doing a few of the companies; I have heard some people have trouble with all of those, especially Twitter and IBM.

If it makes you feel better, someone else also got this error when trying to run the code from the book.

rizwaan commented 7 years ago

Looks like there is some sort of limitation on the corpus's dimension. The code works if there are only 5 elements in the company vector. Couldnt investigate any further though.

chankamiperera commented 7 years ago

hi, when I try the code mention in topic modeling part. I got a error mention like this. Error in LDA(dtm, k, method = "Gibbs", control = list(nstart = nstart, : Each row of the input matrix needs to contain at least one non-zero entry Please help me to fix it. Thank you.

juliasilge commented 7 years ago

Hello @chankamiperera! I don't think you are using code from our book, because I don't believe we have any Gibbs method fitting at all. If you are running into an error that we might be able to help with, can you make a small, reproducible example to show the error? You also might consider posting your problem on Stack Overflow, because the function LDA() is from the topicmodels package, not our tidytext package, so you might get faster help (or at least help from more people) there. I have SO questions tagged tidytext on a feed so I will always see them.