mannau / tm.plugin.webmining

Retrieve structured, textual data from various web sources.
34 stars 10 forks source link

Failed to load External Entity GoogleNewsSource #14

Closed DFJL closed 8 years ago

DFJL commented 8 years ago

Hi Mannau.Thank you for this amazing package.I develope an aplication of web scrapping and text analyitics based on this package, with the Google News API, but now I want to put it in production I get the following error:

#Query
  elementsOtros<-c("george orwell","bob marley","barack obama","christopher nolan","jose mujica","lionel messi","hadley wickham","john chambers")

elements<-c(elementsLAFT,elementsOtros)

evaluate<-as.vector(as.matrix(elements))

TevLAFT<- WebCorpus(GoogleNewsSource(evaluate))

Error 1: Unknown IO error2: failed to load external entity "http://news.google.com/news?hl=en&q=lionel%20messi&ie=utf-8&num=100&output=rss
DFJL commented 8 years ago

Thank you very much Mario.

ctorrez commented 8 years ago

Hi Mannau. Thank you for your package, I am trying to use your package in my research project, but I get the following error: googlenews <- WebCorpus(GoogleNewsSource("Microsoft"))

Unknown IO errorfailed to load external entity "http://news.google.com/news?hl=en&q=Microsoft&ie=utf-8&num=100&output=rss"

Error: 1: Unknown IO error2: failed to load external entity "http://news.google.com/news?hl=en&q=Microsoft&ie=utf-8&num=100&output=rss"

The erros comes from this function parser <- function(cr) { tree <- parse(cr, type = "XML", asText = FALSE) nodes <- xpathSApply(tree, path = "//item") xmlns1 <- lapply(nodes, newXMLNamespace, "http://purl.org/dc/elements/1.1/", "dc") nodes } The parse function create the error.

Thanks,

ddten commented 6 years ago

How do you then check all the news as being positive or negative?

keshav-rathi commented 4 years ago

Hi Mannau. I am still getting the same error when using Google news and Yahoo Finance.

googlenews <- WebCorpus(GoogleNewsSource("Microsoft")) Unknown IO errorfailed to load external entity "http://news.google.com/news?hl=en&q=Microsoft&ie=utf-8&num=100&output=rss" Error: 1: Unknown IO error2: failed to load external entity "http://news.google.com/news?hl=en&q=Microsoft&ie=utf-8&num=100&output=rss"

yahoofinance <- WebCorpus(YahooFinanceSource("MSFT")) Unknown IO errorfailed to load external entity "http://feeds.finance.yahoo.com/rss/2.0/headline?s=MSFT&region=US&lang=en-US" Error: 1: Unknown IO error2: failed to load external entity "http://feeds.finance.yahoo.com/rss/2.0/headline?s=MSFT&region=US&lang=en-US"

keshav-rathi commented 4 years ago

Hi Mannau. I am still getting the same issue when using GoogleNews and YahooFinance.

googlenews <- WebCorpus(GoogleNewsSource("Microsoft")) Unknown IO errorfailed to load external entity "http://news.google.com/news?hl=en&q=Microsoft&ie=utf-8&num=100&output=rss" Error: 1: Unknown IO error2: failed to load external entity "http://news.google.com/news?hl=en&q=Microsoft&ie=utf-8&num=100&output=rss"

yahoofinance <- WebCorpus(YahooFinanceSource("MSFT")) Unknown IO errorfailed to load external entity "http://feeds.finance.yahoo.com/rss/2.0/headline?s=MSFT&region=US&lang=en-US" Error: 1: Unknown IO error2: failed to load external entity "http://feeds.finance.yahoo.com/rss/2.0/headline?s=MSFT&region=US&lang=en-US"

Please help