Closed mzeidhassan closed 5 years ago
Yes you just have to set appropriate Sys.locale
For Arabic :
Sys.setlocale("LC_ALL","Arabic")
In this example we fetch extract titles
Rcrawler(Website = "https://government.ae/",no_cores = 4,no_conn = 4, ExtractXpathPat = "//*/h2")
Rcrawler v0.1.9 is just released with a lot of features , Subscribe to our mailing list to receive last updates http://eepurl.com/dMv_7s
Hi Recrawler team,
I am new to R and Recrawler. I would like to know if Recrawler can be used to scrape/crawl bilingual sites, let's say I have this English site: https://government.ae/en and this is the corresponding Arabic one: https://government.ae/ar-ae
How can I use Recrawler to get the bitext from them and save the output in tab-delimited file? Can you crawl only texts based on div tag, CSS selectors or maybe xpath?
Thanks