laurentprudhon / nlptextdoc

Suite of tools to extract and annotate language resources for NLP applications
Other
1 stars 2 forks source link

UriFormatException in Abot.Core.PageRequester.MakeRequest #22

Closed laurentprudhon closed 5 years ago

laurentprudhon commented 5 years ago

fatal: Error occurred during processing of page [http://www.leparisien.fr/bouches-du-rhone-13/deces-a-marseille-d-henri-germain-delauze-fondateur-de-la-comex-et-pionnier-de-la-plongee-industrielle-14-02-2012-1861992.php] fatal: System.UriFormatException: Invalid URI: The format of the URI could not be determined. at System.Uri.CreateThis(String uri, Boolean dontEscape, UriKind uriKind) at System.Uri..ctor(String uriString) at Abot.Core.PageRequester.MakeRequest(Uri uri, Func`2 shouldDownloadContent) in C:\Users\laure\OneDrive\Dev\C#\nlptextdoc\nlptextdoc.extract.dependencies\Abot\Core\PageRequester.cs:line 106 at Abot.Crawler.WebCrawler.CrawlThePage(PageToCrawl pageToCrawl) in C:\Users\laure\OneDrive\Dev\C#\nlptextdoc\nlptextdoc.extract.dependencies\Abot\Crawler\WebCrawler.cs:line 888 at Abot.Crawler.WebCrawler.ProcessPage(PageToCrawl pageToCrawl) in C:\Users\laure\OneDrive\Dev\C#\nlptextdoc\nlptextdoc.extract.dependencies\Abot\Crawler\WebCrawler.cs:line 675

fatal: Error occurred during processing of page [https://www.lemonde.fr/_rprt/p-VDGieT2h0PwvfnZOwc1n0B8uz1jf6VCN3hk8iw2VAxuCqYdjF6cqK-QxPiJz6Oelw7VWmRSieCUcwKzsY4E7j_5i4EgNDsSLW6BdzsR8fAapN-wjGj_5rHg51Upt9nRODgWUFh-5xdwwi7yZtN0aKm81wPefx0aaj5JsYMTDhNcomnTDPn4sJAhY0pK8d_KHoOg2hblo8TgQoU83U7fV5WPrhKboB5yq2HbCZbEGYiYM4xHwcw6OROEUG9ITbhGxvGP7cDp0xu_vOE6-ucoJa27c60mhVWJLw_nqU2PIWV0rS8zArlBCZderouu478LEqCOSUvZWFbPurleQ4vHiTqdUIhG48bd73m8Alnop-ReK_zCo-Y5l-6bDV1UHr7ZAllVTSE0dBigD_baxPQDM0-XDe2RzS07j2y3A?p=awin&s=] fatal: System.UriFormatException: Invalid URI: The format of the URI could not be determined. at System.Uri.CreateThis(String uri, Boolean dontEscape, UriKind uriKind) at System.Uri..ctor(String uriString) at Abot.Core.PageRequester.MakeRequest(Uri uri, Func`2 shouldDownloadContent) in C:\Users\laure\OneDrive\Dev\C#\nlptextdoc\nlptextdoc.extract.dependencies\Abot\Core\PageRequester.cs:line 106 at Abot.Crawler.WebCrawler.CrawlThePage(PageToCrawl pageToCrawl) in C:\Users\laure\OneDrive\Dev\C#\nlptextdoc\nlptextdoc.extract.dependencies\Abot\Crawler\WebCrawler.cs:line 888 at Abot.Crawler.WebCrawler.ProcessPage(PageToCrawl pageToCrawl) in C:\Users\laure\OneDrive\Dev\C#\nlptextdoc\nlptextdoc.extract.dependencies\Abot\Crawler\WebCrawler.cs:line 675