Encoding issue - Githubissues

What steps will reproduce the problem?
1. Process the site with Windows-1251 encoidng (for example applico.ru)
2. Get the ShouldCrawlPageLinks event's crawledPage.RawContent or 
crawledPage.HtmlDocument.DocumentNode.OuterHtml values
3. Strings contain many "??????" substrings

What is the expected output? What do you see instead?
Strings should contain UTF encoded characters. They contain "?" characters 
instead 

What version of the product are you using? On what operating system?
Version Abot v1.1.1.0, 2012

Please provide any additional information below.
The issue can be resolved by changing the PageRequester.GetRawHtml method:
            try
            {
                Encoding encoding = Encoding.GetEncoding(response.CharacterSet);
                using (StreamReader sr = new StreamReader(response.GetResponseStream(), encoding))
                {
                    rawHtml = sr.ReadToEnd();
                    sr.Close();
                }
            }

Original issue reported on code.google.com by elisy....@gmail.com on 3 Jan 2014 at 7:34

Merged into: #112

abhishekbhalani / abot

Encoding issue #123