HtmlUnit / NHtmlUnit

.Net port of HtmlUnit
Other
88 stars 21 forks source link

Difference with HtmlUnit while mathjax script processing #32

Open ars18wrw opened 6 years ago

ars18wrw commented 6 years ago

I try to process web pages with mathml using mathjax via nhtmlunit

In java (using htmlunit) everything is fine. I use the next snippet:

WebClient webClient = new WebClient(BrowserVersion.CHROME);
HtmlPage page = webClient.getPage(new File(mathml).toURI().toURL());
webClient.waitForBackgroundJavaScript(300000 * 1000);
String pageAsXml = page.asXml();

and the next html - https://pastebin.com/hvJZmtnH The result is quite good : https://pastebin.com/kHzrM3rJ

However it's not so good in C# (NHtmlUnit):

The code i use:

WebClient webClient = new WebClient(BrowserVersion.CHROME);
webClient.Options.JavaScriptEnabled = true;
webClient.Options.Timeout = 30000;
webClient.Options.RedirectEnabled = true;
HtmlPage page = webClient.GetHtmlPage(mathml);
webClient.WaitForBackgroundJavaScript(50000000);
string pageAsXml = page.AsXml();

The result: https://pastebin.com/E5uXfUZ1

As you can see the javascript wasn't processed.

Please confirm that it's impossible to process such scripts via NHtmlUnit or point me out how to fix the issue.

asbjornu commented 3 years ago

I wouldn't expect this project to receive an update before IKVM is ported to .NET Core. See #35 for details.