Closed kazemcodes closed 1 year ago
I think the issue is because of this
can skrapeit bypass Cloudflare protection?
I think the issue is because of this
the issue is already fix in the html-unit-2.59.0-SNAPHSHOT but there is a new problem in that build, not all sites throw this exception some complex url throws it for example https://pstbn.top/?c865fde3461094d1#2hAyyUKtXm72BHLzyzhq7UBug9YMP1FCgnJccA8YyQ2n
Ok I see. Thx for finding. I will check if we have everything that @rbri suggests, if not I will add it and make new release
Will have a look at https://pstbn.top/?c865fde3461094d1#2hAyyUKtXm72BHLzyzhq7UBug9YMP1FCgnJccA8YyQ2n.
I got here
ScriptException: missing ; before statement (https://pstbn.top/js/zlib-1.2.11.js#6)
Do you see the same?
I got here
ScriptException: missing ; before statement (https://pstbn.top/js/zlib-1.2.11.js#6)
Do you see the same?
I only got this exception
java.lang.NoClassDefFoundError: Failed resolution of: Ljava/awt/datatransfer/ClipboardOwner
I have a question regarding htmlunit, is there any way to make headless browser suspend it request until a certail html tag or some criterial fullfit before fetching the htmls something like this func
I got here
ScriptException: missing ; before statement (https://pstbn.top/js/zlib-1.2.11.js#6)
Do you see the same?
I only got this exception
java.lang.NoClassDefFoundError: Failed resolution of: Ljava/awt/datatransfer/ClipboardOwner
I have a question regarding htmlunit, is there any way to make headless browser suspend it request until a certail html tag or some criterial fullfit before fetching the htmls
actually skrapeit is using the htmlunit 2.59.0 which throws this exception, higher versions actually required higher api which is android O, I havent tested that version
i guess this api requirement has something to do with changes in Rhino
Any news about this issue? I got this error while using BrowserFetcher
java.lang.NoSuchFieldError: No static field INSTANCE of type Lorg/apache/http/conn/ssl/AllowAllHostnameVerifier; in class Lorg/apache/http/conn/ssl/AllowAllHostnameVerifier; or its superclasses (declaration of 'org.apache.http.conn.ssl.AllowAllHostnameVerifier' appears in /system/framework/framework.jar!classes3.dex)
Are you using latest version of skrapeit (1.2.1)?
Since recently fixes the issue for other people, e.g. here https://github.com/skrapeit/skrape.it/issues/185#issuecomment-1145827982
Are you using latest version of skrapeit (1.2.1)?
Since recently fixes the issue for other people, e.g. here #185 (comment)
I just tried with 1.2.1 and got this error
Execution failed for task ':app:mergeDebugJavaResource'.
> A failure occurred while executing com.android.build.gradle.internal.tasks.MergeJavaResWorkAction
> 2 files found with path 'mozilla/public-suffix-list.txt' from inputs:
- /Users/yusuf/.gradle/caches/transforms-3/f245c43f9945c78889e5173b03033420/transformed/jetified-htmlunit-android-2.58.0.jar
- /Users/yusuf/.gradle/caches/transforms-3/96071c01f90e37e991c04a7f8de1ffc4/transformed/jetified-httpclient-4.5.6.jar
Adding a packagingOptions block may help, please refer to
https://google.github.io/android-gradle-dsl/current/com.android.build.gradle.internal.dsl.PackagingOptions.html
for more information
* Try:
Run with --stacktrace option to get the stack trace. Run with --info or --debug option to get more log output. Run with --scan to get full insights.
Adding packagingOptions
and invalid cache - restart did not help
There is also a SO question about this but there is not any answer yet
Are you using latest version of skrapeit (1.2.1)? Since recently fixes the issue for other people, e.g. here #185 (comment)
I just tried with 1.2.1 and got this error
Execution failed for task ':app:mergeDebugJavaResource'. > A failure occurred while executing com.android.build.gradle.internal.tasks.MergeJavaResWorkAction > 2 files found with path 'mozilla/public-suffix-list.txt' from inputs: - /Users/yusuf/.gradle/caches/transforms-3/f245c43f9945c78889e5173b03033420/transformed/jetified-htmlunit-android-2.58.0.jar - /Users/yusuf/.gradle/caches/transforms-3/96071c01f90e37e991c04a7f8de1ffc4/transformed/jetified-httpclient-4.5.6.jar Adding a packagingOptions block may help, please refer to https://google.github.io/android-gradle-dsl/current/com.android.build.gradle.internal.dsl.PackagingOptions.html for more information * Try: Run with --stacktrace option to get the stack trace. Run with --info or --debug option to get more log output. Run with --scan to get full insights.
Adding
packagingOptions
and invalid cache - restart did not helpThere is also a SO question about this but there is not any answer yet
Putting public-suffix to packaging-options solve the compilation error but this time got the same error with @kazemcodes
E/AndroidRuntime: at kotlinx.coroutines.scheduling.CoroutineScheduler.runSafely(CoroutineScheduler.kt:570)
at kotlinx.coroutines.scheduling.CoroutineScheduler$Worker.executeTask(CoroutineScheduler.kt:749)
at kotlinx.coroutines.scheduling.CoroutineScheduler$Worker.runWorker(CoroutineScheduler.kt:677)
at kotlinx.coroutines.scheduling.CoroutineScheduler$Worker.run(CoroutineScheduler.kt:664)
Suppressed: kotlinx.coroutines.DiagnosticCoroutineContextException: [StandaloneCoroutine{Cancelling}@7073866, Dispatchers.Main.immediate]
Caused by: java.lang.ClassNotFoundException: Didn't find class "java.awt.datatransfer.ClipboardOwner" on path: DexPathList[[zip file "/data/app/com.project.skrapeplayground-1r7tHui0F1lYklkZgTKUlQ==/base.apk"],nativeLibraryDirectories=[/data/app/com.project.skrapeplayground-1r7tHui0F1lYklkZgTKUlQ==/lib/arm64, /system/lib64, /hw_product/lib64, /system/product/lib64]]
at dalvik.system.BaseDexClassLoader.findClass(BaseDexClassLoader.java:209)
at java.lang.ClassLoader.loadClass(ClassLoader.java:379)
at java.lang.ClassLoader.loadClass(ClassLoader.java:312)
... 49 more
Recently, htmlunit-android release a new snapshot , that fixed this problem, but right now it requires at least android O as minimum api requirement
please update the html unit to latest snapshot, this problem is fixed in last snapshot
net.sourceforge.htmlunit:htmlunit-android:2.63.0-SNAPSHOT
htmlunit-android:2.63.0 was released some days ago (https://twitter.com/htmlunit)
Big Thx for the great work @rbri. I will bump the version in browserfetcher and release ne version of skrape it
it's a pleasure
skrapeit patch version 1.2.2 has just been published to maven central
Here is the error I get when using BrowseFetcher I think the error is beacuse of hunit-android