-
- os: Kubuntu 23.10
- `lsd --version`: lsd 1.1.2
- `echo $TERM`: xterm-256color
- `echo $LS_COLORS`: *.7z=38;5;40:*.WARC=38;5;40:*.a=38;5;40:*.arj=38;5;40:*.br=38;5;40:*.bz2=38;5;40:*.cpio=38;5;40:…
-
According to [this](https://github.com/commoncrawl/nutch/issues/8#issuecomment-511756689) 2019 analysis, fully 1/3 of WarcRecordWriter's time is being spent in zlib.so. Cloudflare has a performance en…
-
See http://mobile.reuters.com/article/idUSKBN15906G and https://climatecrocks.com/2017/01/24/trump-to-epa-war-is-peace/
ghost updated
7 years ago
-
## Dev Effort
1D - investigation
## Description
We have a large number of PDFs that are getting a Java language exception when JHOVE attempts to validate. An example can be downloaded from: http…
-
#### Describe the bug
Hi there!
There's an XSS vulnerability when you open your index.html if you saved a page with a title containing an XSS vector.
#### Steps to reproduce
1. Save this page f…
-
i have an ARC enabled project iOS 4.3+ where i implemented ASIHTTP with code blocks..to avoid retains cycle i have to use `__unsafe_unretained` .. it workes perfect in debug mode + (release mode with …
-
We have an issue in warc2zim where when I configure wombat.js to not run inside a Service Worker (`isSW: false` in call to `_WBWombatInit`), then the Youtube player is not working anymore.
Since we…
-
am having issues with this message"Disk utilization threshold reached 90%". this is the first time to go with zimit image on docker; I can't make a complete crawl for any website. I have deleted my co…
-
When a request is redirected to a new URL, the downloader middleware cannot resolve the redirect and will always return a 404 status code. An example of this:
```python
class ExampleSpider(Spider)…
-
Hi, I've been looking to run some crawls of my organisation's Sharepoint/intranet site but I'm having some issues getting through Microsoft 2FA Authentication.
Using --interactive successfully crea…