-
```
An enhanced version of the crawler, that has very clean code and does no longer
rely on ugly os-commands (getstatusoutput).
This will be achieved by using pysmbc as requested in Issue 2 (Which wi…
-
Importance: 3/5.
Currently our site doesn’t show up on search engines even though we have very relevant/the best information. This is an issue with our React framework but we should figure it out …
-
def ofd_to_pdf(self, ofd_file_path):
output_pdf_path = ofd_file_path[:-4] + ".pdf"
with open(ofd_file_path, "rb") as f:
ofdb64 = f.read()
ofd = OFD() #…
-
JIRA Issue: [KIEKER-1335] Crawler checking for dead links on the web site?
Original Reporter: Andre van Hoorn
***
Maybe some free online services exist? Otherwise should be easy to implement with wg…
-
```js
const HCCrawler = require('headless-chrome-crawler');
const JSONLineExporter = require('headless-chrome-crawler/exporter/json-line');
const FILE_PATH = 'C:\\git\\examples\\result.csv';
c…
-
### Repository commit
ac3bd1032c02ff5c2f6eb16f2bf5a1b24d106d1c
### Python version (python --version)
3.12.0
### Dependencies version (pip freeze)
beautifulsoup4==4.12.2
certifi==2023.7.22
chars…
-
Hello, I'm experiencing performance issues with my web crawler after approximately 1.5 to 2 hours of runtime. The crawling speed significantly decreases to about one site per minute or less, and I'm e…
-
-
https://github.com/mozilla/coverage-crawler/commit/f164a6de4a961277cb4006c6939290526bf5c955
https://github.com/mozilla/coverage-crawler/commit/7e4218a61d06da26477b6d740cd643f1808d29db?w=1
-
hi there,
I just discovered this project today, via the NGI Search newsletter, and I noticed the `green_energy` property on various regions.
It wasn't obvious to me where this is defined, and I…