-
Hello,
API on related tools has changed quite a bit over 2 years and the project doesn't work out of the box anymore (related to the fact the versions of related tools were not pinned on the requir…
-
Needed to exercise and validate the new Python test suite.
George recommended we look at: Expect or DejaGnu.
-
JIRA Issue: [KIEKER-1335] Crawler checking for dead links on the web site?
Original Reporter: Andre van Hoorn
***
Maybe some free online services exist? Otherwise should be easy to implement with wg…
-
-
def ofd_to_pdf(self, ofd_file_path):
output_pdf_path = ofd_file_path[:-4] + ".pdf"
with open(ofd_file_path, "rb") as f:
ofdb64 = f.read()
ofd = OFD() #…
-
Importance: 3/5.
Currently our site doesn’t show up on search engines even though we have very relevant/the best information. This is an issue with our React framework but we should figure it out …
-
```
An enhanced version of the crawler, that has very clean code and does no longer
rely on ugly os-commands (getstatusoutput).
This will be achieved by using pysmbc as requested in Issue 2 (Which wi…
-
Hello, I'm experiencing performance issues with my web crawler after approximately 1.5 to 2 hours of runtime. The crawling speed significantly decreases to about one site per minute or less, and I'm e…
-
### Repository commit
ac3bd1032c02ff5c2f6eb16f2bf5a1b24d106d1c
### Python version (python --version)
3.12.0
### Dependencies version (pip freeze)
beautifulsoup4==4.12.2
certifi==2023.7.22
chars…
-
```js
const HCCrawler = require('headless-chrome-crawler');
const JSONLineExporter = require('headless-chrome-crawler/exporter/json-line');
const FILE_PATH = 'C:\\git\\examples\\result.csv';
c…