-
```
Web Server: Tomcat
OS: Ubuntu Linux server
Techs: jQuery, JS, Ajax, css, monitoring tools
Additional struts action classes should also be developed to react to the web
client.
```
Original issue…
-
# Summary
I understand Lacus can fetch content from both Tor websites and the normal internet.
An installation without configuring Tor will make the built-in test fail, when in reality web conte…
ajoga updated
1 month ago
-
The google crawler is currently downloading the mrc files as a part of indexing the site. To prevent this from impacting the page score.
Impact: Not fixing this will impact the SEO of the page
-
As a search engine, we should build a general web crawler for internet. It could do:
* find undiscovered website URL
* find schema.org recipe type from undiscovered URL
Please note that this kind…
-
`hoarder_workers | 2024-08-23T19:24:44.650Z error: [Crawler] Failed to connect to the browser instance, will retry in 5 secs
hoarder_workers | 2024-08-23T19:24:49.651Z info: [Crawler] Conne…
-
If for some resources the crawler encounters a ZIM file on a web property, we should immediately block it so that it is not included inside the WARC and then inside the ZIM.
This is probably a page…
-
- This issue will probably be broken up into several
- Should probably email the Professor for advice
-
Having a clicky interface has been a goal for a long time now. There are many users who abhor the command line but are still interested in the tools that use them.
* The remnants of a TkInter inter…
-
**Describe the bug**
I have run Kendra Web Crawler and confirmed that the web crawl is successful, but the SNS (KendraCrawlerSNSTopic) that triggers the CrawlerLambda is not triggered.
https://githu…
-