-
I need to get full page content (with html tags) in my commiter. How I can do this?
For now i geting just text, without html tags and other information
Maybe exists some class which provide that…
-
Hi,
currently i have the situation that i want to only have the "main" content parsed in an html document. Like this:
```xml
text/html
``…
-
I have List of object like this :
``` java
{ "url": "http:www.example.com/url1", "class": "classA" },
{ "url": "http:www.example.com/url2", "class": "classB" },
{ "url": "http:www.example.com/…
-
Our website contains pages with ``, where "foo" is the url of the page itself. HTTP Collector 2.8.0 seems to reject these pages erroneously.
I made this minimal config:
```xml
/home/ron…
-
First I want to thank you for this great collector
Second I want to know is there is a filtering criteria to filter extracted urls on date or stop extraction when reach to date?
what I see now is th…
-
Hi,
I splitted the xml files generated for zip file using
`
application/zip
`.
However when I again ran the program in which the zip file was already exi…
-
This is the latest stable release afaik.
My config, not sure if I need these as well...
```
./forescout/wiki-output/logs
"FS HTTP Client"
ht…
-
Hi,
we have a situation where instead of a default 404 error being returned, a custom page is rendered with a regular 200 status code. The only thing different is that the title has a 404 error in …
-
Hi,
I have put one simple text file having hash(#) in its name like ``#1.txt`` and when I try to crawl it using this path **smb://localhost/shared/test**, its not getting crawled. And when I try to c…
-
Hello Pascal,
I have a list of about 180000 user profiles with each page navigation displaying 10 users profiles at a time that I want to index. Each user list navigation page is tagged as noindex,…