jesbin / crawler4j

Automatically exported from code.google.com/p/crawler4j
0 stars 0 forks source link

Couldn't find tld-names.txt #207

Closed GoogleCodeExporter closed 9 years ago

GoogleCodeExporter commented 9 years ago
Hi,

When i try to execute the sample usage crawl, i am getting a error as couldnt 
fine tld-names.txt.

i am coppied all the files in my local machine.

what is the use of this tld-names.txt and where we need to keep this?

i have found this as file name defined in TLDList.java class.

Please help me out to resolve this issue.

Original issue reported on code.google.com by arunas...@gmail.com on 18 Mar 2013 at 1:54

Attachments:

GoogleCodeExporter commented 9 years ago
The file is visible in your attached screenshot - it resides in the same 
package as the java class.

Original comment by acrocraw...@gmail.com on 26 Mar 2013 at 8:41

GoogleCodeExporter commented 9 years ago
The file is used to determine valid host names during crawling.

Original comment by acrocraw...@gmail.com on 26 Mar 2013 at 8:42

GoogleCodeExporter commented 9 years ago
Not a bug or feature request

Original comment by avrah...@gmail.com on 11 Aug 2014 at 2:35