spritt82 / harvestman-crawler

Automatically exported from code.google.com/p/harvestman-crawler
0 stars 0 forks source link

Error: "I/O operation on closed file" when running the crawler on the same site twice. #19

Open GoogleCodeExporter opened 9 years ago

GoogleCodeExporter commented 9 years ago
What steps will reproduce the problem?
1. svn up
2. run in the shell: "harvestman -C config-sample.xml"
3. run the same command.

What is the expected output? What do you see instead?

#############
# Expected: #
#############

[hh:mm:ss] *** Log Completed ***

Writing project statistics to crawl database...
Done.
HarvestMan session finished.

###########
# Actual: #
###########

HarvestMan session finished.
[20:37:06] Waiting for threads to finish up...
Traceback (most recent call last):
  File "/usr/lib/python2.5/logging/__init__.py", line 750, in emit
    self.stream.write(fs % msg)
ValueError: I/O operation on closed file
[20:37:06] Done.
Traceback (most recent call last):
  File "/usr/lib/python2.5/logging/__init__.py", line 750, in emit
    self.stream.write(fs % msg)
ValueError: I/O operation on closed file
stefan@vamaveche:~/temp/crawling_test$ [20:37:04] *** Log Completed ***

[1]+  Stopped                 [20:37:04] *** Log Completed ***

What version of the product are you using? On what operating system?

revision 83.

Ubuntu 8.04 x86_64

Please provide any additional information below.

I checked the write access for the directory where I did the tests and I
have full writes. 

The config.xml file is attached.

Original issue reported on code.google.com by andrei.p...@gmail.com on 22 Jul 2008 at 5:46

Attachments:

GoogleCodeExporter commented 9 years ago

Original comment by abpil...@gmail.com on 6 Oct 2008 at 11:24