ckan / datapusher

A standalone web service that pushes data files from a CKAN site resources into its DataStore
GNU Affero General Public License v3.0
77 stars 155 forks source link

Incomlete Read exception #26

Closed maratsh closed 7 years ago

maratsh commented 10 years ago

I loaded large (28mb) file and have an exception after serveral minutes:

--------------------------------------------------------------------------------
[pid: 20171|app: 0|req: 5/5] 172.18.100.106 () {42 vars in 705 bytes} [Wed Feb  5 17:29:30 2014] GET /job/d0394bb1-1b72-4c3a-8c4c-7a7bc37ca784 => generated 446 bytes in 16 msecs (HTTP/1.1 200) 2 headers in 72 bytes (1 switches on core 0)
[pid: 20171|app: 0|req: 6/6] 172.18.100.106 () {42 vars in 705 bytes} [Wed Feb  5 17:29:35 2014] GET /job/d0394bb1-1b72-4c3a-8c4c-7a7bc37ca784 => generated 735 bytes in 19 msecs (HTTP/1.1 200) 2 headers in 72 bytes (1 switches on core 0)
[pid: 20171|app: 0|req: 7/7] 172.18.100.106 () {40 vars in 578 bytes} [Wed Feb  5 17:44:58 2014] POST /job => generated 497 bytes in 47 msecs (HTTP/1.1 200) 2 headers in 72 bytes (1 switches on core 0)
[pid: 20171|app: 0|req: 8/8] 172.18.100.106 () {40 vars in 578 bytes} [Wed Feb  5 17:45:04 2014] POST /job => generated 497 bytes in 26 msecs (HTTP/1.1 200) 2 headers in 72 bytes (1 switches on core 0)
Wed Feb  5 17:53:17 2014 - SIGPIPE: writing to a closed pipe/socket/fd (probably the client disconnected) on request /job/d0394bb1-1b72-4c3a-8c4c-7a7bc37ca784 (ip 172.18.100.106) !!!
--------------------------------------------------------------------------------
ERROR in scheduler [/var/www/ckan/devenv/lib/python2.6/site-packages/apscheduler/scheduler.py:520]:
Job "push_to_datastore (trigger: RunTriggerNow, run = True, next run at: None)" raised an exception
--------------------------------------------------------------------------------
Traceback (most recent call last):
  File "/var/www/ckan/devenv/lib/python2.6/site-packages/apscheduler/scheduler.py", line 512, in _run_job
    retval = job.func(*job.args, **job.kwargs)
  File "/var/www/ckan/devenv/src/datapusher/datapusher/jobs.py", line 261, in push_to_datastore
    f = cStringIO.StringIO(response.read())
  File "/usr/lib64/python2.6/socket.py", line 354, in read
    data = self._sock.recv(rbufsize)
  File "/usr/lib64/python2.6/httplib.py", line 522, in read
    return self._read_chunked(amt)
  File "/usr/lib64/python2.6/httplib.py", line 571, in _read_chunked
    value.append(self._safe_read(amt))
  File "/usr/lib64/python2.6/httplib.py", line 621, in _safe_read
    raise IncompleteRead(''.join(s), amt)
IncompleteRead: IncompleteRead(7483 bytes read, 709 more expected)

All timeoutes on nginx setted on half of hour, max_file_size setted to 1G in datapuser and ckan.