rug-compling / alpinocorpus

Library for handling Alpino corpora
GNU Lesser General Public License v2.1
8 stars 1 forks source link

Caching in RemoteCorpusReader without query #19

Closed pebbe closed 11 years ago

pebbe commented 12 years ago

At the time of writing, this is in branch 'RemoteCorpus'.

When asking RemoteCorpusReader for the list of entries without query, and then the process is interrupted, and then later it is done again, RemoteCorpusReader restarts retrieving entries from the server from the beginning.

Should RemoteCorpus be changed, to start retrieving entries from the point it was cut off in the previous run?

pebbe commented 12 years ago

I implemented this with a 'start' parameter for the server.py in alpinocorpus-python.

Next question: in alpinocorpus-python, the skipping is done in server.py. Should it be done in the C-code called by server.py instead?

danieldk commented 11 years ago

I don't think this should be done in the C code, since the iterators are forward iterators that do not have such notions.

Closing this bug now...