Closed GoogleCodeExporter closed 9 years ago
How big is your single_urdu_file? If it's large can you test this out with a
much smaller file?
Only using 360mb of RAM doesn't necessarily indicate a problem, because sarse
elemental vectors and incremental reading from disk is designed to keep the
memory footprint small. But unless it's a huge huge file I'm surprised at 1000
minutes.
Original comment by dwidd...@gmail.com
on 14 Feb 2012 at 8:55
My file is only of 350 MB. I'll try it with a small file now, but I guess 350MB
is pretty small anyway ?
Original comment by manaal...@gmail.com
on 14 Feb 2012 at 8:59
ok, it ran with a smaller corpus to success. I guess it just needs more time !
Original comment by manaal...@gmail.com
on 14 Feb 2012 at 9:03
Another way to speed it up would be to use fewer dimensions. This is a trade
off between computational performance and semantic performance, of course.
I'm going to mark this as "Done" for now - we'd like things to go faster of
course, but at least we don't think there's a non-terminating loop causing a
bug somewhere.
Thanks for your patience!
Original comment by dwidd...@gmail.com
on 15 Feb 2012 at 5:40
Original issue reported on code.google.com by
manaal...@gmail.com
on 14 Feb 2012 at 7:16