The original langdetect in C++ has a very nice "early abort" efficiency optimization. Could "detect" accept some form of lazy-loading (I'd suggest being able to pass a python file object), so it can read only as much of a file as it needs to detect the language the file is in?
The original langdetect in C++ has a very nice "early abort" efficiency optimization. Could "detect" accept some form of lazy-loading (I'd suggest being able to pass a python file object), so it can read only as much of a file as it needs to detect the language the file is in?