Open MSZ-MGS opened 1 year ago
I think this is already possible using the n_threads parameter when initializing the GPT4All model. Usage and tuning is shown in this issue (they are technically using GPT4All through Langchain but it works the same directly see here line 70). I'll admit I'm not familiar with LocalDocs but you should find a n_threads or similar parameter there.
Issue you'd like to raise.
Whenever I change the advanced settings of the LocalDocs e.g. number of the document snippets per prompt or snippet size, the G4A got freeze for several minutes. I think the G4A is reprocessing all documents again. Anyhow, when I checked the TaskManager, I noticed that the processing load is only on one of the CPU cores as shown below:
Suggestion:
If it is possible, multi-thread the load over the number of cores available.