botpress / nlu

This repo contains every ML/NLU related code written by Botpress in the NodeJS environment. This includes the Botpress Standalone NLU Server.
22 stars 21 forks source link

svm training improvement 4 #92

Closed franklevasseur closed 2 years ago

franklevasseur commented 2 years ago

About

This PR is part of a sequence of PR's with name svm training improvement $n that presents few improvements or combination of improvements as attempts to make training faster and consume less memory.

⚠️⚠️ Do not merge this PR as we first need to compare with other attempts first. ⚠️⚠️

Description

This PR basically removes #88 with commit:

fix(nlu-engine): launch svm trainings one after the other (cec8a7a7b8f28bb567d80050e4f3b3e529d0f558)

There's no real need to run grid-search in serial as the ammount of folds is highly reduced by #91.

Performance

On clinc150 using local lang server with dimension 100:

branch memory used (mb) time to train (s)
master ~800 101
this ~700 82

On John Doe* using remote lang server https://lang-01.botpress.io

branch memory used (gb) time to train (min)
master ~40 20
this ~1.8 5