Add UCI Adult example - Githubissues

Tradeshift / blayze

A fast and flexible Naive Bayes implementation for the JVM

MIT License

19 stars 11 forks source link

To study the acc change with training size change. one can use the following test script.

@Test
    fun can_fit_uci_adult_dataset() {
        val train = uciAdult("adult.train.txt")
        val test = uciAdult("adult.test.txt")

        val index = arrayListOf(0, 5, 10, 15, 20, 25, 30, 40, 50, 60, 80, 100, 200, 500, 1000, 2000, 10000, train.size)
        var acc = 0.0
        var model = Model()

        for (i in 1 until index.size) {
            model = model.batchAdd(train.subList(index[i - 1], index[i]))
            acc = test
                    .parallelStream()
                    .map {
                        if (it.outcome == model.predict(it.inputs).maxBy { it.value }?.key) {
                            1.0
                        } else {
                            0.0
                        }
                    }
                    .toList()
                    .average()

            println(acc)
        }
        Assert.assertTrue("expected $acc > 0.83", acc > 0.83)
    }

Tradeshift / blayze

Add UCI Adult example #20