-
Caret allows optimizing the parameter min.node.size of ranger, but not the parameter nodesize of randomForest package. Is there some rationale for deciding which parameters are optimizable as hyperpar…
-
Running:
python main.py --config cornell-movie-dialogs --mode train
to the end (100000 steps) will result in a training loss of about 2.6, test loss of 8.4.
Which hyperparameters did you use? T…
lk251 updated
6 years ago
-
I'm messing around with Concrete ML these days and I was wondering if this is possible. Furthermore, after compiling the FHE model equivalent, what should be the size of the test sample to run it in s…
-
### Feature description
Add LSTM Algorithm to Neural Network Algorithms
**Feature Description:**
I would like to propose adding an LSTM (Long Short-Term Memory) algorithm to the existing neural…
-
Hello, the learning rate of cora, citeseer, and pubmed is 0.02 in the paper. But it seems to be 0.2 in the implementation. Which one is correct?
In addition, can you provide the code of reddit, or th…
-
When using a logger, I get the following error during the "Logging hyperparameters!" step:
"AttributeError: 'str' object has no attribute '__name__'"
I noticed that it's happening when evaluating …
-
Supplementary figure 3 talks about manual fine tuning of three hyperparameters : alpha, beta1 and beta2. I couldn't find these parameters in the code. Could you tell me how these parameters could be s…
-
Hi,
If I've ran TabularPredictor.fit() and I've got a model that I like with all the HP optimsations and feature selections, how do I then refit that exact set of models and weighted ensemble with …
-
Hello,
First of all, I want to thank you for this amazing crate! It is truly a joy to work on LLM using Rust 😄 .
I recently wrote an [API that serves Llama-2](https://github.com/AmineDiro/cria) m…
-
Implement the DETR (DEtection TRansformer) architecture.