-
hi author, I could not find the train/test split files, which displays "404 error", could you please upload the file? Thanks!
-
**Describe the bug**
I am currently trying to implement my own variant of splafacto which include a custom data parser. However, while I am trying to run ns-install-cli I got an error saying `subproc…
-
请问你是怎么解决“def build_vocabulary(spacy_de, spacy_en):
def tokenize_de(text):
return tokenize(text, spacy_de)
def tokenize_en(text):
return tokenize(text, spacy_en)
pr…
-
Hello!
I have been experimenting with running model using data split at sequence level rather than student level. But whenever I set 'multigs'=true, I can fit the model but I can NOT evaluate the …
-
- [ ] __init__
- [x] reset_estimator
- [ ] process_imbalance_sampler
- [ ] calibrateModel
- [ ] fit
- [x] predict
- [ ] predict_proba
- [ ] grid_search_param_tuning
- [ ] train_val_test_split
…
-
Namespace(dropout=0.2, evaluate_every=500, gpu=-1, grad_norm=1.0, graph_batch_size=30000, graph_split_size=0.5, lr=0.01, n_bases=4, n_epochs=10000, negative_sample=1, regularization=0.01)
load data f…
-
## Problems
1. Splitting graphs is a bit more complicated than normal data. Graphs can be split based on node, edge or whole graph itself. We should be able to support all of that with minimal alte…
-
It looks like `to_batches` isn't respecting the filter kwarg
Repro
```
import lance
ds = lance.dataset(path)
fragments = ds.get_fragments()
for batch in fragments[0].to_batches(
batch…
tonyf updated
2 months ago
-
Hi,I am wondering how to split the train and test datasets?
In your paper,it said that "The dataset is randomly partitioned into a training set and testing set".
Dose randomly mean the Font content …
-
In these datasets, node in the train set may be related to the node in test/val set, so I want to know how to sample to avoid data leakage