-
Hi,
Thanks for your efforts in creating/curating these datasets! These are priceless and greatly advance NLP for Indian languages.
I tried adding them into `mtdata` https://github.com/thammegowd…
-
Over in https://github.com/freelawproject/courtlistener/issues/4312#issuecomment-2339308370, we have a little discussion about a new network ranking algorithm that @mattdahl is working on.
He says:…
-
See dotnet/runtime#14343 and dotnet/runtime#15603.
RsaProtectedConfigurationProvider is the default protected provider and the only one that has a hope of being cross platform. I've ported the code…
-
Take the pages from radiopedia which contains about 16k articles under CC licence. Could be used in simple Q-A setting where question is rephrased name of section and answer is the paragraph from that…
-
While using the preprocessed data from [http://www.statmt.org/lm-benchmark/](url) I noticed that some of the training data was duplicated in the heldout (aka test). This is in addition to _train/news…
-
## Describe the bug
1. Add `**kwargs` to allow formatted_doc to be passed into metric computation to address the following
```
metrics = compute_metric(results=sample_responses, formatted_doc=d…
-
See https://github.com/obophenotype/cell-ontology/issues/572#issuecomment-762830332
This is dependent on all relevant terms being added to PATO.
-
Most comprehensive contracts specify both what law will govern disputes arising under the contract (choice of law) and the mechanism for resolving such disputes (choice of forum). Kleros presumably h…
-
Hello, is there info about the dataset used for training the model? It's pretty important for some use cases (like publishing games on steam) to be sure that all training data is licensed appropriatel…
-
I choose to view Vulcain as a possible REST maturity models level 4. So I'm wondering if it should be possible to specify accepted media-type on pushes / preloads via Preload and Fields in order to fi…