-
Depuis la version lodex 14.0.40, le format lodex-field ne permet plus d’afficher les informations choisies.
Problèmes pour les ressources des jeux de données publiés sur data.istex :
- corpus scient…
-
Apparently we already use some monolingual data from there as a custom corpus based on @gregtatum's investigation. Also we have a tool to list the available data https://github.com/mozilla/firefox-tra…
-
Hi, I am trying to take a peak by count matrix generated using another software (SnapATAC2) which similar to ArchR produces a tile matrix and make this compatible with SCENIC+. The link showing a tuto…
-
Medusa seems to save an empty string in the corpus for a "0x00" string input, which makes correctly parsing the corpus input values more difficult.
Example property:
```solidity
function chec…
-
Diskussion som starta på [DIGGs projektyta för Persistenta identifierare](https://github.com/diggsweden/persistent-identifiers-investigation/issues/4)
* ämnet om att mäta metadatakvalitet berör detta…
-
User can download/export his query result or filtered corpus data as sub-corpus, just like CQPweb does as below:
![image](https://github.com/INL/corpus-frontend/assets/1741341/2198f575-4c39-42be-8081…
-
Hi @kaituoxu !
I wanted to train tacotron2 on jsut. Can your implementation be trained on jsut?
I have the jsut preprocessed. I used deepvoice3's preprocess module!
-
I know, there are lots of `hrefs` and `[=...=]` to change as well as the visible text. And probably some number of other documents that we cannot change (now, if not ever), but which can be supported …
-
InkVisitorbot - research assistant working in InkV with corpus data.
Input = non-CASTEMOed (or partially CASTEMOed) full-texts.
Human language queries of full-text corpus, using [word embeddigs]…
-
I have been trying to use this lib to perform some basic cleanup and processing on text datasets. I am having trouble figuring out how to get the documents/text back out of the corpus after some funct…