-
From Shawn Douglas' site:
Check out some other projects from the community:
> MrDNA from the Aksimentiev Lab at UIUC
I think this above one has a Python implementation which might make it more co…
-
I'm copying the relevant manual text below for ease of reference. Comments describing my confusion are added in bold in square brackets
//
### --UTR=on
Generate UTR training examples for AU…
-
I have identified discrepancies between the sequences extracted from the `hg38.ml.fa` reference genome using BED file coordinates and those stored in the tfrecords within the Basenji dataset hosted on…
-
Collection of issues for addition to MIxS to create SIP checklist (SIP-MIMS & SIP-MIMARKS)
Some term descriptions, examples, and syntax are still being updated (2023-06-05)
- [ ] [New term proposa…
-
Good morning,
Thank you for sharing the paper, code and pre-trained model for NLP text data. Your research work results are impressive. Because I am developing embeddings solutions for genes and pr…
-
I've added the Sample Collection chapter. It is currently just a place holder, but I'm going to add information about collecting samples for DNA, RNA, and microbiome (maybe). @sdhutchins I'm also go…
-
Hi,
first of all I want to say thanks for developing that super fast basecaller. It is even fast enough to use it for live basecalling. And my intention is to use it in a ReadUntil context. My ques…
-
Hi @Moeinh77,I noticed that the longest token length for bert is 512, while the length of the training data is much greater than 512. So does it mean that when processing data, it can be directly trun…
-
Currently we have a high level of the IA facets at https://embl-design-language.github.io/Springboard/information-architecture/#facet-structure-and-categories
They are:
```
1. Who
- people
…
-
Hi! I have read your paper about BERTax. It is wonderful and very inspiring. I'm interested in training a BERTax model for my own application: predict the phylum, class, order, family, genus, and spec…