-
I worked a bit on the labelling matching with ENA / NCBI as I had a lot of mismatch issues with the development of the FAIR Data Station. I forked the repo and did an analysis on the terms in MIxS and…
-
## Expected Behavior
- I would like to use metaeuk easy-predict to taxtocontig workflow against UniRef90 to annotate contigs from a metagenome and output a blast outfmt 6 table as input for Blobtoo…
-
I am trying to calculate the perplexity on `minerva_math`, here are my task yaml config.
```yaml
group:
- math_word_problems_ppl
task: minerva_math_algebra_ppl
dataset_path: EleutherAI/hendryck…
-
**Replace**
Data decontamination is the process of removing evaluation data from the training dataset. This important step in data preprocessing ensures the integrity of model evaluation, ensuring …
-
### Intro paragraph
## START
- the great celebrity twitter bitcoin scam https://www.theverge.com/2020/7/15/21326200/elon-musk-bill-gates-twitter-hack-bitcoin-scam-compromised
Can talk about…
-
"March 4, 2020 -- X-ray may not be the best imaging tool for detecting novel coronavirus disease (COVID-19). Almost three-quarters of a small cohort of South Korean patients with COVID-19 pneumonia ha…
-
Thanks for posting this public rebuttal! Good science is open science.
There's been some suggestion that removing environmental contaminants, as done in the original paper, removes the cancer sub-t…
-
[Determining open cluster membership, Stott (2018)](https://www.aanda.org/articles/aa/full_html/2018/01/aa28568-16/aa28568-16.html)
The algorithm is really interesting.
-
It is probably caused by the lane merge in the beginning.
In hindsight, each R1 and R2 should have probably been quality controlled on lane level and then merged after host decontamination... if merg…
-
Hi Chenhao,
we discovered an issue with the contamination pipeline from this repo. Shortly, BWA default seed (k=19) allows 'random' alignments of bacterial reads against the human genome. This issu…