-
The new version of RStudio includes an Terminal interface. I think doing genomics data analysis people use a lot of bashscripts and often want to go back and forth to raw fastq read files (i.e. do a l…
-
Could clobber it down to 16 bits for the calculation but a lot of example data are recorded with an exposure time with an Eiger which allows 32 bit readout. In the cases I looked at the data would fit…
-
Ingest new bundles for the HDCA 10x dataset in prod.
In the original submission, the bundling was incorrect. In order to fix this since the re-bundling is not possible at the moment, I would like t…
-
There are lots of "tidy" data sets in packages, but it would be nice to have a package containing many different types of untidy data, to provide a convenient way to practice data wrangling skills.
…
-
[README.md for hw06](https://github.com/suminwei2772/STAT545-547-hw-Wei-Lisa/blob/master/hw06/README.md)
[hw06_data_wrangling.md](https://github.com/suminwei2772/STAT545-547-hw-Wei-Lisa/blob/master…
-
In https://github.com/estuary/connectors/pull/1563 and https://github.com/estuary/connectors/pull/1572, support for base64-encoded strings to be materialized as binary columns was added to `materializ…
-
This is a long-standing issue. Historically we have avoided wrangling projects that reuse primary data from other projects as we do not have a method of representing them in our metadata schema. We ha…
Wkt8 updated
10 months ago
-
IMHO, the current `README` is quite dull and long-winded, and doesn't provide much insight into how this package can be useful for the users.
What we need is for it to feature a visual schematics l…
-
Following on from #46, we should create a function use diagram to keep in the package documentation.
I did start creating a PR (#54) for this, but stopped short and only implemented the static func…
-
R lesson showing how to use detection extracts in parquet files and why they're better than CSV