OmicsML / CellPLM

Official repo for CellPLM: Pre-training of Cell Language Model Beyond Single Cells.
BSD 2-Clause "Simplified" License
67 stars 6 forks source link

Request for Update on Dataset Details described in Table 6 #15

Closed litxiaoyao closed 2 months ago

litxiaoyao commented 4 months ago

屏幕截图 2024-07-10 153204

litxiaoyao commented 4 months ago

CELLxGENE census version 15 May 2023 has 33M cells, however you said only used 11M, what's the difference? 屏幕截图 2024-07-10 153342

thanks

lemousehunter commented 1 month ago

Hi, I'm wondering if the links for the dataset / its subsets that have been used for pretraining have been included in this repo. And if so, could anyone help to ping it here please :) Thanks!