ekunger / iSeroSnFR

Code associated with
4 stars 0 forks source link

Is it possible that you can upload the random forest model and instructions about it? #3

Open InvincibleZZH opened 1 year ago

ekunger commented 1 year ago

I used the random forest model through R. There is a package called "rattle" (if you install it, there are quite a few dependencies, and some might be out of date, so you might have to manually install updated dependencies). Rattle is a GUI, so it's pretty easy to use. On the first tab you upload your data. My data is available if you want to use it or see the formatting. There is another tab where you can generate models for your data. Random forest is one of the options, and you can set different parameters. The parameters I used are listed in the paper and below, but there's nothing special about them, I just played with different numbers until I got results that made sense and that I could test.

You can also run a random forest directly through R using the package randomForest https://cran.r-project.org/web/packages/randomForest/randomForest.pdf

https://rattle.togaware.com/

randomForest(formula = x5HT ~., data = crs$dataset[, c(crs$input, crs$target)], ntree = 500, mtry = 14, importance = TRUE, replace = FALSE, na.action = randomForest::na.roughfix) Type of random forest: regression Number of trees: 500 No. of variables tried at each split: 14 Missing value imputation is active.

InvincibleZZH commented 1 year ago

do I need your dataset to predict the important site in my own protein?

InvincibleZZH commented 1 year ago

No description provided.

and thank you very much for your instructions! it helped my research a lot!

ekunger commented 1 year ago

Not at all. It's just an example of how I formatted it.

InvincibleZZH commented 1 year ago

Not at all. It's just an example of how I formatted it.

so where can i download your dataset?

ekunger commented 1 year ago

https://www.cell.com/cms/10.1016/j.cell.2020.11.040/attachment/1a31be91-e1eb-4ff0-9795-6bcd3ec79ab0/mmc1.xlsx

ekunger commented 1 year ago

I'm super curious what you're working on. I'm happy to help in any way I can!

ekunger commented 1 year ago

iSero RF data (Round 0) is probably the tab you want to start with.

InvincibleZZH commented 1 year ago

https://www.cell.com/cms/10.1016/j.cell.2020.11.040/attachment/1a31be91-e1eb-4ff0-9795-6bcd3ec79ab0/mmc1.xlsx

thank you very much! there are also two issues troubling me, I still cant edit a right xxxx.res in step two backrub 😭

InvincibleZZH commented 1 year ago

I'm super curious what you're working on. I'm happy to help in any way I can!

1129804456@qq.com this is my email, can we talk about my research through this privately,sir?

ekunger commented 1 year ago

Unfortunately my labmate is the one who did all the Rosetta modeling, so I know a lot less about that. I'm not sure if he's on Github.

InvincibleZZH commented 1 year ago

Unfortunately my labmate is the one who did all the Rosetta modeling, so I know a lot less about that. I'm not sure if he's on Github.

thats OK!I will keep trying to get a right file,its quiet interesting and challenging!