Open bes827 opened 3 years ago
I now tried just typing the exact text I need removed (rather than regex as these are only 4) and seems like it is working
pm$Abstract= stringr::str_remove_all(pm$Abstract, "Label=\"INTRODUCTION\" NlmCategory=\"BACKGROUND\":")
pm$Abstract= stringr::str_remove_all(pm$Abstract, "Label=\"METHODS\" NlmCategory=\"METHODS\":")
pm$Abstract= stringr::str_remove_all(pm$Abstract, "Label=\"RESULTS\" NlmCategory=\"RESULTS\":")
pm$Abstract= stringr::str_remove_all(pm$Abstract, "Label=\"DISCUSSION\" NlmCategory=\"CONCLUSIONS\":")
Which version of RISmed are you using? And please provide an example PMID to reproduce the type of Abstract you are describing.
I am using version 2.2
and this is the PMID of the abstract above: 31934897
thank you
@skoval is there a way to install the previous version of RISmed instead of the current one? When I go the archive, I can only find the versions from 2017 https://cran.r-project.org/src/contrib/Archive/RISmed/
thank you
I am now trying to use a regex to remove the labels from the abstract body so I can get the "clean" text. For example, need to remove
Label=\"INTRODUCTION\" NlmCategory=\"BACKGROUND\":
as well as all the labels for other abstract sections (methods, results and conclusion)I tried the following regex but did not work: x$Abstract= stringr::str_remove_all(x$Abstract, "[A-Z]+\:")
do you have any suggestions?
thank you
this is an example: