dainlp / acl2020-transition-discontinuous-ner

65 stars 9 forks source link

About preprocessing ShARe datasets #6

Closed LeeSureman closed 3 years ago

LeeSureman commented 3 years ago

I download two ShARe datasets, And it seems you only provide the script of preprocessing CADEC. So I want to know how I preprocess the ShARe datasets and transform it into the form like your preprocessed CADEC (train, dev and test)

dainlp commented 3 years ago

Hi, please find scripts under data/share2013 folder

LeeSureman commented 3 years ago

Thank U~

LeeSureman commented 3 years ago

It seems the data/share2013/script.sh is not used to process the original share 2013 form. I don't find 'train/ann' , 'test/Gold_SN2012' etc in original share2013 65GONRU{{LKY9JC0@UZTJHT

dainlp commented 3 years ago

The train folder corresponds to 'Task1TrainSetCorpus199.zip' and test folder is 'Task1Gold_SN2012'

LeeSureman commented 3 years ago

maybe I don't understand, for example, 'train/text' and 'train/ann' both correspond to 'Task1TrainSetCorpus199.zip' ?

LeeSureman commented 3 years ago

And I also want to know about preprocessing share2014, thank you ~

dainlp commented 3 years ago

You can unzip Task1TrainSetCorpus199.zip and change the folder name to train.

share 2014 use almost the same code as 2013, I upload the dev.list there.

Send you an email as well.

131250208 commented 3 years ago

@LeeSureman Hi, I am a newbie for this task. I cannot find how to download the two ShARe datasets by the link provided. Seems it is not called ShARe in the link. Could you tell me what should I type to search for the two datasets? Or could you give me a direct link to the datasets? Thank you!

LeeSureman commented 3 years ago

this is useful to you. https://github.com/daixiangau/acl2020-transition-discontinuous-ner/issues/4

131250208 commented 3 years ago

this is useful to you.

4

Thank you!

zlh-source commented 3 years ago

Hello, ShARe datasets needs to submit some applications when downloading, but I don’t know how to fill in one item, can you tell me how to fill it out? Or send this dataset directly to my mailbox 2650603623@qq.com. Thank you!

59a5171c4024e91378ec6ed00d0e10c
dainlp commented 3 years ago

You mean you don't know how to fill the 'research topic' item? That item asks for a short description of your research project. For example, I was working on clinical information extraction project, and I focus on recognize discontinuous concepts from clinical notes.

zlh-source commented 3 years ago

You mean you don't know how to fill the 'research topic' item? That item asks for a short description of your research project. For example, I was working on clinical information extraction project, and I focus on recognize discontinuous concepts from clinical notes.

There are still problems. This requires me to provide a "reference", but what is the "reference"?

c967cdfa45d2e0af7bb3a6f1c7f6833
dainlp commented 3 years ago

I guess that warning does not relate to this field; there may be a field asking you to provide contact information about your supervisors or principal investigator (PI) of your research project.

zlh-source commented 3 years ago

I guess that warning does not relate to this field; there may be a field asking you to provide contact information about your supervisors or principal investigator (PI) of your research project.

Thank you, the problem has been resolved.

GDUTT1 commented 3 years ago

I download two ShARe datasets, And it seems you only provide the script of preprocessing CADEC. So I want to know how I preprocess the ShARe datasets and transform it into the form like your preprocessed CADEC (train, dev and test)

Hi, I have searched for datasets ShARe 2013 and 2014 for a long time but I couldn't find yet. Could please send me a copy of datasets ShARe 2013 and 2014. I would appreciate it a lot.

dainlp commented 3 years ago

sorry, i cannot distribute the data, you can download it here: https://physionet.org/content/shareclefehealth2013/1.0/ and https://physionet.org/content/shareclefehealth2014task2/1.0/

you can register as a normal user and then apply for becoming credentialed user.

FrankZhao1999 commented 11 months ago

I download two ShARe datasets, And it seems you only provide the script of preprocessing CADEC. So I want to know how I preprocess the ShARe datasets and transform it into the form like your preprocessed CADEC (train, dev and test)

Hi,can u send me a copy of ShARe2013 and ShARe2014, I can't download these datasets directly. I really need to do this experiment at this term. I would be realy grateful if u can send them to me. Thank u very much!!!! My email address is FrankZhao1999@163.com