lizzieinvancouver / egret

1 stars 0 forks source link

Papers to double check #17

Open DeirdreLoughnan opened 3 months ago

DeirdreLoughnan commented 3 months ago

In cleaning the data, we might come across papers that we should double check and ensure all the relevant data was entered.

I think we should check: Gremer20 Chichizola18 Lee21 Tilki07 Wytsalucy21

ngoj1 commented 3 months ago

Hi Deirdre I did Tilki07, which section in the table did you feel was missing in the datasheet? I thought I had covered all the treatments, but since so many of them were compounded on each other it's definitely possible I missed something.

DeirdreLoughnan commented 3 weeks ago

In cross referencing data with a pdf, I found that Rostamipoor20 is not in english, but the data from the figures has been scraped.

My thought is that we should remove it, since we can not read the text to get the important treatment information. Happy to discuss this as a group!

buniwuuu commented 6 days ago

Hi @DeirdreLoughnan! I am going over the papers in the folder again and realized there are two cho18, but only one of them is scraped. Should I try scraping the data into my file?
Also, some of papers in the missing pdf file were not in the meta source list.

*update June 25th:

DeirdreLoughnan commented 5 days ago

@buniwuuu Thanks for looking into this.

There are a few duplicates in the source tab that were not entered for various reasons, but I agree that the second cho18 looks like it should have been entered. Could you please enter it and add it to your file?

But yan16_2 was entered by DK, but yang16_1 was not, since it is on apples, therefore I did not update the datasetID.

What meta source list are you looking at? The source tab in egret.xlsx? I see both yang20 and zhang21 in that tab.

buniwuuu commented 5 days ago

@DeirdreLoughnan I will scrape Cho18! I finished checking the missing pdfs of the scraped papers:

About yang20 and zhang21 I missed the second one in the source tab so it's all good!

Also, do we want all of the papers we've reviewed in the folder or just the ones we scraped?

DeirdreLoughnan commented 5 days ago

@buniwuuu Awesome!

We just need the ones we scraped! I think some people added papers that were downloaded and not scraped. I noticed a few that were in other languages in the folder, for example. I think it is fine to leave these in the folder for now.

According to my notes (please double check) we are also missing Farhadi13.

ngoj1 commented 5 days ago

basbag09, basaran12, Aldrige 199X, he09, huang14, Washitani89, Fetouh14, bibby53, geszprych02 I will hunt them down and add them to the folder.

I sent in a request for geszprych02 last week, it's still being processed by the library, so no need to look for that one!

buniwuuu commented 2 days ago

@DeirdreLoughnan I couldn't find basbag09, basaran12, Aldrige 199X, and bibby53 in the source tab... but all the other ones are uploaded!

@ngoj1 thanks for uploading geszprych02!