masonnlp / hacking_metamap

CRUD code for integrating metamap
Apache License 2.0
2 stars 4 forks source link

MetaMap Indexing #2

Open ArchelOlazabal opened 3 years ago

ArchelOlazabal commented 3 years ago

So I would need to look at CSV files and how CSV parsing works.

For the formatting I just need to talk to Rahul about the text manipulation correct?

armaninspace commented 3 years ago

@Archel saw your pull req -- I think if you can open an issue an assign it to me also if you can perhaps think of adding any learning to a wiki in summary I see two problems

  1. you need to open the CSV file with a CSV parser like understand what a CSV lookup any basic tutorial like https://docs.python.org/3/library/csv.html

  2. I believe meta-map requires input of a certain format ID|Text so you have to prepare this line -- perhaps by -- check with @Rahul95 : a. making sure text has no pipe (the | char) -- replace pipes with space or another char b. not sure what ID is -- check with @Rahul95 c. concatenate ID with text and then send it @Archel if you create an issue in the project and assign it to me we can then have a convo there

armaninspace commented 3 years ago

So I would need to look at CSV files and how CSV parsing works.

--> Yes, so I think understanding basic data formats: CSV for now and then later --> JSON, XML, Parquette, Pickle is essential --> CSV is the easiest -- I would checkout the csv parsing page and try to get comfortable --> Also note you can generally use Excel, GoogleSheets or other spreadsheet software to explore CSV files -- they're big spreadsheets

For the formatting I just need to talk to Rahul about the text manipulation correct? --> Not formatting but what is the input --> from your code you seem to be sending the entire line CSV to the meta_map API --> you have to have a pretty good to exact understanding of what the API is looking for (the hard part) --> once you know that it is easy

ArchelOlazabal commented 3 years ago

Ok, sounds good. Any more issues I will let you know.

armaninspace commented 3 years ago

Great!

On Wed, Mar 3, 2021 at 10:25 AM ArchelOlazabal notifications@github.com wrote:

Ok, sounds good. Any more issues I will let you know.

— You are receiving this because you were assigned. Reply to this email directly, view it on GitHub https://github.com/masonnlp/hacking_metamap/issues/2#issuecomment-789795862, or unsubscribe https://github.com/notifications/unsubscribe-auth/AAASOBJBFYWLBSYGL2USY3LTBZIHPANCNFSM4YRNIPKQ .