nasa-petal / PeTaL-labeller

The PeTaL labeler labels journal articles with biomimicry functions.
https://petal-labeller.readthedocs.io/en/latest/
The Unlicense
6 stars 3 forks source link

Python MATCH data preparation from Lens #46

Closed dsmith111 closed 3 years ago

dsmith111 commented 3 years ago

Summary

This script reformats the lens data pulled in using their API to be fit MATCH's data standards. This script also pulls labels from our labeled csv data to label this Lens API dataset.

Related Issues

Backwards incompatibilities

None.

New Dependencies

Imports unique when compared to other files are: re - (RegEx for removing/replacing punctuation) nltk - (NLTK for intelligently tokenizing words and providing the corpus of stopwords) ast - (AST for safely parsing stringified lists) string - (For providing the full set of special characters)

bruffridge commented 3 years ago

Can you include a README that describes how to run your script? Also can you commit your output file to the repo so I can review it?

bruffridge commented 3 years ago

I see you sent the output in discord, checking it out now.