amchagas / open-hardware-supply

having a closer look on how OSH papers are evolving over time
MIT License
5 stars 2 forks source link

Write python code to analyze text #2

Closed matiasandina closed 2 years ago

matiasandina commented 4 years ago
  1. Explore shared code
  2. Decide what is useful
  3. Implement what's lacking
  4. Tabulate according to keywords (coarse filter)
  5. Manual check to see whether the filter is good
  6. Data analysis
matiasandina commented 4 years ago

After a brief look at the code, there are 2 scripts that can be useful pdf_controller.py and models.py. If we get papers in .pdf format, we can parse them using pdf_controller, looks like functions to retrieve keywords (see #3 ) and tabulate have to be written by us.

amchagas commented 2 years ago

the dataset we have and how we deal with it have changed quite a bit. I am closing this issue as it seems no longer relevant