Closed evamaxfield closed 7 months ago
Specific tasks 2024-03-07:
fine-tune-mpnet-three-sentences
so need to train and keep that one for useSpecific tasks 2024-03-08
spent the morning working on icssi PLoS dataset. Have a basic pipeline working for extracting key details and filtering out data from the whole corpus. Also did some minor EDA from a sample of ~20,000 JATS XMLs (iirc): https://github.com/evamaxfield/aligning-credit/blob/main/eda.ipynb
data which meets our requirements tends (so far with this trial sample) to:
Stuff to do to wrap up the day:
RA Projects
AwardFindR / award-pynder
Research Software
EAGER / Software Sustainability
RS-Graph / Database for RS Research
Civic
Public Comment Segmentation
ML for PIT
PhD Activities
Writing
Reading
Read this week:
Plan to read this week: