TutteInstitute / vectorizers

Vectorizers for a range of different data types
BSD 3-Clause "New" or "Revised" License
97 stars 23 forks source link

Docstring might need updating for `construct_timed_document_frequency` #115

Closed cakiki closed 1 year ago

cakiki commented 1 year ago

I think the current docstring for construct_timed_document_frequency is copied over from construct_document_frequency: https://github.com/TutteInstitute/vectorizers/blob/master/vectorizers/preprocessing.py#L40-L56 and might need adjusting.

"timed" here probably refers to a sequence of (token, timestamp) tuples. I'm guessing both the docstring and the first parameter need to be changed? I'll send a tentative PR.