thanks for the great package. I think it would be useful to have a constructor message that specifies the spacy model string to load (instead of a generic language attribute and a lookup).
For example, I'm currently using scispacy's en_core_sci_sm to parse text instead of en_core_web_sm. I also think en_core_web_sm might be a bad default choice, given that most of the people using quickumls might use it to parse biomedical text (although I don't have any numbers on performance).
This would also solve #68
I currently have a workaround, like this:
Hi,
thanks for the great package. I think it would be useful to have a constructor message that specifies the spacy model string to load (instead of a generic
language
attribute and a lookup). For example, I'm currently using scispacy'sen_core_sci_sm
to parse text instead ofen_core_web_sm
. I also thinken_core_web_sm
might be a bad default choice, given that most of the people using quickumls might use it to parse biomedical text (although I don't have any numbers on performance).This would also solve #68 I currently have a workaround, like this:
I'd envision something like this: