Closed BradKML closed 1 year ago
thanks for bringing our attention to pke
!
this issue is similar to #78 for which we have made already great progress with 2 contributions:
PositionRank
and BiasedRank
BaseTextRank
and BaseTextRankFactory
to enable integration of more flavoursRegarding the graph based models of pke
, I can see this:
TextRank
can be achieved with our BaseTextRank(edge_weight=0)
SingleRank
can be achieved with our BaseTextRank()
or BaseTextRank(edge_weight=1.0)
PositionRank
can be achieved with our PositionRank
the following ones are missing:
I was not aware of these 3 papers and approaches so thank you. Do you have experience with them in practice and are they good? Would you be open to contribute them?
I am mainly reporting them for notes in Documentation, but if I can I would contribute
Also some extra note: https://github.com/miso-belica/sumy/blob/master/docs/alternatives.md
To reiterate the current algorithms that are not included:
Looking at
Also check the algorithms listed in pke
https://github.com/boudinfl/pke which has an excellent range of implementations. FWIW, that library is GPL and not implemented as a spaCy pipeline, so there's some room for algorithm implementations both there (for research) and here (for production deployments).
The models and algorithms in https://github.com/boudinfl/pke#implemented-models are similar to Textrank but not sped up by SpaCy, so it might be a good idea to include them in PyTextRank
PS: There are also other non TextRank-esque algorithms to consider when making this assessment: