thevermeer / pg_ts_semantic_headline

Improved highlighting of text search content for multi-word phrases and eliminating partial matches; Includes a methodology for delivering highlighted search result content 5x-10x faster that the PostgreSQL built-in ts_headline function
GNU General Public License v3.0
5 stars 0 forks source link

Thanks! #1

Open brittag opened 7 months ago

brittag commented 7 months ago

Not a bug, just wanted to thank you for posting your work! My team is using Postgres full text search with some long documents in some of our rows (such as the raw text of Federal Register rules that range from 2 to 200+ pages when they're in PDF format), and we're seeing similar challenges with ts_headline performance and highlighting behavior. It's great to see specific concepts for improving ts_headline, and I hope it's helpful for Postgres contributors.

thevermeer commented 6 months ago

@brittag - Thanks for writing. We have clearly encountered a number of the same issues. I too hope that https://github.com/thevermeer/pg_large_text_search works with this extension to better PGSQL's performance and functionality with large document indices.

I have been dealing with some big, family issues, and progress is a little slow, but we are going to also put out a small application that brings the two extensions together and creates a search index for the 50 most downloaded books from the Project Gutenberg Free Library (https://www.gutenberg.org/)

All told, this project has been a lot of fun, and we greatly appreciate the encouragement!

All the best,

JV