-
```
Add a type for generic segmentation, e.g. chapters, acts, etc.
Existing div-like types like Paragraph, Document (?) should be changed to
inherit from this type. Sentence and token are probably n…
-
The version of HKCanCor published on [HuggingFace](https://huggingface.co/datasets/nanyang-technological-university-singapore/hkcancor/tree/main) by NTU is different from the version offered by this l…
-
I am currently using Ragas to evaluate my RAG application, which is built using llama index . I've encountered a few issues in the generated results:
1- When generating queries using `TestsetGenera…
-
**Fleet version**: 4.56.0
**Web browser and operating system**: Safari or Chrome on macOS
### 💥 Actual behavior
Following the configuration steps found here: https://github.com/fleetdm/f…
-
Hi, thank you for your work and sharing the code. I found that the resultls of compared method, especially the SIIM segmenation part, are very different from the results in the original paper, e.g, MG…
-
## How to reproduce the behaviour
[Colab notebook demonstrating problem](https://colab.research.google.com/drive/14FFYKqjRVRbN7aAVmHUYEao9CwahY0We?usp=sharing)
When parsing a sentence that con…
-
**tl;dr:** `calculate_position` should not use the lengths of graphemes as provided by unicode-width, but instead use the sum of the widths of the codepoints.
At least on Unix, when calculating the…
-
Hi, thanks for this amazing work. i have some questions.
The annotation is base on abstract level. but when you use PubMedBERT model for relation extraction, how do tokenizers do the sentence segmen…
-
```
Add a type for generic segmentation, e.g. chapters, acts, etc.
Existing div-like types like Paragraph, Document (?) should be changed to
inherit from this type. Sentence and token are probably n…
-
Hi Again :)
I was checking my own implementation of the factCC scoring you described in the paper against your data, and noticed that for 90 cases we derived different scores
I suspect this is …