-
The current implementation of column lineage extraction does not address real world applications as far as I can tell.
The two most common applications I can think of are:
1) Impact analysis when …
-
Seed corpus can be significantly improve some harnesses. We should explore the use of generating seeds by way of LLMs, some ideas:
1) Integrate into the workflow so it becomes possible to experimen…
-
Hi, I find that the retrieval corpus used by hotpotqa and other datasets mentioned in the paper seems different? I have obtained the pre-processed corpus of wikipedia2017 from other issues.
**May …
-
**Debugging checklist**
[X ] Have you read the troubleshooting page (https://montreal-forced-aligner.readthedocs.io/en/latest/user_guide/troubleshooting.html) and searched the documentation to ensu…
-
"Original"
``` python
def smote(data,num,k=5):
corpus=[]
nbrs = NearestNeighbors(n_neighbors=k+1, algorithm='ball_tree').fit(data)
distances, indices = nbrs.kneighbors(data)
for i in …
-
**Summary**
I am developping a tool which could read distance from billions of corpus in NLP. I think databend with OSS is a good idea for me. We don't have too much transaction but only write to the…
-
### Feature request
This is a Bert based model however when trying to run, the message says model not supported. https://huggingface.co/meta-llama/Prompt-Guard-86M/tree/main
### Motivation
LLM-pow…
-
Medusa seems to save an empty string in the corpus for a "0x00" string input, which makes correctly parsing the corpus input values more difficult.
Example property:
```solidity
function chec…
-
Corpus - given a corpora generate uni, bi-gram data
This data can be used in tasks like prediction of words, and correction of spelling etc.
The analysis task is captured here; take a Wikipedia, Pro…
-
When running
```
from taxii2client.v21 import Collection
collection = Collection('https://attack-taxii.mitre.org/api/v21/collections/x-mitre-collection--1f5f1533-f617-4ca8-9ab4-6a02367fa019')
obje…