-
[x] I checked the [documentation](https://docs.ragas.io/) and related resources and couldn't find an answer to my question.
Hi, I'm currently getting into evaluation of my RAG-system but had a prob…
-
Currently the generated synthetic data has the q/a/context in its own columns, the new training api assumes the datasets are formatted in [messages format](https://huggingface.co/docs/transformers/en/…
-
**Background**
In the field of multilingual large models, especially for non-English corpora, there is often a problem of insufficient data quantity and poor quality. High-quality training data is cr…
-
I know that @sckott has done the `charlatan` package, which does some basic generation of patient variables. When we teach learning with biomedical data, we often need to use synthetic datasets becaus…
-
Hello all,
Firstly, I want to express my appreciation for the fantastic work @wenbowen123 has done on the project.
My aim is to utilize bundlesdf/run_nerf.py to generate a textured obj file. Thi…
-
### There are some exsting datasets that we can leverage directly such as -
- https://www.openslr.org/104/ contains aligned Hindi-English extracted from spoken tutorials of technical topics and lec…
-
* Implement creation of *synthetic datasets*, where trust of sources is given
upfront
* Initially create only binary variables, possibly extend this later
-
Hi, I am trying to use `TestsetGenerator` to produce a synthetic dataset paired with `LlamaIndex` and 'Ollama', it successfully completes the embedding process, but before startin the generation proce…
-
https://openaccess.thecvf.com/content_CVPR_2020/papers/Yao_BlendedMVS_A_Large-Scale_Dataset_for_Generalized_Multi-View_Stereo_Networks_CVPR_2020_paper.pdf
BlendedMVS: A Large-scale Dataset for Gene…
-
https://openaccess.thecvf.com/content_CVPR_2020/papers/Yao_BlendedMVS_A_Large-Scale_Dataset_for_Generalized_Multi-View_Stereo_Networks_CVPR_2020_paper.pdf
BlendedMVS: A Large-scale Dataset for Gene…