-
This is a "living issue". Editing is appreciated.
### Context:
- Most prominent benchmark for embedding models: https://huggingface.co/spaces/mteb/leaderboard
- We can choose to index the pdf dat…
-
Tabular data are treated very differently than data for NLP, audio, vision, etc. and therefore the worflow for tabular data in `datasets` is not ideal.
For example for tabular data, it is common to…
-
This idea will most likely be implemented in `unified-doc-cli`
## Goals
The internet is a connection of files. `unified-doc` aims to bridge working with different files with unified document APIs…
-
- Plan for the deployment and release of Jada Helpifyr.
- Establish processes for ongoing maintenance and updates to ensure the personal assistant remains up to date and relevant.
┆Issue is synchroni…
-
#### 1. Personalized Nutrition & Diet Plan Flow
**Data Collection:**
- **User Data Input:** Collect user-specific data such as age, weight, height, dietary restrictions, goals, preferences, acti…
-
**翻译活动已改为校对活动,请重新查看[参与方式](https://github.com/apachecn/hbase-doc-zh/blob/master/CONTRIBUTING.md)!**
留言格式:昵称 + QQ + 章节
| 章节 | 贡献者 | 进度 |
| --- | --- | --- |
| [Preface](https://github.com/apache…
-
There have been feedbacks about the way the scripts are currently written that can be improved, so I'd like to start a thread to collect feedback and suggestions on how to improve, and potentially com…
-
### Problem Statement
Nowadays remote model servers like AWS SageMaker, BedRock, or OpenAI, Cohere, etc all support batch predict APIs, which allow users to send large amount of synchronous request…
-
Hello,
samplers documentation: https://keras.io/api/keras_nlp/samplers/
that is the most unclear documentation I've ever seen in my entire life! what is this????? someone should explain the docume…
-
### Ticket Contents
As part of the [BIRD initiative ](https://billionreaders.org/same-language-subtitling-sls-for-a-billion-readers/), we aim to create a tool which can speed up the adoption of Sam…