dglazkov / polymath

MIT License
133 stars 9 forks source link

Create a tool to generate embeddings given text #1

Open jkomoros opened 1 year ago

jkomoros commented 1 year ago

To start, it should produce a .pkl file according to the format described in https://github.com/dglazkov/polymath/blob/main/format.md

It should accept a JSON document with a structure like:

{
  chunks: [
    text: '<raw text>',
    //all of the following are optional
    url: '<URL>',
    image_url: '<IMAGE_URL>',
    title: '<TITLE>',
    description: '<DESCRIPTION>',
  ]
}
jkomoros commented 1 year ago