HazyResearch / evaporate

This repo contains data and code for the paper "Language Models Enable Simple Systems for Generating Structured Views of Heterogeneous Data Lakes"
480 stars 45 forks source link

Small tutorial request in readme.md for usability #22

Open brando90 opened 1 year ago

brando90 commented 1 year ago

it would be nice to get a small tutorial on the readme to use this than going through 476 lines in the profiler code.

An example use case for me is:

  1. create schema from a pdf/text file, so I can use it with pandas data frame (I want to process the pdf page by page)
  2. populate schema and update it (page by page)

The input witll be a pdf, text file or file.

hasalams commented 1 year ago

I am also looking for a simple tutorial. It is not easy to follow how to run this in the current format.