duggurd / graduation_project

0 stars 1 forks source link

Training Data #2

Closed duggurd closed 1 year ago

duggurd commented 1 year ago

Inspect proposed datasets, download and prep for training. Training data sources.

  1. Extract data
  2. Transform data
    • Feature engineering
    • Clean data
    • 11 - TF-IDF as a feature?

  3. Ingest into database?
  4. Store locally?

https://github.com/duggurd/graduation_project/blob/main/src/etl/amazon_reviews_transform.ipynb

duggurd commented 1 year ago

TODO: