North-Seattle-College / ad440-winter2022-tuesday-repo

North Seattle College AD 440 Winter 2020 Cloud Practicum class repoitory
Apache License 2.0
0 stars 4 forks source link

change classifier and add more for demo #170

Closed zaks2318 closed 2 years ago

zaks2318 commented 2 years ago

Documentation:

Closes #153

Change classifier to LogisticRegression Add confusion matrix for demo the classifier.

User story, acceptance criteria will show on the related issue

Task Breakdown Description Approx. Time
Researching for LogisticRegression 1 hour
add more content to the note 2 hours
Frontend Team Meets 1 hours

How to deploy the functionality locally or in the cloud?

It can run locally after download the file.

How to configure the functionality?

It needs to use Jupyter notebook, pandas, NumPy, nltk, matplotlib, sklearn, and seaborn python libraries. Thus, you can run it after downloading them with pip. Also, we use "floopData02-16-2022-21-06-37.json" as data, so go to AWS S3 dashboard and the file is in the bucket called floop-dataset.

How to access the implemented functionality?

Open the file in Jupyter notebook, and run each grid.

What input to provide?

The input will be floopData02-16-2022-21-06-37.json file.

What output is expected?

The output will be DataFrames and graphics of the floopData02-16-2022-21-06-37.json. Also, the prediction and knn.score.

Initial estimated time

5 hours