Open toddysm opened 2 years ago
As a developer, I am expected to demo the insights that were generated from the floop_dataset, and also try clustering algorithms to group various feedbacks into categories.
Estimated time to complete: 4 hours Actual time to complete: 4 hours
The dataset that we had currently, only had two data points:
What I started with is some Explorative Data Analysis on this dataset, to understand the patterns better. We discovered that majority of the conversations consist of only one message, This means that the feedback is most often, one-way in nature, where the student does not revert to the teacher on the feedback submitted.
This might be because of multiple reasons:
It seems that doing clustering with only the text, will not suffice in answering all our questions. A better idea, would be to combine the features like Question/No-Question, Emotion, Sentiment together into the clustering model and then come up with a strategy on how to improve on the conversation length between the teacher and the student.
The analysis was done on a SageMaker notebook that can be found here