CornellNLP / ConvoKit

ConvoKit is a toolkit for extracting conversational features and analyzing social phenomena in conversations. It includes several large conversational datasets along with scripts exemplifying the use of the toolkit on these datasets.
https://convokit.cornell.edu/documentation/
MIT License
556 stars 129 forks source link

Add Documentation for FORA Corpus and Update Related Datasets and Config Files #237

Closed yash-chatha closed 1 month ago

yash-chatha commented 1 month ago

Description

I added the fora.rst file with detailed documentation for the FORA Corpus, ensuring that it is properly reflected in the ConvoKit website. In addition, I updated the datasets.rst file to include the FORA dataset, and made necessary changes to the download_config.json file to ensure the correct handling and downloading of the dataset.

Motivation and Context

This change is required to integrate the new FORA Corpus into the ConvoKit website's documentation and ensure that it can be downloaded and utilized by users. This update solves the problem of incomplete dataset documentation and configuration by ensuring that the new corpus is fully supported.

How has this been tested?

These changes were tested by verifying that the documentation renders correctly in the local development environment and by confirming that the updates to the configuration file allow for successful dataset download. I ran tests locally to ensure that these changes did not interfere with existing functionality.

Other information

N/A