C Keyword Extractor using TF/IDF
The purpose of this program is to learn how TF/IDF works, to gain experience in C and to help the Public Health Agency of Canada (PHAC) to classify their blog posts by extracting their keywords automatically.
The intended use of this program is to generate relevant keywords based on PHAC Data Blogs and allow the department to use these keywords to optimise their search function on their website.
We have just created the project. The project is in its infancy. Research is being conducted to solidify the documentation and design of the program.
The C9 environment offers two compilers gcc and mingw64. gcc can be used to create linux executables while mingw64 is used to create Windows executable. To compile a program with gcc use the following command: