aicoe-aiops / fedora-mailing-list-analysis

This will be the repo for the Fedora mailing list sentiment analysis project
Other
0 stars 2 forks source link

NBs for cleaning and storing email data #11

Open cdolfi opened 3 years ago

cdolfi commented 3 years ago

these two notebooks take the data from hyper kitty, clean and save them as CSVs, and then combines them to a large data set

review-notebook-app[bot] commented 3 years ago

Check out this pull request on  ReviewNB

See visual diffs & provide feedback on Jupyter Notebooks.


Powered by ReviewNB

tumido commented 3 years ago

@MichaelClifford how does this compare to https://github.com/aicoe-aiops/mailing-list-analysis-toolkit ? Is this essentially the same thing?

MichaelClifford commented 3 years ago

@tumido these projects are somewhat similar : ) Insofar as they both are performing some analysis on the Fedora mailing list. I think the main difference at the moment is this is a specific "discriminatory text detection" project @cdolfi is working on for the OSPO team. And https://github.com/aicoe-aiops/mailing-list-analysis-toolkit is less specific as far as analysis goals.

Does that answer your question?

tumido commented 3 years ago

I wonder if there would be any benefit to unify it. @cdolfi can benefit from the automation (automated analysis runs and new data collection) we already have set up for the other project and the other project can benefit from additional analysis it can be capable of... WDYT?

MichaelClifford commented 3 years ago

@tumido I would be happy to have this work integrated into the toolkit analysis as it appears to represent another analysis type that would likely be useful for many emailing lists. That said, @cdolfi is the owner of this project so I will leave it up to her if/when she wants to contribute this work into the mailing list toolkit project. @cdolfi if you decide you'd like us to merge these two efforts, let us know and we'd be happy to connect and figure out the best way to perform the integration.

tumido commented 3 years ago

FTR: I don't want to block this PR by any means or question it by any way. I just got curious by the similarity of these projects. :slightly_smiling_face:

cdolfi commented 3 years ago

@tumido @MichaelClifford I talked with @oindrillac today about working toward merging these two efforts together. I think this is a great idea and would definitely want to connect to see what is the best way to do that is. I feel like the work that each of us are doing is complimentary of each other and leaves a lot of room to save doing the same work twice.

MichaelClifford commented 3 years ago

Great! and sorry for hijacking this PR. I've opened an issue here to discuss this integration stuff further. Lets move this discussion there and use this thread to discuss the PR. :smile:

tumido commented 3 years ago

/hold please read https://chat.google.com/room/AAAAQcVnQvs/kDNfLz86AMs before unholding