Watchful1 / PushshiftDumps

Example scripts for the pushshift dump files
MIT License
289 stars 55 forks source link

How to filter combinations of keywords instead of just single keywords? (about combine_folder_multiprocess.py) #16

Open Jacobzwj opened 10 months ago

Jacobzwj commented 10 months ago

Thank you for sharing your code. It's been incredibly helpful in extracting the specific Reddit data I need!

However, I've encountered an issue. I can successfully use the command: python3 combine_folder_multiprocess.py reddit --field title --value cold,fever --output pushshift to fetch submissions with titles containing either "cold" or "fever".

But, when I try to search for specific keyword combinations like "common cold" or "fever symptoms" using: python3 combine_folder_multiprocess.py reddit --field title --value common cold,fever symptoms --output pushshift I encounter the following error:

combine_folder_multiprocess.py: error: unrecognized arguments: cold,fever symptoms

Could you advise on how to filter for phrases (some words with space) instead of single words?

Looking forward to your guidance, and thanks in advance!