Open bhpayne opened 10 months ago
in the src directory I added
pip install -r requirements.txt
python decompress_model.py
python nltk_downloads.py
python get_symbol_defs.py
I am confident that on average you should get at least 10% of the variable definitions with the get_symbol_defs.py file.
The concordance dict can be used for additional processing as it extracts every sentence where a variable is used.
As an example, suppose the following is in a paper:
For this paper,
c
is the number of cowsb
is the number of batsThe relevance of picking these variable definitions out is to then find other papers with that same variable, even if the symbol being used is different. (In another paper where
w
is the number of cows.)Success here is