sayantikabanik / capstone_isb

Using a combination of multiple facets of Health, Engagement and Productivity to estimate occupational burnout
Other
1 stars 1 forks source link

Fixing Summarization Output #40

Closed anuraagbhavaraju closed 2 years ago

anuraagbhavaraju commented 2 years ago
image

Current: _stext(filteredoutput.reviewText)

Proposed Change: def compile_text(df,text_column): compiled_text='' for text in df[text_column].fillna('NA.'): if text!='NA.' and text!='NA': compiled_text=compiled_text+' '+text return compiled_text

compiled_text=compile_text(filtered_output[:1000],'reviewText') _stext(compiledtext)

anuraagbhavaraju commented 2 years ago

@sayantikabanik can we try the proposed approach to pre-process reviewText/pros/cons column data before summarizing it

sayantikabanik commented 2 years ago

Feel free to make the changes and raise a PR