carissalow / rapids

Reproducible Analysis Pipeline for Data Streams
http://www.rapids.science/
GNU Affero General Public License v3.0
37 stars 20 forks source link

Bugfix - Improper PANDAS column type #185

Closed DouglasBellew closed 2 years ago

DouglasBellew commented 2 years ago

src/features/phone_keyboard/rapids: There was an issue with the keyboard data where if the “before_text” column consisted of only “nothing” or numeric values (including NA - “not a number”), it would default that column to numeric instead of string. This change specifically sets the “before_text” and “current_text” columns to text when creating the pandas dataframe so the string processing later on in the file doesn’t cause the run to crash when it tries to do the “str” vector operation (Line 27) on an incompatible column type.

JulioV commented 2 years ago

Thanks a lot for the fix!