ncsa / standalone-smm-analytics

dockerized version of analytics algorithms
Apache License 2.0
0 stars 0 forks source link

Youtube data doesn't work with preprocessing #121

Open longshuicy opened 1 month ago

longshuicy commented 1 month ago

{"ERROR":{"message":"\"['_source.id_str'] not in index\"","traceback":"Traceback (most recent call last):\n File \"./rabbitmq_handler.py\", line 45, in rabbitmq_handler\n output = algorithm(df, params)\n File \"/scripts/algorithm.py\", line 17, in algorithm\n PP = Preprocess(df, params['column'])\n File \"/scripts/preprocessing.py\", line 31, in init\n df_new = df[df[column] != ''][[self.id_column, column]].dropna()\n File \"/usr/local/lib/python3.7/site-packages/pandas/core/frame.py\", line 3464, in getitem\n indexer = self.loc._get_listlike_indexer(key, axis=1)[1]\n File \"/usr/local/lib/python3.7/site-packages/pandas/core/indexing.py\", line 1314, in _get_listlike_indexer\n self._validate_read_indexer(keyarr, indexer, axis)\n File \"/usr/local/lib/python3.7/site-packages/pandas/core/indexing.py\", line 1377, in _validate_read_indexer\n raise KeyError(f\"{not_found} not in index\")\nKeyError: \"['_source.id_str'] not in index\"\n"}}