Filter expansion issues with HBase 2.1

We have found using literal_or OpenTSDB filters were taking ages to complete and HBase nodes CPU utilization were higher than before (running only these type of queries), much higher than using solely regexp or wildcard even. This was not a problem using iliteral_or. Simply switching from literal_or to iliteral_or the query times went down significantly (it was a jaw dropping moment). So we ended up changing tsd.query.filter.expansion_limit=0, which seemingly eliminated the problem (to make it more transparent), but as expected the OpenTSDB instances are fetching way more data than before (on the other hand it stopped killing the backend).

We suspect this could be related to the ColumnPrefixFilter logic change: https://issues.apache.org/jira/browse/HBASE-21620, the suggestion to the reported performance impact was to use more specific HBase filters https://issues.apache.org/jira/browse/HBASE-22448?focusedCommentId=16846481&page=com.atlassian.jira.plugin.system.issuetabpanels%3Acomment-tabpanel#comment-16846481 MultipleColumnPrefixFilter in this case.

If it is caused by the above change would it be possible to revise the HBase filters OpenTSDB uses?

OpenTSDB / opentsdb

Filter expansion issues with HBase 2.1 #1968