Review Board 1.7.22


HIVE-4548 Speed up vectorized LIKE filter for special cases abc%, %abc and %abc%

Review Request #11222 - Created May 17, 2013 and submitted

Teddy Choi
vectorization
HIVE-4548
Reviewers
hive
hive-git
I edited FilterStringColLikeStringScala.java as Eric Hanson wrote.

For none-complex patterns, it calls a static method that doesn't call others and uses its given byte arrays only. For complex patterns, it reuses a ByteBuffer and a CharBuffer for decoding UTF-8 to avoid object constructions.

There is 30%~170% performance improvement for all cases. Its benchmark result is on https://issues.apache.org/jira/browse/HIVE-4548#comment-13660750.

It still can be more efficient by using a template-driven approach. I'll apply it soon.

 
Review request changed
Updated (Aug. 6, 2013, 5:06 a.m.)
  • changed from pending to submitted