AugustNagro / utf8.java

Vectorized UTF-8 Validation for Java
62 stars 7 forks source link

Leverage changes from the V2 incubator API? #7

Open LifeIsStrange opened 2 years ago

LifeIsStrange commented 2 years ago

https://openjdk.java.net/jeps/414 The JEP mention improvements such as:

Enhancements to the API to support operations on characters, such as for UTF-8 character decoding. Specifically, we add methods for copying characters between short vectors and char arrays, and new vector comparison operators for unsigned comparisons with integral vectors.

I have no idea whether this is directly useful or not for this lib :) @AugustNagro friendly ping

AugustNagro commented 2 years ago

Thanks for the ping @LifeIsStrange

Is this the latest javadocs or is there a newer link?

I don't think java character ops will help with UTF8 validation, but maybe the java simd-json port can use them.

I'm waiting to try porting simd-json until the api and performance stabilizes more. Since Oracle said the Vector api depends on Project Valhalla (Java's value type / structs), it's going to be a few years...