lum-ai / odinson

Odinson is a powerful and highly optimized open-source framework for rule-based information extraction. Odinson couples a simple, yet powerful pattern language that can operate over multiple representations of text, with a runtime system that operates in near real time.
https://lum.ai/odinson/docs/
Apache License 2.0
65 stars 23 forks source link

Endianness #288

Open BeckySharp opened 3 years ago

BeckySharp commented 3 years ago

PLAN:

ALSO:

OTHER:

FYI @kwalcock thanks for the thoughts in processors land, do you have thoughts on this?

kwalcock commented 3 years ago

The default should probably be whatever the Intel chips have so that there is no need to record it in the file, IMO. If a different endianness is found, the bytes can be swapped and swapped back. Those machines will just be slightly (I assume) slower. If the memory layout can vary, there are more than two possibilities, but that's going too far.

If this is the case "The Intel x86 and AMD64 / x86-64 series of processors use the little-endian format." and this is the case "Internally, Java uses Big Endian." then we should be able to look at the output and notice the contradiction---and make sure I'm not crazy.