google-research / long-range-arena

Long Range Arena for Benchmarking Efficient Transformers
Apache License 2.0
710 stars 77 forks source link

Is it really byte-level? #61

Open LuCeHe opened 11 months ago

LuCeHe commented 11 months ago

From your paper it seems like the byte-level classification decomposes a character i.e. 'C' into its binary representation, something like 000101110, but your code gives back 68, which I think it's not what you intended, cause that is simply a char level representation.

Am I wrong?

Your dataset would be still fulfilling its purpose of using very long sequences, but I think it's not char-byte-level, but char-level.