Open eloff opened 9 years ago
The usual way to implement jump tables on amd64 is something like
tableStart: .word label-tableStart
and then the code that uses the table loads the word into a register, adds tableStart, and jumps to the register.
A related compiler request: #5496.
In hand-written asm you can do jump tables, though slighly less efficient than you may want (requires 1 more instruction for dispatch).
In short:
TEXT
blocks instead of labels.$sym<>(SB)
syntax.JMP
to execute it. This can be done in 2 instruction on x86: load index into register, for example, CX
, then do JMP [CX*8+table]
(if it's 32-bit platform, you need to scaling of 4). For Go asm, you need to use explicit register base instead of table
reference as a displacement. This means you load $table<>(SB)
into register, for example AX
and then do JMP (AX)(CX*8)
.(full read, compete example included: go-asm-dispatch-tables)
But please, measure result performance carefully. Branch tables are not always worth it. And the results will depend on the host machine for sure.
Good luck.
If this solution is acceptable, this issue should probably be closed.
Or do we really want "real" labels as data support in cmd/asm
or in x86 backend specifically?
Thanks for the write up!
Would "real" labels as data support needed for #5496?
@martisch : Probably not. The compiler "assembles" its own instructions, so the assembler proper would need no changes. We might need some common support in the obj library.
It would be weird for the compiler to generate code that the assembler can't, though.
I've noticed a couple comments in Go assembly routines "TODO replace this with a jump table in the future" and when I tried implementing an assembly routine with a jump table I realized why it hasn't been tried yet. The assembler does not support using labels in data, so there's no easy way to construct the jump table. My incomplete understanding of x64 architecture is that jumps are signed 32 bit offsets from the instruction pointer, which seems like it would make it difficult to implement. Maybe there's also a way to do a jump to a 64bit virtual address, I don't know.
Anyway if it's not too difficult, this would be a nice to have.