Is there a way to decode from byte stream?

Evian-Zhang commented 9 months ago

When implementing an emulator like QEMU (not KVM mode), there is a decoding stage where instructions are decoded to determine what to do with this instruction, and iced-x86 is the fastest decoder I have ever met, which is the best choice to do this work.

However, it is hard to extract a &[u8] slice which contains the instruction bytes in a common emulator's framework, where the memory itself is also emulated. Though we CAN access any address with any length by issuing emulated page faults and coping the memory values to a temp buffer, and I do know that x86 instruction has a max length (16 maybe?), it may be not a best practice to always get a slice of bytes with this maximum length, since there will always be some bytes unnecessary.

In QEMU, its self-implemented instruction decoder (you can see it here) uses APIs like x86_ldub_code to get bytes to decode, which means "load unsigned byte in code section". This pattern may be more appropriate in this situation, and I wonder if iced-x86 could have a stream decoder which does the same thing, i.e., not directly access a slice, but call .next() on an iterator of u8 to get next byte or peek the next byte.

weltkante commented 9 months ago

Whats preventing you to fetch a bunch of bytes into a buffer and process that? Its usually the most efficient way to handle things (and actually what most stream implementations are doing anyways), so using a stream is just a wrapper around the buffered read concept, something you can easily implement yourself, and not providing any performance benefit on top of that.

PS: I'm not related to this repo, just subscribed to it.

Evian-Zhang commented 9 months ago

@weltkante Thank you for your quick response:)

From my experience dealing with emulated memory model, I cannot come up with an "easily"-implemented buffered reader of bytes in emulated memories. A &[u8] must provide abilities to get total length, access elements at any index, and make sure that these operations have good performance. To do so, we must copy 16 bytes from the emulated memory (by copying rather than referencing, this is because the backend may be not a real "memory" thing), decode it, get the updated ip, and discard remained bytes, and re-copy the next 16 bytes from the emulated memory, and always deal with page fault. It is not as easy as the QEMU-used APIs, i.e., only get needed memories byte-by-byte, without losing any performance for discarded bytes.

I admit that the above explanation is hard to understand, I'm happy to explain any details I did not explain clearly.

weltkante commented 9 months ago

I don't think it'll solve your performance issues if you can't do bulk data transfers, but ignoring that it sounds like you could just implement CodeReader subclass and implement its ReadByte override? or use StreamCodeReader? Then use a Decoder.Create overload taking a CodeReader?

[edit] thinking about, you didn't mention which language you're using, for C# the solution seems so obvious that you're probably using a different version of the API?

Evian-Zhang commented 9 months ago

Thank you for notifying me this API. Yes, StreamCodeReader is a perfect API for my problem. However, I'm using Rust, and the Rust version doesn't have such API. Strange...

wtfsck commented 9 months ago

Yes it's not possible at the moment with the Rust version but something that's useful so will add it to my to do list.

icedland / iced

Is there a way to decode from byte stream? #517