[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index] Re: [Xen-devel] [PATCH] x86/emul: Correct the decoding of mov to/from cr/dr
>>> On 06.03.17 at 14:30, <andrew.cooper3@xxxxxxxxxx> wrote: > On 06/03/17 13:10, Jan Beulich wrote: >>>>> On 06.03.17 at 11:30, <andrew.cooper3@xxxxxxxxxx> wrote: >>> The mov to/from cr/dr behave as if they were encoded with Mod = 3. When >>> encoded with Mod != 3, no displacement or SIB bytes are fetched. >> Would mind letting us know how you became aware of this oddity? >> It's clearly both unexpected > > Most definitely. > >> and undocumented, > > Actually, the contrary. It is explicitly called out in both Intel and > AMDs instruction manual, which is why I noticed it. Ah, well hidden in the middle of text. Using standard opcode representation for non-standard encodings is far from helpful - that's what I look at mainly (and the text becomes important only when there _are_ oddities there). > I have confirmed that hardware doesn't fetch disp or SIB bytes on my > oldest and newest Intel and AMD boxes. So did I. >> and I wonder whether there are any other opcodes behaving the same. > > Not that I have spotted. Neither me, but there must be something in the decoder special casing those insns, and it's easily imaginable for such special cases to also apply elsewhere. >>> --- a/tools/tests/x86_emulator/test_x86_emulator.c >>> +++ b/tools/tests/x86_emulator/test_x86_emulator.c >>> @@ -894,6 +894,27 @@ int main(int argc, char **argv) >>> } >>> printf("okay\n"); >>> >>> + printf("%-40s", "Testing mov %%cr4,%%esi (bad ModRM)..."); >>> + /* >>> + * Mod = 1, Reg = 4, R/M = 6 would normally encode a memory reference >>> of >>> + * disp8(%esi), but mov to/from cr/dr are special and behave as if they >>> + * were encoded with Mod == 3. >>> + */ >>> + instr[0] = 0x0f; instr[1] = 0x20, instr[2] = 0146; >> I can guess why you've done it this way, but I'd prefer if we didn't >> start using octal numbers. I for one am very used to reading ModRM >> bytes in their hex representation. > > The argument ought to be for which representation is easier. I'd accept > an argument for consistency with the rest of the code, but are you > seriously saying you think that hex is easier to read/manipulate than > octal for ModRM/SIB bytes? Yes, and very seriously in fact. Nevertheless I admit that this is a matter of what one is used to. The only alternative to hex I'd freely accept would be to use bit fields to fill the pieces individually. >>> --- a/xen/arch/x86/x86_emulate/x86_emulate.c >>> +++ b/xen/arch/x86/x86_emulate/x86_emulate.c >>> @@ -2086,7 +2086,8 @@ x86_decode_twobyte( >>> } >>> /* fall through */ >>> case 0x21: case 0x23: /* mov to/from dr */ >>> - generate_exception_if(lock_prefix || ea.type != OP_REG, EXC_UD); >>> + ASSERT(ea.type == OP_REG); /* Early operand adjustment ensures >>> this. */ >>> + generate_exception_if(lock_prefix, EXC_UD); >>> op_bytes = mode_64bit() ? 8 : 4; >>> break; >>> >>> @@ -2427,6 +2428,23 @@ x86_decode( >>> break; >>> } >>> } >>> + else if ( ext == ext_0f ) >>> + { >>> + switch ( b ) >>> + { >>> + case 0x20: /* mov cr,reg */ >>> + case 0x21: /* mov dr,reg */ >>> + case 0x22: /* mov reg,cr */ >>> + case 0x23: /* mov reg,dr */ >>> + /* >>> + * Mov to/from cr/dr ignore the encoding of Mod, and >>> behave as >>> + * if they were encoded as reg/reg instructions. No futher >>> + * disp/SIB bytes are fetched. >>> + */ >>> + modrm_mod = 3; >>> + break; >>> + } >>> + } >> Just like done by >> https://lists.xenproject.org/archives/html/xen-devel/2017-03/msg00588.html >> this calls for the outer if/else to be morphed into a switch(). > > I did that first, but it was rather complicated to read. I can switch > back to that if you'd prefer. Well, if you don't here then that other patch will in any case. Jan _______________________________________________ Xen-devel mailing list Xen-devel@xxxxxxxxxxxxx https://lists.xen.org/xen-devel
|
Lists.xenproject.org is hosted with RackSpace, monitoring our |