aniani/nasm - nasm - SDF GIT Society

mirror of https://github.com/netwide-assembler/nasm.git synced 2025-11-08 23:27:15 -05:00

Author	SHA1	Message	Date
H. Peter Anvin	c9457d42a6	WIP checkpoint: more matching changes, starting to work on patterns This is a WIP checkpoint; not all tests pass yet. More matching changes, and hopefully something much closer to what really is desired now. The number of required patterns is now much smaller. However, a lot of changes are needed to the patterns. Since some patterns are repeated all over the place, clean up the x86/addflags.pl script and make it able to generate macro-based common patterns; first use being the patterns for the "basic 8" arithmetic patterns. Signed-off-by: H. Peter Anvin <hpa@zytor.com>	2024-08-11 21:28:57 -07:00
H. Peter Anvin	bff94fbd39	Major changes to a number of subsystems to improve matching Work through a number of changes toward making matching a lot saner, both to reduce the number of patterns to generate for APX but also to make a number of code patterns simpler. This replaces a fair number of byte codes. Improve a number of error messages, especially related to overflows. Move process_insn() from nasm.c to assemble.c, as it really is the primary entry point to the assembler module. Reorder some prefixes. In particular, F2/F3 override 66 when used as a mandatory prefix, so it makes more sense for them to be closer to the opcode. Move a lot more information into struct insn. It is better to have it in one place; memory consumption is not an issue because struct insn is transient information. Get rid of "optimization levels" and replace it with a mask of flags. That was already halfway done; complete the job. Replace seg:offset in struct out_data with a struct location. It would be better to extend this to more places, too. The ARx and SMx flags are now explicit bitmasks, instead of having a couple of hard-coded ranges. Add __func__ to assert or panic messages. Because of prefix and message changes, a number of travis tests had to be audited and updated. Fix a number of instruction patterns which had .128 when they ought to be .lig. This is no longer a minor issue with the disassembler: for AVX10, the pattern vector length determines how SAE/RC are encoded, and there is no valid 128-bit encoding. However, with .lig the 512-bit encoding can be used. Separate "o64nw" into two pieces: opsize 64 and "nw" = "REX.w not necessary". The latter can be included in non-64-bit patterns. "o64" still set REX.W since that is still the common thing. New "osz" bytecode: emit an OSP or REX.W depending on the current mode and operand size. Useful for special cases like "nop" where "o64 nop" probably wants to be encoded as "48 90". Signed-off-by: H. Peter Anvin <hpa@zytor.com>	2024-08-07 17:13:44 -07:00
H. Peter Anvin	19a6fca486	insns.dat: add MOVSX -> CBW/CWDE/CDQE optimiztion; add MOVZX[D] Add MOVSX[D] -> CBW/CWDE/CDQE optimization patterns when the suitable form of the AX register are referenced. Add MOVZX reg64,rm32 pattern which converts to a 32-bit MOV. Add MOVZXD reg64,rm32 alias pattern for consistency. Signed-off-by: H. Peter Anvin <hpa@zytor.com>	2024-08-04 16:29:08 -07:00
H. Peter Anvin	065f60f062	insns.dat: add {nf} forms of the non-flag-modifying -x instructions For the instructions ending in -x because they don't modify the flags, also accept {nf}. Add 2-operand relaxed versions like for most other instructions, too. Signed-off-by: H. Peter Anvin <hpa@zytor.com>	2024-08-04 16:01:00 -07:00
H. Peter Anvin	804a1f215b	x86/addflags.pl: for instructions with explicit NF, don't add FL For instructions explicitly tagged NF, don't add an FL annotation. Used for things like ror{nf} -> rorx. Signed-off-by: H. Peter Anvin <hpa@zytor.com>	2024-08-04 15:58:48 -07:00
H. Peter Anvin	d55f780b96	insns.pl: fix the types for the disassembly tables Make sure to get the right type for the disassembly tables. This is now a fixed-depth tree, so there is no reason to use union types or anything along those lines. Signed-off-by: H. Peter Anvin <hpa@zytor.com>	2024-08-04 15:57:14 -07:00
H. Peter Anvin	84f2553d7f	x86/insns.pl: redo the way tables are generated for disassembly Change the generation of tables for disassembly to be map-based. This also makes the code a bit more regular. This is the first step at catching up with APX support in the disassembler. Signed-off-by: H. Peter Anvin <hpa@zytor.com>	2024-08-03 21:24:59 -07:00
H. Peter Anvin	c79b6a97f0	insns.dat: script to auto-generate ZU and FL flags These are mostly structured. Leave this as an explicit processing step, at least until the correctness of these flags have been fully verified; perhaps after that as well. Signed-off-by: H. Peter Anvin <hpa@zytor.com>	2024-08-03 19:07:57 -07:00
H. Peter Anvin	dc76618f5d	insns.pl: don't generate NDD ZU patterns if not necessary If the patterns are inherently ZU, then there is no reason to also generate NDD forms. Signed-off-by: H. Peter Anvin <hpa@zytor.com>	2024-08-03 18:18:11 -07:00
H. Peter Anvin	c2eade6379	insns.pl: fix the generation of NDD patterns for {zu} When specified with {zu}, allow generation of NDD patterns if applicable. Signed-off-by: H. Peter Anvin <hpa@zytor.com>	2024-08-03 18:05:51 -07:00
H. Peter Anvin	1286a2da4e	Tidy up handling of modr/m and compressed immediates Merge a bunch of common code in the handling of modr/m generation. Make the handing of compressed disp8 simpler and more transparent by exporting a the shift factor for the compressed immediate in ea_data. For the case of no compression, the shift factor is simply 0; there is no need to distinguish "compressed" from "uncompressed". The tidied up version of the disp8 code is simple enough that it makes more sense to inline it. Signed-off-by: H. Peter Anvin <hpa@zytor.com>	2024-08-03 16:24:49 -07:00
H. Peter Anvin	b5e613fdf8	Allow more flexiblity for {nf} and {zu} The {nf} and {zu} prefixes (or suffixes) can be used on a number of instructions without actually change the encodings (either they don't touch the flags at all, or they write a 32- or 64-bit register already.) Make this a bit more flexible, by adding an FL instruction flag for the instructions which actually touch the flags, and a ZU instruction flag for the instructions which zero the upper half. Signed-off-by: H. Peter Anvin <hpa@zytor.com>	2024-07-31 17:23:06 -07:00
H. Peter Anvin	b32f572f2d	insns.dat: tag instructions from REX2-excluded opcode spaces as NOAPX For the instruction space that are explicitly excluded by REX2, add NOAPX tags. This was done with an automated script. Signed-off-by: H. Peter Anvin <hpa@zytor.com>	2024-07-31 16:46:53 -07:00
H. Peter Anvin	8c4ccba365	insns.dat: mark a few old Cyrix instructions obsolete Mark a few old Cyrix instructions obsolete that conflict with opcode map prefixes. Signed-off-by: H. Peter Anvin <hpa@zytor.com>	2024-07-31 16:43:38 -07:00
H. Peter Anvin	dda9152b35	apx: smarter determination of REX2 prefix eligibility REX2 encoding is mostly default, so flag the instruction patters which do not support REX2 instead. Signed-off-by: H. Peter Anvin <hpa@zytor.com>	2024-07-31 16:18:17 -07:00
H. Peter Anvin	fd08822070	apx: implement JMPABS Implement the JMPABS instruction, which can also be specified as JMP ABS for consistency. Since ABS is already a keyword, this does not pollute the namespace. Signed-off-by: H. Peter Anvin <hpa@zytor.com>	2024-07-30 17:32:00 -07:00
H. Peter Anvin	973fe19a1b	WIP: apx: now can encode most instructions; handle subcases in EVEX EVEX encoding is really messy, with the 4th register bit in one of several places depending on which type of register it is. It seems to work now. Signed-off-by: H. Peter Anvin <hpa@zytor.com>	2024-07-30 17:03:58 -07:00
H. Peter Anvin	2e4df506e0	WIP: APX: can now generate code for tested APX instructions a Support generating code for APX instruction and add support for the {nf} prefix. No disassembler support yet, and only a handful instructions encoded. Signed-off-by: H. Peter Anvin <hpa@zytor.com>	2024-07-29 20:58:04 -07:00
H. Peter Anvin	318a0b9244	WIP: apx: byte code and byte code compiler changes Change the byte code format and the byte code compiler to be able to generate various kinds of APX-format instructions. THE NEW BYTE CODES ARE NOT YET IMPLEMENTED IN THE ASSEMBLER OR DISASSEMBLER. Signed-off-by: H. Peter Anvin <hpa@zytor.com>	2024-07-28 21:57:31 -07:00
H. Peter Anvin	1618fa745b	apx: support parsing special constants like {dfv=} {dfv=} is basically a constant (immediate). Treat it as such during parsing, except that if "naked" (not in an expression), it has special matching properties and does not need a terminal comma. Signed-off-by: H. Peter Anvin <hpa@zytor.com>	2024-07-28 16:57:24 -07:00
H. Peter Anvin	6389ac8e47	scanner: generalize the handling of {dfv=} Change the handling of {dfv=} to a more general "braced constant" expression, to be tagged with an instruction flag to make sure they match the instruction in question. This really ought to be an operand flag, but the opflags are precious; as the CCMP/CTEST instructions can also take an immediate it probably is necessary to invent a "special immediate" operand type that can fold these together. Signed-off-by: H. Peter Anvin <hpa@zytor.com>	2024-07-27 18:06:51 -07:00
H. Peter Anvin	4f0d89dbe6	apx: modify the tokenizer to be able to handle {dfv=...} The {dfv=} prefix sequences for the CCMP and CTEST instructions need special handling in the parser. This means a fair bit of new magic in the handler of the parser, but it just adds to the fun. Try to make this as general as possible, so we can use it for other things. Signed-off-by: H. Peter Anvin <hpa@zytor.com>	2024-07-27 17:57:08 -07:00
H. Peter Anvin	a556ea3edf	regs: clean up and somewhat automate register range flags Doing the register range flags by hand is a bit more work than necessary when dealing with APX, so auto-generate the flags for ranges {0, 1-15, 16+} using 3 bits. In theory we could handle even more automagically by splitting ranges up further: the existing ranges are sets of {0, 1, 2, 3, 4-5, 6-7, 8-15, 16-31} which would require 7 bits, although it would remove most of the subclass bits for registers; it would require separating the subclass bits for EAs from the ones for registers (which might be a good idea anyway...) Signed-off-by: H. Peter Anvin <hpa@zytor.com>	2024-07-23 16:04:47 -07:00
H. Peter Anvin	49640ed315	x86: move the bytecode defintion into a separate file in x86/ At least three files (asm/assemble.c, disasm/disasm.c, and x86/insns.pl) depend on the bytecode defintions. It makes a lot more sense for them to live in an explicit documentation file in the x86/ directory. Signed-off-by: H. Peter Anvin <hpa@zytor.com>	2024-07-23 12:47:25 -07:00
H. Peter Anvin	adf4f5cd5e	x86/insns.dat: add missing semicolon in comment Comment was missing a semicolon; fix to avoid unnecessary warning and to make sure the documentation is generated correctly. Signed-off-by: H. Peter Anvin <hpa@zytor.com>	2024-01-29 16:37:09 -08:00
Tomasz Kantecki	b0ab00b6a7	x86: SM4-NI VEX support Add VEX-encoded SM4-NI instructions. Signed-off-by: Tomasz Kantecki <tomasz.kantecki@intel.com> Signed-off-by: H. Peter Anvin <hpa@zytor.com>	2024-01-29 16:24:38 -08:00
Tomasz Kantecki	5cab6596bc	x86/insns.dat: SM3-NI VEX support Add VEX-encoded SM3-NI instructions. Signed-off-by: Tomasz Kantecki <tomasz.kantecki@intel.com> Signed-off-by: H. Peter Anvin <hpa@zytor.com>	2024-01-29 16:23:30 -08:00
Tomasz Kantecki	5f684412c7	x86/insns.dat: SHA512-NI VEX support Add support for VEX-encoded SHA512-NI instructions. Signed-off-by: Tomasz Kantecki <tomasz.kantecki@intel.com> Signed-off-by: H. Peter Anvin <hpa@zytor.com>	2024-01-29 16:21:21 -08:00
H. Peter Anvin	24275695ff	x86/insns.dat: PTWRITE requires the F3 prefix PTWRITE takes an f3 prefix, not np. Reported-by: Markus T. Metzger <markus.t.metzger@intel.com> Signed-off-by: H. Peter Anvin <hpa@zytor.com>	2024-01-29 16:15:56 -08:00
H. Peter Anvin	b4300ac280	x86: SMAP instructions are NP The SMAP instructions are np; notably the prefixed versions of CLAC are ERETU/ERETS. Signed-off-by: H. Peter Anvin <hpa@zytor.com>	2023-12-14 17:57:27 -08:00
H. Peter Anvin	dd52f386b9	x86: implement FRED: ERETS, ERETU, LKGS Kind of embarrassing... I had not implemented the FRED instruction, despite personally being one of the architects of FRED ;) Signed-off-by: H. Peter Anvin <hpa@zytor.com>	2023-12-14 17:04:49 -08:00
H. Peter Anvin	e993b75aa6	XCHG: adjust lock prefix warning, add specific warning for LOCK XCHG "LOCK XCHG reg,mem" would issue a warning for being unlockable, which is incorrect. In this case the RM encoding is simply an alias for the MR encoding. Add a "LOCK1" bit to deal with that. However, XCHG is always locked, so create a new warning to explicitly flag a user-specified LOCK XCHG; default off. Consider optimizing that prefix away in the future, but for now, let's stick to the user-requested code sequence. Signed-off-by: H. Peter Anvin <hpa@zytor.com>	2023-10-12 14:53:40 -07:00
H. Peter Anvin	9f31c84405	insns: handle late-introduced VEX encoded instructions For VEX instructions created after the corresponding EVEX instructions, we need the user to either explicitly declare them {vex} or specifying "cpu latevex". Signed-off-by: H. Peter Anvin <hpa@zytor.com>	2022-12-06 13:38:33 -08:00
H. Peter Anvin	5a25ad12b2	insns: fix instruction flags for the ENQCMD instructions Set a more complete set of flags for the ENQCMD family instructions. Signed-off-by: H. Peter Anvin <hpa@zytor.com>	2022-11-14 17:53:06 -08:00
H. Peter Anvin	7c784b0ddb	insns: add HRESET instruction Add the HRESET instruction Signed-off-by: H. Peter Anvin <hpa@zytor.com>	2022-11-14 17:45:29 -08:00
H. Peter Anvin	4369faf827	insns: add vector instructions from ISE 046, Sept 2022 Add vector instructions from the Intel Instruction Set Extensions document, version 046, September 2022. Still need to check for missing instructions that have already passed through the ISE into the SDM. Signed-off-by: H. Peter Anvin <hpa@zytor.com>	2022-11-14 17:28:52 -08:00
H. Peter Anvin	2b01ddf2ec	x86/insns.dat: non-vector instructions from ISE 319433-046 2022-09 Additional nonvector instructions from the Intel Instruction Set Extensions document 319433-046 September 2022. Signed-off-by: H. Peter Anvin <hpa@zytor.com>	2022-11-12 13:15:03 -08:00
H. Peter Anvin	a3fd34ab80	x86/insns.pl: sort conditional instructions alphabetically Makes the build a bit more predictable and debuggable. Signed-off-by: H. Peter Anvin <hpa@zytor.com>	2022-11-12 12:59:23 -08:00
H. Peter Anvin	a2eabbe1d7	insns: drop special handling of conditional instructions Instead of handling conditional instructions ad hoc, generate individual instruction patterns as normal. This simplifies the code and makes CMPccXADD support simpler (otherwise it would be necessary to hack in the handling of a condition code in the middle of an instruction.) Signed-off-by: H. Peter Anvin <hpa@zytor.com>	2022-11-12 12:37:37 -08:00
H. Peter Anvin	1e50772539	Merge remote-tracking branch 'Gramner/vpexpand'	2022-11-07 16:28:12 -08:00
H. Peter Anvin	b18e870d90	Merge remote-tracking branch 'ElyesH/typos'	2022-11-07 12:39:44 -08:00
Iouri Kharon	21d8dbfabb	restire: Support of AVX512-FP16 Instructions Add support for AVX512-FP16 instructions and the associated handling. Allow "mapN" syntax as well as "mN" syntax to match the documentation. Signed-off-by: H. Peter Anvin <hpa@zytor.com>	2022-11-07 12:21:23 -08:00
H. Peter Anvin	7a2b5c9221	x86/insns.dat: fix VCVTNEPS2BF16 The VCVTNEPS2BF16 instruction was incorrectly specified as VCVTNE2S2BF16. Fortunately, the correct opcode for the latter was specified first, so it would emit the correct result when that instruction was specified. Fixes: https://bugzilla.nasm.us/show_bug.cgi?id=3392821 Reported-by: Agner <agner@agner.org> Signed-off-by: H. Peter Anvin <hpa@zytor.com>	2022-11-07 10:03:45 -08:00
H. Peter Anvin	bb1233ccde	Add FRED instructions Add the FRED instructions: ERETU, ERETS, LKGS Signed-off-by: H. Peter Anvin <hpa@zytor.com>	2022-10-05 13:31:30 -07:00
Henrik Gramner	3578cd907f	x86/insns.dat: Fix VPCOMPRESSB and VPCOMPRESSW disp8 Signed-off-by: Henrik Gramner <henrik@gramner.com>	2022-01-25 04:14:23 +01:00
Henrik Gramner	1b84b4e276	x86/insns.dat: Fix VPEXPANDB and VPEXPANDW encoding Signed-off-by: Henrik Gramner <henrik@gramner.com>	2022-01-25 01:53:31 +01:00
Elyes HAOUAS	cdf7ad02c2	Fix some typos while on it, remove unneeded white spaces. Signed-off-by: Elyes HAOUAS <ehaouas@noos.fr>	2022-01-09 17:34:35 +01:00
Cyrill Gorcunov	e2ed7b7e12	x86/insns: add VMGEXIT The instruction supports two forms with [f2] and [f3]. I guess we might add aliases as VMGEXIT2 and VMGEXIT3. For now simly leave a second form for ndisasm sake. https://bugzilla.nasm.us/show_bug.cgi?id=3392755 Signed-off-by: Cyrill Gorcunov <gorcunov@gmail.com>	2021-05-13 09:14:10 +03:00
Cyrill Gorcunov	c4babdf2db	x86/insns: add RMPADJUST https://bugzilla.nasm.us/show_bug.cgi?id=3392754 Signed-off-by: Cyrill Gorcunov <gorcunov@gmail.com>	2021-05-13 01:29:32 +03:00
Cyrill Gorcunov	1430995095	x86/insns: add PVALIDATE https://bugzilla.nasm.us/show_bug.cgi?id=3392753 Signed-off-by: Cyrill Gorcunov <gorcunov@gmail.com>	2021-05-13 01:27:07 +03:00

... 2 3 4 5 6

271 Commits