aniani/nasm - nasm - SDF GIT Society

mirror of https://github.com/netwide-assembler/nasm.git synced 2025-10-10 00:25:06 -04:00

Author	SHA1	Message	Date
InstLatx64	172c4b2342	Missing AVX-VNNI_INT{,8,16} instructions -- AVX-VNNI_INT{,8,16} instructions: VPDP{B,W}{SS,SU,US,UU}{D,DS} - AVX-VNNI_INT{,8,16} test files Checked with XED version: [v2025.06.08]	2025-10-07 09:52:13 +02:00
InstLatx64	62f5f6990f	AMX-COMPLEX support -- TCMMIMFP16PS, TCMMRLFP16PS instructions -- AMX.asm fix: Similar to GATHER instructions, 3-operand AMX instructions cannot have the same operand more than once Checked with XED version: [v2025.06.08]	2025-10-06 19:17:43 +02:00
Yongjie Sheng	e548c76ab3	add AMX instruction TDPFP16PS	2025-09-24 19:52:27 +08:00
Yongjie Sheng	8a30c94a09	add aes key locker instructions	2025-09-24 11:25:51 +08:00
H. Peter Anvin	c0be53fc85	insns.dat: fix flags for the MSR instructions - The MSR immediate instructions are under a separate flag - All MSR instructions are privileged Signed-off-by: H. Peter Anvin (Intel) <hpa@zytor.com>	2025-09-23 12:12:12 -07:00
Maciej Wieczor-Retman	3edef01637	insns: avx: amx: Add missing instructions from ISE june 2025 Add all the missing instructions / instruction variants that are specified in the 2025 June Intel ISE. Signed-off-by: Maciej Wieczor-Retman <maciej.wieczor-retman@intel.com>	2025-09-23 18:56:26 +02:00
Maciej Wieczor-Retman	f0efb28d98	assemble: apx: Add NF forbidden flag and fix SBB and ADC ADC and SBB don't support using the {nf} prefix. They are the only one in the arithmetic instructions group that are this way. Add a flag that will warn when an instructions wants to use {nf} but doesnt' support it. Signed-off-by: Maciej Wieczor-Retman <maciej.wieczor-retman@intel.com>	2025-09-19 14:53:04 +02:00
H. Peter Anvin	9aa48acb3e	Merge remote-tracking branch 'yongjie/master'	2025-09-13 20:10:04 -07:00
Yongjie Sheng	e7a3279828	add avx10_2 instruction VPDPWxxxx family and AVX10_VNNIINT flag	2025-09-13 09:05:56 +08:00
Maciej Wieczor-Retman	ed290acf80	insns: travis: apx: APX MOVRS instruction Add the APX database entry for MOVRS and relevant test cases. Signed-off-by: Maciej Wieczor-Retman <maciej.wieczor-retman@intel.com>	2025-09-11 14:17:32 +02:00
H. Peter Anvin	0852ca5694	disasm: handle NOP disassembly, remove debug message NOP disassembly is a little "special" because it sits as part of the XCHG instructions. Add a flag to bail out of the disassembler search early, and ignore the 0330 bytecode. Signed-off-by: H. Peter Anvin (Intel) <hpa@zytor.com>	2025-09-02 20:01:36 -07:00
H. Peter Anvin	acd01496d7	asm: distinguish between VEX.V as an immediate and a prefix; fix WW If VEX.V is an immediate, it should not be subject to register range checks. If the WW flag is set, REX_W needs to be OR'd in, not XOR'd, because the map might have the W bit set for matching purposes. Signed-off-by: H. Peter Anvin (Intel) <hpa@zytor.com>	2025-09-02 15:38:49 -07:00
H. Peter Anvin	3efdd3cf9a	assemble: rex2.w; hinted Jcc in 64-bit mode; UDB - rex2.w is used as a opcode extension (JMPABS), not rex2.x1 as an earlier version of the spec had. - Segment prefixes used as Jcc hints are valid in 64-bit mode. - Avoid duplicate warning messages for ignored/invalid prefixes. * emit_prefixes() is called twice during code generation. - Add the UDB #UD opcode in 64-bit mode; SALC is 16/32-bit only. Signed-off-by: H. Peter Anvin (Intel) <hpa@zytor.com>	2025-08-29 13:55:30 -07:00
H. Peter Anvin	863bddbdcb	iflags: add NOREX flag Add a NOREX flag to indicate that an instruction pattern is not compatible with REX encoding. Signed-off-by: H. Peter Anvin (Intel) <hpa@zytor.com>	2024-08-22 23:41:32 -07:00
H. Peter Anvin	ecbd1c81b3	insns: fix MOVBE CPUID flag, BSWAP 16-bit XCHG patterns Add the MOVBE CPUID flag, add helper patterns for 16-bit BSWAP emulation. Unfortunately using ROL/ROR for registers other than the ones for which XCHG can work clobbers the flags. Signed-off-by: H. Peter Anvin (Intel) <hpa@zytor.com>	2024-08-22 23:32:42 -07:00
H. Peter Anvin	2b2f1fc98a	More macroizing and sorting of instructions into categories More work on cleaning up instruction patterns, fixing matchig corner cases, and tidying up the organization of insns.dat. Signed-off-by: H. Peter Anvin (Intel) <hpa@zytor.com>	2024-08-22 23:22:59 -07:00
H. Peter Anvin	253ff4f370	insns: tag pseudo-instructions explicitly; change insnsa.c format Tag pseudo-instructions explicitly and don't set any CPU level flag for those. Change insnsa.c to have (length, pointer) rather than using an ever increasing in size sentinel at the end of each table. This also means that empty tables (Dx, INCBIN) can be omitted entirely. Signed-off-by: H. Peter Anvin <hpa@zytor.com>	2024-08-21 12:50:31 -07:00
H. Peter Anvin	58024b4611	insns: more instruction macroizing/fixups; remote FUTURE tags Add more instruction macros and fix problems. Adjust some matching problems. Remove all FUTURE tags from the instruction list, and add a bunch of new CPUID tags. Hopefully a small step toward actually getting CPU feature selection working properly in the future. Signed-off-by: H. Peter Anvin <hpa@zytor.com>	2024-08-21 11:48:47 -07:00
H. Peter Anvin	75f6f4cdb2	WIP: more matching and template work Further work on a better matching system. Still a work in progress, however. Signed-off-by: H. Peter Anvin (Intel) <hpa@zytor.com>	2024-08-20 12:59:07 -07:00
H. Peter Anvin	3b55b62f02	apx: implement the mechanism for evex.zu Implement the mechanism needed to handle {zu} suffixes that actually set ND (IMUL, SETcc). Signed-off-by: H. Peter Anvin <hpa@zytor.com>	2024-08-14 15:44:38 -07:00
H. Peter Anvin	c9457d42a6	WIP checkpoint: more matching changes, starting to work on patterns This is a WIP checkpoint; not all tests pass yet. More matching changes, and hopefully something much closer to what really is desired now. The number of required patterns is now much smaller. However, a lot of changes are needed to the patterns. Since some patterns are repeated all over the place, clean up the x86/addflags.pl script and make it able to generate macro-based common patterns; first use being the patterns for the "basic 8" arithmetic patterns. Signed-off-by: H. Peter Anvin <hpa@zytor.com>	2024-08-11 21:28:57 -07:00
H. Peter Anvin	bff94fbd39	Major changes to a number of subsystems to improve matching Work through a number of changes toward making matching a lot saner, both to reduce the number of patterns to generate for APX but also to make a number of code patterns simpler. This replaces a fair number of byte codes. Improve a number of error messages, especially related to overflows. Move process_insn() from nasm.c to assemble.c, as it really is the primary entry point to the assembler module. Reorder some prefixes. In particular, F2/F3 override 66 when used as a mandatory prefix, so it makes more sense for them to be closer to the opcode. Move a lot more information into struct insn. It is better to have it in one place; memory consumption is not an issue because struct insn is transient information. Get rid of "optimization levels" and replace it with a mask of flags. That was already halfway done; complete the job. Replace seg:offset in struct out_data with a struct location. It would be better to extend this to more places, too. The ARx and SMx flags are now explicit bitmasks, instead of having a couple of hard-coded ranges. Add __func__ to assert or panic messages. Because of prefix and message changes, a number of travis tests had to be audited and updated. Fix a number of instruction patterns which had .128 when they ought to be .lig. This is no longer a minor issue with the disassembler: for AVX10, the pattern vector length determines how SAE/RC are encoded, and there is no valid 128-bit encoding. However, with .lig the 512-bit encoding can be used. Separate "o64nw" into two pieces: opsize 64 and "nw" = "REX.w not necessary". The latter can be included in non-64-bit patterns. "o64" still set REX.W since that is still the common thing. New "osz" bytecode: emit an OSP or REX.W depending on the current mode and operand size. Useful for special cases like "nop" where "o64 nop" probably wants to be encoded as "48 90". Signed-off-by: H. Peter Anvin <hpa@zytor.com>	2024-08-07 17:13:44 -07:00
H. Peter Anvin	1286a2da4e	Tidy up handling of modr/m and compressed immediates Merge a bunch of common code in the handling of modr/m generation. Make the handing of compressed disp8 simpler and more transparent by exporting a the shift factor for the compressed immediate in ea_data. For the case of no compression, the shift factor is simply 0; there is no need to distinguish "compressed" from "uncompressed". The tidied up version of the disp8 code is simple enough that it makes more sense to inline it. Signed-off-by: H. Peter Anvin <hpa@zytor.com>	2024-08-03 16:24:49 -07:00
H. Peter Anvin	b5e613fdf8	Allow more flexiblity for {nf} and {zu} The {nf} and {zu} prefixes (or suffixes) can be used on a number of instructions without actually change the encodings (either they don't touch the flags at all, or they write a 32- or 64-bit register already.) Make this a bit more flexible, by adding an FL instruction flag for the instructions which actually touch the flags, and a ZU instruction flag for the instructions which zero the upper half. Signed-off-by: H. Peter Anvin <hpa@zytor.com>	2024-07-31 17:23:06 -07:00
H. Peter Anvin	dda9152b35	apx: smarter determination of REX2 prefix eligibility REX2 encoding is mostly default, so flag the instruction patters which do not support REX2 instead. Signed-off-by: H. Peter Anvin <hpa@zytor.com>	2024-07-31 16:18:17 -07:00
H. Peter Anvin	318a0b9244	WIP: apx: byte code and byte code compiler changes Change the byte code format and the byte code compiler to be able to generate various kinds of APX-format instructions. THE NEW BYTE CODES ARE NOT YET IMPLEMENTED IN THE ASSEMBLER OR DISASSEMBLER. Signed-off-by: H. Peter Anvin <hpa@zytor.com>	2024-07-28 21:57:31 -07:00
H. Peter Anvin	6389ac8e47	scanner: generalize the handling of {dfv=} Change the handling of {dfv=} to a more general "braced constant" expression, to be tagged with an instruction flag to make sure they match the instruction in question. This really ought to be an operand flag, but the opflags are precious; as the CCMP/CTEST instructions can also take an immediate it probably is necessary to invent a "special immediate" operand type that can fold these together. Signed-off-by: H. Peter Anvin <hpa@zytor.com>	2024-07-27 18:06:51 -07:00
Tomasz Kantecki	b0ab00b6a7	x86: SM4-NI VEX support Add VEX-encoded SM4-NI instructions. Signed-off-by: Tomasz Kantecki <tomasz.kantecki@intel.com> Signed-off-by: H. Peter Anvin <hpa@zytor.com>	2024-01-29 16:24:38 -08:00
Tomasz Kantecki	5cab6596bc	x86/insns.dat: SM3-NI VEX support Add VEX-encoded SM3-NI instructions. Signed-off-by: Tomasz Kantecki <tomasz.kantecki@intel.com> Signed-off-by: H. Peter Anvin <hpa@zytor.com>	2024-01-29 16:23:30 -08:00
Tomasz Kantecki	5f684412c7	x86/insns.dat: SHA512-NI VEX support Add support for VEX-encoded SHA512-NI instructions. Signed-off-by: Tomasz Kantecki <tomasz.kantecki@intel.com> Signed-off-by: H. Peter Anvin <hpa@zytor.com>	2024-01-29 16:21:21 -08:00
H. Peter Anvin	b4300ac280	x86: SMAP instructions are NP The SMAP instructions are np; notably the prefixed versions of CLAC are ERETU/ERETS. Signed-off-by: H. Peter Anvin <hpa@zytor.com>	2023-12-14 17:57:27 -08:00
H. Peter Anvin	dd52f386b9	x86: implement FRED: ERETS, ERETU, LKGS Kind of embarrassing... I had not implemented the FRED instruction, despite personally being one of the architects of FRED ;) Signed-off-by: H. Peter Anvin <hpa@zytor.com>	2023-12-14 17:04:49 -08:00
H. Peter Anvin	e993b75aa6	XCHG: adjust lock prefix warning, add specific warning for LOCK XCHG "LOCK XCHG reg,mem" would issue a warning for being unlockable, which is incorrect. In this case the RM encoding is simply an alias for the MR encoding. Add a "LOCK1" bit to deal with that. However, XCHG is always locked, so create a new warning to explicitly flag a user-specified LOCK XCHG; default off. Consider optimizing that prefix away in the future, but for now, let's stick to the user-requested code sequence. Signed-off-by: H. Peter Anvin <hpa@zytor.com>	2023-10-12 14:53:40 -07:00
H. Peter Anvin	9f31c84405	insns: handle late-introduced VEX encoded instructions For VEX instructions created after the corresponding EVEX instructions, we need the user to either explicitly declare them {vex} or specifying "cpu latevex". Signed-off-by: H. Peter Anvin <hpa@zytor.com>	2022-12-06 13:38:33 -08:00
H. Peter Anvin	7c784b0ddb	insns: add HRESET instruction Add the HRESET instruction Signed-off-by: H. Peter Anvin <hpa@zytor.com>	2022-11-14 17:45:29 -08:00
H. Peter Anvin	4369faf827	insns: add vector instructions from ISE 046, Sept 2022 Add vector instructions from the Intel Instruction Set Extensions document, version 046, September 2022. Still need to check for missing instructions that have already passed through the ISE into the SDM. Signed-off-by: H. Peter Anvin <hpa@zytor.com>	2022-11-14 17:28:52 -08:00
H. Peter Anvin	2b01ddf2ec	x86/insns.dat: non-vector instructions from ISE 319433-046 2022-09 Additional nonvector instructions from the Intel Instruction Set Extensions document 319433-046 September 2022. Signed-off-by: H. Peter Anvin <hpa@zytor.com>	2022-11-12 13:15:03 -08:00
H. Peter Anvin	a2eabbe1d7	insns: drop special handling of conditional instructions Instead of handling conditional instructions ad hoc, generate individual instruction patterns as normal. This simplifies the code and makes CMPccXADD support simpler (otherwise it would be necessary to hack in the handling of a condition code in the middle of an instruction.) Signed-off-by: H. Peter Anvin <hpa@zytor.com>	2022-11-12 12:37:37 -08:00
H. Peter Anvin	b18e870d90	Merge remote-tracking branch 'ElyesH/typos'	2022-11-07 12:39:44 -08:00
Iouri Kharon	21d8dbfabb	restire: Support of AVX512-FP16 Instructions Add support for AVX512-FP16 instructions and the associated handling. Allow "mapN" syntax as well as "mN" syntax to match the documentation. Signed-off-by: H. Peter Anvin <hpa@zytor.com>	2022-11-07 12:21:23 -08:00
H. Peter Anvin	bb1233ccde	Add FRED instructions Add the FRED instructions: ERETU, ERETS, LKGS Signed-off-by: H. Peter Anvin <hpa@zytor.com>	2022-10-05 13:31:30 -07:00
Elyes HAOUAS	cdf7ad02c2	Fix some typos while on it, remove unneeded white spaces. Signed-off-by: Elyes HAOUAS <ehaouas@noos.fr>	2022-01-09 17:34:35 +01:00
H. Peter Anvin	d988ce719c	Fix inefficient encoding of MPX instructions BNDMK, BNDLDX, and BNDSTX are split-SIB (MIB) instructions, but do not require a SIB encoding. However, TILELOAD* and TILESTORE* do require a SIB in all cases. Split the MIB flag into MIB (split address) and SIB (SIB required) flags. This fixes travis test mpx. Signed-off-by: H. Peter Anvin (Intel) <hpa@zytor.com>	2020-08-13 17:21:00 -07:00
H. Peter Anvin	b31a4c9906	Add support for new instructions from ISE June 2020 Add support for new instructions as defined in the Instruction Set Extensions manual as of June 2020. Signed-off-by: H. Peter Anvin <hpa@zytor.com>	2020-07-16 21:52:15 -07:00
Henrik Gramner	bca6b26a7e	insns.dat: Add Intel Control-Flow Enforcement Technology (CET) instructions Add instructions for Intel Control Flow Enforcement Technology (CET). Signed-off-by: Henrik Gramner <henrik@gramner.com> Signed-off-by: H. Peter Anvin (Intel) <hpa@zytor.com>	2020-06-27 16:12:37 -07:00
H. Peter Anvin (Intel)	02b60ddd1c	LEA: allow immediate syntax; ignore operand size entirely The memory operand size of LEA doesn't matter in any way as it isn't "real memory". Add an ANYSIZE option to ignore sizes entirely. Signed-off-by: H. Peter Anvin (Intel) <hpa@zytor.com>	2019-08-14 15:23:00 -07:00
H. Peter Anvin (Intel)	5b39461178	obsolete handing: handle a few more subcases in a useful way Distinguish instructions which have once been valid (OBSOLETE) from those that never saw the light of day (NEVER). Futhermore, flag instructions which devolve to an architectural noop from those with undefined behavior and possibly recycled opcodes. Signed-off-by: H. Peter Anvin (Intel) <hpa@zytor.com>	2019-08-09 14:52:16 -07:00
H. Peter Anvin (Intel)	67289aefb5	iflags.ph: add file missing from commit `418138c8f2` Add file missing from commit `418138c8f2`: iflags: move definitions to a separate file; auto-generate more Signed-off-by: H. Peter Anvin (Intel) <hpa@zytor.com>	2019-08-07 00:56:39 -07:00

48 Commits