DOS MZ executable

DOS MZ executable
DOS MZ executable
Filename extension	.exe, .com, .dll, .sys
Internet media type	application/x-dosexec, application/x-msdos-program, application/x-ms-dos-executable
Magic number	4D 5A (MZ in ASCII)
Type of format	Binary, executable
Extended to	New Executable; Linear Executable; Portable Executable

The DOS MZ executable format is the executable file format used for .EXE files under the DOS and Windows operating systems.

The file can be identified by the ASCII string "MZ" (hexadecimal: 4D 5A) at the beginning of the file (the "magic number"). "MZ" are the initials of Mark Zbikowski, one of the leading developers of MS-DOS.^[1]

The MZ DOS executable file is newer than the COM executable format and differs from it. The DOS executable header contains relocation information, which allows multiple segments to be loaded at arbitrary memory addresses, and it supports executables larger than 64 KB; however, the format still requires relatively low memory limits. These limits were later bypassed using DOS extenders.

Segment handling

The environment of an EXE program run by DOS is found in its Program Segment Prefix.

EXE files normally have separate segments for the code, data, and stack. Program execution begins at address 0 of the code segment, and the stack pointer register is set to whatever value is contained in the header information (thus if the header specifies a 512 byte stack, the stack pointer is set to 200h). It is possible to not use a separate stack segment and simply use the code segment for the stack if desired.

The DS (data segment) register normally contains the same value as the CS (code segment) register and is not loaded with the actual segment address of the data segment when an EXE file is initialized; it is necessary for the programmer to set it themselves, generally done via the following instructions:

    MOV AX, @DATA
    MOV DS, AX

Termination

In the original DOS 1.x API, it was also necessary to have the CS register pointing to the segment with the PSP at program termination; this was done via the following instructions:

    PUSH DS
    XOR AX, AX
    PUSH AX

Program termination would then be performed by a RETF instruction, which would retrieve the original segment address with the PSP from the stack and then jump to address 0, which contained an INT 20h instruction.

The DOS 2.x API introduced a new program termination function, INT 21h Function 4Ch which does not require saving the PSP segment address at the start of the program, and Microsoft advised against the use of the older DOS 1.x method.

Compatibility

MZ DOS executables can be run from DOS and Windows 9x-based operating systems. 32-bit Windows NT-based operating systems can execute them using their built-in Virtual DOS machine (although some graphics modes are unsupported). 64-bit versions of Windows cannot execute them. Alternative ways to run these executables include DOSBox and DOSEMU.

MZ DOS executables can be created by linkers, like Digital Mars Optlink, MS linker, VALX or Open Watcom's WLINK; additionally, FASM can create them directly.

References

^ Inside Windows: An In-Depth Look into the Win32 Portable Executable File Format - MSDN Magazine, February 2002 Archived 2018-07-11 at the Wayback Machine. "Every PE file begins with a small MS-DOS executable. ... The first bytes of a PE file begin with the traditional MS-DOS header, called an IMAGE_DOS_HEADER. The only two values of any importance are e_magic and e_lfanew. ... The e_magic field (a WORD) needs to be set to the value 0x5A4D. ... In ASCII representation, 0x5A4D is MZ, the initials of Mark Zbikowski, one of the original architects of MS-DOS."

External links

OSDev Wiki - MZ format details

[1] Inside Windows: An In-Depth Look into the Win32 Portable Executable File Format - MSDN Magazine, February 2002 Archived 2018-07-11 at the Wayback Machine. "Every PE file begins with a small MS-DOS executable. ... The first bytes of a PE file begin with the traditional MS-DOS header, called an IMAGE_DOS_HEADER. The only two values of any importance are e_magic and e_lfanew. ... The e_magic field (a WORD) needs to be set to the value 0x5A4D. ... In ASCII representation, 0x5A4D is MZ, the initials of Mark Zbikowski, one of the original architects of MS-DOS."

[1]

v t e Disk operating systems (DOS)
API Timeline Comparison Commands Games
MS-DOS, IBM PC DOS, compatible systems	MS-DOS Multitasking MS-DOS 4.0/4.1 MS-DOS 7 IBM PC DOS DOS/V DR-DOS H-DOS Novell DOS ROM-DOS SISNE plus PTS-DOS FreeDOS
Other x86	4680 OS 4690 OS 86-DOS ADOS Concurrent CP/M-86 Concurrent DOS CP/M-86 CP/K Datapac System Manager DOS Plus K8918-OS FlexOS MP/M-86 Multiuser DOS NetWare PalmDOS Novell DOS OpenDOS PC-MOS/386 REAL/32 SB-86 SCP1700 Towns OS TurboDOS
Other platforms	AmigaDOS AMSDOS ANDOS Apple DOS Apple ProDOS Apple SOS Atari DOS Atari TOS BW-DOS Commodore DOS Concurrent DOS 68K Concurrent DOS V60 CP/M Cromemco DOS CSI-DOS DEC BATCH-11/DOS-11 DIP DOS DOS/360 DOS XL Edos EOS FLEX GEMDOS IDEDOS IMDOS iS-DOS ISIS MDOS MicroDOS MP/M MSX-DOS MyDOS NewDos/80 OS/M PTDOS RealDOS SB-80 SCP Sinclair QDOS RDOS SmartDOS SpartaDOS SpartaDOS X Technical Support SuperDOS Top-DOS TR-DOS TRSDOS TurboDOS UDOS Z-DOS Z80-RIO
Category List

Offset (hex)	Size (bytes)	Field Name	Description
00	2	Signature	The magic number 'MZ' (ASCII 4Dh 5Ah) or 'ZM' (5Ah 4Dh), identifying the file as a valid MZ executable. This signature honors Mark Zbikowski, a key Microsoft engineer involved in its design.^[16]^[4]
02	2	Bytes in last page	The number of bytes in the final 512-byte page of the file, ranging from 1 to 511; a value of 0 indicates the last page is full (512 bytes). This field, combined with the total pages field, allows calculation of the exact file size.^[16]^[5]
04	2	Total pages	The number of 512-byte pages in the file, including any partial last page. This value represents the file size in units of 512 bytes.^[16]^[5]
06	2	Relocation entries count	The number of entries in the relocation table, which specifies offsets requiring segment address adjustments during loading. A value of 0 indicates no relocations are needed.^[16]^[4]
08	2	Header paragraphs	The size of the header (including the relocation table) in paragraphs of 16 bytes each. This field indicates the starting offset of the loadable program image within the file. For example, a value of 4 means the header occupies the first 64 bytes.^[16]^[5]
0A	2	Minimum extra paragraphs	The minimum number of additional 16-byte paragraphs of memory the program requests beyond what is needed for the load image and program segment prefix (PSP). If insufficient memory is available, loading fails.^[16]^[4]
0C	2	Maximum extra paragraphs	The maximum number of additional 16-byte paragraphs the program can utilize beyond the minimum. DOS attempts to allocate as much as possible up to this limit from available memory. Historically, a value of FFFFh requests all remaining memory.^[16]^[5]
0E	2	Initial SS	The initial value for the stack segment register (SS), specified as a relocatable offset relative to the start of the load image. This helps establish the program's stack area in memory.^[16]^[4]
10	2	Initial SP	The initial value for the stack pointer register (SP), defining the top of the stack upon program entry. Typically set to point to the end of the allocated stack space.^[16]^[5]
12	2	Checksum	A simple integrity check computed as the one's complement of the sum of all 16-bit words in the file; the sum of all words including this checksum should equal zero. This field is often set to zero and not always verified by DOS.^[16]^[4]
14	2	Initial IP	The initial value for the instruction pointer register (IP), indicating the offset within the code segment where execution begins.^[16]^[5]
16	2	Initial CS	The initial value for the code segment register (CS), specified as a relocatable offset relative to the start of the load image. This points to the base of the program's code in memory.^[16]^[4]
18	2	Relocation table offset	The byte offset from the start of the file to the beginning of the relocation table. If no relocations exist, this may point to the load image or be unused.^[16]^[5]
1A	2	Overlay number	A value used for overlay management in segmented programs; 0 indicates the main executable module, while nonzero values denote overlays loaded on demand. This field is typically 0 for non-overlay programs.^[16]^[4]

Knowledge Base

Talk Channels

Special Pages

DOS MZ executable

DOS MZ executable

DOS MZ executable

Segment handling

Termination

Compatibility

See also

Further reading

References

External links