Code page 850

Code page 858
MIME / IANA	IBM00858
Alias(es)	CCSID00858, CP00858, PC-Multilingual-850+euro
Transforms / Encodes	ISO 8859-1
Preceded by	Code page 850

Code page 850
	Code page 850 character set with 9×14 glyphs, as usually rendered by Enhanced Graphics Array (EGA)
MIME / IANA	IBM850
Alias(es)	cp850, 850, csPC850Multilingual, DOS Latin 1, OEM 850
Languages	English, various others
Classification	Extended ASCII, OEM code page
Extends	US-ASCII
Based on	OEM-US
Transforms / Encodes	ISO/IEC 8859-1 (reordered)
Other related encodings	Code page 858 (PC DOS 2000's "modified code page 850"), code page 437

Code page 850 (CCSID 850) (also known as CP 850, IBM 00850,^[2] OEM 850,^[3] DOS Latin 1^[4]) is a code page used under DOS operating systems^[a] in Western Europe.^[5] Depending on the country setting and system configuration, code page 850 is the primary code page and default OEM code page in many countries, including various English-speaking locales (e.g. in the United Kingdom, Ireland, and Canada), whilst other English-speaking locales (like the United States) default to the hardware code page 437.^[6]

Code page 850 differs from code page 437 in that many of the box-drawing characters, Greek letters, and various symbols were replaced with additional Latin letters with diacritics, thus greatly improving support for Western European languages (all characters from ISO 8859-1 are included). At the same time, the changes frequently caused display glitches with programs that made use of the box-drawing characters to display a GUI-like surface in text mode.

After the DOS era, successor operating systems largely replaced code page 850 with Windows-1252,^[b] later UCS-2 and UTF-16,^[c] and finally UTF-8. However, legacy applications, especially command-line programs, may still depend on support for older code pages.

Character set

Each non-ASCII character appears with its equivalent Unicode code-point. Differences from code page 437 are limited to the second half of the table, the first half being the same.

Code page 850^[3]^[7]^[8]^[9]^[10]
	0	1	2	3	4	5	6	7	8	9	A	B	C	D	E	F
0x 0	NUL	☺︎ 263A	☻ 263B	♥︎ 2665	♦︎ 2666	♣︎ 2663	♠︎ 2660	• 2022	◘ 25D8	○ 25CB	◙ 25D9	♂︎ 2642	♀︎ 2640	♪ 266A	♫ 266B	☼ 263C
1x 16	► 25BA	◄ 25C4	↕︎ 2195	‼︎ 203C	¶ 00B6	§ 00A7	▬ 25AC	↨ 21A8	↑ 2191	↓ 2193	→ 2192	← 2190	∟ 221F	↔︎ 2194	▲ 25B2	▼ 25BC
2x 32	SP	!	"	#	$	%	&	'	(	)	*	+	,	-	.	/
3x 48	0	1	2	3	4	5	6	7	8	9	:	;	<	=	>	?
4x 64	@	A	B	C	D	E	F	G	H	I	J	K	L	M	N	O
5x 80	P	Q	R	S	T	U	V	W	X	Y	Z	[	\	]	^	_
6x 96	`	a	b	c	d	e	f	g	h	i	j	k	l	m	n	o
7x 112	p	q	r	s	t	u	v	w	x	y	z	{	\|	}	~	⌂ 2302
8x 128	Ç 00C7	ü 00FC	é 00E9	â 00E2	ä 00E4	à 00E0	å 00E5	ç 00E7	ê 00EA	ë 00EB	è 00E8	ï 00EF	î 00EE	ì 00EC	Ä 00C4	Å 00C5
9x 144	É 00C9	æ 00E6	Æ 00C6	ô 00F4	ö 00F6	ò 00F2	û 00FB	ù 00F9	ÿ 00FF	Ö 00D6	Ü 00DC	ø 00F8	£ 00A3	Ø 00D8	× 00D7	ƒ 0192
Ax 160	á 00E1	í 00ED	ó 00F3	ú 00FA	ñ 00F1	Ñ 00D1	ª 00AA	º 00BA	¿ 00BF	® 00AE	¬ 00AC	½ 00BD	¼ 00BC	¡ 00A1	« 00AB	» 00BB
Bx 176	░ 2591	▒ 2592	▓ 2593	│ 2502	┤ 2524	Á 00C1	Â 00C2	À 00C0	© 00A9	╣ 2563	║ 2551	╗ 2557	╝ 255D	¢ 00A2	¥ 00A5	┐ 2510
Cx 192	└ 2514	┴ 2534	┬ 252C	├ 251C	─ 2500	┼ 253C	ã 00E3	Ã 00C3	╚ 255A	╔ 2554	╩ 2569	╦ 2566	╠ 2560	═ 2550	╬ 256C	¤ 00A4
Dx 208	ð 00F0	Ð 00D0	Ê 00CA	Ë 00CB	È 00C8	ı 0131	Í 00CD	Î 00CE	Ï 00CF	┘ 2518	┌ 250C	█ 2588	▄ 2584	¦ 00A6	Ì 00CC	▀ 2580
Ex 224	Ó 00D3	ß 00DF	Ô 00D4	Ò 00D2	õ 00F5	Õ 00D5	µ 00B5	þ 00FE	Þ 00DE	Ú 00DA	Û 00DB	Ù 00D9	ý 00FD	Ý 00DD	¯ 00AF	´ 00B4
Fx 240	SHY 00AD	± 00B1	‗ 2017	¾ 00BE	¶ 00B6	§ 00A7	÷ 00F7	¸ 00B8	° 00B0	¨ 00A8	· 00B7	¹ 00B9	³ 00B3	² 00B2	■ 25A0	NBSP 00A0

Differences from code page 437

Code page 858

In 1998, code page 858 (CCSID 858)^[11] (also known as CP 858, IBM 00858, OEM 858^[3]) was derived from this code page by changing code point 213 (D5_hex) from a dotless i ⟨ı⟩ to the euro sign ⟨€⟩ U+20AC.^[12]^[13]^[14] Unlike most code pages modified to support the euro sign, the generic currency sign at CF_hex was not chosen as the character to replace (compare ISO-8859-15 (from ISO-8859-1), code pages 808 (from 866), 848 (from 1125), 849 (from 1131) and 872 (from 855), ISO-IR-205 (from ISO-8859-4), ISO-IR-206 (from ISO-8859-13), and the changes to MacRoman and MacCyrillic).

IBM's PC DOS 2000, also released in 1998, just changed the definition of 850 to match 858 and called it modified code page 850.^[15]^[16]^[17]^[18] This was done so programs that hard-coded 850 would be able to use the Euro sign. There may also have been a problem with Code Page Information (.CPI) files being limited to about six codepages maximum. More recent IBM/MS products implement codepage 858 under its own ID and have restored 850 to the original.^[19]

Notes

^ as well as Psion's EPOC16 operating system
^ akin to and not always well-distinguished from ISO-8859-1
^ The Windows NT line was natively Unicode from the start, but issues of development tool support and compatibility with Windows 9x kept most applications on the 8-bit code pages.

References

^ ^a ^b Character Sets, Internet Assigned Numbers Authority (IANA), 2018-12-12
^ "00850" (PDF). Code pages by CPGID. IBM. Archived (PDF) from the original on 2012-09-23. Retrieved 2020-02-24.
^ ^a ^b ^c "OEM 850". Go Global Developer Center. Microsoft. Archived from the original on 2016-06-06. Retrieved 2016-06-06.
^ "Code Page 850 MS-DOS Latin 1". Developing International Software. Microsoft. Archived from the original on 2016-06-06. Retrieved 2016-06-06.
^ "CCSID 850 information document". Archived from the original on 2016-03-27.
^ Paul, Matthias R. (1997-07-30). "II.16.iii. Landessprachliche Unterstützung - Landescodes und Keyboard-Kürzel" [II.16.iii. National language support - Country codes and keyboard layout IDs]. NWDOS-TIPs — Tips & Tricks rund um Novell DOS 7, mit Blick auf undokumentierte Details, Bugs und Workarounds [NWDOSTIPs — Tips & tricks for Novell DOS 7, with special focus on undocumented details, bugs and workarounds]. MPDOSTIP (in German) (3 ed.). Archived from the original on 2016-06-06. Retrieved 2016-06-06. (NB. NWDOSTIP.TXT is a comprehensive work on Novell DOS 7 and OpenDOS 7.01, including the description of many undocumented features and internals. It is part of the author's yet larger MPDOSTIP.ZIP collection maintained up to 2001 and distributed on many sites at the time. The provided link points to a HTML-converted older version of the NWDOSTIP.TXT file.)
^ "cp850_DOSLatin1 to Unicode table" (TXT). The Unicode Consortium. Archived from the original on 2016-06-06. Retrieved 2016-06-06.
^ Code Page CPGID 00850 (pdf), IBM, 1986
^ Code Page (CPGID) 00850 (txt), IBM, 1998
^ "International Components for Unicode (ICU), ibm-850_P100-1995.ucm". GitHub. 2002-12-03. Archived from the original on 2022-01-28. Retrieved 2022-01-28.
^ "CCSID 858 information document". IBM. Archived from the original on 2016-03-27.
^ Code Page (CPGID) 00858 (txt), IBM, 1998
^ "00858". Code pages by CPGID. IBM. Archived from the original on 2016-06-06. Retrieved 2016-06-06.
^ "Code page 858 information document". IBM. Archived from the original on 2016-08-20.
^ Paul, Matthias R. (2001-06-10) [1995]. "DOS COUNTRY.SYS file format" (COUNTRY.LST file) (1.44 ed.). Archived from the original on 2016-04-20. Retrieved 2016-08-20.
^ Starikov, Yuri (2005-04-11). "15-летию Russian MS-DOS 4.01 посвящается" [15 Years of Russian MS-DOS 4.01] (in Russian). Archived from the original on 2016-06-06. Retrieved 2014-05-07.
^ Paul, Matthias R. (2001-08-27). "Changing codepages in FreeDOS (follow-up)". Archived from the original on 2014-10-01. Retrieved 2013-05-08. […] one could also create custom .CPI files in the traditional FONT style without difficulties, but you could only store up to […] six codepages in such a file if it should be useable by MS-DOS/PC DOS (some OEM issues and NT can handle files larger than 64 Kb, but MS-DOS/PC DOS can not). (NB. Based on fd-dev post [1].)
^ Paul, Matthias R. (2001-06-10) [1995]. "Format description of DOS, OS/2, and Windows NT .CPI, and Linux .CP files" (CPI.LST file) (1.30 ed.). Archived from the original on 2016-04-20. Retrieved 2016-08-20.
^ Paul, Matthias R. (2001-08-15). "Changing codepages in FreeDOS" (Technical design specification). Archived from the original on 2016-06-06. Retrieved 2016-06-06. The new official ID for the Multilingual "codepage 850 with EURO SIGN" is 858, not 850. IBM will switch to use 858 instead of their 850 variant with future issues of their products. […] I can only guess why they didn't add 858 to their EGAx.CPI, COUNTRY.SYS, and KEYBOARD.SYS files in PC DOS 2000. Many third-party applications are designed to work with 850 and didn't know about 858 at the time PC DOS 2000 was released, so it's easier for everyone, but unfortunately it's not compatible. […] As explained above, COUNTRY.SYS and KEYBOARD.SYS contain only two codepage entries for a given country in Western issues of DOS. (In Arabic and Hebrew issues there can be up to 8 codepages for one country, in theory there is no limit below the range of allowed codepages 1..65534). […] The problem is that removing support for 850 might have caused compatibility problems with applications which are hard-wired to use 850. Adding 858 as a third choice to all the files would have increased the file and table sizes significantly. The COUNTRY.SYS file parser in MS-DOS/PC DOS IO.SYS/IBMBIO.COM sets aside a 6 Kb (for DOS 6) scratchpad to load all the info. This allows a maximum of 438 entries in a COUNTRY.SYS file to be accepted, otherwise you will get the message "COUNTRY.SYS too large.". The NLSFUNC parser does not have this limitation, and the file parsers in DR-DOS (kernel and NLSFUNC) also do not know of such a restriction. Older issues of MS-DOS/PC DOS even had a 2 Kb buffer for a maximum of 146 entries.

[5] s well as Psion's EPOC16 operating system

[8] to and not always well-distinguished from ISO-8859-1

[9] The Windows NT line was natively Unicode from the start, but issues of development tool support and compatibility with Windows 9x kept most applications on the 8-bit code pages.

[IANA_2018-1] Character Sets, Internet Assigned Numbers Authority (IANA), 2018-12-12

[IBM-2] "00850" (PDF). Code pages by CPGID. IBM. Archived (PDF) from the original on 2012-09-23. Retrieved 2020-02-24.

[GGDC-3] "OEM 850". Go Global Developer Center. Microsoft. Archived from the original on 2016-06-06. Retrieved 2016-06-06.

[DIS-4] "Code Page 850 MS-DOS Latin 1". Developing International Software. Microsoft. Archived from the original on 2016-06-06. Retrieved 2016-06-06.

[IBM_CCSID850-6] "CCSID 850 information document". Archived from the original on 2016-03-27.

[Paul_1997_NWDOSTIP-7] Paul, Matthias R. (1997-07-30). "II.16.iii. Landessprachliche Unterstützung - Landescodes und Keyboard-Kürzel" [II.16.iii. National language support - Country codes and keyboard layout IDs]. NWDOS-TIPs — Tips & Tricks rund um Novell DOS 7, mit Blick auf undokumentierte Details, Bugs und Workarounds [NWDOSTIPs — Tips & tricks for Novell DOS 7, with special focus on undocumented details, bugs and workarounds]. MPDOSTIP (in German) (3 ed.). Archived from the original on 2016-06-06. Retrieved 2016-06-06. (NB. NWDOSTIP.TXT is a comprehensive work on Novell DOS 7 and OpenDOS 7.01, including the description of many undocumented features and internals. It is part of the author's yet larger MPDOSTIP.ZIP collection maintained up to 2001 and distributed on many sites at the time. The provided link points to a HTML-converted older version of the NWDOSTIP.TXT file.)

[UNICODE-10] "cp850_DOSLatin1 to Unicode table" (TXT). The Unicode Consortium. Archived from the original on 2016-06-06. Retrieved 2016-06-06.

[IBM_CPGID850_PDF-11] Code Page CPGID 00850 (pdf), IBM, 1986

[IBM_CPGID850_TXT-12] Code Page (CPGID) 00850 (txt), IBM, 1998

[ICU_1995-13] "International Components for Unicode (ICU), ibm-850_P100-1995.ucm". GitHub. 2002-12-03. Archived from the original on 2022-01-28. Retrieved 2022-01-28.

[IBM_CCSID858-14] "CCSID 858 information document". IBM. Archived from the original on 2016-03-27.

[IBM_CPGID858_TXT-15] Code Page (CPGID) 00858 (txt), IBM, 1998

[IBM2-16] "00858". Code pages by CPGID. IBM. Archived from the original on 2016-06-06. Retrieved 2016-06-06.

[IBMCP858INFO-17] "Code page 858 information document". IBM. Archived from the original on 2016-08-20.

[Paul_2001_COUNTRY-18] Paul, Matthias R. (2001-06-10) [1995]. "DOS COUNTRY.SYS file format" (COUNTRY.LST file) (1.44 ed.). Archived from the original on 2016-04-20. Retrieved 2016-08-20.

[Starikov_2005_CP866-19] Starikov, Yuri (2005-04-11). "15-летию Russian MS-DOS 4.01 посвящается" [15 Years of Russian MS-DOS 4.01] (in Russian). Archived from the original on 2016-06-06. Retrieved 2014-05-07.

[Paul_2001_CPSwitch-64KB-Size-20] Paul, Matthias R. (2001-08-27). "Changing codepages in FreeDOS (follow-up)". Archived from the original on 2014-10-01. Retrieved 2013-05-08. […] one could also create custom .CPI files in the traditional FONT style without difficulties, but you could only store up to […] six codepages in such a file if it should be useable by MS-DOS/PC DOS (some OEM issues and NT can handle files larger than 64 Kb, but MS-DOS/PC DOS can not). (NB. Based on fd-dev post [1].)

[Paul_2001_CPI-21] Paul, Matthias R. (2001-06-10) [1995]. "Format description of DOS, OS/2, and Windows NT .CPI, and Linux .CP files" (CPI.LST file) (1.30 ed.). Archived from the original on 2016-04-20. Retrieved 2016-08-20.

[Paul_2001_CPSwitch-438-Entries-22] Paul, Matthias R. (2001-08-15). "Changing codepages in FreeDOS" (Technical design specification). Archived from the original on 2016-06-06. Retrieved 2016-06-06. The new official ID for the Multilingual "codepage 850 with EURO SIGN" is 858, not 850. IBM will switch to use 858 instead of their 850 variant with future issues of their products. […] I can only guess why they didn't add 858 to their EGAx.CPI, COUNTRY.SYS, and KEYBOARD.SYS files in PC DOS 2000. Many third-party applications are designed to work with 850 and didn't know about 858 at the time PC DOS 2000 was released, so it's easier for everyone, but unfortunately it's not compatible. […] As explained above, COUNTRY.SYS and KEYBOARD.SYS contain only two codepage entries for a given country in Western issues of DOS. (In Arabic and Hebrew issues there can be up to 8 codepages for one country, in theory there is no limit below the range of allowed codepages 1..65534). […] The problem is that removing support for 850 might have caused compatibility problems with applications which are hard-wired to use 850. Adding 858 as a third choice to all the files would have increased the file and table sizes significantly. The COUNTRY.SYS file parser in MS-DOS/PC DOS IO.SYS/IBMBIO.COM sets aside a 6 Kb (for DOS 6) scratchpad to load all the info. This allows a maximum of 438 entries in a COUNTRY.SYS file to be accepted, otherwise you will get the message "COUNTRY.SYS too large.". The NLSFUNC parser does not have this limitation, and the file parsers in DR-DOS (kernel and NLSFUNC) also do not know of such a restriction. Older issues of MS-DOS/PC DOS even had a 2 Kb buffer for a maximum of 146 entries.

[1]

[2]

[3]

[4]

[a]

[5]

[6]

[b]

[c]

[7]

[8]

[9]

[10]

[11]

[12]

[13]

[14]

[15]

[16]

[17]

[18]

[19]

v t e Character encodings
Early telecommunications	Telegraph code Needle Morse Non-Latin Wabun/Kana Chinese Cyrillic Baudot and Murray Fieldata ASCII ISO/IEC 646 BCDIC Teletex and Videotex/Teletext T.51/ISO/IEC 6937 ITU T.61 ITU T.101 World System Teletext background sets Transcode
ISO/IEC 8859	Approved parts -1 (Western Europe) -2 (Central Europe) -3 (Maltese/Esperanto) -4 (North Europe) -5 (Cyrillic) -6 (Arabic) -7 (Greek) -8 (Hebrew) -9 (Turkish) -10 (Nordic) -11 (Thai) -13 (Baltic) -14 (Celtic) -15 (New Western Europe) -16 (Romanian) Abandoned parts -12 (Devanagari) Proposed but not approved KOI-8 Cyrillic Sámi Adaptations Welsh Estonian Ukrainian Cyrillic
Bibliographic use	MARC-8 ANSEL CCCII/EACC ISO 5426 5426-2 5427 5428 6438 6862
National standards	ArmSCII Big5 BraSCII BSCII CNS 11643 DIN 66003 ELOT 927 GOST 10859 GB 2312 GB 12345 GB 12052 GB 18030 HKSCS ISCII JIS X 0201 JIS X 0208 JIS X 0212 JIS X 0213 KOI-7 KPS 9566 KS X 1001 KS X 1002 LST 1564 LST 1590-4 PASCII Shift JIS SI 960 TIS-620 TSCII VISCII VSCII YUSCII
ISO/IEC 2022	ISO/IEC 8859 ISO/IEC 10367 Extended Unix Code / EUC
Mac OS Code pages ("scripts")	Armenian Arabic Barents Cyrillic Celtic Central European Croatian Cyrillic Devanagari Farsi (Persian) Font X (Kermit) Gaelic Georgian Greek Gujarati Gurmukhi Hebrew Iceland Inuit Keyboard Latin (Kermit) Maltese/Esperanto Ogham Roman Romanian Sámi Turkish Turkic Cyrillic Ukrainian VT100
DOS code pages	437 737 850 858 861 862 863 864 865 866 867 868 869 899 904 932 936 942 949 950 951 1040 1043 1046 1098 1115 1116 1117 1118 1127 ABICOMP CS Indic CSX Indic CSX+ Indic CWI-2 Iran System Kamenický Mazovia MIK
IBM AIX code pages	895 896 912 915 921 922 1006 1008 1009 1010 1012 1013 1014 1015 1016 1017 1018 1019 1046 1133
Windows code pages	CER-GS 932 936 (GBK) 950 Extended Latin-8 1250 1251 1252 1253 1254 1255 1256 1257 1258 1270 Cyrillic + French Cyrillic + German Polytonic Greek
EBCDIC code pages	Japanese language in EBCDIC DKOI
DEC terminals (VTx)	Multinational (MCS) National Replacement (NRCS) French Canadian Swiss Spanish United Kingdom Dutch Finnish French Norwegian and Danish Swedish Norwegian and Danish (alternative) 8-bit Greek 8-bit Turkish SI 960 Hebrew Special Graphics Technical (TCS)
Platform specific	1052 1053 1054 1055 1058 Acorn RISC OS Amstrad CPC Apple II ATASCII Atari ST BICS Casio calculators CDC Compucolor 8001 Compucolor II CP/M+ DEC RADIX 50 DEC MCS/NRCS DG International Galaksija GEM GSM 03.38 HP Roman HP FOCAL HP RPL SQUOZE LICS LMBCS MSX NEC APC NeXT PETSCII PostScript Standard PostScript Latin 1 SAM Coupé Sega SC-3000 Sharp calculators Sharp MZ Sinclair QL Teletext TI calculators TRS-80 Ventura International WISCII XCCS ZX80 ZX81 ZX Spectrum
Unicode / ISO/IEC 10646	UTF-1 UTF-7 UTF-8 UTF-16 UTF-32 UTF-EBCDIC GB 18030 DIN 91379 BOCU-1 CESU-8 SCSU TACE16 Comparison of Unicode encodings
TeX typesetting system	Cork LY1 OML OMS OT1
Miscellaneous code pages	ABICOMP ASMO 449 Digital encoding of APL symbols ISO-IR-68 ARIB STD-B24 Fieldata HZ IEC-P27-1 INIS 7-bit 8-bit ISO-IR-169 ISO 2033 KOI KOI8-R KOI8-RU KOI8-U Mojikyō SEASCII Stanford/ITS Symbol TRON Unified Hangul Code
Control character	Morse prosigns C0 and C1 control codes ISO/IEC 6429 JIS X 0211 Unicode control, format and separator characters Whitespace characters
Related topics	CCSID Character encodings in HTML Charset detection Han unification Hardware code page MICR code Mojibake Variable-length encoding
Character sets

CP850 Byte (Hex)	Unicode Code Point	Description
0x80	U+00C7	LATIN CAPITAL LETTER C WITH CEDILLA (Ç)
0x84	U+00E4	LATIN SMALL LETTER A WITH DIAERESIS (ä)
0x9C	U+00A3	POUND SIGN (£)
0xA4	U+00F1	LATIN SMALL LETTER N WITH TILDE (ñ)
0xB0	U+2591	LIGHT SHADE
0xB3	U+2502	BOX DRAWINGS LIGHT VERTICAL (│)
0xC7	U+00C3	LATIN CAPITAL LETTER A WITH TILDE (Ã)
0xD0	U+00F0	LATIN SMALL LETTER ETH (ð)
0xFF	U+00A0	NO-BREAK SPACE

Code Point	Code Page 437 (Character, Unicode)	Code Page 850 (Character, Unicode)
0xB5	┡ (U+2561)	Á (U+00C1)
0xC6	╞ (U+255E)	ã (U+00E3)
0xD2	╥ (U+2565)	Ê (U+00CA)

History

Media collections

Code page 850

Recent from talks

Recent from talks

Contribute something

Contribute something

Media Pages

Timelines

Articles

Notes collections

Notes

Notes

Days in Chronicle

Code page 850

Character set

Code page 858

See also

Notes

References

Code page 850

Overview

Definition and Purpose

Historical Development

Technical Specifications

Character Encoding Details

Mapping to Unicode

Variants and Comparisons

Code Page 858

Comparison with Code Page 437

Usage and Compatibility

Adoption in DOS and Early PCs

Compatibility Issues and Solutions

Modern Context

Transition to Unicode and UTF-8

Remaining Legacy Applications

References

Add your contribution

Related Hubs

Contribute something