所谓代码页 (codepage) 就是各国的文字编码和 Unicode 之间的映射表。

例如 GBK 和 Unicode 的映射表就是 CP936,所以也常用 cp936 来指代 GBK。

Code Pages Supported by Windows

The lists below provide links to graphical representations and textual listings of each of the Windows codepages.

SBCS (Single Byte Character Set) Codepages

  • 1250 (Central Europe)
  • 1251 (Cyrillic)
  • 1252 (Latin I)
  • 1253 (Greek)
  • 1254 (Turkish)
  • 1255 (Hebrew)
  • 1256 (Arabic)
  • 1257 (Baltic)
  • 1258 (Vietnam)
  • 874 (Thai)

DBCS (Double Byte Character Set) Codepages

In these graphical representations, leadbytes are indicated by light gray background shading.
Each of these leadbytes hyperlinks to a new page showing the 256 character block associated with that leadbyte.
Unused leadbytes are identified by a darker gray background.

  • 932 (Japanese Shift-JIS)
  • 936 (Simplified Chinese GBK)
  • 949 (Korean)
  • 950 (Traditional Chinese Big5)

Windows OEM Code Pages

The list below provides links to graphical representations, and textual listings, of each of the Windows OEM code pages:

  • 437 (US)
  • 720 (Arabic)
  • 737 (Greek)
  • 775 (Baltic)
  • 850 (Multilingual Latin I)
  • 852 (Latin II)
  • 855 (Cyrillic)
  • 857 (Turkish)
  • 858 (Multilingual Latin I + Euro)
  • 862 (Hebrew)
  • 866 (Russian)

Windows ANSI and OEM Code Pages

The following codepages are used as both Windows ANSI and OEM codepages:

  • 874 (Thai)
  • 932 (Japanese Shift-JIS)
  • 936 (Simplified Chinese GBK)
  • 949 (Korean)
  • 950 (Traditional Chinese Big5)
  • 1258 (Vietnam)

ISO Code Pages

The list below provides links to graphical representations, and textual listings, of each of the ISO 8859 character sets:

  • ISO-8859-1 (Latin 1)
  • ISO-8859-2 (Latin 2)
  • ISO-8859-3 (Latin 3)
  • ISO-8859-4 (Baltic)
  • ISO-8859-5 (Cyrillic)
  • ISO-8859-6 (Arabic)
  • ISO-8859-8 (Hebrew)
  • ISO-8859-9 (Turkish)
  • ISO-8859-15 (Latin 9)

http://www.unicode.org/Public/MAPPINGS/VENDORS/MICSFT/WINDOWS/

 CP874.TXT                        21-Nov-2006 18:58  8.2K  
 CP932.TXT                        21-Nov-2006 18:58  288K  
 CP936.TXT                        21-Nov-2006 18:59  798K  
 CP949.TXT                        21-Nov-2006 19:00  772K  
 CP950.TXT                        21-Nov-2006 19:01  497K  
 CP1250.TXT                       21-Nov-2006 18:56  9.3K  
 CP1251.TXT                       21-Nov-2006 18:57  9.0K  
 CP1252.TXT                       21-Nov-2006 18:57  9.1K  
 CP1253.TXT                       21-Nov-2006 18:57  8.7K  
 CP1254.TXT                       21-Nov-2006 18:57  9.1K  
 CP1255.TXT                       21-Nov-2006 18:57  8.1K  
 CP1256.TXT                       21-Nov-2006 18:58  8.5K  
 CP1257.TXT                       21-Nov-2006 18:58  9.0K  
 CP1258.TXT                       21-Nov-2006 18:58  9.0K  

http://www.unicode.org/Public/MAPPINGS/VENDORS/MICSFT/WindowsBestFit/readme.txt
The following files describe windows code page behavior for the "ansi" code pages provided by Microsoft.
File               Code Page       Descriptionbestfit874.txt     windows-874     ANSI/OEM Thai (same as 28605, ISO 8859-15); Thai (Windows)
bestfit932.txt     shift_jis       ANSI/OEM Japanese; Japanese (Shift-JIS)
bestfit936.txt     gb2312          ANSI/OEM Simplified Chinese (PRC, Singapore); Chinese Simplified (GB2312)
bestfit949.txt     ks_c_5601-1987  ANSI/OEM Korean (Unified Hangul Code)
bestfit950.txt     big5            ANSI/OEM Traditional Chinese (Taiwan; Hong Kong SAR, PRC); Chinese Traditional (Big5)
bestfit1250.txt    windows-1250    ANSI Central European; Central European (Windows)
bestfit1251.txt    windows-1251    ANSI Cyrillic; Cyrillic (Windows)
bestfit1252.txt    windows-1252    ANSI Latin 1; Western European (Windows)
bestfit1253.txt    windows-1253    ANSI Greek; Greek (Windows)
bestfit1254.txt    windows-1254    ANSI Turkish; Turkish (Windows)
bestfit1255.txt    windows-1255    ANSI Hebrew; Hebrew (Windows)
bestfit1256.txt    windows-1256    ANSI Arabic; Arabic (Windows)
bestfit1257.txt    windows-1257    ANSI Baltic; Baltic (Windows)
bestfit1258.txt    windows-1258    ANSI/OEM Vietnamese; Vietnamese (Windows)

These tables include "best fit" behavior which is not present in the other files. Examples of best fit
are converting fullwidth letters to their counterparts when converting to single byte code pages, and
mapping the Infinity character to the number 8.932, 936, 949 and 950 are all double byte code pages. The remainder are single byte code pages. Each file
is encoded in the code page it describes, eg: bestfit1252.txt is encoded in the windows-1252 encoding. The only
non-ASCII characters however are in the comments so these files may be read by an ASCII parser if necessary.Each file has sections of key word tags and records. Any text after a ; is ignored as are blank lines. Fields are
delimited by one or more space or tab characters. Each section begins one of the following tags:CODEPAGE
CPINFO
MBTABLE
WCTABLE
DBCSRANGE  (double byte code pages only)
DBSCTABLE  (double byte code pages only)Descriptions of each tag are:CODEPAGE 932            ; Japanese - ANSI, OEMThe CODEPAGE tag contains 1 field and marks the start of the code page file.Field 1 -- The only field is the decimal windows code page number for this code page.CPINFO 2 0x3f 0x30fb    ; DBCS CP, Unic Default Char = Katakana Middle DotThe CPINFO tag describes the code page with 3 fields:Field 1 -- "1" for a single byte code page, "2" for a double byte code page.Field 2 -- Replacement characters for unassigned Unicode code points when written to thiscode page (currently always ?)Field 3 -- Replacement characters for illegal or unassigned code page values when converting to Unicode.This is Katakana middle dot for 932 and ? for all other code pages.MBTABLE 256The MBTABLE tag marks the start of the "Multibyte" code page to Unicode conversion table. It has 1 field.Field 1 -- This field contains the number of following records of code page to Unicode mappings. Note thatlead bytes dont have mappings, so this is not always 256. For 932 for example it is 196.MultiByte Mapping Records:0x00    0x0000  ;Null
0x01    0x0001  ;Start Of Heading
...
0x30    0x0030  ;Digit 0
0x31    0x0031  ;Digit 1
...Each record consists of two fields to map from the code page to Unicode.Field 1 -- The code page byte that is being mapped to Unicode, eg "0x3f"Field 2 -- The Unicode UTF-16 code point that this byte maps to, eg "0x003f"DBCSRANGE  2            ;2 DBCS Lead Byte Ranges: 0x81-0x9f and 0xe0-0xfcThe DBSCRANGE describes the number of double byte ranges for double byte code page. Ranges are consecutivelead byte values such as 0x81-0x9fField 1 -- This field contains the number of double byte ranges. The next record is the 1st lead byte range0x81  0x9f              ;Lead Byte RangeThis record describes the first lead byte range. It is the first record after DBCSRANGE and is followed by one
DBCSTABLE record for each lead byte in the range. If there are additional ranges, another Lead Byte Range record
will follow the last DBCSTABLE in the previous range.Field 1 -- This field is the first lead byte used in this range.Field 2 -- This field is the last lead byte used in this rangeDBCSTABLE 147           ;LeadByte = 0x81The DBCSTABLE record describes the mappings available for a particular lead byte. The comment is ignored butdescriptive. The lead byte of the first DBCSTABLE is the first lead byte of the previous Lead Byte Rangerecord. Each subsequent DBCSTABLE is for the next consecutive lead byte value.Field 1 -- This field is the number of trail byte mappings following.Double byte mapping records:0x40 0x3000  ;   Ideographic Space
0x41    0x3001  ;   Ideographic Comma
...Field 1 -- This field is the trail byte to map from.Field 2 -- This field is the Unicode UTF-16 code point that this lead byte/trail byte combination map to.Example:DBCSRANGE  2            ;2 DBCS Lead Byte Ranges: 0x81-0x9f and 0xe0-0xfc0x81  0x9f              ;Lead Byte RangeDBCSTABLE 147           ;LeadByte = 0x810x40  0x3000  ;   Ideographic Space
0x41    0x3001  ;   Ideographic Comma
...The preceeding example would map the byte sequences 0x81 0x40 to U+3000 and 0x81 0x41 to U+3001.WCTABLE 698The WCTABLE tag marks the start of the Unicode UTF-16 (WideChar) to "MultiByte" bytes. It has 1 field.Field 1 -- This field contains the number of records of Unicode to byte mappings. Note that this is oftenmore than the number of round trip mappings supported by the code page due to windows "Best Fit" behavior.Unicode UTF-16 (WideChar) Mapping Records:These take two forms, differing between single byte and double byte code pages. Both forms have 2 fields:Single byte WCTABLE records:0x0000  0x00    ;Null
0x0001  0x01    ;Start Of Heading
...
0x0061  0x61    ;Latin Small Letter A
0x0062  0x62    ;Latin Small Letter B
0x0063  0x63    ;Latin Small Letter C
...
0x221e  0x38    ;Infinity                        << Best Fit Mapping
...
0xff41  0x61    ;Fullwidth Latin Small Letter A  << Best Fit Mapping
0xff42  0x62    ;Fullwidth Latin Small Letter B  << Best Fit Mapping
0xff43  0x63    ;Fullwidth Latin Small Letter C  << Best Fit Mapping
...Field 1 -- The Unicode UTF-16 code point for the character being converted.Field 2 -- The single byte that this UTF-16 code point maps to. If a reverse mapping does not in the MBTABLE,then this is a Best Fit mapping.Multibyte WCTABLE records:0x0000 0x0000  ;   Null
0x0001  0x0001  ;   Start Of Heading
...
0x0061  0x0061  ;   a
0x0062  0x0062  ;   b
0x0063  0x0063  ;   c
...
0x221e  0x8187  ;   Infinity
...
0xff41  0x8281  ;   Fullwidth a
0xff42  0x8282  ;   Fullwidth b
0xff43  0x8283  ;   Fullwidth c
...Field 1 -- The Unicode UTF-16 code point for the character being converted.Field 2 -- The byte or bytes that this code point maps to as a 16 bit value. The high byte is the lead byte,and the low byte is the trail byte. If the high byte is 0, then this is a single byte code point, withthe value of the low byte and no lead byte is emitted.ENDCODEPAGEThis tag marks the end of the code page data. Anything after this marker is ignored.
 bestfit874.txt                   04-Oct-2006 15:58   19K  
 bestfit932.txt                   04-Oct-2006 15:58  353K  
 bestfit936.txt                   04-Oct-2006 15:58  852K  
 bestfit949.txt                   04-Oct-2006 15:58  1.2M  
 bestfit950.txt                   04-Oct-2006 15:58  689K  
 bestfit1250.txt                  04-Oct-2006 15:58   36K  
 bestfit1251.txt                  04-Oct-2006 15:58   35K  
 bestfit1252.txt                  04-Oct-2006 15:58   36K  
 bestfit1253.txt                  04-Oct-2006 15:58   33K  
 bestfit1254.txt                  04-Oct-2006 15:58   36K  
 bestfit1255.txt                  04-Oct-2006 15:58   20K  
 bestfit1256.txt                  04-Oct-2006 15:58   29K  
 bestfit1257.txt                  04-Oct-2006 15:58   16K  
 bestfit1258.txt                  04-Oct-2006 15:58   22K  
 bestfit1361.txt                  13-Mar-2012 19:43  1.2M  
 readme.txt                       05-Oct-2006 16:22  7.2K  

http://www.unicode.org/Public/MAPPINGS/VENDORS/MICSFT/PC/

 CP437.TXT                        21-Nov-2006 18:54  9.6K  
 CP737.TXT                        21-Nov-2006 18:54  9.6K  
 CP775.TXT                        21-Nov-2006 18:54  9.5K  
 CP850.TXT                        21-Nov-2006 18:54  9.4K  
 CP852.TXT                        21-Nov-2006 18:54  9.7K  
 CP855.TXT                        21-Nov-2006 18:54  9.3K  
 CP857.TXT                        21-Nov-2006 18:54  9.4K  
 CP860.TXT                        21-Nov-2006 18:54  9.6K  
 CP861.TXT                        21-Nov-2006 18:54  9.6K  
 CP862.TXT                        21-Nov-2006 18:54  9.2K  
 CP863.TXT                        21-Nov-2006 18:54  9.5K  
 CP864.TXT                        21-Nov-2006 18:54  9.3K  
 CP865.TXT                        21-Nov-2006 18:55  9.6K  
 CP866.TXT                        21-Nov-2006 18:55  9.5K  
 CP869.TXT                        21-Nov-2006 18:55  9.1K  
 CP874.TXT                        21-Nov-2006 18:55  8.3K

http://www.unicode.org/Public/MAPPINGS/VENDORS/MICSFT/EBCDIC/

 CP037.TXT                        21-Nov-2006 18:52  8.8K  
 CP500.TXT                        21-Nov-2006 18:53  8.8K  
 CP875.TXT                        21-Nov-2006 18:53  8.5K  
 CP1026.TXT                       21-Nov-2006 18:52  8.8K

http://www.unicode.org/Public/MAPPINGS/VENDORS/MICSFT/MAC/

 CYRILLIC.TXT                     21-Nov-2006 18:53  9.0K  
 GREEK.TXT                        21-Nov-2006 18:53  9.2K  
 ICELAND.TXT                      21-Nov-2006 18:53  9.0K  
 LATIN2.TXT                       21-Nov-2006 18:53  9.6K  
 ROMAN.TXT                        21-Nov-2006 18:53  9.0K  
 TURKISH.TXT                      21-Nov-2006 18:53  9.0K 

1

转载于:https://www.cnblogs.com/shangdawei/archive/2013/05/13/3074983.html

Code Pages Supported by Windows相关推荐

  1. This application is only supported on Windows 8.1, Windows Server 2012 R2, or hi gher.

    有的nodejs版本不支持win7,在win7系统中执行npm -v时会有以下提示 This application is only supported on Windows 8.1, Windows ...

  2. TeXlive2021+Texworks/TeXstudio/VS Code/Sublime Text on Windows WSL or Fedora

    LaTeX环境配置TeXlive2021+Texworks/TeXstudio/VS Code/Sublime Text on Windows WSL or Fedora 1 TeXlive2021 ...

  3. System.IO.Ports is currently only supported on Windows

    在Unigine中使用串口,报System.IO.Ports is currently only supported on Windows错误. 后来发现,框架是.net 5.0.改成6.0或者7.0 ...

  4. .pages怎么在windows上打开?Windows下打开在Mac中编辑的.pages文件方法

    .pages怎么在windows上打开?Windows下打开在Mac中编辑的.pages文件方法 1.最简单的方法是修改后缀名为.zip然后解压,解压后就可以看到一张图片,这个就是文档内容了. 2.更 ...

  5. 《软件工具》手把手教你使用Visual Studio Code开发C/C++(Windows)

    C/C++的开发工具很多,微软的Visual Studio,QT等都是不错的选择,但是这些IDE都过于庞大,而且有很多IDE都是收费的.笔者这里推荐GCC+Make+代码编辑器的方式来开发C/C++, ...

  6. 【Visual Studio Code】VS Code在Linux/Mac/Windows中向前、向后定位的快捷键及修改方法

    1,原因 由于工作需要,要在不用操作系统使用VS Code,但是不同操作系统下的快捷键又有差异,操作不便,需要设置. 2,向前/向后默认快捷键 Visual Studio Code的向前.向后 定位的 ...

  7. 解决win7下nodejs安装运行报错:Node.js is only supported on Windows 8.1, Windows Server 2012 R2... 之不支持高版本问题

    最近找了一个开源项目,要求高版本node,于是我直接把node版本更新到了16.0.0,结果node -v时报错: 后来查资料才知道2020年开始微软对window7操作系统不在维护,所以导致我们日程 ...

  8. 在Windows/Ubuntu上使用Visual Studio Code作为Go语言编辑器操作步骤

    下面以在Windows10上操作为例,在Ubuntu上操作步骤与windows一致: 1. 从 https://code.visualstudio.com/  下载windows上的最新发布版本1.2 ...

  9. Windows下LaTeX安装及使用,使用VS Code改善LaTeX编辑环境

    1. 下载LaTeX并安装 从清华镜像站 https://mirrors.tuna.tsinghua.edu.cn/CTAN/systems/texlive/Images/ 下载texlive.iso ...

最新文章

  1. 函数中使用栈与使用堆时函数执行效率浅对比
  2. React 实现一个漂亮的 Table
  3. C# 中 System.Range 结构体
  4. 【转】关于维生素的那些事
  5. webpack处理url资源的配置
  6. hadoop-执行mapreduce时主机名非法的处理
  7. 巧用EasyRecovery监控硬盘 为你的电脑保驾护航
  8. php+分割文本文件,python实现:将文本文件分割成多个小文本文件(php也可实现)...
  9. 前端开发需要学习的技术:
  10. 信息差,永远可以赚钱
  11. 【14】婚礼片剪辑案例【15】电视剧片头剪辑案例
  12. Oracle 11g加密备份
  13. 我们的秘密是绿色的!他喵的
  14. 保研面试/考研复试:英语口语面试必备话题及常用句型句式整理(四)
  15. 泰课在线android,泰课在线rollaball
  16. 小学语文三年级阅读理解专项练习附答案~寓言二则
  17. 计算机实用软件大全pdf,常用工具软件大全.pdf
  18. 蓝牙芯片蓝牙模块音频发射器杂音噪音问题解决方案
  19. osgEarth示例分析——osgearth_lights
  20. 图像传感器binning_CCD的Binning技术在光信号测量中的应用研究

热门文章

  1. 项目管理知识体系指南_PMP考前冲刺丨项目管理知识体系指南(PMBOK)串讲(4)...
  2. insert exec 语句不能嵌套_Python基础入门知识点——if 语句简介
  3. node本地连接服务器的数据库_Linux本地连接阿里云服务器,以及下载node.js配置环境...
  4. 时间同步失败_关于同步、异常处理的思考
  5. 谷歌查询mysql,谷歌地图:使用mysql查询更新标记不起作用
  6. ubuntu 下系统监视器_Ubuntu下,用Pyinstaller打包桌面系统时,你踩坑了没?
  7. static在php中,php中static关键字在类中的使用
  8. JupyterLab显示pyecharts图像
  9. CSV格式整理,去除与上一行数据重复的单元格
  10. python123测验4程序题_Python面试题245道(从基础到高级)123~141——网络编程,Web—Flask...