Pages

2015-09-09

字串編碼Encoding CodePage/CodeName List

很多公家機關及銀行資料查詢仍是以big5為主,假如沒有歷史包袱,新的程序就儘量使用utf-8編碼吧!

// Converts big5 into utf-8 encoding.
byte[] big5Bytes = XXX();
byte[] utf8Bytes = Encoding.Convert(Encoding.GetEncoding("big5"), Encoding.UTF8, big5Bytes);
string result = Encoding.UTF8.GetString(utf8Bytes);


要產生指定的CodePage(int)及CodeName(string)的方式:

using System.Text;

void Main()
{
// For every encoding, get the property values.
foreach (EncodingInfo ei in Encoding.GetEncodings() )
{
Encoding e = ei.GetEncoding();
Console.WriteLine("{0,-6} {1}", ei.CodePage, ei.Name);
}
}

產生的列表如下:


Encoding CodePage List

37     IBM037
437    IBM437
500    IBM500
708    ASMO-708
720    DOS-720
737    ibm737
775    ibm775
850    ibm850
852    ibm852
855    IBM855
857    ibm857
858    IBM00858
860    IBM860
861    ibm861
862    DOS-862
863    IBM863
864    IBM864
865    IBM865
866    cp866
869    ibm869
870    IBM870
874    windows-874
875    cp875
932    shift_jis
936    gb2312
949    ks_c_5601-1987
950    big5
1026   IBM1026
1047   IBM01047
1140   IBM01140
1141   IBM01141
1142   IBM01142
1143   IBM01143
1144   IBM01144
1145   IBM01145
1146   IBM01146
1147   IBM01147
1148   IBM01148
1149   IBM01149
1200   utf-16
1201   utf-16BE
1250   windows-1250
1251   windows-1251
1252   Windows-1252
1253   windows-1253
1254   windows-1254
1255   windows-1255
1256   windows-1256
1257   windows-1257
1258   windows-1258
1361   Johab
10000  macintosh
10001  x-mac-japanese
10002  x-mac-chinesetrad
10003  x-mac-korean
10004  x-mac-arabic
10005  x-mac-hebrew
10006  x-mac-greek
10007  x-mac-cyrillic
10008  x-mac-chinesesimp
10010  x-mac-romanian
10017  x-mac-ukrainian
10021  x-mac-thai
10029  x-mac-ce
10079  x-mac-icelandic
10081  x-mac-turkish
10082  x-mac-croatian
12000  utf-32
12001  utf-32BE
20000  x-Chinese-CNS
20001  x-cp20001
20002  x-Chinese-Eten
20003  x-cp20003
20004  x-cp20004
20005  x-cp20005
20105  x-IA5
20106  x-IA5-German
20107  x-IA5-Swedish
20108  x-IA5-Norwegian
20127  us-ascii
20261  x-cp20261
20269  x-cp20269
20273  IBM273
20277  IBM277
20278  IBM278
20280  IBM280
20284  IBM284
20285  IBM285
20290  IBM290
20297  IBM297
20420  IBM420
20423  IBM423
20424  IBM424
20833  x-EBCDIC-KoreanExtended
20838  IBM-Thai
20866  koi8-r
20871  IBM871
20880  IBM880
20905  IBM905
20924  IBM00924
20932  EUC-JP
20936  x-cp20936
20949  x-cp20949
21025  cp1025
21866  koi8-u
28591  iso-8859-1
28592  iso-8859-2
28593  iso-8859-3
28594  iso-8859-4
28595  iso-8859-5
28596  iso-8859-6
28597  iso-8859-7
28598  iso-8859-8
28599  iso-8859-9
28603  iso-8859-13
28605  iso-8859-15
29001  x-Europa
38598  iso-8859-8-i
50220  iso-2022-jp
50221  csISO2022JP
50222  iso-2022-jp
50225  iso-2022-kr
50227  x-cp50227
51932  euc-jp
51936  EUC-CN
51949  euc-kr
52936  hz-gb-2312
54936  GB18030
57002  x-iscii-de
57003  x-iscii-be
57004  x-iscii-ta
57005  x-iscii-te
57006  x-iscii-as
57007  x-iscii-or
57008  x-iscii-ka
57009  x-iscii-ma
57010  x-iscii-gu
57011  x-iscii-pa
65000  utf-7
65001  utf-8

No comments: