您的位置:首页 > 数据库 > MySQL

MYSQL-字符校对规则探究

2013-09-26 10:31 441 查看
1、mysql版本:
mysql> select version(); +-----------+ | version() | +-----------+ | 5.0.95 | +-----------+ 1 row in set (0.00 sec)
2、字符集查看:
mysql> select * from character_sets; +--------------------+----------------------+-----------------------------+--------+ | CHARACTER_SET_NAME | DEFAULT_COLLATE_NAME | DESCRIPTION | MAXLEN | +--------------------+----------------------+-----------------------------+--------+ | big5 | big5_chinese_ci | Big5 Traditional Chinese | 2 | | dec8 | dec8_swedish_ci | DEC West European | 1 | | cp850 | cp850_general_ci | DOS West European | 1 | | hp8 | hp8_english_ci | HP West European | 1 | | koi8r | koi8r_general_ci | KOI8-R Relcom Russian | 1 | | latin1 | latin1_swedish_ci | cp1252 West European | 1 | | latin2 | latin2_general_ci | ISO 8859-2 Central European | 1 | | swe7 | swe7_swedish_ci | 7bit Swedish | 1 | | ascii | ascii_general_ci | US ASCII | 1 | | ujis | ujis_japanese_ci | EUC-JP Japanese | 3 | | sjis | sjis_japanese_ci | Shift-JIS Japanese | 2 | | hebrew | hebrew_general_ci | ISO 8859-8 Hebrew | 1 | | tis620 | tis620_thai_ci | TIS620 Thai | 1 | | euckr | euckr_korean_ci | EUC-KR Korean | 2 | | koi8u | koi8u_general_ci | KOI8-U Ukrainian | 1 | | gb2312 | gb2312_chinese_ci | GB2312 Simplified Chinese | 2 | | greek | greek_general_ci | ISO 8859-7 Greek | 1 | | cp1250 | cp1250_general_ci | Windows Central European | 1 | | gbk | gbk_chinese_ci | GBK Simplified Chinese | 2 | | latin5 | latin5_turkish_ci | ISO 8859-9 Turkish | 1 | | armscii8 | armscii8_general_ci | ARMSCII-8 Armenian | 1 | | utf8 | utf8_general_ci | UTF-8 Unicode | 3 | | ucs2 | ucs2_general_ci | UCS-2 Unicode | 2 | | cp866 | cp866_general_ci | DOS Russian | 1 | | keybcs2 | keybcs2_general_ci | DOS Kamenicky Czech-Slovak | 1 | | macce | macce_general_ci | Mac Central European | 1 | | macroman | macroman_general_ci | Mac West European | 1 | | cp852 | cp852_general_ci | DOS Central European | 1 | | latin7 | latin7_general_ci | ISO 8859-13 Baltic | 1 | | cp1251 | cp1251_general_ci | Windows Cyrillic | 1 | | cp1256 | cp1256_general_ci | Windows Arabic | 1 | | cp1257 | cp1257_general_ci | Windows Baltic | 1 | | binary | binary | Binary pseudo charset | 1 | | geostd8 | geostd8_general_ci | GEOSTD8 Georgian | 1 | | cp932 | cp932_japanese_ci | SJIS for Windows Japanese | 2 | | eucjpms | eucjpms_japanese_ci | UJIS for Windows Japanese | 3 | +--------------------+----------------------+-----------------------------+--------+ 36 rows in set (0.01 sec)
3、字符集校对默认规则:
mysql> select * from collations; +----------------------+--------------------+-----+------------+-------------+---------+ | COLLATION_NAME | CHARACTER_SET_NAME | ID | IS_DEFAULT | IS_COMPILED | SORTLEN | +----------------------+--------------------+-----+------------+-------------+---------+ | big5_chinese_ci | big5 | 1 | Yes | Yes | 1 | | big5_bin | big5 | 84 | | Yes | 1 | | dec8_swedish_ci | dec8 | 3 | Yes | Yes | 1 | | dec8_bin | dec8 | 69 | | Yes | 1 | | cp850_general_ci | cp850 | 4 | Yes | Yes | 1 | | cp850_bin | cp850 | 80 | | Yes | 1 | | hp8_english_ci | hp8 | 6 | Yes | Yes | 1 | | hp8_bin | hp8 | 72 | | Yes | 1 | | koi8r_general_ci | koi8r | 7 | Yes | Yes | 1 | | koi8r_bin | koi8r | 74 | | Yes | 1 | | latin1_german1_ci | latin1 | 5 | | Yes | 1 | | latin1_swedish_ci | latin1 | 8 | Yes | Yes | 1 | | latin1_danish_ci | latin1 | 15 | | Yes | 1 | | latin1_german2_ci | latin1 | 31 | | Yes | 2 | | latin1_bin | latin1 | 47 | | Yes | 1 | | latin1_general_ci | latin1 | 48 | | Yes | 1 | | latin1_general_cs | latin1 | 49 | | Yes | 1 | | latin1_spanish_ci | latin1 | 94 | | Yes | 1 | | latin2_czech_cs | latin2 | 2 | | Yes | 4 | | latin2_general_ci | latin2 | 9 | Yes | Yes | 1 | | latin2_hungarian_ci | latin2 | 21 | | Yes | 1 | | latin2_croatian_ci | latin2 | 27 | | Yes | 1 | | latin2_bin | latin2 | 77 | | Yes | 1 | | swe7_swedish_ci | swe7 | 10 | Yes | Yes | 1 | | swe7_bin | swe7 | 82 | | Yes | 1 | | ascii_general_ci | ascii | 11 | Yes | Yes | 1 | | ascii_bin | ascii | 65 | | Yes | 1 | | ujis_japanese_ci | ujis | 12 | Yes | Yes | 1 | | ujis_bin | ujis | 91 | | Yes | 1 | | sjis_japanese_ci | sjis | 13 | Yes | Yes | 1 | | sjis_bin | sjis | 88 | | Yes | 1 | | hebrew_general_ci | hebrew | 16 | Yes | Yes | 1 | | hebrew_bin | hebrew | 71 | | Yes | 1 | | tis620_thai_ci | tis620 | 18 | Yes | Yes | 4 | | tis620_bin | tis620 | 89 | | Yes | 1 | | euckr_korean_ci | euckr | 19 | Yes | Yes | 1 | | euckr_bin | euckr | 85 | | Yes | 1 | | koi8u_general_ci | koi8u | 22 | Yes | Yes | 1 | | koi8u_bin | koi8u | 75 | | Yes | 1 | | gb2312_chinese_ci | gb2312 | 24 | Yes | Yes | 1 | | gb2312_bin | gb2312 | 86 | | Yes | 1 | | greek_general_ci | greek | 25 | Yes | Yes | 1 | | greek_bin | greek | 70 | | Yes | 1 | | cp1250_general_ci | cp1250 | 26 | Yes | Yes | 1 | | cp1250_czech_cs | cp1250 | 34 | | Yes | 2 | | cp1250_croatian_ci | cp1250 | 44 | | Yes | 1 | | cp1250_bin | cp1250 | 66 | | Yes | 1 | | gbk_chinese_ci | gbk | 28 | Yes | Yes | 1 | | gbk_bin | gbk | 87 | | Yes | 1 | | latin5_turkish_ci | latin5 | 30 | Yes | Yes | 1 | | latin5_bin | latin5 | 78 | | Yes | 1 | | armscii8_general_ci | armscii8 | 32 | Yes | Yes | 1 | | armscii8_bin | armscii8 | 64 | | Yes | 1 | | utf8_general_ci | utf8 | 33 | Yes | Yes | 1 | | utf8_bin | utf8 | 83 | | Yes | 1 | | utf8_unicode_ci | utf8 | 192 | | Yes | 8 | | utf8_icelandic_ci | utf8 | 193 | | Yes | 8 | | utf8_latvian_ci | utf8 | 194 | | Yes | 8 | | utf8_romanian_ci | utf8 | 195 | | Yes | 8 | | utf8_slovenian_ci | utf8 | 196 | | Yes | 8 | | utf8_polish_ci | utf8 | 197 | | Yes | 8 | | utf8_estonian_ci | utf8 | 198 | | Yes | 8 | | utf8_spanish_ci | utf8 | 199 | | Yes | 8 | | utf8_swedish_ci | utf8 | 200 | | Yes | 8 | | utf8_turkish_ci | utf8 | 201 | | Yes | 8 | | utf8_czech_ci | utf8 | 202 | | Yes | 8 | | utf8_danish_ci | utf8 | 203 | | Yes | 8 | | utf8_lithuanian_ci | utf8 | 204 | | Yes | 8 | | utf8_slovak_ci | utf8 | 205 | | Yes | 8 | | utf8_spanish2_ci | utf8 | 206 | | Yes | 8 | | utf8_roman_ci | utf8 | 207 | | Yes | 8 | | utf8_persian_ci | utf8 | 208 | | Yes | 8 | | utf8_esperanto_ci | utf8 | 209 | | Yes | 8 | | utf8_hungarian_ci | utf8 | 210 | | Yes | 8 | | ucs2_general_ci | ucs2 | 35 | Yes | Yes | 1 | | ucs2_bin | ucs2 | 90 | | Yes | 1 | | ucs2_unicode_ci | ucs2 | 128 | | Yes | 8 | | ucs2_icelandic_ci | ucs2 | 129 | | Yes | 8 | | ucs2_latvian_ci | ucs2 | 130 | | Yes | 8 | | ucs2_romanian_ci | ucs2 | 131 | | Yes | 8 | | ucs2_slovenian_ci | ucs2 | 132 | | Yes | 8 | | ucs2_polish_ci | ucs2 | 133 | | Yes | 8 | | ucs2_estonian_ci | ucs2 | 134 | | Yes | 8 | | ucs2_spanish_ci | ucs2 | 135 | | Yes | 8 | | ucs2_swedish_ci | ucs2 | 136 | | Yes | 8 | | ucs2_turkish_ci | ucs2 | 137 | | Yes | 8 | | ucs2_czech_ci | ucs2 | 138 | | Yes | 8 | | ucs2_danish_ci | ucs2 | 139 | | Yes | 8 | | ucs2_lithuanian_ci | ucs2 | 140 | | Yes | 8 | | ucs2_slovak_ci | ucs2 | 141 | | Yes | 8 | | ucs2_spanish2_ci | ucs2 | 142 | | Yes | 8 | | ucs2_roman_ci | ucs2 | 143 | | Yes | 8 | | ucs2_persian_ci | ucs2 | 144 | | Yes | 8 | | ucs2_esperanto_ci | ucs2 | 145 | | Yes | 8 | | ucs2_hungarian_ci | ucs2 | 146 | | Yes | 8 | | cp866_general_ci | cp866 | 36 | Yes | Yes | 1 | | cp866_bin | cp866 | 68 | | Yes | 1 | | keybcs2_general_ci | keybcs2 | 37 | Yes | Yes | 1 | | keybcs2_bin | keybcs2 | 73 | | Yes | 1 | | macce_general_ci | macce | 38 | Yes | Yes | 1 | | macce_bin | macce | 43 | | Yes | 1 | | macroman_general_ci | macroman | 39 | Yes | Yes | 1 | | macroman_bin | macroman | 53 | | Yes | 1 | | cp852_general_ci | cp852 | 40 | Yes | Yes | 1 | | cp852_bin | cp852 | 81 | | Yes | 1 | | latin7_estonian_cs | latin7 | 20 | | Yes | 1 | | latin7_general_ci | latin7 | 41 | Yes | Yes | 1 | | latin7_general_cs | latin7 | 42 | | Yes | 1 | | latin7_bin | latin7 | 79 | | Yes | 1 | | cp1251_bulgarian_ci | cp1251 | 14 | | Yes | 1 | | cp1251_ukrainian_ci | cp1251 | 23 | | Yes | 1 | | cp1251_bin | cp1251 | 50 | | Yes | 1 | | cp1251_general_ci | cp1251 | 51 | Yes | Yes | 1 | | cp1251_general_cs | cp1251 | 52 | | Yes | 1 | | cp1256_general_ci | cp1256 | 57 | Yes | Yes | 1 | | cp1256_bin | cp1256 | 67 | | Yes | 1 | | cp1257_lithuanian_ci | cp1257 | 29 | | Yes | 1 | | cp1257_bin | cp1257 | 58 | | Yes | 1 | | cp1257_general_ci | cp1257 | 59 | Yes | Yes | 1 | | binary | binary | 63 | Yes | Yes | 1 | | geostd8_general_ci | geostd8 | 92 | Yes | Yes | 1 | | geostd8_bin | geostd8 | 93 | | Yes | 1 | | cp932_japanese_ci | cp932 | 95 | Yes | Yes | 1 | | cp932_bin | cp932 | 96 | | Yes | 1 | | eucjpms_japanese_ci | eucjpms | 97 | Yes | Yes | 1 | | eucjpms_bin | eucjpms | 98 | | Yes | 1 | +----------------------+--------------------+-----+------------+-------------+---------+ 126 rows in set (0.00 sec) 注意:以_ci(大小写不敏感)、_cs(大小写敏感)或_bin(二元)
4、校对使用的字符集:
mysql> select * from COLLATION_CHARACTER_SET_APPLICABILITY; +----------------------+--------------------+ | COLLATION_NAME | CHARACTER_SET_NAME | +----------------------+--------------------+ | big5_chinese_ci | big5 | | big5_bin | big5 | | dec8_swedish_ci | dec8 | | dec8_bin | dec8 | | cp850_general_ci | cp850 | | cp850_bin | cp850 | | hp8_english_ci | hp8 | | hp8_bin | hp8 | | koi8r_general_ci | koi8r | | koi8r_bin | koi8r | | latin1_german1_ci | latin1 | | latin1_swedish_ci | latin1 | | latin1_danish_ci | latin1 | | latin1_german2_ci | latin1 | | latin1_bin | latin1 | | latin1_general_ci | latin1 | | latin1_general_cs | latin1 | | latin1_spanish_ci | latin1 | | latin2_czech_cs | latin2 | | latin2_general_ci | latin2 | | latin2_hungarian_ci | latin2 | | latin2_croatian_ci | latin2 | | latin2_bin | latin2 | | swe7_swedish_ci | swe7 | | swe7_bin | swe7 | | ascii_general_ci | ascii | | ascii_bin | ascii | | ujis_japanese_ci | ujis | | ujis_bin | ujis | | sjis_japanese_ci | sjis | | sjis_bin | sjis | | hebrew_general_ci | hebrew | | hebrew_bin | hebrew | | tis620_thai_ci | tis620 | | tis620_bin | tis620 | | euckr_korean_ci | euckr | | euckr_bin | euckr | | koi8u_general_ci | koi8u | | koi8u_bin | koi8u | | gb2312_chinese_ci | gb2312 | | gb2312_bin | gb2312 | | greek_general_ci | greek | | greek_bin | greek | | cp1250_general_ci | cp1250 | | cp1250_czech_cs | cp1250 | | cp1250_croatian_ci | cp1250 | | cp1250_bin | cp1250 | | gbk_chinese_ci | gbk | | gbk_bin | gbk | | latin5_turkish_ci | latin5 | | latin5_bin | latin5 | | armscii8_general_ci | armscii8 | | armscii8_bin | armscii8 | | utf8_general_ci | utf8 | | utf8_bin | utf8 | | utf8_unicode_ci | utf8 | | utf8_icelandic_ci | utf8 | | utf8_latvian_ci | utf8 | | utf8_romanian_ci | utf8 | | utf8_slovenian_ci | utf8 | | utf8_polish_ci | utf8 | | utf8_estonian_ci | utf8 | | utf8_spanish_ci | utf8 | | utf8_swedish_ci | utf8 | | utf8_turkish_ci | utf8 | | utf8_czech_ci | utf8 | | utf8_danish_ci | utf8 | | utf8_lithuanian_ci | utf8 | | utf8_slovak_ci | utf8 | | utf8_spanish2_ci | utf8 | | utf8_roman_ci | utf8 | | utf8_persian_ci | utf8 | | utf8_esperanto_ci | utf8 | | utf8_hungarian_ci | utf8 | | ucs2_general_ci | ucs2 | | ucs2_bin | ucs2 | | ucs2_unicode_ci | ucs2 | | ucs2_icelandic_ci | ucs2 | | ucs2_latvian_ci | ucs2 | | ucs2_romanian_ci | ucs2 | | ucs2_slovenian_ci | ucs2 | | ucs2_polish_ci | ucs2 | | ucs2_estonian_ci | ucs2 | | ucs2_spanish_ci | ucs2 | | ucs2_swedish_ci | ucs2 | | ucs2_turkish_ci | ucs2 | | ucs2_czech_ci | ucs2 | | ucs2_danish_ci | ucs2 | | ucs2_lithuanian_ci | ucs2 | | ucs2_slovak_ci | ucs2 | | ucs2_spanish2_ci | ucs2 | | ucs2_roman_ci | ucs2 | | ucs2_persian_ci | ucs2 | | ucs2_esperanto_ci | ucs2 | | ucs2_hungarian_ci | ucs2 | | cp866_general_ci | cp866 | | cp866_bin | cp866 | | keybcs2_general_ci | keybcs2 | | keybcs2_bin | keybcs2 | | macce_general_ci | macce | | macce_bin | macce | | macroman_general_ci | macroman | | macroman_bin | macroman | | cp852_general_ci | cp852 | | cp852_bin | cp852 | | latin7_estonian_cs | latin7 | | latin7_general_ci | latin7 | | latin7_general_cs | latin7 | | latin7_bin | latin7 | | cp1251_bulgarian_ci | cp1251 | | cp1251_ukrainian_ci | cp1251 | | cp1251_bin | cp1251 | | cp1251_general_ci | cp1251 | | cp1251_general_cs | cp1251 | | cp1256_general_ci | cp1256 | | cp1256_bin | cp1256 | | cp1257_lithuanian_ci | cp1257 | | cp1257_bin | cp1257 | | cp1257_general_ci | cp1257 | | binary | binary | | geostd8_general_ci | geostd8 | | geostd8_bin | geostd8 | | cp932_japanese_ci | cp932 | | cp932_bin | cp932 | | eucjpms_japanese_ci | eucjpms | | eucjpms_bin | eucjpms | +----------------------+--------------------+ 126 rows in set (0.00 sec) 5、如何查看一张表的字符集和校对规则:
mysql> show create table pre_ucenter_vars; +------------------+-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------+ | Table | Create Table | +------------------+-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------+ | pre_ucenter_vars | CREATE TABLE `pre_ucenter_vars` ( `name` char(32) NOT NULL default '', `value` char(255) NOT NULL default '', PRIMARY KEY (`name`) ) ENGINE=MEMORY DEFAULT CHARSET=gbk | +------------------+-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------+ 1 row in set (0.00 sec)
内容来自用户分享和网络整理,不保证内容的准确性,如有侵权内容,可联系管理员处理 点击这里给我发消息
标签: