UTF-8转换GB2312
2005-02-03 16:06
162 查看
Java则采用了平台无关的UNICODE,Java从一个byte流中读取一个字符串时,将把平台相关的byte转变为平台无关的Unicode字符串。在输出时Java将把Unicode字符串转变为平台相关的byte流,如果某个Unicode字符在某个平台上不存在,将会输出一个'?'。
GB2312编码中一个汉字是16位的,而UTF8中一个汉字却是24位的。
如果发出页面采用了UTF-8的编码,怎样实现转换到gb2312?
参照下例:
test1.htm
<html>
<head>
<title>Untitled Document</title>
<meta http-equiv="Content-Type" content="text/html; charset=UTF-8">
</head>
<body>
<p><a href="test2.jsp?abc=A打打B">啊啊啊啊啊啊</a></p>
</body>
</html>
test2.jsp
<%@ page contentType="text/html; charset=UTF-8" language="java" import="java.sql.*" errorPage="" %>
<html>
<head>
<title>Untitled Document</title>
</head>
<body>
<%
//String tofield = new String(request.getParameter("abc").getBytes("UTF-8"),"gb2312");
String s = request.getParameter("abc");
StringBuffer sb = new StringBuffer();
for(int i=0; i<s.length(); i++) {
char c = s.charAt(i);
switch (c) {
case '+':
sb.append(' ');
break;
case '%':
try {
sb.append((char)Integer.parseInt(
s.substring(i+1,i+3),16));
}
catch (NumberFormatException e) {
throw new IllegalArgumentException();
}
i += 2;
break;
default:
sb.append(c);
break;
}
}
// Undo conversion to external encoding
String result = sb.toString();
byte[] inputBytes = result.getBytes("8859_1");
out.println(new String(inputBytes,"UTF-8"));
%>
</body>
</html>
GB2312编码中一个汉字是16位的,而UTF8中一个汉字却是24位的。
如果发出页面采用了UTF-8的编码,怎样实现转换到gb2312?
参照下例:
test1.htm
<html>
<head>
<title>Untitled Document</title>
<meta http-equiv="Content-Type" content="text/html; charset=UTF-8">
</head>
<body>
<p><a href="test2.jsp?abc=A打打B">啊啊啊啊啊啊</a></p>
</body>
</html>
test2.jsp
<%@ page contentType="text/html; charset=UTF-8" language="java" import="java.sql.*" errorPage="" %>
<html>
<head>
<title>Untitled Document</title>
</head>
<body>
<%
//String tofield = new String(request.getParameter("abc").getBytes("UTF-8"),"gb2312");
String s = request.getParameter("abc");
StringBuffer sb = new StringBuffer();
for(int i=0; i<s.length(); i++) {
char c = s.charAt(i);
switch (c) {
case '+':
sb.append(' ');
break;
case '%':
try {
sb.append((char)Integer.parseInt(
s.substring(i+1,i+3),16));
}
catch (NumberFormatException e) {
throw new IllegalArgumentException();
}
i += 2;
break;
default:
sb.append(c);
break;
}
}
// Undo conversion to external encoding
String result = sb.toString();
byte[] inputBytes = result.getBytes("8859_1");
out.println(new String(inputBytes,"UTF-8"));
%>
</body>
</html>
相关文章推荐
- Asp.net把UTF-8编码转换为GB2312编码(转)
- 把UTF-8编码转换为GB2312编码[转]
- 字符集之间转换(UTF-8,UNICODE,Gb2312) c++
- asp.ent中把utf-8转换成gb2312格式
- utf-8编码转换成gb2312
- UTF-8,UNICODE,Gb2312他们3个之间的相互转换.
- linux C C++ 字符集转换,UTF-8,GB2312
- Xoops(MySql) gb2312 编码 转换至 utf-8 的 具体步骤 方法, by Emerald 绿色学院 - Green Institute.
- ASP写的汉字转换UTF-8及UTF-8转GB2312
- GB2312和Utf-8编码的转换[2]
- UTF-8转换成GB2312的全处理
- 把UTF-8编码转换为GB2312编码(转csdn)
- MFC中 GB2312、UTF-8、unicode 之间转换
- linux下UTF-8和GB2312之间转换
- form提交时的字符编码转换之utf-8转gb2312
- 字符编码知识:Unicode、UTF-8、ASCII、GB2312等编码之间是如何转换的?[转]
- 字符编码知识:Unicode、UTF-8、ASCII、GB2312等编码 及 转换
- C++将 URL转换成正常字符,支持 utf-8 和 gb2312
- gb2312 utf-8判断及转换
- UTF-8,UNICODE,ANSII,GBK,GB2312相互转换