您的位置:首页 > 编程语言 > Java开发

Java网页爬虫utf-8

2016-07-17 13:39 489 查看
import java.net.*;
import java.io.*;
public class MyURL{
public static void main(String []args)throws IOException{
URL url=new URL("http://inankai.cn/new/index.html");

BufferedReader br=
new BufferedReader(new InputStreamReader(url.openStream(),"utf-8"));
BufferedWriter bw=
new BufferedWriter(new OutputStreamWriter(new FileOutputStream("D:/inankai.html"),"utf-8"));
String msg=null;
while((msg=br.readLine())!=null){
System.out.println(msg);
bw.append(msg);
bw.newLine();
}
bw.flush();
bw.close();
br.close();
/*
InputStream is=url.openStream();
byte []flush=new byte[1024];
int len=0;
while(-1!=(len=is.read(flush))){
String str=new String(flush,0,len);
System.out.println(str);
}
is.close();
*/

}
}
内容来自用户分享和网络整理,不保证内容的准确性,如有侵权内容,可联系管理员处理 点击这里给我发消息
标签: