您的位置:首页 > 运维架构 > 网站架构

gzip 压缩格式的网站处理方法---sina.com

2013-07-16 17:40 281 查看

gzip 压缩格式的网站处理方法---sina.com

[code]#coding:utf-8

import urllib2

import re

import zlib



url_address = 'http://www.sina.com.cn'



f = urllib2.urlopen(url_address)



buf = f.read()

buf = zlib.decompress(buf, 16+zlib.MAX_WBITS);



#buf = buf.replace("\n","")

print buf

练习:

#coding:utf-8
import urllib2
import zlib

def main():
request = urllib2.urlopen('http://www.sina.com.cn')
print request.headers.get('Content-Encoding')#获得编码格式

buf = request.read()
buf = zlib.decompress(buf, 16+zlib.MAX_WBITS)
print buf

if __name__ == '__main__':
main()

方法二:
#coding:utf-8
import urllib2,StringIO,gzip

f = urllib2.urlopen('http://www.sina.com.cn')
print f.headers

compressdata = f.read()
compressdata = StringIO.StringIO(compressdata)

gzipper = gzip.GzipFile(fileobj=compressdata)

data = gzipper.read()
print data

[/code]
内容来自用户分享和网络整理,不保证内容的准确性,如有侵权内容,可联系管理员处理 点击这里给我发消息
标签: