您的位置:首页 > 编程语言 > Python开发

python 抽取信息

2013-08-22 19:55 169 查看
获取网页中的信息,用到了BeautifulSoup和tornado

#!/usr/bin/env python3
from bs4 import BeautifulSoup
#import tornado.httpclient
import tornado
from tornado import httpclient
cli=tornado.httpclient.HTTPClient()
link='http://www.iciba.com/'
search=raw_input('search: ')

link+=search
data=cli.fetch(link)
body=data.body.decode('utf8')

soup=BeautifulSoup(body)

group=soup.find_all(class_='group_pos')

group2=group[0].find_all('p')
for ele in group2:
print(ele.find(class_='fl').get_text())
result=ele.find_all('label')
for r in result:
print(r.get_text())
内容来自用户分享和网络整理,不保证内容的准确性,如有侵权内容,可联系管理员处理 点击这里给我发消息
标签: