Python_API_Structured Markup Processing Tools_sgmllib.SGMLParser.reset
2012-04-07 16:47
441 查看
API文档:
SGMLParser.reset()
Reset the instance. Loses all unprocessed data. This is called implicitly at instantiation time.
翻译文档:
reset
4000
有SGMLParser的__init__调用,在reset进行初始化的工作。
例子:
#! /usr/bin/env python
#coding=utf-8
from sgmllib import SGMLParser
class URLLister(SGMLParser):
def reset(self):
SGMLParser.reset(self)
self.urls=[]
def start_a(self,attrs):
href = [v for k,v in attrs if k=='href']
if href:
self.urls.extend(href)
import urllib
usock = urllib.urlopen('http://www.baidu.com')
parser = URLLister()
parser.feed(usock.read())
usock.close()
parser.close()
for url in parser.urls:
print url
SGMLParser.reset()
Reset the instance. Loses all unprocessed data. This is called implicitly at instantiation time.
翻译文档:
reset
4000
有SGMLParser的__init__调用,在reset进行初始化的工作。
例子:
#! /usr/bin/env python
#coding=utf-8
from sgmllib import SGMLParser
class URLLister(SGMLParser):
def reset(self):
SGMLParser.reset(self)
self.urls=[]
def start_a(self,attrs):
href = [v for k,v in attrs if k=='href']
if href:
self.urls.extend(href)
import urllib
usock = urllib.urlopen('http://www.baidu.com')
parser = URLLister()
parser.feed(usock.read())
usock.close()
parser.close()
for url in parser.urls:
print url
相关文章推荐
- Python_API_Structured Markup Processing Tools_sgmllib.SGMLParser
- Python_API_Structured Markup Processing Tools_sgmllib.SGMLParser.feed
- python sgmllib.SGMLParser的学习
- Python itertools 模块 api
- Python SGMLParser简介
- 用python解析html[SGMLParser]
- Processing binary structured data with Python
- Python SGMLParser 的1个BUG??
- 解析html的python库--SGMLParser
- [python]SGMLParser是如何工作的
- Python HTML Resolution Demo - SGMLParser & PyQuery
- 用python解析html[SGMLParser]
- python网页解析之SGMLParser
- 用python解析html--SGMLParser
- 用python解析html--SGMLParser
- SGMLParser (二) 分类: python 小练习 HTMLParser 2014-02-20 14:06 362人阅读 评论(0) 收藏
- python SGMLParser学习的一点心得
- 疑似BUG:Python SGMLParser处理html中的javascript失当
- pythonseleniumAPI
- TensorFlow 1.0更新python相关api更改