一篇综述:A brief survey of web data extraction tools
2009-01-07 14:54
453 查看
一篇经典综述,scholar.google.cn上显示该文被引用超过300次
Laender, A. H. F.; Ribeiro-Neto, B. A.; da Silva, A. S. & Teixeira, J. S. A brief survey of web data extraction tools. SIGMOD Rec., ACM, 2002, 31, 84-93
Abstract:In the last few years, several works in the literature have addressed
the problem of data extraction from Web pages. The importance of this
problem derives from the fact that, once extracted, the data can be
handled in a way similar to instances of a traditional database. The
approaches proposed in the literature to address the problem of Web
data extraction use techniques borrowed from areas such as natural
language processing, languages and grammars, machine learning,
information retrieval, databases, and ontologies. As a consequence,
they present very distinct features and capabilities which make a
direct comparison difficult to be done. In this paper, we propose a
taxonomy for characterizing Web data extraction fools, briefly survey
major Web data extraction tools described in the literature, and
provide a qualitative analysis of them. Hopefully, this work will
stimulate other studies aimed at a more comprehensive analysis of data
extraction approaches and tools for Web data.
Laender, A. H. F.; Ribeiro-Neto, B. A.; da Silva, A. S. & Teixeira, J. S. A brief survey of web data extraction tools. SIGMOD Rec., ACM, 2002, 31, 84-93
Abstract:In the last few years, several works in the literature have addressed
the problem of data extraction from Web pages. The importance of this
problem derives from the fact that, once extracted, the data can be
handled in a way similar to instances of a traditional database. The
approaches proposed in the literature to address the problem of Web
data extraction use techniques borrowed from areas such as natural
language processing, languages and grammars, machine learning,
information retrieval, databases, and ontologies. As a consequence,
they present very distinct features and capabilities which make a
direct comparison difficult to be done. In this paper, we propose a
taxonomy for characterizing Web data extraction fools, briefly survey
major Web data extraction tools described in the literature, and
provide a qualitative analysis of them. Hopefully, this work will
stimulate other studies aimed at a more comprehensive analysis of data
extraction approaches and tools for Web data.
相关文章推荐
- A Brief Survey of Web Data Extraction Tools Web数据抽取工具综述 (续)
- 一篇综述:A Survey of Web Information Extraction Systems
- An Overview of Web Data Warehouse(web数据仓库研究综述)
- An Overview of Web Data Warehouse(web数据仓库研究综述)
- [综述阅读] A Survey of Automated Web Service Compositon Methods
- An Overview of Web Data Warehouse(web数据仓库研究综述)
- A Survey of Web Information Extraction Systems——web信息抽取系统研究现状(一)
- [综述阅读] A Survey of Automated Web Service Composition Methods (SWSWPC, 2004)
- 稀疏表示综述:A Survey of Sparse Representation: Algorithms and Applications_2015(1)
- Data Scraping Studio ™ - Web Scraping & Data Extraction Software
- A beginner’s guide to collecting Twitter data (and a bit of web scraping)
- Notes on Chinese Web Data Extraction in Java(part 3)
- web data extraction service
- The Eclipse Web Tools Platform (WTP) 1.5.2 Status as of 2006-10-31
- [综述阅读] A survey on Web Services composition
- [转]Web 数据的动态融合(Dynamic Fusion of Web Data 的文章进行翻译)
- 背景建模一篇综述 Evaluation of Background Subtraction Techniques for Video Surveillance
- Web Data Extraction
- Caused by java.lang.UnsatisfiedLinkError: dlopen failed: "/data/app/com.google.android.webview-1/lib/arm/libwebviewchromium.so" is 32-bit instead of 64-bit
- WKWebView使用时的[removeDataOfTypes:modifiedSince:completionHandler:]崩溃