Rendered/interactive javascript with gtk/webkit/jswebkit
2012-03-01 01:00
453 查看
Scrapy snippets: Rendered/interactive javascript with gtk/webkit/jswebkit
Rendered/interactive javascript with gtk/webkit/jswebkit
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 | from scrapy.http import Request, FormRequest, HtmlResponse import gtk import webkit import jswebkit class WebkitDownloader( object ): def process_request( self, request, spider ): if( type(request) is not FormRequest ): webview = webkit.WebView() webview.connect( 'load-finished', lambda v,f: gtk.main_quit() ) webview.load_uri( request.url ) gtk.main() js = jswebkit.JSContext( webview.get_main_frame().get_global_context() ) renderedBody = str( js.EvaluateScript( 'document.documentElement.innerHTML' ) ) return HtmlResponse( request.url, body=renderedBody ) |
相关文章推荐
- Manipulating Pages and Server Controls with JavaScript(使用JS处理页面和服务器控件)
- Scraping JavaScript webpages with webkit | WebScraping.com
- phantomjs-1.4.1-linux-x86_64-dynamic.tar.gz - phantomjs - PhantomJS 1.4.1 Linux x86_64 (Dynamic build) - headless WebKit with JavaScript API - Google Project Hosting
- Issue 6 - phantomjs - Debugging with Web Inspector - headless WebKit with JavaScript API - Google Project Hosting
- [Javascript + rxjs] Simple drag and drop with Observables
- JS(javaScript)的with用法
- JavaScriptCore, WebKit的JS实现简介
- Rainyday.js – Rendering Raindrops with JavaScript
- JS(javaScript)的with用法
- Issue 6 - phantomjs - Debugging with Web Inspector - headless WebKit with JavaScript API - Google Project Hosting
- Javascript Module pattern template. Shows a class with a constructor and public/private methods/properties. Also shows compatibility with CommonJS(eg Node.JS) and AMD (eg requireJS) as well as in a br
- CS001496 - Gather data from web page with JavaScript, WebKit, and Qt
- Troubleshooting - phantomjs - Troubleshooting - headless WebKit with JavaScript API - Google Project Hosting
- Creating Web Applications in JavaScript with Express.js and ejs
- Issue 132 - phantomjs - Passing variables to page for page.evaluate - headless WebKit with JavaScript API - Google Project Hosting
- [Poi] Use Poi to Build an Index.js with Modern JavaScript Features
- CS001496 - Gather data from web page with JavaScript, WebKit, and Qt
- [Node.js] Scraping Dynamic JavaScript Websites with Nightmare
- JavaScriptCore, WebKit的JS实现
- CS001497 - Add data to a web page with JavaScript, WebKit, and Qt