您的位置:首页
selenium_采集药品数据1_采集第一页表格
2016-05-08 09:51
211 查看
Python爬虫视频教程零基础小白到scrapy爬虫高手-轻松入门
https://item.taobao.com/item.htm?spm=a1z38n.10677092.0.0.482434a6EmUbbW&id=564564604865
![](https://oscdn.geek-share.com/Uploads/Images/Content/201605/5a5c3e183611015667b28f13117e26ce.png)
https://item.taobao.com/item.htm?spm=a1z38n.10677092.0.0.482434a6EmUbbW&id=564564604865
![](https://oscdn.geek-share.com/Uploads/Images/Content/201605/5a5c3e183611015667b28f13117e26ce.png)
# -*- coding: utf-8 -*- """ Created on Sat May 7 09:32:30 2016 @author: daxiong """ import requests,bs4,csv,time,selenium from selenium import webdriver site1="http://118.114.237.85:8081/searchbio.aspx" charset="gb2312" browser=webdriver.Firefox() browser.get(site1) elems=browser.find_elements_by_class_name("tb") elems1= elems[1:] content=[i.text for i in elems1] #功能:将list对象N等分 def div_list(ls,n): if not isinstance(ls,list) or not isinstance(n,int): return [] ls_len = len(ls) if n<=0 or 0==ls_len: return [] if n > ls_len: return [] elif n == ls_len: return [[i] for i in ls] else: j = int(ls_len/n) ls_return = [] for i in range(0,(n-1)*j,j): ls_return.append(ls[i:i+j]) #算上末尾的j+k ls_return.append(ls[(n-1)*j:]) return ls_return def Write_table_to_csv(fileName,list_tableContent): #对列表格式修改,字符串写入的格式不对 file=open(fileName,'w',newline='') writer1=csv.writer(file) writer1.writerows(list_tableContent) file.close() content1=div_list(content,20) Write_table_to_csv("1.csv",content1)
相关文章推荐
- (1)NSThread
- swift基础学习(04)[闭包、函数、枚举、类和结构体]
- zookeeper分布式锁
- 为sql server 增加 parseJSON 和 ToJSON 函数
- zookeeper分布式锁
- C#版 选择法、冒泡法、插入法和快速排序法分析与对比
- java 死锁产生原因及解锁
- azure 云上 oracle11.2.0.4里dataguard归档日志传输 1034 问题详细解决过程
- java或者jsp中修复会话标识未更新漏洞
- 工厂模式
- 10期4月份博客期刊
- [shell] 文本操作
- [转]javascript中基本类型和引用类型的区别分析
- linux grep
- linux grep
- Vim for Python & Normal Plugins & Vundle
- ios Sqlite 的基本使用
- Java千百问_06数据结构(012)_如何遍历数组
- java中public private protected default的区别
- python爬虫抓取51cto博客大牛的文章保存到MySQL数据库