您的位置:首页 > 编程语言 > PHP开发

最新抓取淘宝价格数据,图片标题的就更好抓了整合就好

2015-08-05 14:29 736 查看
淘宝现在变成https了。不过难不倒咱们这帮程序员的,不说废话

<?php
$url='https://mdskip.taobao.com/core/initItemDetail.htm?itemId=520048818994';
$ch = curl_init();
curl_setopt($ch, CURLOPT_URL, $url);
//设置来源链接,这里是商品详情页链接
curl_setopt($ch,CURLOPT_REFERER,"https://detail.tmall.com/item.htm?spm=a220m.1000858.1000725.16.e5qIwL&id=520048818994&skuId=3100219519273&areaId=110000&cat_id=2&rn=530571831843b5644f85de9cfa5ee2f3&user_id=2153724245&is_b=1");
curl_setopt($ch,CURLOPT_RETURNTRANSFER,true);
curl_setopt($ch, CURLOPT_SSL_VERIFYPEER, false);//规避证书
curl_setopt($ch, CURLOPT_FOLLOWLOCATION, 1); // 防止302 盗链
$result = curl_exec($ch);
curl_close($ch);
//去除回车、空格等
$result=str_replace(array("\r\n","\n","\r","\t",chr(9),chr(13)),'',$result);
$mode="#([0-9]+)\:#m";
preg_match_all($mode,$result,$s);
$s=$s[1];
if(count($s)>0){
foreach($s as $v){
$result=str_replace($v.':','"'.$v.'":',$result);
}
}
//将字符编码转为utf-8,并且将中文转译,否则json_decode会出现错误
$result=iconv('gb2312','utf-8',$result);
$str=array();
$mode='/([\x80-\xff]*)/i';
if(preg_match_all($mode,$result,$s)){
foreach($s[0] as $v){
if(!empty($v)){
$str[base64_encode($v)]=$v;
$result=str_replace('"'.$v.'"','"'.base64_encode($v).'"',$result);
}
}
}
$result=json_decode($result,true);

foreach($result['defaultModel']['itemPriceResultDO']['priceInfo'] as $k=>$v) {
if (empty($v['promotionList'])) {
$arrprice = $v['price'];
} else {
$arrprice = $v['promotionList'][0]['price'];
}

}
print_r($arrprice);

?>
内容来自用户分享和网络整理,不保证内容的准确性,如有侵权内容,可联系管理员处理 点击这里给我发消息
标签:  php 抓取天猫