文章詳情頁

網頁爬蟲 - python爬蟲翻頁問題，請問各位大神我這段代碼怎樣翻頁，還有價格要登陸后才能看到，應該怎么解決

瀏覽：166日期：2022-08-06 14:43:40

問題描述

import urllib.requestimport reweb=urllib.request.urlopen(’https://www.gpyh.com/pricebuy/index?pageNum=1&hasStock=&goodsStandardId=1931&materialDictCode=&materialGroupCode=037001&diameter=&length=&brandId=&merchantId=’)neirong=web.read()def getPage(self,pageIndex): url = self.siteURL + '?pageNum=' + str(pageIndex) request = urllib2.Request(url) response = urllib2.urlopen(request) return response.read().decode(’gbk’)jiangrenhua=neirong.decode(’UTF-8’)RegularExpression=’<td>(.*)</td>’Valuable=re.findall(RegularExpression,jiangrenhua)information=[]for i in range(173): print(Valuable[i]

問題解答

回答1：

?pageNum=' + str(pageIndex)

這一個不就是你的頁碼控制嗎？登錄后才看到那就用cookie或者用戶名密碼模擬登錄后獲取

回答2：

httplib2基本應該是所有http請求的終結者了吧。

import httplib2import urllibhttp = httplib2.Http()url=’要獲取的地址’header={’Accept’:’text/html’, ’Accept-Encoding’:’gzip, deflate, sdch’, ’Accept-Language’:’zh-CN,zh;q=0.8’, ’Cache-Control’:’max-age=0’, ’Connection’:’keep-alive’, ’Cookie’:’cookie內容’, ’Upgrade-Insecure-Requests’:’1’, ’User-Agent’:’Mozilla/5.0 (Windows NT 6.1; WOW64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/45.0.2454.101 Safari/537.36’} #要有登陸狀態才能翻頁就要模擬登陸后把cookie放進去body_value={’username’:’test’,’password’:’123456’} #表單的所有內容body_value=urllib.urlencode(body_value) #utf8編碼response, content = http.request(url, ’GET’, headers=header,body=body_value) #GET或者POST方法response.encoding = ’utf-8’#content就是返回內容

Python 編程

上一條：python - angular route 與 django urls 沖突怎么解決？下一條：python - pyspider爬pdf爬了一小段時間后就不動了

排行榜

					
					css3 - [CSS] 動畫效果 3D翻轉bug
主從備份 - 跪求mysql 高可用主從方案
mysql優化 - mysql count(id)查詢速度如何優化?
angular.js - 不適用其他構建工具，怎么搭建angular1項目
python - django 里自定義的  login 方法，如何使用 login_required()
angular.js - angularjs 用ng-reapt渲染的dom  怎么獲取上面的屬性
node.js - node_moduls太多了
angular.js - Angular路由和express路由的組合使用問題
python如何不改動文件的情況下修改文件的 修改日期
mysql主從 - 請教下mysql 主動-被動模式的雙主配置 和 主從配置在應用上有什么區別？
java - 計算機圖像表示方法？
				

熱門標簽

国产成人精品久久免费动漫-国产成人精品天堂-国产成人精品区在线观看-国产成人精品日本-a级毛片无码免费真人-a级毛片毛片免费观看久潮喷

網頁爬蟲 - python爬蟲翻頁問題，請問各位大神我這段代碼怎樣翻頁，還有價格要登陸后才能看到，應該怎么解決