文章詳情頁

python 制作網(wǎng)站小說下載器

瀏覽：11日期：2022-06-27 14:16:50

基本開發(fā)環(huán)境

· Python 3.6

· Pycharm

相關(guān)模塊使用

python 制作網(wǎng)站小說下載器

目標(biāo)網(wǎng)頁分析

python 制作網(wǎng)站小說下載器

輸入想看的小說內(nèi)容，點擊搜索

python 制作網(wǎng)站小說下載器

這里會返回很多結(jié)果，我只選擇第一個

網(wǎng)頁數(shù)據(jù)是靜態(tài)數(shù)據(jù)，但是要搜索，是post請求，需要提價data參數(shù)，如下圖所示：

python 制作網(wǎng)站小說下載器

然后通過解析網(wǎng)站數(shù)據(jù)，獲取第一個小說i的詳情頁url即可

靜態(tài)網(wǎng)頁的獲取，難度是不大的。

def search(): search_url = ’http://www.xbiquge.la/modules/article/waps.php’ data = {’searchkey’: name } response = requests.post(url=search_url, data=data, headers=headers) selector = get_parsing(response.text) novel_url = selector.css(’.even a::attr(href)’).extract_first()1、獲取每本小說的章節(jié)名以及url地址

所有的章節(jié)名以及url地址，都包含在dd標(biāo)簽里面

python 制作網(wǎng)站小說下載器

2、獲取url后，需要拼接

’/23/23019/11409705.html’ # 這是網(wǎng)頁獲取到的url’http://www.xbiquge.la/23/23019/11409705.html’ # 這是真實的小說章節(jié)內(nèi)容url地址3、小說名字，直接獲取即可。

def download_one_book(index_url): response = get_response(index_url) response.encoding = response.apparent_encoding sel = get_parsing(response.text) book_name = sel.css(’#info h1::text’).get() # 提取了所有章節(jié)的下載地址 urls = sel.css(’#list dd a::attr(href)’).getall() # 不要最新的 12 章放在最前main for url in urls:chapter_url = ’http://www.xbiquge.la’ + urlprint(chapter_url)

保存下載每章小說內(nèi)容

def download_one_chapter(chapter_url, book_name): response = get_response(chapter_url) response.encoding = response.apparent_encoding html = response.text selector = get_parsing(html) h1 = selector.css(’.bookname h1::text’).get() content = selector.css(’#content::text’).getall() lines = [] for c in content:lines.append(c.strip()) print(h1) text = ’n’.join(lines) file = open(book_name + ’.txt’, mode=’a’, encoding=’utf-8’) file.write(h1) file.write(’n’) file.write(text) file.write(’n’) file.close()小說軟件界面

root = Tk()root.title(’小說下載器’)root.geometry(’560x450+400+200’) label = Label(root, text=’請輸入下載小說名字:’, font=(’華文行楷’, 20))label.grid() entry = Entry(root, font=(’隸書’, 20))entry.grid(row=0, column=1) text = Listbox(root, font=(’隸書’, 16), width=50, heigh=15)text.grid(row=2, columnspan=2) button1 = Button(root, text=’開始下載’, font=(’隸書’, 15), command=search)button1.grid(row=3, column=0) button2 = Button(root, text=’退出程序’, font=(’隸書’, 15), command=root.quit)button2.grid(row=3, column=1) root.mainloop()顯示下載內(nèi)容

def novel_load(title): text.insert(END, ’正在保存：{}’.format(title)) # 文本框滾動 text.see(END) # 更新 text.update()實現(xiàn)效果

python 制作網(wǎng)站小說下載器

以上就是python 制作網(wǎng)站小說下載器的詳細(xì)內(nèi)容，更多關(guān)于python 小說下載器的資料請關(guān)注好吧啦網(wǎng)其它相關(guān)文章！

Python 編程

上一條：python 多線程爬取壁紙網(wǎng)站的示例下一條：python 統(tǒng)計list中各個元素出現(xiàn)的次數(shù)的幾種方法

相關(guān)文章：

1. JSP之表單提交get和post的區(qū)別詳解及實例2. 詳解瀏覽器的緩存機(jī)制3. 存儲于xml中需要的HTML轉(zhuǎn)義代碼4. WML語言的基本情況5. ASP動態(tài)網(wǎng)頁制作技術(shù)經(jīng)驗分享6. .Net加密神器Eazfuscator.NET?2023.2?最新版使用教程7. python多線程和多進(jìn)程關(guān)系詳解8. Python xlrd/xlwt 創(chuàng)建excel文件及常用操作9. Xml簡介_動力節(jié)點Java學(xué)院整理10. Python 實現(xiàn)勞拉游戲的實例代碼（四連環(huán)、重力四子棋）

排行榜

					
					Spring Cloud Alibaba整合Sentinel的實現(xiàn)步驟
.Net加密神器Eazfuscator.NET?2023.2?最新版使用教程
Docker容器如何更新打包并上傳到阿里云
PHP利用COM對象訪問SQLServer、Access
Python xlrd/xlwt 創(chuàng)建excel文件及常用操作
python多線程和多進(jìn)程關(guān)系詳解
python 寫函數(shù)在一定條件下需要調(diào)用自身時的寫法說明
JSP之表單提交get和post的區(qū)別詳解及實例
ASP動態(tài)網(wǎng)頁制作技術(shù)經(jīng)驗分享
WML語言的基本情況
Xml簡介_動力節(jié)點Java學(xué)院整理