-
Notifications
You must be signed in to change notification settings - Fork 48
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
8comic下載問題回報 #391
Comments
是哪一話? |
https://www.8comic.com/html/13736.html total 305 episode. |
我用chrome 無痕視窗 實驗 但如果我複製網址 我猜測comiccrawler抓到的html就是第2個不是漫畫頁面的狀況 |
試試開啟 errorlog︰
如果可以編輯程式碼,可以找到 eight.py 做以下修改︰ diff --git a/comiccrawler/mods/eight.py b/comiccrawler/mods/eight.py
index 815e10a..ffc57ef 100644
--- a/comiccrawler/mods/eight.py
+++ b/comiccrawler/mods/eight.py
@@ -71,6 +71,9 @@ j_js = ""
lazy_js = ""
def get_images(html, url):
+ import pathlib
+ pathlib.Path("8comic.html").write_text(html, encoding="utf-8")
+
global j_js
if not j_js:
j_js = re.search(r'src="([^"]*/j\.js[^"]*)"', html).group(1) 這樣在發生錯誤時,就會把HTML原始碼寫進 8comic.html |
我有另外印 Start downloading 炎炎之消防隊-無限-8comic [crawler.py][get_images]self.html [eight.py][get_images]html |
今天下載了最新版的ComicCrawler
但下載8comic漫畫還是有錯
漫畫網址:
https://www.8comic.com/html/13736.html
錯誤:
Traceback (most recent call last):
File "C:\Users\LIMIT\AppData\Local\Programs\Python\Python311\Lib\site-packages\comiccrawler\crawler.py", line 393, in error_loop
process()
File "C:\Users\LIMIT\AppData\Local\Programs\Python\Python311\Lib\site-packages\comiccrawler\crawler.py", line 342, in download
crawler.init()
File "C:\Users\LIMIT\AppData\Local\Programs\Python\Python311\Lib\site-packages\comiccrawler\crawler.py", line 58, in init
self.init_images(self.ep.current_page - 1)
File "C:\Users\LIMIT\AppData\Local\Programs\Python\Python311\Lib\site-packages\comiccrawler\crawler.py", line 65, in init_images
self.get_images()
File "C:\Users\LIMIT\AppData\Local\Programs\Python\Python311\Lib\site-packages\comiccrawler\crawler.py", line 235, in get_images
images = self.mod.get_images(
^^^^^^^^^^^^^^^^^^^^
File "C:\Users\LIMIT\AppData\Local\Programs\Python\Python311\Lib\site-packages\comiccrawler\mods\eight.py", line 76, in get_images
j_js = re.search(r'src="([^"]/j.js[^"])"', html).group(1)
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
AttributeError: 'NoneType' object has no attribute 'group'
wait 10 seconds...
The text was updated successfully, but these errors were encountered: