Commit Graph

175 Commits

Author SHA1 Message Date
lededev
aae4df73fa javbus.py: 清理过期代码 2021-10-19 01:00:50 +08:00
lededev
249884a27e javbus.py: 优化提速 2021-10-19 00:58:28 +08:00
lededev
5da134986a storyline.py: bug fix 2021-10-19 00:17:45 +08:00
lededev
d80b2eeb7d javbus.py: 优化,修理无码片的导演、系列等字段 2021-10-19 00:14:26 +08:00
lededev
dd106453f7 对标记为删除的tag进行清理 2021-10-19 00:03:51 +08:00
lededev
4428971135 javdb.py: 优化,修理getActorPhoto() 2021-10-18 19:52:42 +08:00
lededev
5ef16e3a6d 剧情简介新增运行模式run_mode, 0:顺序执行 1:线程池 2:进程池 2021-10-18 18:09:36 +08:00
lededev
f553927913 提速,暂时屏蔽未实现的演员照片功能 javdb javbus 2021-10-18 17:58:21 +08:00
lededev
24b4f9f5e2 将元数据的来源网站记入日志以便进行评估 2021-10-18 10:51:32 +08:00
lededev
56bbfe6f24 storyline.py: skip SequenceMatcher when number match 2021-10-17 23:25:19 +08:00
lededev
3420f918f5 fix ratio.txt log lost newline 2021-10-17 22:53:53 +08:00
lededev
6624ed7224 clean up 2021-10-17 22:47:49 +08:00
lededev
bc3cda953d fix 2021-10-17 22:29:57 +08:00
lededev
a546c4e83e Parall query on storyline data 2021-10-17 21:59:08 +08:00
lededev
189f4db616 javdb:get faster benefit from http keep-alive 2021-10-15 21:16:48 +08:00
lededev
f26987ddf9 move into try block 2021-10-12 11:42:30 +08:00
lededev
f8dc05a38b improve javbus and javdb outline source 2021-10-12 11:28:17 +08:00
lededev
0933e87944 fix outline of javbus and javdb which caused by airav down 2021-10-10 17:41:33 +08:00
lededev
b0959d1b18 javdb:无有效期内cookies文件时,随机选择一个站点 2021-10-09 20:29:17 +08:00
lededev
d010ea6d51 清理全部conf穿梭参数 2021-10-09 19:42:11 +08:00
lededev
bd3504f3b5 javdb:only accept one login site after javdb site update 2021-10-09 19:32:00 +08:00
lededev
f601669229 javdb:change to site 31 and 32 2021-10-09 12:23:00 +08:00
lededev
a405c5c41b WebCrawler:全面换装getInstance(),厘清airav.py与javbus.py及javdb.py的相爱相杀 2021-10-08 11:46:35 +08:00
lededev
0aa4c7d76c javdb.py:javdbx.json bugfix, find path before check days 2021-09-30 06:29:29 +08:00
lededev
3e1d951af8 去掉返回值为空的tag 2021-09-28 18:36:20 +08:00
lededev
b5b2e7f0d8 由于目前程序未实现演员照片功能,暂时屏蔽以提升速度 2021-09-28 18:31:20 +08:00
lededev
75b71888d9 javdb*.json path search order 2021-09-27 22:20:37 +08:00
Yoshiko2
8f99f4b939 Merge pull request #597 from lededev/fc2-m
fc2.py: update
2021-09-27 22:02:07 +08:00
Yoshiko2
30bc6a59c6 Merge pull request #591 from lededev/xcity-f1
xcity.py: get detail page by form query
2021-09-27 22:00:43 +08:00
lededev
161f4063b9 fc2.py: update 2021-09-26 11:38:48 +08:00
lededev
c6efec91dd 新增失败文件列表以避免重复刮削,模式3与软连接适用 2021-09-26 04:25:25 +08:00
lededev
4ffc34a5cf xcity on top when number similar ABP321 2021-09-25 20:54:07 +08:00
Yoshiko2
8cfefc60ef Merge pull request #594 from lededev/schar-2
replace special characters after translate, null str do not write back
2021-09-25 16:43:41 +08:00
lededev
43bb64d7d0 xcity.py: Strictly limit the number 2021-09-25 06:53:40 +08:00
lededev
6c990e8482 xcity.py: Mode 3 requires the file name to remain unchanged 2021-09-25 06:45:08 +08:00
lededev
c41df40e9f replace special characters after translate, null str do not write back 2021-09-24 01:11:25 +08:00
lededev
50574a705b carib.py: add outline/series/actor_photo 2021-09-23 15:45:00 +08:00
lededev
5e0e8b9cea WebCrawler site list in default config.ini larger than 60 2021-09-23 15:43:00 +08:00
lededev
54ed626294 remove abs_url(), just urljoin() is enough 2021-09-23 08:21:01 +08:00
lededev
c599463409 rewrite getActorPhoto() to get real photo 2021-09-23 07:58:53 +08:00
lededev
c32a4a12ac speed up by reusing stateful browser 2021-09-23 07:01:24 +08:00
lededev
b59b4938d6 xcity.py: get detail page by form query 2021-09-22 06:03:58 +08:00
lededev
d368d061f2 replace special characters in outline json node 2021-09-19 17:53:18 +08:00
lededev
d4f6abe1be special characters replacement in all json text nodes 2021-09-15 17:29:05 +08:00
Mathhew
1204076366 Check sources 2021-08-09 11:09:42 +08:00
yoshiko2
91c7b016a2 Adjust order for sources 2021-08-03 02:52:38 +08:00
Mathhew
08df7383a5 Make webcrawler clear 2021-07-29 10:28:25 +08:00
Mathhew
2c41487a4e Move get_data_from_json to WebCrawler 2021-07-28 16:12:08 +08:00
tojito
1cb54fb956 fix fc2 exception of getTrail() 2021-07-20 23:54:36 +08:00
tojito
9ed239105a add fc2club 2021-07-19 01:04:50 +08:00