Script Summary: Generate g.e-hentai.org picture urls to download. 批量下载地址解析
I update in my blog now,you can use google to find. the latest ver is working now.
I am tired with the g.e's ban.
also include lofi and exhentai VERs.
key: keating42
Hi everyone ,I want to change the place to update this script.
now I have a new ver complete,can someone tell me a place to update?
V4:新版本,修正了80这个数字,改为120,同时我有了一个更严谨的方法,不过只能等下次ge更新再放出了,一直修正很累人的。。。。。。。。。
^_^
V3.1:修正xpath的获取条件,取消'style'的依赖,更不容易被封。
V3:现在我彻底修改了元素获取和xml解析,现在我敢放言,只要在使用html的情况下,正则的解析都不需要太大的修改了,除非g.e修改了源图片的src长度,不过无所谓,现在你们可以自如的修复它,只要数数源地址的长度,修改“80”(120)这个数字就可以。
V2:好了,现在其实连alt num 都不需要了,我完成了一个更简洁高效的解析,现在,除非g.e愿意修改大量代码甚至是服务器架构,否则在不违反html的前提下,这已经是我能做的最好了。应该也是最不容易被因为源码改变而失效了。
V1:其实正则匹配很简单,自己照着源码改就行,现在主要是改next_url的匹配,我使用了alt的数字标识来匹配,现在应该不会有太大问题了。g.e如果再改的话需要改动许多代码,或者取消标识,如果取消数字标识的话,估计我也没太大的办法来修复了。。。。
update2.4 FINAL V4:site change src length,now you can change the num 80 to 120.but in fact I have a better way to parse..I will update it when the g.e encrypt code again ^_^
I changed the getimg() and parse().if it is change code again, you can fix it self by change the num '120' which is the main img src length.
update2.4 FINAL V3(3.1 change xpath not depend on 'style';3.2'120'):g.e-hentai encrypt code again.script cannot get the first main img by old xpath.now I fix it. I use a effective way to get the img in the getimg() function .
=====================================================================
update2.4 FINAL V2:I use a easy and effective way to parse,now I think it is perfect.now there are no parse error except xmlhttp error.
and xml error you can click continue to continue.
=====================================================================
update2.4 FINAL:g.e-hentai encrypt code again.I fix it.now the g.e change the nextpage_img for random just like "W.ffs",but now I parse the url by the alt number.So now it can work long.and this VER I think it is the last one,I will go on business for at lest 3 month.if I have time I may work on this script.if you need fix,please send message to me.Have fun everyone!!!!!!!
---------------------------------------------------
update2.3:g.e-hentai encrypt code again.I fix it.
in fact.I am tired with the G.E site,I set the time limit,I do not why the G.E always change code,maybe AD?
maybe this is the lAST VER,in fact you can fix it youself by change some regex.I will go on business for at lest 3 month.
------------------------------------------------------
update2.2:G.E-H.ORG changed their system.
now many imges have ID,so I change the parse function to fix it.
-------------------------------------------
update2.1:G.E-H.ORG changed their system.
now the image url is the direct image URL ,so I change the parse function to fix it.
-------------------------------------------
hi .I am back now and fix this script again.
most important, Happy Chinese New Year!!!!!!!!
大家春节快乐!!!!!!!!!
update2.0:now I have the time to fix this script.I only change some Regular Expressions.now it can work better.I sorry for so long time I have not fixed this script.because I am too busy.
------------------------------------------
update1.9:I try to fix it,becuse of the standard xmlhttprequest's bug,the referer will not be change ,it is always the first page.when I complete the standard xmlhttprequest code,it is work fine,but now they may change something to ban it,so now I use gm_xmlhttprequest again, it's not sent referer,now may work fine,and the cookie also work fine seem.(if also have problem ,please use old version Oct 19, 2009 14:40).
I will go on business this week,so may have no time to update.
-------------------------------------------
update 1.8:I complete the cookie code,then it is fake like a normal broswer.
I do not know the effect of this script.but I do my best.
----------------------------------------
update 1.7:I instead GM_xmlhttprequest with standard xmlhttprequest now the referer can be sent rightly.
now this ver you set a right (3500+.denpends on your net condition) timeout,I sure it will not cause a ip ban problem,but the pic server also cause a triffic limit.you can use proxy.
--------------------------------------------------
update 1.6:I try to change the header ,now it seem to can parse more than 200pics,and I never see the ban message.it maybe work now.
maybe it can avoid ban,may not.but it can parse imgurls is real.it working fine.
you may change the 'setTimeout(do_next,2500);'the '2500'means 2.5s.
you can change longger,it make your script working longger.
______________________________________________________________
update 1.5:I try to change the right refer and set a long timeout,but even you can get the pic url,but ge may ban your ip.
-----------------------------------------
update 1.4:- - ..g.e-hentai encrypt code again again 10/17/2009...and I fixed it again again.........
-----------------------------------------------------------------------
update 1.3:- - ..g.e-hentai encrypt code again 10/17/2009...and I fixed it again.........
-----------------------------------------------------------------------
update 1.2:我在do_current()中为do_next()使用了setTimeout(do_next,1500);。延时1.5秒,避免封锁,感谢neozone的建议。
I use neozone's advise.change ajax next to 'setTimeout(do_next,1500);'
for avoid ip ban problem.
------------------------------------------
update 1.1:g.e-hentai downloader fixed*于10/15/2009失效(修改了imgid和div的id),我修改了解析的id和img的正则匹配,现在修复了。
now it is working,I fixed it。
-----------------------------------------------------------------------
version 1.0原jpeg的g.e-hentai downloader fixed于8/16/2009失效,因为g.e-hentai.org 修改了img的ID,变成每次刷新都会改变。
基于heroboy和jpeg的版本,更改了parse_html的匹配和build_ui()的元素获取,加了一个xpath的获取。
改动不多,感谢heroboy和jpeg的贡献。我只是改动其中很少的一段代码以及加了一段xpath。
介绍:(直接copy的heroby的,^_^)
批量下载g.e-hentai.org上的图片。
此脚本并不直接下载图片,而是在页面上生成图片的地址。使用下载工具中的“下载此页的所有链接”功能下载。
使用方法:
在查看图片的页面,例如:http://g.e-hentai.org/s/8b49e26ada17ef47492234f...
会有一个download按钮,点击就会持续生成图片地址(从当前的图片开始,不断地获取下一页)。
如果下载过程中出错,会出现continue按钮,点击后继续。
如果长时间停止生成图片地址,也没有出现continue按钮,可能出现了没有处理的错误,请现保存下当前已经生成的地址,然后到出错的页面,点download继续下载。
经过本人的测试,使用下载工具批量下载图片的时候,大约下载到100张左右的时候,之后的图片地址会失效(具体表现就是下载下来的图片都是一张9k的gif图)。请去重新生成一边下载的地址即可。
quote(by jpeg)
Instructions:
Open an image in a gallery and you'll notice a 'Download' button above the picture. Click on it and it should start grabbing links from the gallery. After grabbing 50 pictures or so the server would temporarily block you from grabbing more links. Wait for about 10 seconds and click on 'Continue' to resume. You may have to repeat this a few times on a big gallery.
After grabbing all the links use addons like DownThemAll! or FlashGot to download all the links. The name of the pictures you download will be random and out of order, so you need to find a way to rename them. For example, use *text*.*ext* mask in DownThemAll!
If you have a better way to sort them, please post them in Discussions. Also if you have any small improvements you want to add to the code, please post it in the forum and I'll add it to the script.
PS:
If all the images you downloaded turned out to be "Enable your cookies" images then you need to enable 3rd party cookies in your browser or try a different program to download the links.



