最新消息:雨落星辰是一个专注网站SEO优化、网站SEO诊断、搜索引擎研究、网络营销推广、网站策划运营及站长类的自媒体原创博客

javascript - How to crawl through a website and take screenshots of each page? - Stack Overflow

programmeradmin3浏览0评论

My task is to take screenshots of a set of webpages for legal purposes. I'm currently acplishing this by using Webshot and running through its batch operation using a list of URLs I piled myself. I also have to take screenshots of a rotating Flash banner which is done by using IECapt and its delay function.

Each page has a series of ads that require a screenshot taken of the page they lead to. How can I automate this so that I'm not spending hours clicking on each link after link?

In case that was not clear, the homepage has 4 secondary ads linking to 4 different product pages. I need a screenshot of the homepage (done) and then four different screenshots of the pages that the ads link to.

Oh, and to make things interesting, the homepages use Javascript redirects a la:

javascript:dcsOnsiteAdLink('url', '/whyjoin', 'ad', 'S1', 'popup', 'toolbar=yes,location=yes,directories=yes,status=yes,menubar=yes,scrollbars=yes,resizable=yes,width=780,height=500');

My task is to take screenshots of a set of webpages for legal purposes. I'm currently acplishing this by using Webshot and running through its batch operation using a list of URLs I piled myself. I also have to take screenshots of a rotating Flash banner which is done by using IECapt and its delay function.

Each page has a series of ads that require a screenshot taken of the page they lead to. How can I automate this so that I'm not spending hours clicking on each link after link?

In case that was not clear, the homepage has 4 secondary ads linking to 4 different product pages. I need a screenshot of the homepage (done) and then four different screenshots of the pages that the ads link to.

Oh, and to make things interesting, the homepages use Javascript redirects a la:

javascript:dcsOnsiteAdLink('url', '/whyjoin', 'ad', 'S1', 'popup', 'toolbar=yes,location=yes,directories=yes,status=yes,menubar=yes,scrollbars=yes,resizable=yes,width=780,height=500');
Share Improve this question asked Jun 10, 2009 at 14:32 tridiumtridium 3211 gold badge5 silver badges12 bronze badges
Add a ment  | 

3 Answers 3

Reset to default 6

Pay someone 3 USD / hour to do it for you through Elance or Mechanical Turk. Even if you'd spend only 4 hours on hacking together a script, that'd cost more than paying someone a few bucks for it.

gnome-web-photo uses the Geko engine to render and take a snap of a web page. It is pretty nifty and scriptable.

Adobe Acrobat Pro is pretty good at this. It can grab an entire site and make it into a single indexed document. Screen shots will only get you what's on the screen, Acrobat Pro can get you the whole page.

发布评论

评论列表(0)

  1. 暂无评论