option to stop crawling
Reported by rb2k | May 6th, 2010 @ 10:56 AM | in 0.5.0
It would be nice if there was a anemone.stop_crawl method that
could be called from e.g. within the on_every_page block.
In my case, I simply want to stop crawling after I have found
something on a page (in my case: more than x links to other
domains)
Comments and changes to this ticket
-
chris (at chriskite) May 25th, 2010 @ 09:06 PM
- Milestone set to 0.5.0
- Assigned user set to chris (at chriskite)
Please Sign in or create a free account to add a new ticket.
With your very own profile, you can contribute to projects, track your activity, watch tickets, receive and update tickets through your email and much more.
Create your profile
Help contribute to this project by taking a few moments to create your personal profile. Create your profile ยป
Anemone is a Ruby library that makes it quick and painless to write programs that spider a website. It provides a simple DSL for performing actions on every page of a site, skipping certain URLs, and calculating the shortest path to a given page on a site.