#9 ✓resolved
hayato

limit-depth crawling

Reported by hayato | September 6th, 2009 @ 12:31 AM

I suggest to add new function to anemone.

Problem
Anemone 0.1.2 follow link in same domain. In some of the cases of root URL, Anemone crawl too many pages, and take long time.

Suggestion
Add function to Anemone to limit following link by the depth count.

I attatch concept implementation and rspec.
Please consider my suggestion.

Comments and changes to this ticket

Please Sign in or create a free account to add a new ticket.

With your very own profile, you can contribute to projects, track your activity, watch tickets, receive and update tickets through your email and much more.

New-ticket Create new ticket

Create your profile

Help contribute to this project by taking a few moments to create your personal profile. Create your profile ยป

Anemone is a Ruby library that makes it quick and painless to write programs that spider a website. It provides a simple DSL for performing actions on every page of a site, skipping certain URLs, and calculating the shortest path to a given page on a site.

Shared Ticket Bins

People watching this ticket

Attachments

Pages