Welcome to Django-Magellan’s documentation!
Contents:
What is Magellan?
Magellan is a 100% python search engine and spider app for django.
Think of it as a mini intranet search appliance, but for the internet.
How does it work?
Magellan spiders sites that you specify in the django admin, indexing page content via Whoosh.
Features
- Application agnostic. Magellan will spider anything you have access to.
- Pure python. No dependencies on external services like SOLR.
- Portable. Load Magellan into a relocatable virtualenv and use sqlite, and you can carry your search engine on a usb drive.
- Multithreaded spidering, for speed.
- Naive and extensible. Have a site you want to index? Write your own content extractor to scrape just the parts you care about.
- Authenticates. Currently supports form-based authentication. Oauth and HTTP auth to follow.