Indexing Web Content with PHP and SWISH-E

In a previous article, I'd demonstrated how to build a simple email search system in PHP using two indexing tools, Sphinx and the Zend Framework's Lucene implementation. Now, while these are undoubtedly two of the most popular tools available, open source is all about choice...and so, it should come as no surprise that there exist a number of alternatives, many of them equaling Sphinx and Zend_Lucene in sophistication and speed.This article deals with one such alternative, SWISH-E aka the Simple Web Indexing System for Humans - Enhanced. As the name suggests, SWISH-E is particularly good at indexing Web content, be it in text, HTML, XML, PDF or DOC format. If you're trying to add a full-text search engine to your Web site, but don't really want to spend too much time on configuration and data processing, this might just be the thing you're looking for. Come on in, and find out more!