ProFusion: Intelligent Fusion from Multiple, Distributed Search Engines
Susan Gauch (Department of Electrical Engineering and Computer Science, The University of Kansas, USA)
Guijun Wang (Department of Electrical Engineering and Computer Science, The University of Kansas, USA)
Mario Gomez (Department of Electrical Engineering and Computer Science, The University of Kansas, USA)
Abstract: The explosive growth of the World Wide Web, and the resulting information overload, has led to a mini-explosion in World Wide Web search engines. This mini-explosion, in turn, led to the development of ProFusion, a meta search engine. Educators, like other users, do not have the time to evaluate multiple search engines to knowledgeably select the best for their uses. Nor do they have the time to submit each query to multiple search engines and wade through the resulting flood of good information, duplicated information, irrelevant information, and missing documents. ProFusion sends user queries to multiple underlying search engines in parallel, retrieves and merges the resulting URLs. It identifies and removes duplicates and creates one relevance-ranked list. If desired, the actual documents can be pre-fetched to remove yet more duplicates and broken links. ProFusion s performance has been compared to the individual search engines and other meta searchers, demonstrating its ability to retrieve more relevant information and present fewer duplicates pages. The system can automatically analyze queries to identify its topic(s) and, based on that analysis, select the most appropriate search engines for the query.
1.) This paper is an extension of "Information Fusion with ProFusion", Susan Gauch and Guijun Wang, WebNet '96: The First World Conference of the Web Society, San Francisco, CA, October 1996. http://www.designlab.ukans.edu/ProFusion.html