Woogle4MediaWiki/Decider manual

From TeamWeaverWiki

(Difference between revisions)
Jump to: navigation, search
(Created page with ' == Capabilities == === Features === * Searching the Wiki and external content ** Keyword-search with wildcards, phrases etc. (Lucene syntax) ** Search external data from within…')
Line 1: Line 1:
-
 
== Capabilities ==
== Capabilities ==
Line 21: Line 20:
* Accesses all pages in all namespaces - currently no configuration possible (but can be relatively easy changed in the code)
* Accesses all pages in all namespaces - currently no configuration possible (but can be relatively easy changed in the code)
* Does not respect any read restrictions/access rights on MediaWiki content (certain access control is possible in WoogleRemote using an [[Integrated Search|Integrated search backend]])
* Does not respect any read restrictions/access rights on MediaWiki content (certain access control is possible in WoogleRemote using an [[Integrated Search|Integrated search backend]])
 +
* For minor limitions of specific releace also consult the [[Woogle4MediaWiki/Release Notes]]
 +
 +
=== Comparisons ===
 +
Options for search in MediaWiki are described on [http://www.mediawiki.org/wiki/Extension:Woogle4MediaWiki#See_also Woogle's MediaWiki.org page] and at http://www.wiki4enterprise.org/index.php/Suche_in_Wikis (in German).
== Performance ==
== Performance ==

Revision as of 10:58, 18 March 2010

Contents

Capabilities

Features

  • Searching the Wiki and external content
    • Keyword-search with wildcards, phrases etc. (Lucene syntax)
    • Search external data from within MediaWiki (in WoogleRemote using Integrated search backend)
    • Autocomplete of search results
    • Seamless integration in MediaWiki (replaces built-in search; can be disabled by individual users via MediaWiki preferences)
    • Social search (Describe and discuss queries, (Social ranking))
    • Searches all namespaces
  • Need guidance
    • Popups for "Red links" provide meta-information on need
    • Statistics of unsatisfied queries
    • ...
  • Convenience features
    • Supports different languages (en and de supplied by default, further languages can easily be supported)
    • Native PHP extension (in WoogleNative mode)

Limitations

  • Accesses all pages in all namespaces - currently no configuration possible (but can be relatively easy changed in the code)
  • Does not respect any read restrictions/access rights on MediaWiki content (certain access control is possible in WoogleRemote using an Integrated search backend)
  • For minor limitions of specific releace also consult the Woogle4MediaWiki/Release Notes

Comparisons

Options for search in MediaWiki are described on Woogle's MediaWiki.org page and at http://www.wiki4enterprise.org/index.php/Suche_in_Wikis (in German).

Performance

  • WoogleRemote
    • Integrated Search is the core determinant for indexing and query performance - it can typically handle large amounts of data
    • Communication and even network latency between the PHP/MediaWiki frontend and the Java/TeamWeaverIS backend is not an issue to our experience
  • WoogleNative
    • On state-of-the-art hardware, we were able to work with a 50,000 pages subset of Wikipedia, serving queries well below one second
    • However, we did not yet test this scale with massive requests
    • We recommend to restrict search-term autocomplete to title matches only for Wikis larger than 10,000 pages for performance reasons