|
Integrated Search/How to write a custom crawlerFrom TeamWeaverWiki(Difference between revisions)
ChristianRoehr (Talk | contribs)
(Created page with 'This short guide describes how to write crawlers to make additional data sources available for searching with TeamWeaver Integrated Search. A list of existing crawlers can b…') Newer edit → Revision as of 09:24, 17 June 2009This short guide describes how to write crawlers to make additional data sources available for searching with TeamWeaver Integrated Search. A list of existing crawlers can be found at: supported data sources resp. repo_config.xml. There are two general strategies for crawling data sources with TeamWeaverIS:
Writing a "Pull"-CrawlerAt the basic level, creating a pull-crawler requires to implement two Java classes and changing two XML-files - a Crawler and a Processor each. Crawlers are classes which access a data source and extract/create single data items. Processors act upon these items to prepare them for feeding into the index. Crawler
Processor
Advanced topics
Writing a "Push"-Crawler
|