Class Zend_Search_Lucene_Analysis_TokenFilter_StopWords

Description

Token filter that removes stop words. These words must be provided as array (set), example: $stopwords = array('the' => 1, 'an' => '1');

We do recommend to provide all words in lowercase and concatenate this class after the lowercase filter.

  • license: New BSD License
  • copyright: Copyright (c) 2005-2007 Zend Technologies USA Inc. (http://www.zend.com)

Located in /Zend/Search/Lucene/Analysis/TokenFilter/StopWords.php (line 41)

Zend_Search_Lucene_Analysis_TokenFilter
   |
   --Zend_Search_Lucene_Analysis_TokenFilter_StopWords
Method Summary
 Zend_Search_Lucene_Analysis_TokenFilter_StopWords __construct ([array $stopwords = array()])
 void loadFromFile ([string $filepath = null])
Methods
Constructor __construct (line 54)

Constructs new instance of this filter.

  • access: public
Zend_Search_Lucene_Analysis_TokenFilter_StopWords __construct ([array $stopwords = array()])
  • array $stopwords: array (set) of words that will be filtered out
loadFromFile (line 82)

Fills stopwords set from a text file. Each line contains one stopword, lines with '#' in the first column are ignored (as comments).

You can call this method one or more times. New stopwords are always added to current set.

  • access: public
  • throws: Zend_Search_Exception When the file doesn`t exists or is not readable.
void loadFromFile ([string $filepath = null])
  • string $filepath: full path for text file with stopwords
normalize (line 64)

Normalize Token or remove it (if null is returned)

  • access: public

Redefinition of:
Zend_Search_Lucene_Analysis_TokenFilter::normalize()
Normalize Token or remove it (if null is returned)

Inherited Methods

Inherited From Zend_Search_Lucene_Analysis_TokenFilter

 Zend_Search_Lucene_Analysis_TokenFilter::normalize()

Documentation generated on Wed, 21 Feb 2007 12:00:26 -0800 by phpDocumentor 1.3.1