An Analyzer is used to analyze text.
It thus represents a policy for extracting index terms from text.
Note: Lucene Java implementation is oriented to streams. It provides effective work with a huge documents (more then 20Mb). But engine itself is not oriented such documents. Thus Zend_Search_Lucene analysis API works with data strings and sets (arrays).
Located in /Zend/Search/Lucene/Analysis/Analyzer.php (line 68)
Class | Description |
---|---|
![]() |
Common implementation of the Zend_Search_Lucene_Analysis_Analyzer interface. |
Return the default Analyzer implementation used by indexing code.
Set the default Analyzer implementation used by indexing code.
Tokenization stream API Get next token Returns null at the end of stream
Tokens are returned in UTF-8 (internal Zend_Search_Lucene encoding)
Reset token stream
Tokenization stream API Set input
Tokenize text to a terms Returns array of Zend_Search_Lucene_Analysis_Token objects
Tokens are returned in UTF-8 (internal Zend_Search_Lucene encoding)
Documentation generated on Wed, 21 Feb 2007 11:45:43 -0800 by phpDocumentor 1.3.1