org.crosswire.jsword.index.lucene.analysis
Class ChineseLuceneAnalyzer

java.lang.Object
  extended by org.apache.lucene.analysis.Analyzer
      extended by org.crosswire.jsword.index.lucene.analysis.AbstractBookAnalyzer
          extended by org.crosswire.jsword.index.lucene.analysis.ChineseLuceneAnalyzer

public class ChineseLuceneAnalyzer
extends AbstractBookAnalyzer

Uses org.apache.lucene.analysis.cn.ChineseAnalyzer Analysis: ChineseTokenizer, ChineseFilter StopFilter, Stemming not implemented yet Note: org.apache.lucene.analysis.cn.CJKAnalyzer takes overlapping two character tokenization approach which leads to larger index size.

Author:
Sijo Cherian [sijocherian at yahoo dot com]
See Also:
for license details.
The copyright to this program is held by it's authors.

Field Summary
private  org.apache.lucene.analysis.cn.ChineseAnalyzer myAnalyzer
           
 
Fields inherited from class org.crosswire.jsword.index.lucene.analysis.AbstractBookAnalyzer
book, doStemming, doStopWords, naturalLanguage, stopSet
 
Constructor Summary
ChineseLuceneAnalyzer()
           
 
Method Summary
 org.apache.lucene.analysis.TokenStream tokenStream(String fieldName, Reader reader)
           
 
Methods inherited from class org.crosswire.jsword.index.lucene.analysis.AbstractBookAnalyzer
getBook, getDoStopWords, getNaturalLanguage, setBook, setDoStemming, setDoStopWords, setNaturalLanguage, setStopWords
 
Methods inherited from class org.apache.lucene.analysis.Analyzer
getPositionIncrementGap, getPreviousTokenStream, reusableTokenStream, setPreviousTokenStream
 
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 

Field Detail

myAnalyzer

private org.apache.lucene.analysis.cn.ChineseAnalyzer myAnalyzer
Constructor Detail

ChineseLuceneAnalyzer

public ChineseLuceneAnalyzer()
Method Detail

tokenStream

public final org.apache.lucene.analysis.TokenStream tokenStream(String fieldName,
                                                                Reader reader)
Specified by:
tokenStream in class org.apache.lucene.analysis.Analyzer

Copyright ยจ 2003-2007