org.crosswire.jsword.index.lucene.analysis
Class ConfigurableSnowballAnalyzer
java.lang.Object
org.apache.lucene.analysis.Analyzer
org.crosswire.jsword.index.lucene.analysis.AbstractBookAnalyzer
org.crosswire.jsword.index.lucene.analysis.ConfigurableSnowballAnalyzer
public class ConfigurableSnowballAnalyzer
- extends AbstractBookAnalyzer
Analyzer class to use as a Snowball Analyzer
Default behavior: Stemming is done, Stop words not removed
A snowball stemmer can be configured by passing the stemmer name to setNaturalLanguage() method.
Currently it takes following stemmer names (available stemmers in lucene snowball package net.sf.snowball.ext)
Danish
Dutch
English
Finnish
French
German2
German
Italian
Kp
Lovins
Norwegian
Porter
Portuguese
Russian
Spanish
Swedish
This list is expected to expand, as and when Snowball project support more languages
- Author:
- sijo cherian [sijocherian at yahoo dot com]
- See Also:
for license details.
The copyright to this program is held by it's authors.
Method Summary |
void |
setNaturalLanguage(String name)
|
org.apache.lucene.analysis.TokenStream |
tokenStream(String fieldName,
Reader reader)
Filters StandardTokenizer with StandardFilter , LowerCaseFilter , StopFilter if enabled and
SnowballFilter . |
Methods inherited from class org.apache.lucene.analysis.Analyzer |
getPositionIncrementGap, getPreviousTokenStream, reusableTokenStream, setPreviousTokenStream |
Methods inherited from class java.lang.Object |
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
allowedStemmers
private static Pattern allowedStemmers
defaultStopWordMap
private static HashMap defaultStopWordMap
stemmerName
private String stemmerName
ConfigurableSnowballAnalyzer
public ConfigurableSnowballAnalyzer()
tokenStream
public final org.apache.lucene.analysis.TokenStream tokenStream(String fieldName,
Reader reader)
- Filters
StandardTokenizer
with StandardFilter
, LowerCaseFilter
, StopFilter
if enabled and
SnowballFilter
.
- Specified by:
tokenStream
in class org.apache.lucene.analysis.Analyzer
setNaturalLanguage
public void setNaturalLanguage(String name)
- Overrides:
setNaturalLanguage
in class AbstractBookAnalyzer