com.atlassian.confluence.search.lucene.tokenizers
Class GenericCharacterTokenizer

java.lang.Object
  extended by org.apache.lucene.analysis.TokenStream
      extended by org.apache.lucene.analysis.Tokenizer
          extended by org.apache.lucene.analysis.CharTokenizer
              extended by com.atlassian.confluence.search.lucene.tokenizers.GenericCharacterTokenizer

public class GenericCharacterTokenizer
extends org.apache.lucene.analysis.CharTokenizer

Character tokenizers that tokenizes on a specified list of delimiters.


Field Summary
 
Fields inherited from class org.apache.lucene.analysis.Tokenizer
input
 
Constructor Summary
GenericCharacterTokenizer(Reader input, char[] delimiters)
           
 
Method Summary
protected  boolean isTokenChar(char c)
           
 
Methods inherited from class org.apache.lucene.analysis.CharTokenizer
next, normalize
 
Methods inherited from class org.apache.lucene.analysis.Tokenizer
close
 
Methods inherited from class org.apache.lucene.analysis.TokenStream
reset
 
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 

Constructor Detail

GenericCharacterTokenizer

public GenericCharacterTokenizer(Reader input,
                                 char[] delimiters)
Parameters:
input - what to tokenize passed in as a reader
delimiters - array of delimiters to tokenize on (you must specify at least one delimiter)
Throws:
IllegalArgumentException - if no delimiters are specified
Method Detail

isTokenChar

protected boolean isTokenChar(char c)
Specified by:
isTokenChar in class org.apache.lucene.analysis.CharTokenizer


Copyright © 2003-2010 Atlassian. All Rights Reserved.