Class CompoundWordTokenFilterDescriptor

  • All Implemented Interfaces:
    TokenFilterDescriptor

    public final class CompoundWordTokenFilterDescriptor
    extends Object
    implements TokenFilterDescriptor
    A token filter that decomposes compound words found in many Germanic languages based on dictionary.
    Since:
    7.0
    • Constructor Detail

      • CompoundWordTokenFilterDescriptor

        public CompoundWordTokenFilterDescriptor​(Collection<String> dictionary)
        Parameters:
        dictionary - list of common words