Package org.apache.lucene.analysis.hunspell
package org.apache.lucene.analysis.hunspell
Stemming TokenFilter using a Java implementation of the
Hunspell stemming algorithm.
Dictionaries can be found on OpenOffice's wiki
-
ClassesClassDescriptionIn-memory structure for the dictionary (.dic) and affix (.aff) data of a hunspell dictionary.Implementation of
Dictionary.FlagParsingStrategythat assumes each flag is encoded as two ASCII characters whose codes must be combined into a single character.Abstraction of the process of parsing flags taken from the affix and dic filesImplementation ofDictionary.FlagParsingStrategythat assumes each flag is encoded in its numerical form.Simple implementation ofDictionary.FlagParsingStrategythat treats the chars in each String as a individual flags.TokenFilter that uses hunspell affix rules and words to stem tokens.TokenFilterFactory that creates instances ofHunspellStemFilter.Stemmer uses the affix rules declared in the Dictionary to generate one or more stems for a word.