Package org.apache.lucene.analysis.util
package org.apache.lucene.analysis.util
Utility functions for text analysis.
-
ClassDescriptionAbstract parent class for analysis factories
TokenizerFactory,TokenFilterFactoryandCharFilterFactory.AnalysisSPILoader<S extends AbstractAnalysisFactory>Helper class for loading named SPIs from classpath (e.g.A CharacterIterator used internally for use withBreakIteratorAbstract parent class for analysis factories that createCharFilterinstances.An abstract base class for simple, character-oriented tokenizers.SimpleResourceLoaderthat usesClassLoader.getResourceAsStream(String)andClass.forName(String,boolean,ClassLoader)to open resources and classes, respectively.Removes elisions from aTokenStream.Factory forElisionFilter.SimpleResourceLoaderthat opens resource files from the local file system, optionally resolving against a base directory.A StringBuilder that allows one to access the array.Abstraction for loading resources (streams, files, and classes).Interface for a component that needs to be initialized by an implementation ofResourceLoader.Acts like a forever growing char[] as you read characters into it from the provided reader, but internally it uses a circular buffer to only hold the characters that haven't been freed yet.Breaks text into sentences with aBreakIteratorand allows subclasses to decompose these sentences into words.Some commonly-used stemming functionsAbstract parent class for analysis factories that createTokenFilterinstances.Abstract parent class for analysis factories that createTokenizerinstances.This file contains unicode properties used by variousCharTokenizers.