Class FuzzyLikeThisQuery
java.lang.Object
org.apache.lucene.search.Query
org.apache.lucene.sandbox.queries.FuzzyLikeThisQuery
Fuzzifies ALL terms provided as strings and then picks the best n differentiating terms.
In effect this mixes the behaviour of FuzzyQuery and MoreLikeThis but with special consideration
of fuzzy scoring factors.
This generally produces good results for queries where users may provide details in a number of
fields and have no knowledge of boolean query syntax and also want a degree of fuzzy matching and
a fast query.
For each source term the fuzzy variants are held in a BooleanQuery with no coord factor (because
we are not looking for matches on multiple variants in any one doc). Additionally, a specialized
TermQuery is used for variants and does not use that variant term's IDF because this would favour rarer
terms eg misspellings. Instead, all variants use the same IDF ranking (the one for the source query
term) and this is factored into the variant's boost. If the source query term does not exist in the
index the average IDF of the variants is used.
-
Nested Class Summary
Nested ClassesModifier and TypeClassDescription(package private) static classprivate static classprivate static class -
Field Summary
FieldsModifier and TypeFieldDescription(package private) Analyzer(package private) ArrayList<FuzzyLikeThisQuery.FieldVals> (package private) boolean(package private) intprivate int(package private) static TFIDFSimilarity -
Constructor Summary
Constructors -
Method Summary
Modifier and TypeMethodDescriptionvoidAdds user input for "fuzzification"private voidbooleanOverride and implement query instance equivalence properly in a subclass.private booleanequalsTo(FuzzyLikeThisQuery other) inthashCode()Override and implement query hash code properly in a subclass.booleanprivate QuerynewTermQuery(IndexReader reader, Term term) rewrite(IndexReader reader) Expert: called to re-write queries into primitive queries.voidsetIgnoreTF(boolean ignoreTF) Prints a query to a string, withfieldassumed to be the default field and omitted.voidvisit(QueryVisitor visitor) Recurse through the query tree, visiting any child queriesMethods inherited from class org.apache.lucene.search.Query
classHash, createWeight, sameClassAs, toString
-
Field Details
-
sim
-
fieldVals
ArrayList<FuzzyLikeThisQuery.FieldVals> fieldVals -
analyzer
Analyzer analyzer -
MAX_VARIANTS_PER_TERM
int MAX_VARIANTS_PER_TERM -
ignoreTF
boolean ignoreTF -
maxNumTerms
private int maxNumTerms
-
-
Constructor Details
-
FuzzyLikeThisQuery
- Parameters:
maxNumTerms- The total number of terms clauses that will appear once rewritten as a BooleanQuery
-
-
Method Details
-
hashCode
public int hashCode()Description copied from class:QueryOverride and implement query hash code properly in a subclass. This is required so thatQueryCacheworks properly. -
equals
Description copied from class:QueryOverride and implement query instance equivalence properly in a subclass. This is required so thatQueryCacheworks properly. Typically a query will be equal to another only if it's an instance of the same class and its document-filtering properties are identical that other instance. Utility methods are provided for certain repetitive code. -
equalsTo
-
addTerms
Adds user input for "fuzzification"- Parameters:
queryString- The string which will be parsed by the analyzer and for which fuzzy variants will be parsedminSimilarity- The minimum similarity of the term variants; must be 0, 1 or 2 (see FuzzyTermsEnum)prefixLength- Length of required common prefix on variant terms (see FuzzyTermsEnum)
-
addTerms
private void addTerms(IndexReader reader, FuzzyLikeThisQuery.FieldVals f, FuzzyLikeThisQuery.ScoreTermQueue q) throws IOException - Throws:
IOException
-
newTermQuery
- Throws:
IOException
-
visit
Description copied from class:QueryRecurse through the query tree, visiting any child queries -
rewrite
Description copied from class:QueryExpert: called to re-write queries into primitive queries. For example, a PrefixQuery will be rewritten into a BooleanQuery that consists of TermQuerys.- Overrides:
rewritein classQuery- Throws:
IOException
-
toString
Description copied from class:QueryPrints a query to a string, withfieldassumed to be the default field and omitted. -
isIgnoreTF
public boolean isIgnoreTF() -
setIgnoreTF
public void setIgnoreTF(boolean ignoreTF)
-