diff --git a/README.md b/README.md index 89c18b5..14fd8e4 100644 --- a/README.md +++ b/README.md @@ -36,8 +36,9 @@ pip install searcharray * Search w/ a [phrase w/ edit-distance](https://lucene.apache.org/core/9_6_0/core/org/apache/lucene/search/PhraseQuery.html) by passing slop=N. * Access raw stats arrays in termfreqs / docfreqs methods on the array * Tokenize any pandas series, list of strings, by using your own tokenizer of str -> list[str] (`def tokenize(value: str) -> List[str]`) -* Accepts any python function to compute similarity. Here's [one similarity]([https://github.com/softwaredoug/searcharray/blob/main/searcharray/similarity.py](https://github.com/softwaredoug/searcharray/blob/main/searcharray/similarity.py#L103)) -* Scores the entire dataframe, allowing combination w/ other ranking attributes (recency, +* Accepts any python function to compute similarity. Here's [one similarity(https://github.com/softwaredoug/searcharray/blob/main/searcharray/similarity.py#L103) +* Scores the entire dataframe, allowing combination w/ other ranking attributes (recency, popularity, etc) or scores from other fields (ie boolean queries) +* Implement's Solr's [edismax query parser](https://github.com/softwaredoug/searcharray/blob/main/searcharray/solr.py) ## Motivation