"Fossies" - the Fresh Open Source Software Archive

Member "elasticsearch-6.8.15/docs/reference/analysis/tokenfilters/elision-tokenfilter.asciidoc" (3 Mar 2021, 987 Bytes) of package /linux/www/elasticsearch-6.8.15-src.tar.gz:


As a special service "Fossies" has tried to format the requested source page into HTML format (assuming AsciiDoc format). Alternatively you can here view or download the uninterpreted source code file. A member file download can also be achieved by clicking within a package contents listing on the according byte size field. See also the latest Fossies "Diffs" side-by-side code changes report for "elision-tokenfilter.asciidoc": 6.8.14_vs_6.8.15.

Elision Token Filter

A token filter which removes elisions. For example, "l’avion" (the plane) will tokenized as "avion" (plane).

Accepts articles parameter which is a set of stop words articles. Also accepts articles_case, which indicates whether the filter treats those articles as case insensitive.

For example:

PUT /elision_example
{
    "settings" : {
        "analysis" : {
            "analyzer" : {
                "default" : {
                    "tokenizer" : "standard",
                    "filter" : ["elision"]
                }
            },
            "filter" : {
                "elision" : {
                    "type" : "elision",
                    "articles_case": true,
                    "articles" : ["l", "m", "t", "qu", "n", "s", "j"]
                }
            }
        }
    }
}