Elastic Docs/
Reference/
Elasticsearch/
Text analysis components/
Analyzer reference

Whitespace analyzer

Thewhitespace analyzer breaks text into terms whenever it encounters a whitespace character.

Example output

POST _analyze{  "analyzer": "whitespace",  "text": "The 2 QUICK Brown-Foxes jumped over the lazy dog's bone."}

The above sentence would produce the following terms:

[ The, 2, QUICK, Brown-Foxes, jumped, over, the, lazy, dog's, bone. ]

Configuration

Thewhitespace analyzer is not configurable.

Definition

It consists of:

Tokenizer

Whitespace Tokenizer

If you need to customize thewhitespace analyzer then you need to recreate it as acustom analyzer and modify it, usually by adding token filters. This would recreate the built-inwhitespace analyzer and you can use it as a starting point for further customization:

PUT /whitespace_example{  "settings": {    "analysis": {      "analyzer": {        "rebuilt_whitespace": {          "tokenizer": "whitespace",          "filter": [          ]        }      }    }  }}

You’d add any token filters here.

Movatterモバイル変換

Whitespace analyzer

Example output

Configuration

Definition