Multiple Capture Groups in RegEx Tokenizer
Currently the RegEx tokenizer only supports to either create a token from each match or for one capture group per match. This makes it impossible to reuse parts of the text for multiple tokens, for example when you want to tokenize "13.3" into "13" as well as "13.3" (our use case is searching products where we want to find a 13.3" device by searching for 13" as well as 13.3").
1
vote
Marces Engel
shared this idea
-
Marces Engel commented
Using the multi analyzer feature this can currently still be done, however only having one index would be preferable, as this also makes searching a lot easier.