Tell us how to improve Atlas Search!

Multiple Capture Groups in RegEx Tokenizer

Currently the RegEx tokenizer only supports to either create a token from each match or for one capture group per match. This makes it impossible to reuse parts of the text for multiple tokens, for example when you want to tokenize "13.3" into "13" as well as "13.3" (our use case is searching products where we want to find a 13.3" device by searching for 13" as well as 13.3").

1 vote

Marces Engel shared this idea · May 25, 2023 · Report… · Admin →

An error occurred while saving the comment

Marces Engel commented · May 25, 2023 9:38 AM · Report

Using the multi analyzer feature this can currently still be done, however only having one index would be preferable, as this also makes searching a lot easier.

Submitting...

Signed in as (Sign out)

Close

Atlas Search

Searching…

No results.
Clear search results

Give feedback
- Atlas 1,369 ideas
- Atlas App Services 240 ideas
- Atlas CLI 27 ideas
- Atlas Search 160 ideas
- Atlas Stream Processing 5 ideas
- Charts 252 ideas
- Compass 460 ideas
- Connectors (BI, Kafka, Spark) 42 ideas
- Data Federation and Data Lake 58 ideas
- Database 307 ideas
- Database Tools 1 idea
- Documentation 12 ideas
- Drivers 79 ideas
- Education 40 ideas
- FLE and Queryable Encryption 3 ideas
- MongoDB Analyzer (for .NET) 2 ideas
- MongoDB for VS Code 79 ideas
- MongoDB Shell 38 ideas
- Ops Tools 469 ideas
- Realm 96 ideas
- Relational Migrator 3 ideas
- Support Website 18 ideas
- UI 117 ideas
- Vector Search 8 ideas

Tell us how to improve Atlas Search!

Multiple Capture Groups in RegEx Tokenizer

Feedback

Atlas Search

Feedback and Knowledge Base

Searching…

Give feedback

Multiple Capture Groups in RegEx Tokenizer

We're glad you're here

We're glad you're here

We're glad you're here

We're glad you're here

We're glad you're here

Atlas Search

Categories

Searching…

Give feedback