You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
{{ message }}
This repository was archived by the owner on Sep 21, 2021. It is now read-only.
<1> Emits the tokens: `i'm`, `happi`, `about`, `fox`
36
43
37
44
We can't tell if the document mentions one `fox` or many `foxes`; the word
38
45
`not` is a stopword and is removed, so we can't tell whether the document is
@@ -41,7 +48,7 @@ recall as we can match more loosely, but we have reduced our ability to rank
41
48
documents accurately.
42
49
43
50
To get the best of both worlds, we can use <<multi-fields,multifields>> to
44
-
index the `title` field twice: once((("multifields", "using to index a field with two different analyzers"))) with the `english` analyzer and once with
51
+
index the `title` field twice: once with the `english` analyzer and once with
Copy file name to clipboardexpand all lines: 200_Language_intro/20_Configuring.asciidoc
+11-7
Original file line number
Diff line number
Diff line change
@@ -2,13 +2,13 @@
2
2
=== Configuring Language Analyzers
3
3
4
4
While the language analyzers can be used out of the box without any
5
-
configuration, most of them ((("english analyzer", "configuring")))((("language analyzers", "configuring")))do allow you to control aspects of their
5
+
configuration, most of them do allow you to control aspects of their
6
6
behavior, specifically:
7
7
8
8
[[stem-exclusion]]
9
9
Stem-word exclusion::
10
10
+
11
-
Imagine, for instance, that users searching for((("language analyzers", "configuring", "stem word exclusion")))((("stemming words", "stem word exclusion, configuring"))) the ``World Health
11
+
Imagine, for instance, that users searching for the ``World Health
12
12
Organization'' are instead getting results for ``organ health.'' The reason
13
13
for this confusion is that both ``organ'' and ``organization'' are stemmed to
14
14
the same root word: `organ`. Often this isn't a problem, but in this
@@ -18,7 +18,7 @@ stemmed.
18
18
19
19
Custom stopwords::
20
20
21
-
The default list of stopwords((("stopwords", "configuring for language analyzers"))) used in English are as follows:
21
+
The default list of stopwords used in English are as follows:
22
22
+
23
23
a, an, and, are, as, at, be, but, by, for, if, in, into, is, it,
24
24
no, not, of, on, or, such, that, the, their, then, there, these,
@@ -54,13 +54,17 @@ PUT /my_index
54
54
}
55
55
}
56
56
57
-
GET /my_index/_analyze?analyzer=my_english <3>
58
-
The World Health Organization does not sell organs.
57
+
GET /my_index/_analyze
58
+
{
59
+
"analyzer": "my_english", <3>
60
+
"text": "The World Health Organization does not sell organs."
0 commit comments