1 <h2><a name="query">Query</a></h2>
3 <h3><a name="query-ignored">Long messages and words are ignored</a></h3>
5 Messages longer than 100,000 letters or 500,000 bytes are ignored. Words
6 longer than 40 characters are ignored. Attachments are ignored.
9 <h3><a name="query-term">Single term query</a></h3>
11 The query specifies only one term for retrieving all
12 documents which contain the term. e.g.,
19 <h3><a name="query-and">AND query</a></h3>
22 The query specifies two or more terms for retrieving all
23 documents which contain both terms. You can insert the
24 <code class="operator">and</code> operator between the terms. e.g.,
32 You can ommit the <code class="operator">and</code> operator. Terms which is
33 separated by one ore more spaces is assumed to be AND query.
36 <h3><a name="query-or">OR query</a></h3>
38 The query specifies two or more terms for retrieving all
39 documents which contain either term. You can insert the
40 <code class="operator">or</code> operator between the terms.
48 <h3><a name="query-not">NOT query</a></h3>
50 The query specifies two or more terms for retrieving all
51 documents which contain a first term but does't contain the
52 following terms. You can insert the <code class="operator">not</code>
53 operator between the terms to do NOT query. e.g.,
61 <h3><a name="query-grouping">Grouping</a></h3>
63 You can group queries by surrounding them by
64 parentheses. The parentheses should be separated by one or
69 ( Linux or FreeBSD ) and Netscape not Windows
72 <h3><a name="query-phrase">Phrase searching</a></h3>
74 You can search for a phrase which consists of two or more terms
75 by surrounding them with double quotes like
76 <code class="operator">"..."</code> or with braces like <code class="operator">{...}</code>.
77 In Namazu, precision of phrase searching is not 100 %,
78 so it causes wrong results occasionally. e.g.,
87 You must choose the latter with Tkanamzu or namazu.el.
91 <h3><a name="query-substring">Substring matching</a></h3>
93 The are three types of substring matching searching.
98 <dd><code class="example">inter*</code> (terms which begin with <code>inter</code>)
100 <dd><code class="example">*text*</code> (terms which contain <code>text</code>)
102 <dd><code class="example">*net</code> (terms which terminated
103 with <code>net</code>)
107 <h3><a name="query-regex">Regular expressions</a></h3>
110 You can use regular expressions for pattern matching. The
111 regular expressions must be surrounded by slashes like <code
112 class="operator">/.../</code>. Namazu uses <a
113 href="http://www.ruby-lang.org/">Ruby</a>'s regular
114 regular expressions engine. It offers generally <a
115 href="http://www.perl.com/">Perl</a> compatible flavor.
124 <h3><a name="query-field">Field-specified searching</a></h3>
126 You can limit your search to specific fields such as
127 <code>Subject:</code>, <code>From:</code>,
128 <code>Message-Id:</code>. It's especially convenient for
129 Mail/News documents. e.g.,
133 <li><code class="example">+subject:Linux</code><br>
134 (Retrieving all documents which contain <code>Linux</code>
135 in a <code>Subject:</code> field)
137 <li><code class="example">+subject:"GNU Emacs"</code><br>
138 (Retrieving all documents which contain <code>GNU Emacs</code>
139 in a <code>Subject:</code> field)
141 <li><code class="example">+from:foo@bar.jp</code><br>
142 (Retrieving all documents which contain <code>foo@bar.jp</code>
143 in a <code>From:</code> field)
146 <li><code class="example">+message-id:<199801240555.OAA18737@foo.bar.jp></code><br>
147 (Retrieving a certain document which contains specified
148 <code>Message-Id:</code>)
151 <h3><a name="query-notes">Notes</a></h3>
154 <li>In any queries, Namazu ignores case distinctions of
155 alphabet characters. In other words, Namazu does
156 case-insensitive pattern matching in any time.
159 <li>Japanese phrases are forced to be segmented into
160 morphemes automatically and are handled them as <a
161 href="#query-phrase">phrase searching</a>. This processing
162 causes invalid segmentation occasionally.
165 <li>Alphabet, numbers or a part of symbols (duplicated in
166 ASCII) characters which defined in JIS X 0208 (Japanese
167 Industrial Standards) are handled as ASCII characters.
169 <li>Namazu can handle a term which contains symbols like
170 <code>TCP/IP</code>. Since this handling isn't complete,
171 you can describe <code>TCP and IP</code> instead of
172 <code>TCP/IP</code>, but it may cause noisy results.
175 <li>Substring matching and field-specified searching takes
176 more time than other methods.
178 <li>If you want to use <code class="operator">and</code>,
179 <code class="operator">or</code> or <code
180 class="operator">not</code> simply as terms, you can
181 surround them respectively with double quotes like <code
182 class="operator">"..."</code> or braces like <code
183 class="operator">{...}</code>.
186 You must choose the latter with Tkanamzu or namazu.el.