1
| From svn.tartarus.org/snowball/trunk/website/algorithms/english/stop.txt
2
| This file is distributed under the BSD License.
3
| See http://snowball.tartarus.org/license.php
4
| Also see http://www.opensource.org/licenses/bsd-license.html
5
| - Encoding was converted to UTF-8.
6
| - This notice was added.
8
| An English stop word list. Comments begin with vertical bar. Each stop
9
| word is at the start of a line.
11
| Many of the forms below are quite rare (e.g. "yourselves") but included for
17
i | subject, always in upper case of course
20
my | possessive adjective
21
| the possessive pronoun `mine' is best suppressed, because of the
22
| sense of coal-mine etc.
28
| care is required here because US = United States. It is usually
29
| safe to remove it if it is in lower case.
30
our | possessive adjective
31
ours | possessive pronoun
33
| second person (archaic `thou' forms not included)
34
you | subject and object
35
your | possessive adjective
36
yours | possessive pronoun
37
yourself | reflexive (singular)
38
yourselves | reflexive (plural)
39
| third person singular
42
his | possessive adjective and pronoun
46
her | object and possessive adjective
47
hers | possessive pronoun
50
it | subject and object
51
its | possessive adjective
56
their | possessive adjective
57
theirs | possessive pronoun
58
themselves | reflexive
59
| other forms (demonstratives, interrogatives)
69
| VERB FORMS (using F.R. Palmer's nomenclature)
71
am | 1st person, present
72
is | -s form (3rd person, present)
74
was | 1st person, past
77
been | past participle
90
| The forms below are, I believe, best omitted, because of the significant
97
| fight the good fight with all thy MIGHT
99
| would, could, should, ought might however be included
126
| COMPOUND FORMS, increasingly encountered nowadays in 'formal' writing
166
| auxiliary + negation
177
| miscellaneous forms
203
| THE REST (Overlap among prepositions, conjunctions, adverbs etc is so
204
| high, that classification is pointless.)
274
| Just for the record, the following words are among the commonest in English