Snowball: Quick introduction |
Links |
You can use this site at a number of levels: - You can look at the stemming algorithm definitions themselves, and use them as templates for coding your own versions of stemmers in the computer language of your choice. - You can use the various ANSI C stemmers, without bothering yourself with the Snowball system that generated them. To do that, download the Snowball system, and extract the *.h and *.c files from directory q/. They provide the library and api connections to the stemmers. For each language stemmer you want to use, download the corresponding .c and .h file. Then follow the instructions for use. - You can get involved in Snowball itself. This is particularly worthwhile if you want to adjust the stemmers or develop new stemmers. A typical reason for adjusting the stemmers is that you are working with a different encoding of accented letters from the ISO Latin I encoding assumed in most of the scripts here. Then you need to make your own version of the Snowball compiler and work with the Snowball scripts.
|