2 # COMPONENT_NAME: austext
8 # (C) COPYRIGHT International Business Machines Corp. 1993,1996
10 # Licensed Materials - Property of IBM
11 # US Government Users Restricted Rights - Use, duplication or
12 # disclosure restricted by GSA ADP Schedule Contract with IBM Corp.
14 #***************** ENG.SFX *******************
15 # $XConsortium: eng.sfx /main/3 1996/10/29 20:12:24 cde-ibm $
16 # Paice Stemmer Suffix Removal Rules, Ascii English
20 # Empty lines and lines beginning with punctuation are comments.
21 # Lines must be sorted lexicographically by FIRST CHAR only ('A' - 'Z').
22 # Within a char section, rules sorted sequentially as applied.
23 # Token #1: Required, UPPERCASE suffix string, reading backwards.
24 # Token #2: Optional, single asterisk (*). Rule is applied only
25 # if original word "is intact", ie this is first rule applied.
26 # Token #3: Required, 'remove' count. How much of suffix to remove.
27 # Zero is permissable and terminates stemming.
28 # Token #4: Optional, append string, reading correctly. Applied
29 # after suffix is removed.
30 # Token #5: Required, continuation symbol '>' or '$'.
31 # If '$', stemming terminates, else continues.
34 # Revision 2.3 1996/02/01 19:02:05 miker
35 # Restored some rules inadvertently deleted.
37 # Revision 2.2 1996/02/01 18:50:18 miker
38 # AusText 2.1.11, DtSearch 0.3: Changed .sfx format so certain
39 # values are not hardcoded in lang.c.