README

Mkkey - A group of programs to generate a list of key words
and their related files from text files.

This group of programs relies on certain (I assume) standard
*NIX text processing programs:
   hyphen - a program to extract and join hyphenated words. Could
            be done with sed.
   comm - Finds common words in two files.  Probably awk could
          accomplish this, but would be slower.
   sort & uniq - should be available anywhere.
   vi - Use your favorite editor here.

The following is used in the program:
    tolower - converts all upper case to lower.  Tr can do the same
              thing.
    bkey.c - The program that actually does the KEY file generation.
             Makeit is the cc command. This one is for SCO XENIX.

Not many comments are in the files - as they should be fairly self-
explanitory.
Two files, apart from the text file(s), are needed;  ignore and
include.  These should be placed in the directory described by $KPATH.
KPATH will also need to be changed to your preference.  You will
probably want to add changes to mkkey to remove the temporary files
after each key update.  For the ignore file, /usr/lib/eign would be a
good place to start as it contains the common words; the, he, she,
that...  etc.  Include builds itself.  The program does copy KEY to
KEY.O - just in case.  I keep KEY.O around until I'm sure KEY is OK.

There are some sed lines that may confuse some people.
These are there to try to pear down the size of the ignore file.

I'd like to know of changes and improvements.  Especially in the area
of stripping ignore words to their base form.  There are no
restrictions on this collection of files.

Enjoy....

Scotty
------
AIDS INFORMATION EXCHANGE BBS      (214) 247-2367/247-5609
               "Education is the best weapon"
{mic,void,egsner}!ozdaltx!sysop || {uunet,smu,ames}!sulaco!ozdaltx!sysop 
-
