free software

Dan Melamed (melamed@unagi.cis.upenn.edu)
Mon, 27 Jan 1997 15:46:57 -0500 (EST)

Dear Colleagues,

Since I began doing research in NLP, I have written hundreds of
utility scripts. It occurred to me that these may be useful to
others. So I've decided to make much of my personal library public.

You might find my scripts useful if

a) you prefer to reuse existing code instead of writing and debugging
your own,

b) you are just getting acquainted with empirical methods in NLP, and
want to reduce your "start-up" time,

c) you want some ideas about how common NLP programming languages,
like Perl and awk, can be used to construct useful tools.

d) you want to replicate some of my published results (most of which
are on my home page).

The library has 3 main sections:

170 general text processing tools (mostly in Perl5)
75 text statistics tools (mostly in Perl5)
40 bitext geometry tools (mostly in Perl5)

You will also find:

Simulated Annealing Program
English Stemming post-processors for the XTAG morpholyzer
Good-Turing Smoothing Software

Of course, there is no guarantee that any of this is useful, or even
that it works. And I have not gone to great lengths to make the code
more comprehensible to others.

You can access the library from my home page at
http://www.cis.upenn.edu/~melamed/home.html

While you're there, you might want to check out the other PennTools.

Comments, suggestions and bug reports are more than welcome.

Dan Melamed
U. of Penn.