Linguistics 538
Computational Methods in Linguistic Research
Spring 2002-2003

This class meets on Mondays and Wednesdays from 15:00-16:30 in the Phonetics Lab (Williams 623).

Instructors

Bill Poser Web Page: http://www.ling.upenn.edu/~wjposer/ Email: wjposer@unagi.cis.upenn.edu
Tony Kroch Web Page: http://www.ling.upenn.edu/~kroch/ Email: kroch@change.ling.upenn.edu

Why Unix?

The work for this course is done on Unix systems. Indeed, a substantial part of the course is devoted specifically to how to use Unix for linguistic research. Here is why.

Lecture Notes

Miscellaneous Notes

Reading (includes both assigned and optional reference materials)

Bird, Steven (1997)
"A Lexical Database Tool for Quantitative Phonological Research," Proceedings of the Third Meeting of the ACL Special Interest Group in Computational Phonology pp. 33-39. Madrid, July 1997.

Bird, Steven and Mark Liberman (2001)
"A Formal Framework for Linguistic Annotation," Speech Communication 33(1,2), pp 23-60, 2001.

Chan, Marjorie K. (2001)
"Concordancers and concordances: Tools for Chinese language teaching and research" Journal of the Chinese Language Teachers Association 37.2 (2002):1-58. (Link is to revised version with color illustrations).

Karttunen, Lauri, J.-P. Chanod, G. Grefenstette, A. Schiller (1997)
"Regular Expressions for Language Engineering," Natural Language Engineering 2.4.305-328.

Poser, William J. (2002)
"Lexical Databases for Carrier," ms. University of Pennsylvania.

Poser, William J. (2002)
"Making Athabaskan Dictionaries Usable," Proceedings of the Athabaskan Languages Conference - 2002. Fairbanks: Alaska Native Language Center, University of Alaska.

Exercises

Solutions to Exercises

Data for Exercises

Software Links

awk
Programming language well suited to text processing.
msort
Sophisticated sorting program.
R
R is a free reimplementation of the statistics language whose proprietary version goes by the name of "Splus". A page containing lots of useful information about R, especially useful as a local Penn reference, is: http://finzi.psych.upenn.edu. A Windows version can be downloaded from here.
wget
Program for downloading web sites.

Documentation for Software


HTML Links


Character Encoding


Other Links

Back to Linguistics course list