cs0204023/cs0204023
1: \documentclass[12pt]{article}
2: \usepackage{times,apalike,url}
3: \title{Computational Phonology}
4: \author{Steven Bird}
5: \date{University of Pennsylvania}
6: \bibliographystyle{apalike}
7: 
8: \begin{document}
9: \maketitle
10: 
11: Phonology, as it is practiced, is deeply computational.
12: Phonological analysis is data-intensive and the resulting
13: models are nothing other than specialized data structures and algorithms.
14: In the past, phonological computation -- managing data and developing
15: analyses -- was done manually with pencil and paper.  Increasingly,
16: with the proliferation of affordable computers, IPA fonts and drawing software,
17: phonologists are seeking to move their computation work online.
18: {\em Computational Phonology} provides the theoretical and technological
19: framework for this migration, building on methodologies and tools from
20: computational linguistics.  This piece consists of an {\em apology} for
21: computational phonology, a history, and an overview of current research.
22: 
23: {\bf Documentation and Description.}
24: Phonological data is of essentially three types:
25: texts, wordlists and paradigms.
26: A text is any phonetically transcribed narrative or conversation.
27: A wordlist is any compilation of linguistic forms which can be uttered
28: in isolation, with information about pronunciation and meaning.
29: A paradigm is broadly construed to mean any
30: tabulation of words or phrases which illustrates
31: contrasts and systematic variation.
32: Any of these data types may be {\em annotated} with more abstract
33: information originating from a phonological theory, such
34: as syllable boundaries, stress marks and prosodic structure.
35: Additionally, any of these data types may be associated with
36: recordings of audio, video or physiological signals.
37: Digitizing this documentation and description brings all the
38: different media types together, makes the cross-links
39: navigable, and opens up many new possibilities for
40: management, access and preservation.
41: 
42: {\bf Exploration and Analysis.}
43: The data types described above are closely interconnected in
44: phonological practice.
45: For instance, the discovery of a new word in a text
46: may require an update to the lexicon and
47: the construction of a new paradigm (e.g. to correctly
48: classify the word).  Fresh insights may lead to new annotations
49: and further elicitation, closing the loop in this perpetual,
50: exploratory process.
51: Phonological analysis typically involves defining a formal model,
52: systematically testing it against data, and comparing it with other models.
53: (In some cases, the model may be incorporated into a software system, e.g. for
54: generating natural intonation in a text-to-speech system.)
55: In this exploration and analysis --
56: sorting, searching, tabulating, defining, testing and comparing --
57: the principal task is computational.
58: 
59: Perhaps the earliest work in computational phonology was Bobrow and
60: Fraser's {\em Phonological Rule Tester} \cite{BobrowFraser68}, an
61: implementation of SPE designed to ``alleviate the problem of rule
62: evaluation.''  Shortly afterwards Johnson showed that, while SPE rules
63: resemble general rewriting systems at the top of the
64: Chomsky hierarchy, the way SPE rules are used in practice only requires
65: finite state power \cite{Johnson72}.
66: Independently, Kaplan and Kay discovered the connections between SPE
67: grammars and finite state transducers in the 70's and 80's, and laid down
68: a complete algebraic foundation (ultimately reported in
69: \cite{KaplanKay94}).  Significant implementations followed, including
70: \cite{Koskenniemi83b,BeesleyKarttunen02}.  Attempts to apply finite
71: state devices to Autosegmental Phonology have largely foundered,
72: but applications to Optimality Theory are thriving.
73: 
74: While finite-state phonology fixated on SPE, generative phonology continued
75: its rapid evolution.  The discovery of rule ``conspiracies''
76: \cite{Kisseberth70} and the abstractness controversy \cite{Koutsoudas74},
77: lead to calls for the reintroduction of {\em surface structure
78: constraints}.  Many theories arose from the fallout; most notable
79: for its computational ramifications was
80: Montague Phonology \cite{Wheeler81}.  This model adapted new lexicalist
81: formalisms from syntax and semantics, providing a {\em declarative}
82: (as opposed to {\em procedural}) account of phonological well-formedness,
83: and providing the first computational account of underspecification (where the
84: phonological content of a lexical entry is incompletely specified,
85: to be filled in during a derivation).  From these beginnings, {\em Declarative
86: Phonology} was born, and subsequent work provided a mathematical foundation
87: in first-order logic \cite{Bird95} and phonetic interpretation with links
88: to Firthian prosodic analysis and speech synthesis \cite{Coleman97},
89: with implementations generally in the Prolog programming language.
90: 
91: A third major strand of development, complementing the finite state and
92: declarative models, is best characterized as statistical.  It seeks to
93: apply neural networks, information theory, and weighted automata
94: in the automatic discovery of phonological information.
95: Gasser trained a recurrent neural network to recognize syllables and to repair
96: ill-formed syllables \cite{Gasser92a}.
97: Ellison showed how a technique from information theory called
98: MDL -- {\em minimum description length} -- could be applied to
99: automatically identify syllable boundaries in phonemically transcribed
100: texts \cite{Ellison92b}.
101: Many researchers apply Markov models (a kind of weighted automata) in
102: speech recognition, mapping speech recordings to phonetic transcriptions
103: and thence to orthographic words, using large, phonetically annotated
104: corpora as training data (e.g. TIMIT \cite{TIMIT86}).
105: 
106: Four key areas of ongoing research in computational phonology are in
107: Optimality Theory, automatic learning, interfaces to grammar and
108: phonetics, and supporting phonological description in the field.
109: Comprehensive references to online research papers in this areas may
110: be found on the SIGPHON website.
111: 
112: Computational phonology is generating sophisticated and rigorous ways for
113: creating, exploring and disseminating multidimensional phonological
114: information, encompassing primary recordings, texts, wordlists, paradigms,
115: theories and analyses.  As phonologists adopt the computational methods
116: described above, extending and adapting them as needed, the consequences
117: for the discipline will be increased accessibility, accountability, and
118: stability of empirical research.
119: 
120: {\bf Resources.}
121: The Association for Computational Linguistics
122: (ACL) has a special interest group in computational phonology (SIGPHON) with
123: a homepage at \url{http://www.cogsci.ed.ac.uk/sigphon/}.
124: The website contains online proceedings for SIGPHON workshops
125: and information about relevant books, dissertations and articles.
126: A special issue of {\it Computational Linguistics} devoted to
127: computational phonology was published in 1994 \cite{Bird94}.
128: 
129: \raggedright
130: 
131: \begin{thebibliography}{}
132: 
133: \bibitem[Beesley and Karttunen, 2002]{BeesleyKarttunen02}
134: Beesley, K.~R. and Karttunen, L. (2002).
135: \newblock {\em Finite-State Morphology: Xerox Tools and Techniques}.
136: \newblock Studies in Natural Language Processing. Cambridge University Press.
137: 
138: \bibitem[Bird, 1994]{Bird94}
139: Bird, S., editor (1994).
140: \newblock {\em Computational Linguistics: Special Issue on Computational
141:   Phonology}, volume 20(3).
142: \newblock MIT Press.
143: 
144: \bibitem[Bird, 1995]{Bird95}
145: Bird, S. (1995).
146: \newblock {\em Computational Phonology: A Constraint-Based Approach}.
147: \newblock Studies in Natural Language Processing. Cambridge University Press.
148: 
149: \bibitem[Bobrow and Fraser, 1968]{BobrowFraser68}
150: Bobrow, D.~G. and Fraser, J.~B. (1968).
151: \newblock A phonological rule tester.
152: \newblock {\em Communications of the ACM}, 11:766--72.
153: 
154: \bibitem[Coleman, 1997]{Coleman97}
155: Coleman, J.~S. (1997).
156: \newblock {\em Phonological Representations --- their names, forms and powers}.
157: \newblock Cambridge Studies in Linguistics. Cambridge University Press.
158: 
159: \bibitem[Ellison, 1992]{Ellison92b}
160: Ellison, T.~M. (1992).
161: \newblock {\em Machine Learning of Phonological Structure}.
162: \newblock PhD thesis, University of Western Australia.
163: 
164: \bibitem[Garofolo et~al., 1986]{TIMIT86}
165: Garofolo, J.~S., Lamel, L.~F., Fisher, W.~M., Fiscus, J.~G., Pallett, D.~S.,
166:   and Dahlgren, N.~L. (1986).
167: \newblock {\em The {DARPA TIMIT} Acoustic-Phonetic Continuous Speech Corpus
168:   {CDROM}}.
169: \newblock NIST.
170: \newblock \url{http://www.ldc.upenn.edu/Catalog/LDC93S1.html}.
171: 
172: \bibitem[Gasser, 1992]{Gasser92a}
173: Gasser, M. (1992).
174: \newblock Learning distributed representations for syllables.
175: \newblock In {\em Proceedings of the Fourteenth Annual Conference of the
176:   Cognitive Science Society}, pages 396--401. Hillsdale NJ: Lawrence Erlbaum
177:   Associates.
178: 
179: \bibitem[Johnson, 1972]{Johnson72}
180: Johnson, C.~D. (1972).
181: \newblock {\em Formal Aspects of Phonological Description}.
182: \newblock The Hague: Mouton.
183: 
184: \bibitem[Kaplan and Kay, 1994]{KaplanKay94}
185: Kaplan, R.~M. and Kay, M. (1994).
186: \newblock Regular models of phonological rule systems.
187: \newblock {\em Computational Linguistics}, 20:331--78.
188: 
189: \bibitem[Kisseberth, 1970]{Kisseberth70}
190: Kisseberth, C.~W. (1970).
191: \newblock On the functional unity of phonological rules.
192: \newblock {\em Linguistic Inquiry}, 1:291--306.
193: 
194: \bibitem[Koskenniemi, 1983]{Koskenniemi83b}
195: Koskenniemi, K. (1983).
196: \newblock {\em Two-Level Morphology: A General Computational Model for
197:   Word-Form Recognition and Production}.
198: \newblock PhD thesis, University of Helsinki.
199: 
200: \bibitem[Koutsoudas et~al., 1974]{Koutsoudas74}
201: Koutsoudas, A., Sanders, G., and Noll, C. (1974).
202: \newblock The application of phonological rules.
203: \newblock {\em Language}, 50:1--28.
204: 
205: \bibitem[Wheeler, 1981]{Wheeler81}
206: Wheeler, D.~W. (1981).
207: \newblock {\em Aspects of a Categorial Theory of Phonology}.
208: \newblock PhD thesis, University of Massachusetts at Amherst.
209: 
210: \end{thebibliography}
211: 
212: 
213: 
214: \end{document}
215: