CSC2540S Machine Learning and Universal Grammar
Department of Computer Science,
Spring Semester, 2009
Shalom Lappin
Department of Philosophy
King's
College
shalom@cs.toronto.edu
References
Special issue of the Linguistic Review (2002), volume 19, pp
1-223.
Abney, S.
(1996), "Statistical Methods and Linguistics" in J. Klavans and P.
Resnik (eds.), The Balancing Act:
Combining Symbolic and Statistical
Approaches to Language, MIT Press,
Archibald, L. and
International Journal of Language and Communication Disorders 41, pp. 675–693.
Arikawa, S., T. Shinohara, and A. Yamamoto (1989),
“Elementary Formal Systems as a Unifying Framework
for Language Learning”, Proceedings of the 2nd Workshop on Computational Learning Theory, pp. 312-327.
Baker, M.
(2001), The Atoms of Language: The Mind's
Hidden Rules of Grammar, Basic Books,
Banko, M. and E. Brill (2001), “Scaling to Very Very
Large corpora for Natural Language Disambiguation”,
Proceedings of the 39th Annual Meeting of the Association of Computational
Linguistics,
Berwick,
B. and N. Chomsky (2008), “ ‘Poverty of the Stimulus’
Revisited: Recent Challenges Reconsidered”,
Proceedings of the 30th Annual Conference of the Cognitive Science
Society,
Bishop, D., C. Adams, and C. Norbury (2006), “Distinct Genetic Influences on Grammar and Phonological
Short-Term Memory Deficits: Evidence from 6-Year-Old Twins, Genes, Brain and Behavior 5, pp.159–169.
Bod, R. (2006), “An All-Subtrees Approach to
Unsupervised Parsing”, Proceedings of ACL-COLING 2006,
Bod, R. (2007a), “Is the End of Supervised Parsing in Sight?”, Proceedings of the 45th Annual Meeting of
the Association
of Computational Linguistics,
Bod, R. (2007b), “A linguistic Investigation into Unsupervised DOP”, Proceedings
of the Workshop on
Cognitive Aspects of Computational
Language Acquisition,
Boeckx, C. (2008), Approaching Parameters from Below, ms.,
Borer, H.
(1984), Parametric Syntax: Case Studies in Semitic and Romance Languages,
Foris,
Braine. M.D.S. (1971), “On Two Types of Models
of the Internalization of Grammars” in D.I. Slobin
(ed.),
The Ontogenesis of Grammar, Academic Press,
Bresnan, J. (2001), Lexical-Functional Syntax,
Blackwell,
Bresnan, J. & R. Kaplan (1982), “Lexical-Functional
Grammar: A Formal System for Grammatical
Representation”, in J. Bresnan (ed.), The Mental Representations of Grammatical Relations, MIT Press,
Bresnan, J., A. Cueni, T. Nikitina, and H Baayen. (2005), "Predicting the Dative Alternation", to appear in
Carroll,
G. and E. Charniak (1992), “Two Experiments on
Learning Probabilistic Dependency Grammars
from Corpora in C. Weir, S. Abney, R. Grishman, and R. Weischedel (eds.), Working notes of the Workshop
on Statistically-based NLP
Techniques, AAAI Press,
Cartling, B. (2008),” On the Implicit Acquisition of a
Context-Free Grammar by a Simple Recurrent Neural
Network, Neurocomputing 71, pp. 527-1537.
Charniak, E. (1997), "Statistical Parsing with
Context-Free Grammar and Word Statistics", Proceedings
of the Fourteenth National Conference on Artificial Intelligence, pp. 598-603.
Charniak, E. and M. Johnson (2005), “Coarse-to-Fine N-Best
Parsing and Maxent Discriminative Reranking”,
Proceedings of the 43rd Annual Meeting of the Association for Computational Linguistics (ACL ’05),
Chomsky,
N. (1957), Syntactic Structures, Mouton,
Chomsky,
N. (1965), Aspects of the Theory of Syntax, MIT Press,
Chomsky,
N. (1971), Problems of Knowledge and Freedom,
Chomsky,
N. (1975), The Logical Structure of
Linguistic Theory, Plenum Press,
Chomsky, N. (1980), Rules and Representations,
Chomsky,
N. (1981), Lectures on Government and Binding, Foris,
Chomsky,
N. (1986), Knowledge of Language,
Chomsky,
N. (1995), The Minimalist Program, MIT
Press,
Chomsky,
N. (2001), “Derivation by Phase”, in M. Kenstowicz
(ed.), Ken Hale: A Life in Language, MIT Press,
Chomsky,
N. (2005), "Three Factors in Language Design", Linguistic Inquiry
36, pp. 1-21.
Chomsky,
N. (2007), “Approaching UG from Below”, in U. Sauerland
& M. Gaertner (eds.), Interfaces +
Recursion = Language?Chomsky's Minimalism and the View from Syntax-Semantics, Mouton, de
Christiansen,
M. & N. Chater (2008), “Language as Shaped by the
Brain”, Behavioral and Brain Sciences 31,
pp. 489-558.
Chouinard, M. and E. Clark (2003), "Adult Reformulations of Child Errors as Negative Evidence", Journal of
Child Language 30, pp. 637-669.
Clark, A.
(2003), Combining Distributional and Morphological Information for Part of
Speech Induction",
Proceedings of the 10th Annual Meeting of the European Association of Computational Linguistics,
pp.59-66.
Clark, A.
(2004), "Grammatical Inference and First Language Acquisition", Workshop
on Psychocomputational
Models of Human Language Acquisition,
Clark, A.
(2006), “PAC-Learning Unambiguous NTS
Languages”, 8th International Colloquium on Grammatical
Inference, Springer,
Clark, A.
and F. Thollard (2004), “Partially Distribution-Free
Learning of Regular Languages from Positive
Samples”, Proceedings of COLING 2004,
Clark, A.
and R. Eyraud (2006), “Learning Auxiliary Fronting
with Grammatical Inference”, in Proceedings of
the 10th Conference on
Computational Language Learning (CoNLL-X),
Clark, A. and S. Lappin (2009), “Another Look at Indirect Negative
Evidence”. Proceedings of the EACL
Workshop on Cognitive Aspects of
Computational Language Acquisition,
(http://www.dcs.kcl.ac.uk/staff/lappin/papers/negativeClarkLappin.pdf).
Collins,
M. (1999), Head-Driven Statistical Models for Natural Language Parsing, Ph.D dissertation,
Conti-Ramsden, G., N. Botting, and B. Faragher (2001), “Psycholinguistic Markers for Specific Language
Impairment (SLI)”, Journal of Child Psychology and Psychiatry 42, pp. 741–748.
Cowie, F. (1999), What's Within? Nativism
Reconsidered,
Crain, S.
(1991), “Language Acquisition in the Absence of Experience”, Behavioral and
Brain Sciences
14, pp. 597-612.
Crain, S.
& M. Nakayama (1987), “Structure Dependence in Grammar Formation”, Language
63,
pp. 522-543.
Diessel, H. and M. Tomasello
(2005), "A New Look at the Acquisition of Relative Clauses", Language
81,
pp. 882-906.
Dodwell, K. and
Narratives and Memory”, International Journal of Language and Communication Disorders 43,
pp. 201–218.
Elman, J. (1990), “Finding Structure in Time”, Cognitive Science 14,
pp. 179-211.
Elman, J. (1991), “Distributed Representations, Simple Recurrent Networks,
and Grammatical Structure”,
Machine Learning 7, pp. 195-225.
Elman, J. (1998), “Generalization, Simple Recurrent Networks, and the
Emergence of Structure”, in
M. Gernsbacher &
Society,
Elman, J., E. Bates, M. Johnson, A. Karmilo_-Smith,
D. Parisi, & K. Plunkett (1996), Rethinking
Innateness:
A Connectionist Perspective on Development, MIT Press,
Fernandez,
R., J. Ginzburg, and S. Lappin (2007),
"Classifying Non-Sentential Utterances in Dialogue: A
Machine Learning Approach", Computational Linguistics 33(3), pp. 397-427.
Fisher, S. (2006), “Tangled Webs: Tracing the Connections between Genes and Cognition”, Cognition 101,
pp. 270–297.
Fisher, S. (2008), A Molecular Window into Speech and Language, Francis Crick Lecture, Royal Society,
Fitch,
W.T., N. Chomsky, & M. Hauser (2005), “The Evolution of the Language
Faculty: Clarifications and
Implications”, Cognition 97, pp. 179-210.
Fodor, J.
(1983), The Modularity of Mind, MIT
Press,
Fodor, J.
(2000), The Mind Doesn't Work that Way,
MIT Press,
Fodor, J.
& Z. Pylyshyn (1988), “Connectionism and
Cognitive Architecture: A Critical Analysis,
Cognition 28, pp. 3-71.
Fodor,
J.D. and C. Crowther (2002), “Understanding Stimulus
Poverty Arguments”, The Linguistic Review
19, pp. 105-145.
Gazdar, G. and C. Mellish (1989), Natural
Language Processing in Prolog, Addison-Wesley,
Gibson, E.
and K. Wexler (1994), "Triggers", Linguistic Inquiry 25, pp.
407-454.
Ginzburg, J. & I.A. Sag (2000), Interrogative
Investigations, CSLI,
Gleitman, L. (1990), “The Structural Sources of Verb
Meanings”, Language Acquisition 1, pp. 3-55.
Gold, E.
M. (1967), “Language Identication in the Limit”, Information
and Control 10(5), pp. 447-474.
Goldsmith,
J. (2001), "Unsupervised Learning of the Morphology of a natural
Language", Computational
Linguistics 27, pp. 153-198.
Gould, S.
& R. Lewontin (1979), The
Spandrels of San Marco and the Panglossian Paradigm:
A Critique of the
Adaptationist Programme”,
Proceedings of the Royal Society, B205, Royal Society,
Groszer, M., D. Keays, R. Deacon, J. de Bono, S. Prasad-Mulcare, S. Gaub, M. Baum, C. French, J. Nicod,
J. Coventry, W. Enard, M. Fray, S. Brown, P. Nolan, S. Paabo, K. Channon, R. Costa, J. Eilers, G. Ehret,
J.N.P. Rawlins, and S. Fisher (2008), “Impaired Synaptic Plasticity and Motor Learning in Mice with a Point
Mutation Implicated in Human Speech Deficits”, Current Biology 18, pp. 354–362.
Harris, Z.
(1951), Structural Linguistics,
Harris, Z.
(1954), “Distributional Structure”, Word 10, pp. 146-162.
Hauser,
M., N. Chomsky, & W.T. Fitch (2002), “The Faculty of Language: What Is It,
Who Has It, and How
did It Evolve?”, Science 298, pp. 1569-1579.
Hawkins,
J. (1994), A Performance Theory of Order and
Constituency,
Hawkins, J. (2004), Efficiency and Complexity in Grammars,
Hornstein, N. and D. Lightfoot (ed.) (1981), Explanation in
Linguistics: The Logical Problem of Language
Acquisition, Longman,
Hurst,
J.A., M. Baraitser, E. Auger, F. Graham, and
Dominantly Inherited Speech Disorder”, Developmental Medicine and Child Neurology 32, pp. 352-355.
Jackendoff, R. (1997), The Architecture
of the Language Faculty, MIT Press,
Jackendoff, R. and S. Pinker (2005), “The Nature of the Language Faculty and Its
Implications for
Evolution of Language (Reply to Fitch,
Hauser, and Chomsky)”, Cognition 97, pp. 211-225.
Johnson,
D. & S. Lappin (1999), Local Constraints vs
Economy, Monographs in Linguistics Series,
CSLI,
Johnson, D.E. & P. Postal (1980), Arc Pair Grammar,
Joshi, A.
& Y. Schabes (1997), “Tree Adjoining Grammars”,
in G. Rozenberg and A. Salomaa
(eds.), Handbook
of Formal Languages Volume 3: Beyond Words, Springer,
Jurafsky, D. and J. Martin
(2000), Speech and Language Processing, Prentice Hall,
Kirby, S.
(2001), “Spontaneous Evolution of Linguistic Structure- An Iterated Learning
Model of the Emergence
of Regularity and Irregularity”, IEEE Transactions on Evolutionary Computation 5, pp. 102-110.
Kirby, S.
(2007), “The Evolution of Meaning-Space Structure through Iterated Learning”,
in C. Lyon, C. Nehaniv,
andA. Cangelosi (eds.), Emergence of Communication and
Language, Springer Verlag,
Kirby, S.
& J. Hurford (2002), “The Emergence of Linguistic
Structure: An Overview of the Iterated Learning
Model, in Angelo Cangelosi and Domenico Parisi (eds.), Simulating the Evolution of Language, Springer
Verlag,
Klein, D.
and C. Manning (2002), "A Generative Constituent-Contex
Model for Improved Grammar Induction",
Proceedings of the 40th Annual Meeting of the Association for Computational Linguistics, pp. 128-135.
Klein, D. and C. Manning (2004), "Corpus-Based Induction of Syntactic Structure: Models of Dependency and
Constitutency", Proceedings of the 42th Annual Meeting of the Association for Computational Linguistics,
Lai, C.S., S. Fisher, J. Hurst, F. Vargha-Khadem, and A. Monaco (2001), “A Forkhead-Domain Gene is Mutated
In Severe Speech and Language Disorder”, Nature 413, pp. 519-523.
Lappin, S.
(2005), "Machine Learning and the Cognitive Basis of Natural
Language", Proceedings of
Computational Linguistics in the Netherlands 2004,
Lappin, S.
and S. Shieber (2007), “Machine Learning Theory and Practice
as a Source of Insight into Universal
Grammar”, Journal of Linguistics 42. pp. 393-427.
Lasnik, H. (1989), “On Certain Substitutes for Negative Data” in R. Matthews
and
Learnability and Linguistic Theory,
Kluwer/Academic Press,
Lasnik, H. and J. Uriagareka (2002), "On the
Poverty of the Challenge", The Linguistic
Review 19, pp. 147-150.
Laurence,
S. and E. Margolis (2001), “The Poverty of the Stimulus Argument”, British Jounral for the Philosophy
of Science 52, pp. 217-276.
Legate, J.
and C. Yang (2002), “Empirical Re-Assessment of Stimulus Poverty Arguments”, The Linguistic Review
19, pp. 151-162.
van der Lely, H. (2004), Evidence for and Implications of a Domain-Specific Grammatical Deficit in L. Jenkins (ed.),
The Genetics of
Language, Elsevier,
van der Lely, H. (2005), “Domain-Specific Cognitive Systems: Insight from Grammatical SLI”, Trends in Cognitive
Science 9, pp. 53–59.
MacWhinney, B. (1995), The CHILDES Project: Tools for
Analyzing Talk, 2nd edn., Lawrence Erlbaum,
MacWhinney, B, (2004), "Multiple Process Solution to the
Logical Problem of Language Acquisiton", Journal
of
Child Language 31, pp. 883-914.
MacWhinney, B. (2005), "Item-Based Constructions and the
Logical Problem", Proceedings of the Second
Workshop on Psychocomputational Models of Human Language Acquisition, Association of Computational
Linguistics, pp. 53-68.
MacWhinney, B., & Snow, C. (1990), “The Child Language Data
Exchange System: An Update”, Journal of Child
Language 17, pp. 457-472.
Manning,
C. & H. Schütze (1999), Foundations of
Statistical Natural Language Processing, MIT Press,
Marcus, G. (1993), “ Negative Evidence in Language Acquisition”, Cognition 46, pp. 53-85.
Marcus, G.
(2001), The Algebraic Mind: Integrating
Connectionism and Cognitive Science, MIT Press,
Marcus, G.
(2006), “Cognitive Architecture and Descent with Modification”, Cognition
101, pp. 443-465.
Marcus, G.
(2008), Kluge, Houghton Miflin Co.,
Marcus, M.
(1993), “Building a Large Annotated Corpus of English: The Penn Treebank”, Computational
Linguistics 19, pp. 313–330.
Marton, K. (2008), “Visuo-Spatial Processing and Executive Functions in Children with Specific Language
Impairment”, International Journal of Language and Communication Disorders 43, pp. 181–200.
Marton, K., R. Schwartz, L. Farkas, and V. Katsnelson (2006), “Effect of Sentence Length and Complexity on
Working Memory Performance in Hungarian Children with Specific Language Impairment (SLI): A Cross
Linguistic Comparison”, International Journal of Language and Communication Disorders 41,
pp. 653–673.
McLelland, J., D. Rumelhart, &
the PDP research group (1986), Parallel Distributed Processing: Explorations
in the Microstructure of
Cognition, Volume II, MIT Press,
Montague,
R. (1974), Formal Philosophy: Selected Papers of Richard Montague,
Morrill,
G. (1994), Type Logical Grammar: Categorial Logic
of Signs, Kluwer,
Morris,
W., G. Cottrell, & J. Elman (1998), “A
Connectionist Simulation of the Empirical Acquisition of
Grammatical Relations, in S. Wemter and R. Sun (eds.), Hybrid Neural Systems, Lecture Notes in Computer
Science, Springer,
Manzini, R. and K. Wexler (1987), “Parameters, Binding, and
Learning Thoery”, Linguistics Inquiry 18,
pp. 413-444.
Newmeyer, F. (2004), "Against a Parameter-Setting
Approach to Typogical Variation", Linguistic
Variation
Yearbook 4, pp. 182-234.
Newmeyer, F. (2005), Possible
and Probable Languages,
Niyogi, P. (2006), The Computational Nature
of Language Learning and Evolution, MIT Press,
Norbury, C., D. Bishop, and J. Briscoe (2001), “English Finite Verb Morphology: A Comparison of SLI and
Mild-Moderate Hearing Impairment”, Journal of Speech, Language, and Hearing Research 44, pp. 165–178.
Nowak, M, N. L. Komarova, and P. Niyogi (2001), “Evolution of Universal Grammar”, Science 291, pp. 114-118.
Nowak, M, N. L. Komarova, and P. Niyogi (2002), "Computational and Evolutionary Aspects of Language",
Nature 411, pp. 611-617.
Pereira,
F. (2000), “Formal grammar and information theory: Together again?”, Philosophical Transactions of
the Royal Society, Royal Society,
Perfors, A., J. Tenenbaum, and T. Regier (2006), "Poverty of the Stimulus? A Rational Approach", Proceedings
of Cognitive Science 2006.
Pinker, S. (1979), "Formal Models of Language Learning", Cognition 7, pp. 217-282.
Pinker, S. (1989), Learnability and
Cognition: The Acquisition of Argument Structure, MIT Press,
Pinker, S. (1984), Language Learnability
and Language Development,
Pinker, S. (1997), How the Mind Works,
Pinker, S.
& P. Bloom (1990), “Natural language and Natural Selection”, Brain and
Behavioral Sciences 13,
pp.707-727.
Pinker, S.
& R. Jackendoff (2005), “The Faculty of Language:
What's Special about It?”, Cognition 95,
pp. 201-236.
Pollard, C. & I.A. Sag (1994), Head-Driven Phrase Structure
Grammar, CSLI, Chicago and Stanford.
Pullum, G. and B. Scholz (2002), “Empirical
Assessment of Stimulus Poverty Arguments”, The Linguistic
Review
19, pp. 9-50.
Rumelhart, D., J. McLelland, &
the PDP research group (1986), Parallel Distributed Processing: Explorations
in the Microstructure of
Cognition, Volume I, MIT Press,
Saffran, J., R. Aslin, and
pp. 1926-1928.
Saxton, M.
(1997), "The Contrast Theory of Negative Input", Journal of Child
Language 24, pp. 139-161.
Saxton, M.
(2000), “Negative Evidence and Negative Feedback”, First Language 20, pp.
221-252.
Saxton,
M., C. Houston-Rice, and
Corrective Input for Grammatical Errors", Applied Psycholinguistics 26, pp. 393-414.
Scholz, B. and G. Pullum (2006), "Irrational Nativist Exuberance" in Robert Stainton
(ed.), Debates in Cognitive
Science, Blackwell,
Schone, P. and D. Jurafsky (2001), “Knowledge-free
Induction of Inflectional Morphologies”, Proceedings of
the Conference of the North American Chapter of the Association for Computational Linguistics
(NAACL-2001),
Schütze, H. (1995), “Distributional Part-of-Ppeech Tagging”, Proceedings of the Conference of the
European
Chapter of the Association for Computational Linguistics (EACL 7),
Shinohara,
T. (1994), “Rich Classes Inferable from Positive Data: Length-Bounded
Elementary Formal Systems”,
Information and Computation 108, pp. 175-186.
Steedman, M. (2000), The Syntactic
Process, MIT Press,
Thompson, S. and
Language Learning and Development 3, pp. 1-42.
Tomasello, M. (2003), Constructing a Language: A Usage-Based
Theory of Language Acquisition, Harvard
University Press,
Vapnik, V. (1998), Statistical Learning
Theory, Wiley,
Watkins,
K., N. Dronkers, and F. Vargha-Khadem
(2002), “Behavioural Analysis of an Inherited Speech
and Language Disorder: Comparison with Acquired Aphasia”, Brain 125, pp. 452-464.
Webelhuth, G. (1992), Principles
and Parameters of Syntactic Saturation,
Weiss, D. and
Approach”, Infancy 2, pp. 241-257.
Wexler,
Ken (1991), "On the Argument from the Poverty of the Stimulus", in A.
Kasher (ed.), The Chomskyan
Turn, Blackwell,
White, S., S. Fisher, D. Geschwind, C. Scharff, and T. Holy (2006), “Singing Mice, Songbirds, and More:
Models for FOXP2 Function and Dysfunction in Human Speech and Language”, The Journal of Neuroscience
26, pp. 10376 –10379.
Wonnacott, E.,
Distributional Learning in a Miniature Language”, Cognitive Psychology, 56, pp. 165-209.
Yang, C.
(2002), Knowledge and Learning in
Natural Language,
Yang, C.
(2004), "Universal Grammar, Statistics, or Both?",
Trends in Cognitive Sciences 10, pp. 451-456.
Zwicky, A. (1970), “A Double Regularity in the Acquisition of English Verb
Morphology”, WorkingPapers in
Linguistics 4,