1 Chomsky, N. Aspects of the Theory of Syntax. Cambridge, MA: MIT Press
2 Pinker, S. Language Learnability and Language Development. Cambridge, MA: Harvard University Press
3 Yang, C. Knowledge and Learning in Natural Language. Oxford: Oxford University Press
4 Gold, M. Language identification in the limit. Inform Control 1967, 10:447–474.
5 Angluin, D. Inductive inference of formal language from positive data. Inform Control 1980, 45:117–135.
6 Valiant, L. A theory of the learnable. Commun ACM 1984, 27:1134–1142.
7 Vapnik, V. The Nature of Statistical Learning Theory. New York: Springer
8 Blumer, A, Ehrenfeucht, A, Haussler, D, Warmuth, MK. Learnability and the Vapnik‐Chervonenkis dimension. J ACM 929–965.
9 Kearns, M, Valiant, L. Cryptographic limitations on learning Boolean formulae and finite automata. J ACM 1994, 41:67–95.
10 Osherson, D, Stob, M, Weinstein, S. Systems that Learn. Cambridge, MA: MIT Press
11 Niyogi, P. The Computational Nature of Language Learning and Evolution. Cambridge, MA: MIT Press
12 Angluin, D. Queries and concept learning. Mach Learn 1988, 2:319–342.
13 Chomsky, N. Lectures on Government and Binding. Dordrecht: Foris
14 Angluin, D. Inference of reversible languages. J ACM 1982, 29:741–765.
15 Clark, A, Eyraud, R. Polynomial identification in the limit of substitutable context free languages. J Mach Learn Res 2007, 8:1725–1745.
16 Berwick, R, Pilato, S. Learning syntax by automata induction. Machine Learn 1987, 2:9–38.
17 Chomsky, N. Syntactic Structure. The Hauge: Mouton %26 Co
18 Wexler, K, Culicover, P. Formal Principles of Language Acquisition. Cambridge, MA: MIT Press
19 Kanazawa, M. Learnable Classes of Categorical Grammars. Stanford University: CLSI
20 Stabler, E. Acquiring languages with movement. Syntax 1998, 1:72–97.
21 Horning, J. A study of grammatical inference, Doctoral dissertation, Department of Computer Science, Stanford University, Stanford, CA, 1969.
22 Angluin, D. Identifying languages from stochastic examples. Technical Report 614. New Haven, CT: Yale University
23 Pitt, L. Probabilistic inductive inference. J ACM 1989, 36:383–433.
24 Shieber, S. Evidence against context‐freeness of natural language. Ling Phil 1985, 8:333–343.
25 Perfors, A, Tenenbaum, J, Regier, T. Poverty of the stimulus? A rational approach, In Proceedings of the 28th annual conference of the Cognitive Science Society. Vancouver, Canada, 2006.
26 Chater, N, Vitányi, P. ‘Ideal learning’ of natural language: positive results about learning from positive evidence. J Math Psychol 2007, 51:135–163.
27 Harris, Z. Methods in Structural Linguistics. Chicago: Chicago University Press
28 Chomsky, N. The Logical Structure of Linguistic Theory, Manuscript, Harvard/MIT. Published in 1975 by New York: Plenum; 1955/1975.
29 Redington, M, Chater, N, Finch, S. Distributional information: a powerful cue for acquiring syntactic categories. Cogn Sci 1998, 22:425–469.
30 Pereira, F. Formal grammar and information theory: together again? Phil Trans Royal Soc 2000, 358: 1239–1253.
31 Goldsmith, J. Unsupervised learning of the morphology of a natural language. Comp Ling 2001, 153–198.
32 Collins, M. Head‐driven statistical models for natural language processing, Ph.D. dissertation, University of Pennsylvania, 1999.
33 Charniak, E. A maximum‐entropy‐inspired parser. Proc NAACL 2000, 1:132–139.
34 Chomsky, N. Reflections on Language. New York: Pantheon
35 Legate, JA, Yang, C. Empirical reassessments of poverty‐stimulus arguments. Ling Rev 2002, 19:151–162.
36 Lewis, J, Elman, J. Learnability and the statistical structure of language: poverty of stimulus arguments revisited, In Proceedings of the 26th annual Boston University conference on language development, Somerville, MA: Cascadilla, 2001, 359–370.
37 Reali, F, Christiansen, MH. Uncovering the richness of the stimulus: structure dependence and indirect statistical evidence. Cogn Sci 2005, 29:1007–1028.
38 Kam, X, Stoyneshka, I, Tornyova, L, Fodor, JD, Sakas, W. Bigrams and the richness of the stimulus. Cogn Sci 2008, 32:771–787.
39 Hackerman, D, Geiger, D, Chickering, D. Learning Bayesian networks: the combination of knowledge and statistical data. Mach Learn 1995, 20:197–243.
40 McClelland, J. The place of modeling in cognitive science. Topics Cogn Sci 2009, 1:11–38.
41 Yang, C. Universal grammar, statistics, or both. Trends Cogn Sci 2004, 451–456.
42 Saffran, J. The use of predictive dependencies in language learning. J Memory Lang 2001, 44:493–515.
43 Thompson, S, Newport, E. Statistical learning of syntax: The role of transitional probability. Lang Learn Dev 2007, 3:1–42.
44 Magerman, D, Marcus, M. Parsing a natural language using mutual information statistics, In: Proceedings of the AAAI, 1990, 984–989.
45 de Marcken, C. On the unsupervised induction of phrase‐structure grammar, In Proceedings of the Third Workshop on Very Large Corpora
46 Bikel, D. Intricacies of Collins`s parsing model. Comp Ling 2004, 30:479–511.
47 Jelinek, F. Statistical Methods for Speech Recognition. Cambridge, MA: MIT Press
48 Yang, C. A statistical test for grammar, In Proceedings of the ACL
. Portland, OR, 2011.
49 Gibson, E, Wexler, K. Triggers. Ling Inq 1994, 25:355–407.
50 Berwick, R. The Acquisition of Syntactic Knowledge. Cambridge, MA: MIT Press
51 Berwick, R, Niyogi, P. Learning from triggers. Ling Inq 1996, 27:605–622.
52 Dresher, E. Charting the learning path: cues to parameter setting. Ling Inq 1999, 30:27–67.
53 Fodor, JD. Unambiguous triggers. Ling Inq 1998, 29:1–36.
54 Bush, R, Mosteller, F. A mathematical model for simple learning. Psychol Rev 1951, 68:313–323.
55 Sutton, R, Barto, A. Reinforcement Learning. Cambridge, MA: MIT Press
56 Straus, K. Validations of a probabilistic model of language acquisition, Ph.D. dissertation. Department of Mathematics, Northeastern University, 2008.
57 Sakas, W, Fodor, JD. Disambiguating syntactic triggers. Lang Acquisit (in press).
58 Culicover, P. Syntactic Nuts. New York: Oxford University Press
59 MacWhinney, B. A multiple process solution to the logical problem of language acquisition. J Child Lang 2004, 31:883–914.
60 Fodor, JD, Sakas, W. The subset principle in syntax. J Ling 2005, 41:513–569.
61 Valian, V. Syntactic subjects in the early speech of American and Italian children. Cognition 1991, 40:21–82.
62 Wang, Q, Lillo‐Martin, D, Best, C, Levitt, A. Null subject vs. object: some evidence from the acquisition of Chinese and English. Lang Acquisit 1992, 2:221–254.
63 Suppes, P. Probabilistic grammars for natural languages. Synthese 1970, 22:95–116.
64 Newell, A, Simon, H, Shaw, D. Chess‐playing programs and the problem of complexity. IBM J Res Dev 1958, 2:320–335.
65 Kasparov, G. The chess master and the computer. New York Rev Books 2010, 57:2.