Syntactic Learning over Tree Tiers
Abstract
1. Introduction
2. Preliminaries
2.1. Trees, Strings, and Substructures
- 1.
- (unique root);
- 2.
- (mother-of closure);
- 3.
- (left-sibling closure).
- (immediate dominance);
- (proper dominance);
- (immediate left sibling);
- (precedence).
- ;
- For every pair of elements such that , there is a dominance 2-path in t where f is a dominance 2-factor and V is the set of symbols which prevent u and v from forming this 2-factor: .
- For every pair of elements such that , there is a sibling 2-path in t where f is a sibling 2-factor and V is the set of symbols which prevent u and v from forming this 2-factor: .
2.2. Languages, Grammars, and Learning
- For any 2-factor , there is exactly one tier such that (constraint uniqueness).
- For each pair it must be the case that i ) each symbol which is not part of is attested in an intervener set for in , and ii each smallest intervener set (for in ) in which σ appears contains only other elements in T, which do not themselves appear in any smaller intervener set (for in ) (tier-element independence).
3. CUTIA
| Algorithm 1 CUTIA |
| Data: Positive sample I Result: Grammar G, a set of 〈2-factor, forbidding tier〉 pairs. G := {} B := foreach : for cardinality c in : foreach : if: then: return G |
3.1. Algorithm
- (1)
- Cé a/*go dúradh léithi a/*go cheannódh é?Who C-wh/*C was-said with-her C-wh/*C would-buy it‘Who was she told would buy it?’
3.2. Identification in Polynomial Time and Data
- 1.
- ;
- 2.
- ;
- 3.
- .
3.3. Limitations
- 1.
- *Who did you say that likes Sally?
- 2.
- Who did you say that John likes?
4. MTSL-BUFIA
4.1. Canonical Form
- 1.
- (incomparable tier constraints);
- 2.
- (exhaustivity).
4.2. BUFIA
4.3. Algorithm
| Algorithm 2 MTSL-BUFIA |
| Data: Positive sample I Result: Grammar G, a set of 〈2-factor, forbidding tier〉 pairs. G := {} B := for each : Q := [{ }] while Q ≠ []: Q. if : continue if : Q. else: G = G return G |
4.4. Identification in Polynomial Data
- 1.
- ;
- 2.
- ;
- 3.
- ;
- 4.
- .
5. Discussion
6. Conclusions
Funding
Data Availability Statement
Acknowledgments
Conflicts of Interest
Abbreviations
| TSL2 | Tier-based Strictly 2-Local |
| MTSL2 | Multi-Tier-based Strictly 2-Local |
| CUTI-MTSL | Constraint Unique Multi Tier-based Strictly Local |
| BUFIA | Bottom-Up Factor Inference Algorithm |
| MTSL-BUFIA | Multi-Tier-based Strictly 2-Local Bottom-Up Factor Inference Algorithm |
| CUTIA | Constraint Unique Tier Inference Algorithm |
References
- Graf, T. Subregular linguistics: Bridging theoretical linguistics and formal grammar. Theor. Linguist. 2022, 48, 145–184. [Google Scholar] [CrossRef]
- Heinz, J. The computational nature of phonological generalizations. In Phonological Typology, Phonetics and Phonology; Walter de Gruyter GmbH & Co KG: Berlin, Germany, 2018; pp. 126–195. [Google Scholar]
- Lambert, D. Relativized Adjacency. J. Log. Lang. Inf. 2023, 32, 707–731. [Google Scholar] [CrossRef]
- McMullin, K.; Hansson, G.Ó. Long-distance phonotactics as tier-based strictly 2-local languages. In Proceedings of the Annual Meeting on Phonology, Cambridge, MA, USA, 19–21 September 2014. [Google Scholar]
- Hanson, K. Tier-based strict locality and the typology of agreement. J. Lang. Model. 2025, 13, 43–97. [Google Scholar] [CrossRef]
- Hanson, K. A TSL Analysis of Japanese Case; Association for Computational Linguistics: Stroudsburg, PA, USA, 2023; Volume 6, pp. 15–24. [Google Scholar]
- Graf, T. Typological implications of tier-based strictly local movement. In Proceedings of the Society for Computation in Linguistics, Online, 7–9 February 2022; pp. 184–193. [Google Scholar]
- Jardine, A.; McMullin, K. Efficient learning of tier-based strictly k-local languages. In Proceedings of the International Conference on Language and Automata Theory and Applications; Springer: Berlin/Heidelberg, Germany, 2017; pp. 64–76. [Google Scholar]
- Lambert, D. Grammar interpretations and learning TSL online. In Proceedings of the International Conference on Grammatical Inference; PMLR: London, UK, 2021; pp. 81–91. [Google Scholar]
- de la Higuera, C. Grammatical Inference: Learning Automata and Grammars; Cambridge University Press: Cambridge, UK, 2010. [Google Scholar]
- Heinz, J.; de la Higuera, C.; van Zaanen, M. Grammatical Inference for Computational Linguistics; Synthesis Lectures on Human Language Technologies; Morgan and Claypool: San Rafael, CA, USA, 2015. [Google Scholar]
- Wieczorek, W. Grammatical Inference: Algorithms, Routines and Applications; Springer: Berlin/Heidelberg, Germany, 2017. [Google Scholar]
- Jurafsky, D.; Martin, J. Speech and Language Processing: An Introduction to Natural Language Processing, Computational Linguistics, and Speech Recognition with Language Models, 3rd ed.; Online manuscript released January 6, 2026; Available online: https://web.stanford.edu/~jurafsky/slp3 (accessed on 13 April 2026).
- Strobl, L.; Merrill, W.; Weiss, G.; Chiang, D.; Angluin, D. What Formal Languages Can Transformers Express? A Survey. Trans. Assoc. Comput. Linguist. 2024, 12, 543–561. [Google Scholar] [CrossRef]
- Li, T.; Precup, D.; Rabusseau, G. Connecting Weighted Automata, Tensor Networks and Recurrent Neural Networks through Spectral Learning. Mach. Learn. 2024, 113, 2619–2653. [Google Scholar] [CrossRef]
- Van Der Poel, S.; Lambert, D.; Kostyszyn, K.; Gao, T.; Verma, R.; Andersen, D.; Chau, J.; Peterson, E.; Clair, C.S.; Fodor, P.; et al. Mlregtest: A benchmark for the machine learning of regular languages. J. Mach. Learn. Res. 2024, 25, 1–45. [Google Scholar]
- Bod, R. From exemplar to grammar: Integrating analogy and probability in language learning. Cogn. Sci. 2008, 33, 752–793. [Google Scholar] [CrossRef] [PubMed]
- Clark, A.; Eyraud, R. Polynomial Identification in the Limit of Substitutable Context-free Languages. J. Mach. Learn. Res. 2007, 8, 1725–1745. [Google Scholar]
- Clark, A. Learning trees from strings: A strong learning algorithm for some context-free grammars. J. Mach. Learn. Res. 2013, 14, 3537–3559. [Google Scholar]
- Clark, A. Distributional learning of some context-free languages with a minimally adequate teacher. In Proceedings of the International Colloquium on Grammatical Inference; Springer: Berlin/Heidelberg, Germany, 2010; pp. 24–37. [Google Scholar]
- Yoshinaka, R. Towards dual approaches for learning context-free grammars based on syntactic concept lattices. In Proceedings of the International Conference on Developments in Language Theory; Springer: Berlin/Heidelberg, Germany, 2011; pp. 429–440. [Google Scholar]
- Stabler, E. Derivational minimalism. In Proceedings of the International Conference on Logical Aspects of Computational Linguistics; Springer: Berlin/Heidelberg, Germany, 1996; pp. 68–95. [Google Scholar]
- McMullin, K.; Aksënova, A.; De Santo, A. Learning phonotactic restrictions on multiple tiers. Soc. Comput. Linguist. 2019, 2, 377–378. [Google Scholar]
- Gorn, S. Explicit definitions and linguistic dominoes. In Systems and Computer Science; University of Toronto Press: Toronto, ON, Canada, 1967. [Google Scholar] [CrossRef]
- Jardine, A.; Heinz, J. Learning Tier-based Strictly 2-Local Languages. Trans. Assoc. Comput. Linguist. 2016, 4, 87–98. [Google Scholar] [CrossRef]
- Gold, E.M. Language identification in the limit. Inf. Control 1967, 10, 447–474. [Google Scholar] [CrossRef]
- De La Higuera, C. Characteristic sets for polynomial grammatical inference. Mach. Learn. 1997, 27, 125–138. [Google Scholar] [CrossRef]
- Heinz, J.; Rawal, C.; Tanner, H.G. Tier-based strictly local constraints for phonology. In Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, Portland, OR, USA, 19–24 June 2011; pp. 58–64. [Google Scholar]
- Graf, T.; Kostyszyn, K. Multiple wh-movement is not special: The subregular complexity of persistent features in Minimalist grammars. Soc. Comput. Linguist. 2021, 4, 275–285. [Google Scholar]
- Graf, T. Diving deeper into subregular syntax. Theor. Linguist. 2022, 48, 245–278. [Google Scholar] [CrossRef]
- McCloskey, J. The morphosyntax of WH-extraction in Irish. J. Linguist. 2001, 37, 67–100. [Google Scholar] [CrossRef]
- Chomsky, N. The Minimalist Program; MIT Press: Cambridge, MA, USA, 1995. [Google Scholar]
- Chandlee, J.; Eyraud, R.; Heinz, J.; Jardine, A.; Rawski, J. Learning with partially ordered representations. arXiv 2019, arXiv:1906.07886. [Google Scholar] [CrossRef]
- Rawski, J. Structure and Learning in Natural Language. Ph.D. Thesis, Stony Brook University, Stony Brook, NY, USA, 2021. [Google Scholar]
- Swanson, L.; Heinz, J.; Rawski, J. Phonotactic Learning with Structure, Not Statistics; Linguistic Inquiry Remarks & Replies; Cambridge, MA, USA, 2025; To Appear. [Google Scholar]







| () | () | ||||||
| () | () | () | () | ||||
| () | () | () | |||||
| 2-Factor | Interveners | Tier | ||||
|---|---|---|---|---|---|---|
| 2-Factor | Interveners | Tier | |||||
|---|---|---|---|---|---|---|---|
| 2-Factor | Interveners | Tier(s) |
|---|---|---|
| , , , | , | |
| , , | ||
| , , , | , , | |
| , , | , | |
| , , , , , , | ||
| , , | ||
| , , , , , , | ||
| , , | ||
| , , , , , , | ||
| , , | ||
| , , , | , | |
| , , , | , | |
| , , , , , , | ||
| , , | ||
| , , , | , | |
| , , , | , | |
| , , | , | |
| , , , | , , | |
| , , | ||
| , , , | , |
| 2-Factor | Interveners | Tier(s) | |||
|---|---|---|---|---|---|
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the author. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.
Share and Cite
Swanson, L. Syntactic Learning over Tree Tiers. Logics 2026, 4, 5. https://doi.org/10.3390/logics4020005
Swanson L. Syntactic Learning over Tree Tiers. Logics. 2026; 4(2):5. https://doi.org/10.3390/logics4020005
Chicago/Turabian StyleSwanson, Logan. 2026. "Syntactic Learning over Tree Tiers" Logics 4, no. 2: 5. https://doi.org/10.3390/logics4020005
APA StyleSwanson, L. (2026). Syntactic Learning over Tree Tiers. Logics, 4(2), 5. https://doi.org/10.3390/logics4020005

