Evaluating the phylogenetic signal of morphosyntax

评估形态句法的系统发育信号

阅读:1

Abstract

Computational linguistic phylogenetics has so far relied heavily on cognate data. In contrast, the potential of morphosyntactic characters as a valuable source for phylogenetic analysis has been largely overlooked. We argue that morphosyntactic characters may conflate historical signal with the results of homoplasies, horizontal transfer, and universal tendencies, and must be scrutinized in terms of their propensity to change and borrowing, analogously to the curation of lexical data which produced the Swadesh lists. In this paper we make a start by evaluating a set of morphosyntactic characters based on the World Atlas of Language Structures using three methods: we (1) calculated Pearson correlation coefficients for each character against different language groupings, reflecting either shared ancestry (genera) or contact (geographical proximity); (2) counted the minimum number of mutations needed for the distribution of a character's states on a cognate-based reference tree (parsimony score), testing whether they correctly reflect language change known from historical linguistics; and (3) ran a classic hill-climbing algorithm to determine which random subsets of characters produced a phylogeny closest to a reference tree. We conclude that these are useful tools, but expect that making the definitions of the characters more theoretically informed will produce a stronger historical signal.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。