### Лингвистическая таксономия: компактность языковых подгрупп, групп и семей

#### Anotacija

**LINGUISTIC TAXONOMY: DENSITY OF LANGUAGE SUBGROUPS, GROUPS AND FAMILIES**

*Summary*

Language subgroups, groups, families and unities were investigated from the point of view of their dispersion in the way it was first proposed in (Тамбовцев 1986). We consider how compact this or that language taxon (i.e. subgroup, group, family or unity) on the basis of distribution of certain consonantal groups in the speech sound chain. Therefore, one can speak about a compact or disperse language family. If language taxon is compact, then its internal connections are shorter than its outer connections. Actually, the same notion of compact object is accepted in pattern recognition. The more compact the family, the more correctly its languages are chosen. If we put in the family a language, which does not belong to the family, then the dispersion of the family rises, thus it becomes less compact. If the language has a similar sound chain, then the dispersion of the group remains the same or becomes less. It means that the family became more compact. In this case we speak about the typological properties of the families. We measure the dispersion of a family by the sum of dispersions of 8 phonostatistical features: frequency of occurrence of labial, front, palatal, velar, sonorant, occlusive, fricative and voiced consonants. It is important that the features do not intersect.

The values of the coefficient of variance (V) and T coefficient show the degree of dispersion. The principle is the greater the dispersion, the less compact the family.

We have chosen the coefficient of variance and T coefficient since they both keep to the law of commensurability. Comparing different languages of different language families and different morphological structures was possible since all of them have the same 8 phonetic features mentioned above. We have considered Indo-European, Turkic, Mongolian, Tungus-Manchurian, Samoyedic, Finno-Ugric, Paleo-Asiatic, Austronesian, Australian and American Indian language families. The most compact is Mongolian (V = 10.78%; T = 0.08), the least compact is Austronesian (V = 46.21%; T = 0.90). Tungus-Manchurian (V = 17.41%; T = 0.20) is more compact than Samoyedic (18.29%; T = 0.16), Turkic (18.77%; T - 0.21), Indo-European (V = 28.00%; T = 0.61) or Paleo-Asiatic (V = 33.89%; T = 0.46). Language groups are more compact than language families. Iranian group (V = 13.21%; T = 0.09) of Indo-European language family is the most compact, the least compact is Romanian (V = 26.25%; T= 0.33). Slavonic group (V = 15.21%; T = 0.17) is more compact than Indo-Arien (V = 20.40%; T = 0.23) or Germanic (V = 24.51%; T = 0.29). Volga group of the Finno-Ugric language family (V = 17.90%; T = 0.13) is more compact than Ugric (V = 27.66%; T = 0.47) or Finnic (V = 29.24%; T = 0.35) group. Altaic super-family is rather compact (V - 25.97%; T - 0.45). It is more compact than Indo-European (V = 28.00%; T = 0.61) or Paleo-Asiatic family (V = 33.89%; T = 0.46). This fact supports those linguists who consider Altaic as a family, not a super family. Uralic super family is less compact than that (V = 28.31%; T = 0.47), though it is more compact than Balkan (V = 29.74%; T = 0.36) language unity (Sprachbund). Ural-Altaic language unity (V = 30.98%; T = 0.88) is more (much more) compact than American Indian (43.37% 1.07) language unity. Measuring the typological density of language taxons (subgroups, groups, families, unities) may help to understand how natural (or correct) these taxons are.Svetainės turinį galima naudoti nekomerciniais tikslais, vadovaujantis CC-BY-NC-4.0 tarptautinės licencijos nuostatomis.