RAS History & PhilologyНовая и Новейшая история Novaia i noveishaia istoriia

  • ISSN (Print) 01303864
  • ISSN (Online) 3034-6002

A QUANTITATIVE STUDY OF MONGOLIAN

PII
S0373-658X0000392-4-1
DOI
10.7868/SX0000392-4-1
Publication type
Article
Status
Published
Authors
Volume/ Edition
Volume / Issue 5
Pages
46-57
Abstract
The paper describes a General Corpus of the Modern Mongolian language (GCML), which contains 966 texts, 1 155 583 words. We also report a morphological analyzer for the Modern Mongolian language (MML), a grammatical dictionary for 63 071 lexemes, a general table of morphological homonymy. The processor analyzes effectively 95% of textual word forms which correspond to 76% word forms from the inputs of the concordance to the GCML. MML can be described in its quantitative aspect, according to a structural-probabilistic model (SPM) of MML. SPM contains frequency dictionaries (FDs) of MML of different types: FDs of word forms, lexemes, grammatemes, root morphemes and allomorphemes, affixal morphemes and allomorphemes, flexionemes, grammemes. SPM allows to describe behavior of various language units in the written text from the quantitative point of view: their frequency, distribution in texts, compatibility with other units etc. It is possible to transform the usual structural model into an SPM, which is based on statistical analysis of texts (in this model units of language are considered as possessing «the weight», the language oppositions and relations are being measured)...
Keywords
quantitative linguistics, corpus linguistics, Mongolian languages, halha Mongolian language, frequency dictionaries
Date of publication
20.09.2025
Year of publication
2025
Number of purchasers
1
Views
551

References

QR
Translate

Indexing

Scopus

Scopus

Scopus

Crossref

Scopus

Higher Attestation Commission

At the Ministry of Education and Science of the Russian Federation

Scopus

Scientific Electronic Library