Authors
Madibragimov N.S. – Assistant, Department of Mathematics, Physics and Medical Informatics, Ryazan State Medical University, Ryazan, Russia, navruzmadibragimov@gmail.com
Prutzkow A.V. – Doctor of Technical Sciences, Professor, Department of Computational and Applied Mathematics, Ryazan State Radiotechnical University, Ryazan, Russia, mail@prutzkow.com
Annotation
The article presents the results of the classification of words in the Tajik language for subsequent generation and definition of word forms. Words are classified to formalize the formation of word forms in the Tajik language in terms of a universal model of formation. The universal model of form formation belongs to verbal paradigmatic models. Words are classified according to the types of formation. Each type has certain chains of transformations of the stem into a word form. Classification consists in obtaining the word forms of the analyzed word and assigning it to one of the types that have the same method of obtaining word forms, i.e. chaining conversions, or creating a new type. Articles are divided into types of Tajik words, the following parts of speech: noun – 5 types and 12 subtypes, verb – 9 types and 2 subtypes, adjective – 5 types and 2 subtypes and pronoun – 5 types. At present, the words of the remaining parts of speech of the Tajik language are classified. The classification is carried out on the basis of scientific results obtained by Z.D. Usmanov and G.M. Dovudov. The significant contribution of Z.D. Usmanov in the formation of the fundamental foundations of automatic text processing in the Tajik language, the introduction into scientific circulation of such new concepts as αβ-coding and γ-classifier, which increase the results of solving text processing problems. He brought up talented students who develop automatic word processing.
Key words
automatic text processing, machine morphological analysis and synthesis, model of formation, classification of words, generation of formation, word forms
Language english |
Type technical |
Year 2022 |
Page 13 |
References
- Madibragimov N. Sh., Prutzkow A. V. Study of the types of word formation in the Tajik language // Applied information systems: modeling problems, applications in developing countries: materials of the 3rd republics. scientific-practical. conf. – Khujand: Khujand. polytechnic in-t Tajik. tech. un-ta, 2022. – S. 41-45.
- Hockett, C.F. Two Models of Grammatical Description. In Word, 1954, 10(210–31):386-399.
- Prutzkow A.V. Algebraic representation of a natural language shaping model // Cloud of Science. 2014. T. 1. No. 1. S. 88-97.
- Prutskov, A.V. Algorithmic Provision of a Universal Method for Word-Form Generation and Recognition. In Automatic Documentation and Mathematical Linguistics, 2011, 45(5):232-238.
- Prutzkow A.V. Mathematical-algorithmic formalization of models of morphological analysis and synthesis of word forms of natural languages // Cloud of Science. 2018. V. 5. No. 4. S. 729-748.
- Madibragimov N.Sh., Prutzkow A.V. Classification of nouns of the Tajik language for automatic text processing // Caspian journal: management and high technologies. 2020. No. 4 (52). pp. 39-52.
- Madibragimov N.Sh., Prutzkow A.V. Types of adjectives and pronouns in the Tajik language and their use for the generation and definition of word forms // International Journal of Open Information Technologies. 2021. V. 9. No. 11. S. 85-89.
- Madibragimov N.Sh. Features of machine morphological analysis and synthesis of Tajik verbs // International Journal of Open Information Technologies. 2022.
- Arzumanov S.D., Sanginov A. Tajik language. Dushanbe: Maorif, 1988. 416 p.
- Dovudov G.M., Usmanov Z.D. Morphological analysis of word forms of the Tajik language: monograph. Dushanbe: Donish, 2015. 132 p.
- Dovudov G.M. Computer morphological analysis of Tajik word forms. [Text]: diss…..cand. tech. Sciences: 05.13.11: defended 06.04.18 / Dovudov Gulshan Mirbakhoevich. – Dushanbe, 2018. – 161 p.
- Usmanov Z.D. On the ordered alphabetic coding of words in natural languages // Reports of the Academy of Sciences of the Republic of Tajikistan. 2012. V. 55. No. 7. S. 545-548.
- Usmanov Z.D. Evaluation of the effectiveness of the use of the γ-classifier // Reports of the Academy of Sciences of the Republic of Tajikistan. 2020. V. 63. No. 3-4. pp. 172-179.
- Kosimov A.A. Determination of the specialty cipher using symbolic unigrams // Information exchange in interdisciplinary research: Sat. tr. Vseros. scientific-practical. conf. with international participation. Ryazan: Ryazan. state radio engineering un-t, 2022.
- Kosimov A.A. Formation of computational linguistics in Tajikistan. Dushanbe: Irfon, 2021. 102 p.
Publication date
09/22/2023