TAJIK LANGUAGE AUTOMATIC SPELL CHECKING SYSTEM – TAJSPELL

Authors

       Soliev O.M.candidate of physical and mathematical sciences, senior lecturer, Department  of Programming and Information Systems, Polytechnic Institute of the Tajik Technical  University, Khujand, Republic of Tajikistan, osoliev@gmail.com

      Nazarov A.A.senior lecturer, Department of Programming and Information Systems,  Polytechnic Institute of the Tajik Technical University, Khujand, Republic of Tajikistan, n.abdusamad@hotmail.com

      Ashurova Sh.N.senior lecturer, Department of Programming and Information Systems, Polytechnic Institute of the Tajik Technical University, Khujand, Republic of Tajikistan, sh.nurulloevna@gmail.com

Annotation

          The Tajik language belongs to one of the Indo-European family of languages. This language is spoken by the peoples of Tajikistan, Iran, Afghanistan and Uzbekistan. However, only in the Republic of Tajikistan are the letters written on the basis of the Cyrillic writing system. The Unicode standard of the Tajik alphabet letters is defined by the Government of the Republic of Tajikistan Decree No. 330, August 2, 2004. For countries where the number of users of computer systems in the local language is small, localization and spell-checking programs have not been developed by Microsoft. The article describes the methods and modules of spell checking of text documents, on the basis of which has been developed an automatic spell-checking system of the Tajik language named TajSpell which uses in MS Office software package. It is demonstrating a logical structure which shows the modules of the MS Office package and the self-contained automatic TajSpell system. Also, it discusses the schema of morphs and postfixes database, which forms new formats of words and provide users the ability of real time check spelling.

Key words

   automatic system, methods, spell checking, dictionary, morphemes, text documents, TajSpell.

Language

english

Type

technical

Year

2021

Page

21

References

      1. Usmanov Z.D., Soliev O.M., Khudoiberdiev Kh.A., Dovudov G.M. Automatic system TajSpell-2.0. to check the spelling of the Tajik language in the office package of MS Office 2010-2019 applications // Certificate of state registration of an information resource, Republic of Tajikistan 07/30/2020, № 4202000456.
      2. Usmanov Z.D., Soliev O.M., Khudoiberdiev Kh.A., Dovudov G.M. Tajik language package for spellchecking in Microsoft Office // Certificate of registration of intellectual product No. 4201200235, dated 04.10.2012. National Patent Information Center of the Ministry of Economic Development and Trade of the Republic of Tatarstan.
      3. Usmanov Z.D., Dovudov G.M. Formation of the base of morphs of the Tajik language: monograph / – Dushanbe: Donish, 2014. – 109 p.
      4. Usmanov Z.D., Dovudov, G.M. Morphological analysis of word forms of the Tajik language: monograph. – Dushanbe: Donish, 2015. – 132 p.
      5. Khudoiberdiev Kh.A., Soliev O.M. Linguistic thesaurus of the Tajik language. New information technologies in automated systems. MIEM HSE. Moscow, 2017, 268p. (103-106).
      6. Khudoiberdiev Kh.A. On automatic conversion of Tajik text to standard graphics. Reports of the Academy of Sciences of the Republic of Tajikistan, Volume 57, 2014. № 3. 210-214s.

Publication date

2023-10-02