Authors
Soliev O.M. – Candidate of Physical and Mathematical Sciences, Department of Programming and Information Technologies, Polytechnic Institute of Tajik Technical University, Khujand, Republic of Tajikistan, osoliev@gmail.com.
Kosimov O.A. – assistant, Department of Programming and Information Technologies, Polytechnic Institute of Tajik Technical University, Khujand, Republic of Tajikistan, oqosimov9293@gmail.com.
Annotation
The article established that the distribution of the frequency of trigrams in the economic and political works of the Russian language is the identifier of authorship. The Z.D. Usmanov classifier’s capabilities to recognize the author of a text by the frequency of alphabetic trigrams are investigated. A digital portrait and a metric space of works are constructed. Assuming the uniqueness of the author’s creativity, the threshold values of the metrics are established, on the basis of which the classes of “homogeneous” works are determined. The classifier of discrete random variables, which confirmed high efficiency in identifying authorship of text fragments in works of classical and modern poetry, as well as in modern Tajik prose, is tested for adaptability to recognition of authorship in economic and political works. It is concluded that symbol trigrams are acceptable quantitative characteristics for identifying text authors. Including spaces in trigrams improves classification accuracy. The γ-classifier of discrete random variables, which confirmed high efficiency in identifying the authorship of text fragments in works of classical and modern poetry, as well as in modern prose of the Tajik language, is tested for adaptability to the recognition of authorship in economical and political text.
Key words
Russian language, economical and political text, trigram, classifier, frequency, statistics, efficiency.
Language english |
Type technical |
Year 2019 |
Page 28 |
References
-
-
- Burkhanova N.M. The budget system of the Russian Federation. – M.: Eksmo, 2007, 32 p.
- Burkhanova N.M. Economical geography. Cheat sheets. – M.: Eksmo, 2008, 32 p.
- Katasonov V.Yu. America v. Russia. – M.: Book World, 2015, 449 p.
- Katasonov V.Yu. Anti-crisis. Survive and conquer. – M.: Algorithm, 2015, 149 p.
- Katasonov V.Yu. The battle for the ruble. – M.: Book World, 2015, 288 p.
- Klimova M.A. Wage. – M.: Tax Herald, 2008, 320 p.
- Klimova M.A. Income tax. – M.: Tax Herald, 2008, 98 p.
- Nikanorov P.S. Cooperative activity. – M.: Tax Bulletin, 2008, 320 p.
- Nikanorov P.S. Mediation activities. – M.: Tax Bulletin, 2008, 320 p.
- Panchenko T.M. Loans and loans. – M.: Tax Bulletin, 2008, 158 p.
- Panchenko T.M. Vacation and social benefits. – M.: Tax Bulletin, 2008, 340 p.
- Starikov N.V. Geopolitics. How it’s done. – St. Petersburg: Peter, 2014, 368 p.
- Starikov N.V. The nationalization of the ruble. – St. Petersburg: Peter, 2011, 169 p.
- Usmanov Z.D. N-grams in the recognition of homogeneous texts. – Materials of 20 scientific-practical seminar “New information technologies in automated systems”. – M.: 2017, P. 52 – 54.
- Usmanov Z.D. Algorithm for tuning the clustering of discrete random variables. – Reports of the Academy of Sciences of the Republic of Tajikistan, 2017, vol. 60, № 9, P. 392 – 397.
- Usmanov Z.D. Classifier of discrete random variables. – Reports of the Academy of Sciences of the Republic of Tajikistan. 2017, vol. 60, № 7 – 8, P. 291 – 300.
- Shevchuk D.A. The history of economics. – M.: Author, 2009, 305 p.
- Shevchuk D.A. World economy. Lecture notes. – Rostov-on-Don: Phoenix, 2007, 417 p.
-
Publication date
09/22/2023