EVALUATION OF UNIGRAMM USE EFFICIENCY FOR A TEXT IDENTIFICATION

Authors

      Khudoyberdiev Kh. A. – Candidate of Physical and Mathematical Sciences, Head of the Chair of Programming and Information Technologies, PITTU.

        Qosimov A. A. senior teacher of the Chair of Programming and Information Technologies, PITTU.

 Annotation

 Efficiency of N.V. Smirnov’s uniformity criterion and his modifier for identification of the author of a text by means of letter unigram frequencies are investigated. It is substantiated that this criterion and its modifier allow to identify the works of poets of classical Tajik-Persian literature, as well as various authors of modern Tajik poetry and prose, by the frequency of the symbols of the Tajik alphabet.

Key words

tajik language, unigram, frequency, statistics, efficiency.

Language

english

Type

technical

Year

2017

Page

13

Publication date

2023-09-25