EVALUATION OF UNIGRAMM USE EFFICIENCY FOR A TEXT IDENTIFICATION

Authors

 

      Khudoyberdiev Kh. A. – Candidate of Physical and Mathematical Sciences, Head of the Chair of Programming and Information Technologies, PITTU

     Qosimov A. A. senior teacher of the Chair of Programming and Information  Technologies, PITTU

 

Annotation

 

     Efficiency of N.V. Smirnov’s uniformity criterion and his modifier for identification of the author of a text by means of letter unigram frequencies are investigated. It is substantiated that this criterion and its modifier allow to identify the works of poets of classical Tajik-Persian literature, as well as various authors of modern Tajik poetry and prose, by the frequency of the symbols of the Tajik alphabet.

 

Key words

 

  tajik language, unigram, frequency, statistics, efficiency.

 

Publication date

2023-10-25