DETERMINING THE MINIMUM AMOUNT OF WORD SAMPLING FOR TEXT IDENTIFICATION

Authors

         Khudoyberdiev Kh.A. Candidate of Physical and Mathematical Sciences, Head Chair of Programming and Information Technologies, Polytechnic Institute of Tajik Technical University

          Qosimov A.A. Senior teacher, Department of Programming and Information  Technologies, Polytechnic Institute of Tajik Technical University

 Annotation

  The minimum volume of word samples for authorship identification of a text written in tajik is determined. The results of experiments with a minimal amount of word sampling for identification author of a text are described.

Key words

 tajik language, words, trigram, frequency, statistics, efficiency.

Language

english

Type

technical

Year

2017

Page

22

 

Publication date

2023-09-25