{"id":2271,"date":"2023-09-25T15:47:41","date_gmt":"2023-09-25T10:47:41","guid":{"rendered":"http:\/\/vestnik.polytech.tj\/?p=2271"},"modified":"2023-09-25T15:50:34","modified_gmt":"2023-09-25T10:50:34","slug":"determining-the-minimum-amount-of-word-sampling-for-text-identification","status":"publish","type":"post","link":"https:\/\/vestnik.polytech.tj\/?p=2271&lang=en","title":{"rendered":"DETERMINING THE MINIMUM AMOUNT OF WORD SAMPLING  FOR TEXT IDENTIFICATION"},"content":{"rendered":"<p><!--vcv no format--><!-- vcwb\/dynamicElementComment:f7ffdd1b --><\/p>\n<div class=\"vce-row-container\" data-vce-boxed-width=\"true\">\n<div class=\"vce-row vce-row--col-gap-30 vce-row-equal-height vce-row-content--top\" id=\"el-f7ffdd1b\" data-vce-do-apply=\"all el-f7ffdd1b\">\n<div class=\"vce-row-content\" data-vce-element-content=\"true\"><!-- vcwb\/dynamicElementComment:88772d1d --><\/p>\n<div class=\"vce-col vce-col--md-78p vce-col--xs-1 vce-col--xs-last vce-col--xs-first vce-col--sm-last vce-col--sm-first vce-col--md-first vce-col--lg-first vce-col--xl-first\" id=\"el-88772d1d\">\n<div class=\"vce-col-inner\" data-vce-do-apply=\"border margin background  el-88772d1d\">\n<div class=\"vce-col-content\" data-vce-element-content=\"true\" data-vce-do-apply=\"padding el-88772d1d\"><!-- vcwb\/dynamicElementComment:a4d64558 --><\/p>\n<div class=\"vce-text-block\">\n<div class=\"vce-text-block-wrapper vce\" id=\"el-a4d64558\" data-vce-do-apply=\"all el-a4d64558\">\n<p><strong><span style=\"font-size: 14pt;\">Authors<\/span><\/strong><\/p>\n<p><strong>&nbsp; &nbsp; &nbsp; &nbsp; &nbsp;Khudoyberdiev Kh.A. <\/strong>\u2013 <em>Candidate of Physical and Mathematical Sciences<\/em><em>, <\/em><em>Head Chair of Programming and Information Technologies,<\/em><em> Polytechnic <\/em><em>Institute of Tajik Technical University<\/em><\/p>\n<p><strong>&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; Qosimov A.A. <\/strong>\u2013 <em>Senior teacher, Department of Programming and Information&nbsp; <\/em><em>Technologies<strong>,<\/strong><\/em><em> Polytechnic Institute of Tajik Technical University<\/em><\/p>\n<p><strong><span style=\"font-size: 14pt;\">&nbsp;Annotation<\/span><\/strong><\/p>\n<p>&nbsp; <em>The minimum volume of word samples for authorship identification of a text written in tajik is determined. The results of experiments with a minimal amount of word sampling for identification author of a text are described.<\/em><\/p>\n<p><span style=\"font-size: 14pt;\"><strong>Key words<\/strong><\/span><\/p>\n<p><em>&nbsp;tajik language, words, trigram, frequency, statistics, efficiency.<\/em><\/p>\n<\/div>\n<\/div>\n<p><!-- \/vcwb\/dynamicElementComment:a4d64558 --><\/div>\n<\/div>\n<\/div>\n<p><!-- \/vcwb\/dynamicElementComment:88772d1d --><!-- vcwb\/dynamicElementComment:2a64a1af --><\/p>\n<div class=\"vce-col vce-col--md-22p vce-col--xs-1 vce-col--xs-last vce-col--xs-first vce-col--sm-last vce-col--sm-first vce-col--md-last vce-col--lg-last vce-col--xl-last\" id=\"el-2a64a1af\">\n<div class=\"vce-col-inner\" data-vce-do-apply=\"border margin background  el-2a64a1af\">\n<div class=\"vce-col-content\" data-vce-element-content=\"true\" data-vce-do-apply=\"padding el-2a64a1af\"><!-- vcwb\/dynamicElementComment:fdb494f1 --><\/p>\n<div class=\"vce-text-block\">\n<div class=\"vce-text-block-wrapper vce\" id=\"el-fdb494f1\" data-vce-do-apply=\"all el-fdb494f1\">\n<table style=\"border-collapse: collapse; width: 100%;\" border=\"1\">\n<tbody>\n<tr>\n<td style=\"width: 100%;\">\n<p style=\"line-height: 1;\"><strong><span style=\"font-size: 14pt;\">Language<\/span><\/strong><\/p>\n<p style=\"line-height: 1;\"><span style=\"font-size: 12pt; letter-spacing: 1px; font-weight: 400; font-style: normal;\">english<\/span><\/p>\n<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<\/div>\n<\/div>\n<p><!-- \/vcwb\/dynamicElementComment:fdb494f1 --><!-- vcwb\/dynamicElementComment:30ca3f28 --><\/p>\n<div class=\"vce-text-block\">\n<div class=\"vce-text-block-wrapper vce\" id=\"el-30ca3f28\" data-vce-do-apply=\"all el-30ca3f28\">\n<table style=\"border-collapse: collapse; width: 100%;\" border=\"1\">\n<tbody>\n<tr>\n<td style=\"width: 100%;\">\n<p style=\"line-height: 1;\"><strong><span style=\"font-size: 14pt;\">Type<\/span><\/strong><\/p>\n<p style=\"line-height: 1;\"><span style=\"font-size: 12pt; letter-spacing: 1px; font-weight: 400; font-style: normal;\">technical<\/span><\/p>\n<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<\/div>\n<\/div>\n<p><!-- \/vcwb\/dynamicElementComment:30ca3f28 --><!-- vcwb\/dynamicElementComment:bf795935 --><\/p>\n<div class=\"vce-text-block\">\n<div class=\"vce-text-block-wrapper vce\" id=\"el-bf795935\" data-vce-do-apply=\"all el-bf795935\">\n<table style=\"border-collapse: collapse; width: 100%;\" border=\"1\">\n<tbody>\n<tr>\n<td style=\"width: 100%;\">\n<p style=\"line-height: 1;\"><strong><span style=\"font-size: 14pt;\">Year<\/span><\/strong><\/p>\n<p style=\"line-height: 1;\"><span style=\"font-size: 12pt; letter-spacing: 1px; font-weight: 400; font-style: normal;\">2017<\/span><\/p>\n<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<\/div>\n<\/div>\n<p><!-- \/vcwb\/dynamicElementComment:bf795935 --><!-- vcwb\/dynamicElementComment:aa3d0f20 --><\/p>\n<div class=\"vce-text-block\">\n<div class=\"vce-text-block-wrapper vce\" id=\"el-aa3d0f20\" data-vce-do-apply=\"all el-aa3d0f20\">\n<table style=\"border-collapse: collapse; width: 100%;\" border=\"1\">\n<tbody>\n<tr>\n<td style=\"width: 100%;\">\n<p style=\"line-height: 1;\"><strong><span style=\"font-size: 14pt;\">Page<\/span><\/strong><\/p>\n<p style=\"line-height: 1;\"><span style=\"font-size: 12pt; letter-spacing: 1px; font-weight: 400; font-style: normal;\">22<\/span><\/p>\n<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p>&nbsp;<\/p>\n<\/div>\n<\/div>\n<p><!-- \/vcwb\/dynamicElementComment:aa3d0f20 --><\/div>\n<\/div>\n<\/div>\n<p><!-- \/vcwb\/dynamicElementComment:2a64a1af --><\/div>\n<\/div>\n<\/div>\n<p><!-- \/vcwb\/dynamicElementComment:f7ffdd1b --><!-- vcwb\/dynamicElementComment:088773ed --><\/p>\n<div class=\"vce-row-container\" data-vce-boxed-width=\"true\">\n<div class=\"vce-row vce-row--col-gap-30 vce-row-equal-height vce-row-content--top\" id=\"el-088773ed\" data-vce-do-apply=\"all el-088773ed\">\n<div class=\"vce-row-content\" data-vce-element-content=\"true\"><!-- vcwb\/dynamicElementComment:fcace6f3 --><\/p>\n<div class=\"vce-col vce-col--md-auto vce-col--xs-1 vce-col--xs-last vce-col--xs-first vce-col--sm-last vce-col--sm-first vce-col--md-last vce-col--lg-last vce-col--xl-last vce-col--md-first vce-col--lg-first vce-col--xl-first\" id=\"el-fcace6f3\">\n<div class=\"vce-col-inner\" data-vce-do-apply=\"border margin background  el-fcace6f3\">\n<div class=\"vce-col-content\" data-vce-element-content=\"true\" data-vce-do-apply=\"padding el-fcace6f3\"><!-- vcwb\/dynamicElementComment:f726293f --><\/p>\n<div class=\"vce-text-block\">\n<div class=\"vce-text-block-wrapper vce\" id=\"el-f726293f\" data-vce-do-apply=\"all el-f726293f\">\n<h2><strong><span style=\"font-size: 14pt;\">Publication date<\/span><\/strong><\/h2>\n<p>2023-09-25<\/p>\n<\/div>\n<\/div>\n<p><!-- \/vcwb\/dynamicElementComment:f726293f --><\/div>\n<\/div>\n<\/div>\n<p><!-- \/vcwb\/dynamicElementComment:fcace6f3 --><\/div>\n<\/div>\n<\/div>\n<p><!-- \/vcwb\/dynamicElementComment:088773ed --><!--vcv no format--><\/p>\n","protected":false},"excerpt":{"rendered":"<p>Authors &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;Khudoyberdiev Kh.A. \u2013 Candidate of Physical and Mathematical Sciences, Head Chair of Programming and Information Technologies, Polytechnic Institute of Tajik Technical University &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; Qosimov A.A. \u2013 Senior teacher, Department of Programming and Information&nbsp; Technologies, Polytechnic Institute of Tajik Technical University &nbsp;Annotation &nbsp; The minimum volume of word samples for authorship identification of a text written in tajik is determined. The results of experiments with a minimal amount of word sampling for identification author of a text are described. Key words &nbsp;tajik&hellip;<\/p>\n","protected":false},"author":3,"featured_media":0,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"_acf_changed":false,"footnotes":""},"categories":[135],"tags":[257],"class_list":["post-2271","post","type-post","status-publish","format-standard","hentry","category-bulletin_of_pittu_2017","tag-bulletin-of-pittu-2017-3"],"acf":[],"featured_image_src":null,"author_info":{"display_name":"Ilhomjon Qodirov","author_link":"https:\/\/vestnik.polytech.tj\/?author=3"},"_links":{"self":[{"href":"https:\/\/vestnik.polytech.tj\/index.php?rest_route=\/wp\/v2\/posts\/2271","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/vestnik.polytech.tj\/index.php?rest_route=\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/vestnik.polytech.tj\/index.php?rest_route=\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/vestnik.polytech.tj\/index.php?rest_route=\/wp\/v2\/users\/3"}],"replies":[{"embeddable":true,"href":"https:\/\/vestnik.polytech.tj\/index.php?rest_route=%2Fwp%2Fv2%2Fcomments&post=2271"}],"version-history":[{"count":3,"href":"https:\/\/vestnik.polytech.tj\/index.php?rest_route=\/wp\/v2\/posts\/2271\/revisions"}],"predecessor-version":[{"id":2274,"href":"https:\/\/vestnik.polytech.tj\/index.php?rest_route=\/wp\/v2\/posts\/2271\/revisions\/2274"}],"wp:attachment":[{"href":"https:\/\/vestnik.polytech.tj\/index.php?rest_route=%2Fwp%2Fv2%2Fmedia&parent=2271"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/vestnik.polytech.tj\/index.php?rest_route=%2Fwp%2Fv2%2Fcategories&post=2271"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/vestnik.polytech.tj\/index.php?rest_route=%2Fwp%2Fv2%2Ftags&post=2271"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}