{"id":994,"date":"2023-09-04T14:18:36","date_gmt":"2023-09-04T09:18:36","guid":{"rendered":"http:\/\/vestnik.polytech.tj\/?p=994"},"modified":"2023-09-12T15:49:53","modified_gmt":"2023-09-12T10:49:53","slug":"machine-learning-algorithms-in-text-classification","status":"publish","type":"post","link":"https:\/\/vestnik.polytech.tj\/?p=994&lang=en","title":{"rendered":"MACHINE LEARNING ALGORITHMS IN TEXT CLASSIFICATION"},"content":{"rendered":"\n<p>Authors: Nizamitdinov A.I, Inomov B.B.<\/p>\n\n\n\n<div class=\"wp-block-columns is-layout-flex wp-container-core-columns-is-layout-9d6595d7 wp-block-columns-is-layout-flex\">\n<div class=\"wp-block-column is-layout-flow wp-block-column-is-layout-flow\" style=\"flex-basis:70%\">\n<h4 class=\"wp-block-heading\" id=\"0-%D0%BC%D1%83%D0%B0%D0%BB%D0%BB%D0%B8%D1%84%D0%BE%D0%BD-\"><strong>Auhtors<\/strong><\/h4>\n\n\n\n<p>               <strong><em>Nizamitdinov A.I<\/em><\/strong>. \u2013 Doctor of Philosophy (PhD), Department of Programming and Information Technologies, Polytechnic Institute of Tajik Technical University, Khujand, Republic of Tajikistan, <a href=\"mailto:ahlidin@gmail.com\">ahlidin@gmail.com<\/a>.<\/p>\n\n\n\n<p>                 <strong><em>Inomov B.B.<\/em><\/strong> \u2013 Phd student of specialty 6D070300- Information systems, Department of Programming and Information Technologies, Polytechnic Institute of Tajik Technical University, Khujand, Republic of Tajikistan <a href=\"mailto:behruzinomov@gmail.com\">behruzinomov@gmail.com<\/a>.<\/p>\n\n\n\n<h4 class=\"wp-block-heading\" id=\"1-%D1%87%D0%B0%D0%BA%D0%B8%D0%B4%D0%B0-\">Annotation<\/h4>\n\n\n\n<p>           <em>This article gives an overview of the available machine learning algorithms for classification problems, in particular in the problems of classifying texts of different language contexts. Text classification is one of the main tasks of computer linguistics. This direction has several main tasks, such as determining the thematic affiliation of texts, the author of the text, the emotional coloring of statements, etc. To ensure information and public safety in social networks, information sites, analysis of content containing illegal information is of great importance in telecommunication networks. The use of machine learning algorithms to solve text classification problems is a fairly common task today, since program complexes based on these algorithms have a rather high rating indicator in comparison with other classification approaches. The application and comparison of classification algorithms is a rather difficult task, since different input data can give different results. Therefore, software algorithms must be trained and tested on the same data sets.<\/em><\/p>\n\n\n\n<h4 class=\"wp-block-heading\">Key words<\/h4>\n\n\n\n<p>       algorithm, machine learning, text classification, data analysis<\/p>\n<\/div>\n\n\n\n<div class=\"wp-block-column is-layout-flow wp-block-column-is-layout-flow\" style=\"flex-basis:33.33%\">\n<div class='rightBoxFullTheme'>\n<p>Language<\/p>\n<p>English<\/p>\n<\/div>\n\n\n\n<div class='rightBoxFullTheme'>\n<p>Type of article<\/p>\n<p>scientific<\/p>\n<\/div>\n\n\n\n<div class='rightBoxFullTheme'>\n<p>Year<\/p>\n<p>2020<\/p>\n<\/div>\n\n\n\n<div class='rightBoxFullTheme'>\n<p>Page<\/p>\n<p>34-46<\/p>\n<\/div>\n<\/div>\n<\/div>\n\n\n\n<h4 class=\"wp-block-heading\">References<\/h4>\n\n\n\n<ol class=\"wp-block-list\">\n<li style=\"font-size:14px\">Aggarwal C. and Zhai C. (2012) A survey of text classification algorithms. Springer, P. 163\u2014222.<\/li>\n\n\n\n<li style=\"font-size:14px\">Gareth James, Daniela Witten, Trevor Hastie, Robert Tibshirani. (2013). An introduction to statistical learning: with applications in R. New York: Springer.<\/li>\n\n\n\n<li style=\"font-size:14px\">Jimenez, S. (2014) Text Classification and Clustering with WEKA.<\/li>\n\n\n\n<li style=\"font-size:14px\">Kayumov M.M. (2019) On the effectiveness of using digital portraits based on high-frequency punctuation marks for recognizing authors of works, News of Polytechnic. Serie: Intellect, Innovation, Investments ,4 (48), \u0420. 23-26.<\/li>\n\n\n\n<li style=\"font-size:14px\">Korde V. and Mahender C. (2012) Text classification and classifiers: A survey. International Journal of Artificial Intelligence &amp; Applications (IJAIA), 3 (2), P. 85\u201499.<\/li>\n\n\n\n<li style=\"font-size:14px\">Maksudov Kh.T., Inomov B.B (2019) The comparison of classification algorithms by machine learning methods: case study of scientific texts by specialties, News of Polytechnic. Serie: Intellect, Innovation, Investments, 4 (48), \u0420. 34-38.<\/li>\n\n\n\n<li style=\"font-size:14px\">Mukhsinzoda M. Y., Soliev O. M. (2019) Generating new Tajik national names using artificial neural networks, News of Polytechnic. Serie: Intellect, Innovation, Investments, 4 (48), \u0420. 18-23.<\/li>\n\n\n\n<li style=\"font-size:14px\">Nazarov A.A. (2019) An automatic synthesis of Tajik word forms of adjective, News of Polytechnic. Serie: Intellect, Innovation, Investments,4(48), 16-18.<\/li>\n\n\n\n<li style=\"font-size:14px\">Niharika S., Latha V. and Lavanya, D. (2012). A Survey on Text Categorization. International Journal of Computer Trends and Technology, volume 3, Issue 1.<\/li>\n\n\n\n<li style=\"font-size:14px\">Pandey U. and Chakraverty S.A (2011) Review of Text Classification Approaches for E-mail Management. IACSIT International Journal of Engineering and Technology, 3 (2).<\/li>\n\n\n\n<li style=\"font-size:14px\">Patra A. and Singh D. (2013). A Survey Report on Text Classification with Different Term Weighing Methods and Comparison between Classification Algorithms. International Journal of Computer Applications, Volume 75, \u2116 7, \u0420. 14 &#8212; 18.<\/li>\n\n\n\n<li style=\"font-size:14px\">Wilcox A. and Hripcsak G. (1999) Classification algorithms applied to narrative reports. P. 455.<\/li>\n<\/ol>\n\n\n\n<div class=\"wp-block-columns is-layout-flex wp-container-core-columns-is-layout-9d6595d7 wp-block-columns-is-layout-flex\">\n<div class=\"wp-block-column is-layout-flow wp-block-column-is-layout-flow\" style=\"flex-basis:33.33%\">\n<h4 class=\"wp-block-heading\"><br>Publication date<\/h4>\n\n\n\n<p class=\"has-cyan-bluish-gray-color has-text-color\">05 Jun 2023<\/p>\n<\/div>\n\n\n\n<div class=\"wp-block-column is-layout-flow wp-block-column-is-layout-flow\" style=\"flex-basis:66.66%\"><\/div>\n<\/div>\n","protected":false},"excerpt":{"rendered":"<p>Authors: Nizamitdinov A.I, Inomov B.B. Auhtors Nizamitdinov A.I. \u2013 Doctor of Philosophy (PhD), Department of Programming and Information Technologies, Polytechnic Institute of Tajik Technical University, Khujand, Republic of Tajikistan, ahlidin@gmail.com. Inomov B.B. \u2013 Phd student of specialty 6D070300- Information systems, Department of Programming and Information Technologies, Polytechnic Institute of Tajik Technical University, Khujand, Republic of Tajikistan behruzinomov@gmail.com. Annotation This article gives an overview of the available machine learning algorithms for classification problems, in particular in the problems of classifying texts of different language contexts. Text classification is one of the&hellip;<\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"_acf_changed":false,"footnotes":""},"categories":[65],"tags":[71],"class_list":["post-994","post","type-post","status-publish","format-standard","hentry","category-bulletin_of_pittu_2020","tag-bulletin-of-pittu-2020-1"],"acf":[],"featured_image_src":null,"author_info":{"display_name":"ilhomjonqodirov02","author_link":"https:\/\/vestnik.polytech.tj\/?author=1"},"_links":{"self":[{"href":"https:\/\/vestnik.polytech.tj\/index.php?rest_route=\/wp\/v2\/posts\/994","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/vestnik.polytech.tj\/index.php?rest_route=\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/vestnik.polytech.tj\/index.php?rest_route=\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/vestnik.polytech.tj\/index.php?rest_route=\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/vestnik.polytech.tj\/index.php?rest_route=%2Fwp%2Fv2%2Fcomments&post=994"}],"version-history":[{"count":4,"href":"https:\/\/vestnik.polytech.tj\/index.php?rest_route=\/wp\/v2\/posts\/994\/revisions"}],"predecessor-version":[{"id":1409,"href":"https:\/\/vestnik.polytech.tj\/index.php?rest_route=\/wp\/v2\/posts\/994\/revisions\/1409"}],"wp:attachment":[{"href":"https:\/\/vestnik.polytech.tj\/index.php?rest_route=%2Fwp%2Fv2%2Fmedia&parent=994"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/vestnik.polytech.tj\/index.php?rest_route=%2Fwp%2Fv2%2Fcategories&post=994"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/vestnik.polytech.tj\/index.php?rest_route=%2Fwp%2Fv2%2Ftags&post=994"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}