{"id":2620,"date":"2023-10-02T11:22:34","date_gmt":"2023-10-02T06:22:34","guid":{"rendered":"http:\/\/vestnik.polytech.tj\/?p=2620"},"modified":"2023-10-02T11:37:20","modified_gmt":"2023-10-02T06:37:20","slug":"comparative-analysis-of-the-recognition-systems-sphinx-and-mozilla-deepspeech","status":"publish","type":"post","link":"https:\/\/vestnik.polytech.tj\/?p=2620&lang=en","title":{"rendered":"COMPARATIVE ANALYSIS OF THE RECOGNITION SYSTEMS  SPHINX AND MOZILLA DEEPSPEECH"},"content":{"rendered":"<p><!--vcv no format--><!-- vcwb\/dynamicElementComment:bc0e1e95 --><\/p>\n<div class=\"vce-row-container\" data-vce-boxed-width=\"true\">\n<div class=\"vce-row vce-row--col-gap-30 vce-row-equal-height vce-row-content--top\" id=\"el-bc0e1e95\" data-vce-do-apply=\"all el-bc0e1e95\">\n<div class=\"vce-row-content\" data-vce-element-content=\"true\"><!-- vcwb\/dynamicElementComment:742638ae --><\/p>\n<div class=\"vce-col vce-col--md-78p vce-col--xs-1 vce-col--xs-last vce-col--xs-first vce-col--sm-last vce-col--sm-first vce-col--md-first vce-col--lg-first vce-col--xl-first\" id=\"el-742638ae\">\n<div class=\"vce-col-inner\" data-vce-do-apply=\"border margin background  el-742638ae\">\n<div class=\"vce-col-content\" data-vce-element-content=\"true\" data-vce-do-apply=\"padding el-742638ae\"><!-- vcwb\/dynamicElementComment:94f9893b --><\/p>\n<div class=\"vce-text-block\">\n<div class=\"vce-text-block-wrapper vce\" id=\"el-94f9893b\" data-vce-do-apply=\"all el-94f9893b\">\n<p><strong><span style=\"font-size: 14pt;\">Authors<\/span><\/strong><\/p>\n<p><strong>&nbsp; &nbsp; &nbsp; &nbsp;<\/strong><strong>Khudoiberdiev H.A.<\/strong><em> \u2013 Candidate of Physical and Mathematical Sciences, Head of the&nbsp; <\/em><em>Department of Programming and Information Technologies, Polytechnic Institute of Tajik <\/em><em>Technical University, <\/em><em>Khujand, Republic of Tajikistan<\/em><em>, <a href=\"mailto:tajlingvo@gmail.com\">tajlingvo@gmail.com<\/a><\/em><\/p>\n<p><strong>&nbsp; &nbsp; Vositov R.M.<\/strong><em> &#8212; Teacher at the Department of Programming and Information Technologies <\/em><em>Polytechnic Institute of Tajik Technical University, <\/em><em>Khujand, Republic of Tajikistan, <a href=\"mailto:ravshan488889@gmail.com\">ravshan488889@gmail.com<\/a> <\/em><\/p>\n<p><span style=\"font-size: 14pt;\"><strong>Annotation<\/strong><\/span><\/p>\n<p><em>&nbsp; &nbsp; &nbsp; &nbsp; The article provides a comparative analysis of CMU Sphinx and Mozilla speech recognition, created on the basis of Deep Speech 0.6. Nowadays a lot of speech recognition systems and software products are available to users of computer systems. ach of them are based on existing technologies. The most commonly used technologies are artificial intelligence and machine learning. Recognition of human speech is realized on the basis of the study of grammar, syntax, the structure of sound elements. CMU Sphinx can be used in commercial projects. Thus, the proposed system in the form of API can be used in stand-alone software products. The system supports many platforms, including the Android operating system. Mozilla&#8217;s speech recognition system is based on the DeepSpeech engine, which uses machine learning technology. The Mozilla system can be used as an additional platform for their software products. Both systems are popular and open source. The comparison used many criteria, including system structures, availability of detailed documentation, supported recognition languages, and license restrictions. Experiments were also conducted on several speech cases to determine the speed and accuracy of recognition. As a result, for each of the considered systems, recommendations for use were developed with an additional indication of the scope of activity.<\/em><\/p>\n<p><span style=\"font-size: 14pt;\"><strong><em>Key words<\/em><\/strong><\/span><\/p>\n<p><em>speech recognition, metric, deep speech, Word Recognition Rate (WRR), Word Error Rate (WER), Speed Factor (SF), open source, machine learning<\/em><em>.<\/em><\/p>\n<\/div>\n<\/div>\n<p><!-- \/vcwb\/dynamicElementComment:94f9893b --><\/div>\n<\/div>\n<\/div>\n<p><!-- \/vcwb\/dynamicElementComment:742638ae --><!-- vcwb\/dynamicElementComment:de781739 --><\/p>\n<div class=\"vce-col vce-col--md-22p vce-col--xs-1 vce-col--xs-last vce-col--xs-first vce-col--sm-last vce-col--sm-first vce-col--md-last vce-col--lg-last vce-col--xl-last\" id=\"el-de781739\">\n<div class=\"vce-col-inner\" data-vce-do-apply=\"border margin background  el-de781739\">\n<div class=\"vce-col-content\" data-vce-element-content=\"true\" data-vce-do-apply=\"padding el-de781739\"><!-- vcwb\/dynamicElementComment:c8b6dc86 --><\/p>\n<div class=\"vce-text-block\">\n<div class=\"vce-text-block-wrapper vce\" id=\"el-c8b6dc86\" data-vce-do-apply=\"all el-c8b6dc86\">\n<table style=\"border-collapse: collapse; width: 100%;\" border=\"1\">\n<tbody>\n<tr>\n<td style=\"width: 100%;\">\n<p style=\"line-height: 1;\">Language<\/p>\n<p style=\"line-height: 1;\"><span style=\"font-weight: 400; font-style: normal;\">english<\/span><\/p>\n<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<\/div>\n<\/div>\n<p><!-- \/vcwb\/dynamicElementComment:c8b6dc86 --><!-- vcwb\/dynamicElementComment:5727ecb7 --><\/p>\n<div class=\"vce-text-block\">\n<div class=\"vce-text-block-wrapper vce\" id=\"el-5727ecb7\" data-vce-do-apply=\"all el-5727ecb7\">\n<table style=\"border-collapse: collapse; width: 100%;\" border=\"1\">\n<tbody>\n<tr>\n<td style=\"width: 100%;\">\n<p style=\"line-height: 1;\">Type<\/p>\n<p style=\"line-height: 1;\"><span style=\"font-weight: 400; font-style: normal;\">technical<\/span><\/p>\n<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<\/div>\n<\/div>\n<p><!-- \/vcwb\/dynamicElementComment:5727ecb7 --><!-- vcwb\/dynamicElementComment:616f20e9 --><\/p>\n<div class=\"vce-text-block\">\n<div class=\"vce-text-block-wrapper vce\" id=\"el-616f20e9\" data-vce-do-apply=\"all el-616f20e9\">\n<table style=\"border-collapse: collapse; width: 100%;\" border=\"1\">\n<tbody>\n<tr>\n<td style=\"width: 100%;\">\n<p style=\"line-height: 1;\">Year<\/p>\n<p style=\"line-height: 1;\"><span style=\"font-weight: 400; font-style: normal;\">2021<\/span><\/p>\n<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<\/div>\n<\/div>\n<p><!-- \/vcwb\/dynamicElementComment:616f20e9 --><!-- vcwb\/dynamicElementComment:925ca160 --><\/p>\n<div class=\"vce-text-block\">\n<div class=\"vce-text-block-wrapper vce\" id=\"el-925ca160\" data-vce-do-apply=\"all el-925ca160\">\n<table style=\"border-collapse: collapse; width: 100%;\" border=\"1\">\n<tbody>\n<tr>\n<td style=\"width: 100%;\">\n<p style=\"line-height: 1;\">Page<\/p>\n<p style=\"line-height: 1;\"><span style=\"font-weight: 400;\">12-13<\/span><\/p>\n<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<\/div>\n<\/div>\n<p><!-- \/vcwb\/dynamicElementComment:925ca160 --><\/div>\n<\/div>\n<\/div>\n<p><!-- \/vcwb\/dynamicElementComment:de781739 --><\/div>\n<\/div>\n<\/div>\n<p><!-- \/vcwb\/dynamicElementComment:bc0e1e95 --><!-- vcwb\/dynamicElementComment:6ee3b4f8 --><\/p>\n<div class=\"vce-row-container\" data-vce-boxed-width=\"true\">\n<div class=\"vce-row vce-row--col-gap-30 vce-row-equal-height vce-row-content--top\" id=\"el-6ee3b4f8\" data-vce-do-apply=\"all el-6ee3b4f8\">\n<div class=\"vce-row-content\" data-vce-element-content=\"true\"><!-- vcwb\/dynamicElementComment:e72a503a --><\/p>\n<div class=\"vce-col vce-col--md-auto vce-col--xs-1 vce-col--xs-last vce-col--xs-first vce-col--sm-last vce-col--sm-first vce-col--md-last vce-col--lg-last vce-col--xl-last vce-col--md-first vce-col--lg-first vce-col--xl-first\" id=\"el-e72a503a\">\n<div class=\"vce-col-inner\" data-vce-do-apply=\"border margin background  el-e72a503a\">\n<div class=\"vce-col-content\" data-vce-element-content=\"true\" data-vce-do-apply=\"padding el-e72a503a\"><!-- vcwb\/dynamicElementComment:674f2248 --><\/p>\n<div class=\"vce-text-block\">\n<div class=\"vce-text-block-wrapper vce\" id=\"el-674f2248\" data-vce-do-apply=\"all el-674f2248\">\n<p><span style=\"font-size: 14pt;\"><strong>References<\/strong><\/span><\/p>\n<ol>\n<li style=\"list-style-type: none;\">\n<ol>\n<li style=\"list-style-type: none;\">\n<ol>\n<li><em>Burkhanova N.M. The budget system of the Russian Federation. \u2013 M.: Eksmo, 2007, 32 p.<\/em><\/li>\n<li><em>Burkhanova N.M. Economical geography. Cheat sheets. \u2013 M.: Eksmo, 2008, 32 p.<\/em><\/li>\n<li><em>Katasonov V.Yu. America v. Russia. \u2013 M.: Book World, 2015, 449 p.<\/em><\/li>\n<li><em>Katasonov V.Yu. Anti-crisis. Survive and conquer. \u2013 M.: Algorithm, 2015, 149 p.<\/em><\/li>\n<li><em>Katasonov V.Yu. The battle for the ruble. \u2013 M.: Book World, 2015, 288 p.<\/em><\/li>\n<li><em>Klimova M.A. Wage. \u2013 M.: Tax Herald, 2008, 320 p.<\/em><\/li>\n<li><em>Klimova M.A. Income tax. \u2013 M.: Tax Herald, 2008, 98 p.<\/em><\/li>\n<li><em>Nikanorov P.S. Cooperative activity. \u2013 M.: Tax Bulletin, 2008, 320 p.<\/em><\/li>\n<li><em>Nikanorov P.S. Mediation activities. \u2013 M.: Tax Bulletin, 2008, 320 p.<\/em><\/li>\n<li><em>Panchenko T.M. Loans and loans. \u2013 M.: Tax Bulletin, 2008, 158 p.<\/em><\/li>\n<li><em>Panchenko T.M. Vacation and social benefits. \u2013 M.: Tax Bulletin, 2008, 340 p.<\/em><\/li>\n<li><em>Starikov N.V. Geopolitics. How it&#8217;s done. \u2013 St. Petersburg: Peter, 2014, 368 p.<\/em><\/li>\n<li><em>Starikov N.V. The nationalization of the ruble. \u2013 St. Petersburg: Peter, 2011, 169 p.<\/em><\/li>\n<li><em>Usmanov Z.D. N-grams in the recognition of homogeneous texts. &#8212; Materials of 20 scientific-practical seminar &#171;New information technologies in automated systems&#187;. \u2013 M.: 2017, P. 52 \u2013 54.<\/em><\/li>\n<li><em>Usmanov Z.D. Algorithm for tuning the clustering of discrete random variables. &#8212; Reports of the Academy of Sciences of the Republic of Tajikistan, 2017, vol. 60, \u2116 9, P. 392 \u2013 397.<\/em><\/li>\n<li><em>Usmanov Z.D. Classifier of discrete random variables. \u2013 Reports of the Academy of Sciences of the Republic of Tajikistan. 2017, vol. 60, \u2116 7 \u2013 8, P. 291 \u2013 300.<\/em><\/li>\n<li><em>Shevchuk D.A. The history of economics. \u2013 M.: Author, 2009, 305 p.<\/em><\/li>\n<li><em>Shevchuk D.A. World economy. Lecture notes. \u2013 Rostov-on-Don: Phoenix, 2007, 417 p.<\/em><\/li>\n<\/ol>\n<\/li>\n<\/ol>\n<\/li>\n<\/ol>\n<\/div>\n<\/div>\n<p><!-- \/vcwb\/dynamicElementComment:674f2248 --><\/div>\n<\/div>\n<\/div>\n<p><!-- \/vcwb\/dynamicElementComment:e72a503a --><\/div>\n<\/div>\n<\/div>\n<p><!-- \/vcwb\/dynamicElementComment:6ee3b4f8 --><!-- vcwb\/dynamicElementComment:e470a6c6 --><\/p>\n<div class=\"vce-row-container\" data-vce-boxed-width=\"true\">\n<div class=\"vce-row vce-row--col-gap-30 vce-row-equal-height vce-row-content--top\" id=\"el-e470a6c6\" data-vce-do-apply=\"all el-e470a6c6\">\n<div class=\"vce-row-content\" data-vce-element-content=\"true\"><!-- vcwb\/dynamicElementComment:f53bb834 --><\/p>\n<div class=\"vce-col vce-col--md-auto vce-col--xs-1 vce-col--xs-last vce-col--xs-first vce-col--sm-last vce-col--sm-first vce-col--md-last vce-col--lg-last vce-col--xl-last vce-col--md-first vce-col--lg-first vce-col--xl-first\" id=\"el-f53bb834\">\n<div class=\"vce-col-inner\" data-vce-do-apply=\"border margin background  el-f53bb834\">\n<div class=\"vce-col-content\" data-vce-element-content=\"true\" data-vce-do-apply=\"padding el-f53bb834\"><!-- vcwb\/dynamicElementComment:4df50fea --><\/p>\n<div class=\"vce-text-block\">\n<div class=\"vce-text-block-wrapper vce\" id=\"el-4df50fea\" data-vce-do-apply=\"all el-4df50fea\">\n<h2><strong><span style=\"font-size: 14pt;\">Publication date<\/span><\/strong><\/h2>\n<p>2023-10-02<\/p>\n<\/div>\n<\/div>\n<p><!-- \/vcwb\/dynamicElementComment:4df50fea --><\/div>\n<\/div>\n<\/div>\n<p><!-- \/vcwb\/dynamicElementComment:f53bb834 --><\/div>\n<\/div>\n<\/div>\n<p><!-- \/vcwb\/dynamicElementComment:e470a6c6 --><!--vcv no format--><\/p>\n","protected":false},"excerpt":{"rendered":"<p>Authors &nbsp; &nbsp; &nbsp; &nbsp;Khudoiberdiev H.A. \u2013 Candidate of Physical and Mathematical Sciences, Head of the&nbsp; Department of Programming and Information Technologies, Polytechnic Institute of Tajik Technical University, Khujand, Republic of Tajikistan, tajlingvo@gmail.com &nbsp; &nbsp; Vositov R.M. &#8212; Teacher at the Department of Programming and Information Technologies Polytechnic Institute of Tajik Technical University, Khujand, Republic of Tajikistan, ravshan488889@gmail.com Annotation &nbsp; &nbsp; &nbsp; &nbsp; The article provides a comparative analysis of CMU Sphinx and Mozilla speech recognition, created on the basis of Deep Speech 0.6. Nowadays a lot of speech recognition&hellip;<\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"_acf_changed":false,"footnotes":""},"categories":[159],"tags":[342],"class_list":["post-2620","post","type-post","status-publish","format-standard","hentry","category-bulletin-of-pittu-2021","tag-bulletin-of-pittu-2021-1"],"acf":[],"featured_image_src":null,"author_info":{"display_name":"ilhomjonqodirov02","author_link":"https:\/\/vestnik.polytech.tj\/?author=1"},"_links":{"self":[{"href":"https:\/\/vestnik.polytech.tj\/index.php?rest_route=\/wp\/v2\/posts\/2620","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/vestnik.polytech.tj\/index.php?rest_route=\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/vestnik.polytech.tj\/index.php?rest_route=\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/vestnik.polytech.tj\/index.php?rest_route=\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/vestnik.polytech.tj\/index.php?rest_route=%2Fwp%2Fv2%2Fcomments&post=2620"}],"version-history":[{"count":3,"href":"https:\/\/vestnik.polytech.tj\/index.php?rest_route=\/wp\/v2\/posts\/2620\/revisions"}],"predecessor-version":[{"id":2626,"href":"https:\/\/vestnik.polytech.tj\/index.php?rest_route=\/wp\/v2\/posts\/2620\/revisions\/2626"}],"wp:attachment":[{"href":"https:\/\/vestnik.polytech.tj\/index.php?rest_route=%2Fwp%2Fv2%2Fmedia&parent=2620"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/vestnik.polytech.tj\/index.php?rest_route=%2Fwp%2Fv2%2Fcategories&post=2620"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/vestnik.polytech.tj\/index.php?rest_route=%2Fwp%2Fv2%2Ftags&post=2620"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}