|
新冠病毒DNA序列基于熵值的分布可视化
|
Abstract:
新型冠状病毒于2020年1月正式命名为2019-nCoV,时至今日该病毒的传播仍然没有得到良好的控制。病毒的DNA碱基序列在病毒的性状中起着决定性的作用,本文分析了四个国家(中国,美国,澳大利亚,德国)新冠病毒DNA之间的差异。选择各国新冠病毒DNA序列,以信息熵,相对熵,交叉熵的形式给出四国病毒之间差异的可视化分析。
Novel coronavirus is officially named 2019-ncov in January 2020, and has not been well controlled until now. The DNA base sequence of avirus plays a decisive role in the character of avirus. This pa-per analyzes the differences between novel coronavirus DNA from four countries (China, America, Australia and Germany). The novel coronavirus DNA sequences from different countries were se-lected, and the visual analysis of the differences between four viruses was given in the form of in-formation entropy, relative entropy and cross entropy.
[1] | Kullback, S. and Leibler, R.A. (1951) On Information and Sufficiency. The Annals of Mathematical Statistics, 22, 79-86. https://doi.org/10.1214/aoms/1177729694 |
[2] | Goodfellow, I., Bengio, Y. and Courville, A. (2016) Deep Learning (Vol. 1). MIT Press, Cambridge, 71-73. |
[3] | 百度百科. 交叉熵.
https://baike.baidu.com/item/%E4%BA%A4%E5%8F%89%E7%86%B5/8983241?fr=aladdin |