%0 Journal Article %T 基于拟南芥DNA序列的变值统计可视化分析
The Visual Analysis of Variables Statistic Based on Arabidopsis DNA Sequences %A 周影平 %A 罗佳 %A 袁鑫 %A 邱国宸 %A 曲恒熠 %A 郑智捷 %J Hans Journal of Computational Biology %P 39-47 %@ 2164-5434 %D 2019 %I Hans Publishing %R 10.12677/HJCB.2019.93006 %X
随着植物基因组测序的展开,拟南芥基因测序因其巨大的科研价值得到了世界各国的重视。拟南芥基因序列的研究得到了极大的进展。由于传统的基因图示非常特殊和复杂,我们试图利用一种更为直观和普适的基因变值图示系统。利用该类模式,对从基因库获取到的DNA序列进行预处理后,统计碱基A,C,G,T的数量,计算AG,AT的数量,然后将数量投影到二维或者三维图像中,从而观察拟南芥的DNA序列的特征。从给出的系列图示可以看到,新的统计分布结果对后续为基因序列的相似性分析和更高维的特征研究提供基础信息。
With the development of plant genome sequencing, Arabidopsis gene sequencing has received great attention from all over the world for its great scientific research value. At the same time, the research on the sequence of Arabidopsis gene has been greatly advanced. Because of the specific and complex of traditional genetic representations, we try to use a more intuitive and universal genetic variable mapping system. Using this model, after preprocessing the DNA sequence obtained from the gene pool, we count the number of bases A, C, G, and T and calculate the number of AG, and AT. Then the quantity is projected into a two-dimensional or three-dimensional image which can observe the characteristics of the DNA sequence of Arabidopsis. As can be seen from the series of diagrams given, the new statistical distribution results provide basic information for subsequent similarity analysis of gene sequences and higher dimensional studies.
%K 拟南芥,DNA序列,变值图示,可视化
Arabidopsis %K DNA Sequence %K Variable Diagram %K Visuali-zation %U http://www.hanspub.org/journal/PaperInformation.aspx?PaperID=32018