欢迎访问林业科学,今天是

林业科学 ›› 2013, Vol. 49 ›› Issue (8): 43-50.doi: 10.11707/j.1001-7488.20130807

• 论文与研究报告 • 上一篇    下一篇

基于454测序的油茶DNA序列微卫星观察与分析

温强1,2, 徐林初2, 江香梅2, 李江2, 顾胤聪1, 徐立安1, 黄敏仁1   

  1. 1. 南京林业大学林木遗传与生物技术省部共建教育部重点实验室 南京 210037;
    2. 江西省林业科学院 南昌 330032
  • 收稿日期:2013-03-11 修回日期:2013-04-23 出版日期:2013-08-25 发布日期:2013-08-17
  • 通讯作者: 徐立安
  • 基金资助:

    国家自然科学基金项目(31260184); 江西省自然科学基金项目(2009GQN0047); 江西省科技支撑项目(20122BBF60125)。

Survey and Analysis of Microsatellites from DNA Sequences in Camellia Species Using 454 Pyrosequencing

Wen Qiang1,2, Xu Linchu2, Jiang Xiangmei2, Li Jiang2, Gu Yincong1, Xu Li'an1, Huang Minren1   

  1. 1. Key Laboratory of Forest Genetics and Biotechnology of Ministry of Education Nanjing Forestry University Nanjing 210037;
    2. Jiangxi Forestry Academy Nanchang 330032
  • Received:2013-03-11 Revised:2013-04-23 Online:2013-08-25 Published:2013-08-17

摘要:

采用MISA软件挖掘比较各1/4个454高通量测序反应获得的普通油茶、浙江红山茶和短柱茶EST序列及普通油茶基因组序列中的微卫星信息。结果显示: 3个种的EST序列微卫星出现频率大小相近,普通油茶EST序列的微卫星出现频率高于其基因组序列的; 在所有被检索序列的二至六碱基微卫星中,均以二碱基微卫星最多(>55%),并以(AG)n类型为主,但三碱基微卫星在所有EST序列中以(AAG)n类型最多,而在普通油茶基因组序列中则以(AAT)n最多; 除六碱基微卫星外,二至五碱基微卫星均表现为不同微卫星重复单元的丰度随着微卫星碱基长度增加而减少。在转录组序列中,除六碱基微卫星之外,不同微卫星单元重复数的变异与重复单元长度呈负相关,推测二碱基微卫星理论多态性最高,而五碱基微卫星理论多态性最低; 对在无冗余独立基因各区域中的微卫星进行统计,显示分布比例依次为3'UTR >CDS>5'UTR,在相同区域内比较,UTR区域中二碱基微卫星占总数量比例最大,CDS区域内三碱基微卫星的比例最高,此外5'UTR区域的三碱基微卫星的分布比例均高于3'UTR,初步推断5'UTR区域的微卫星相对3'UTR要保守些。

关键词: 山茶属, 454高通量测序, 微卫星

Abstract:

Abstract: By using 454 pyrosequencing, the simple sequence repeats (SSRs) from DNA sequences in three Camellia species, including C. oleifera, C. brevistyla and C. chekiangoleosa, were tapped with repeats of 2 to 6 bp by MISA. In all cases, the frequency of repeat motifs in the EST was similar, while the proportion in the EST of C. oleifera was higher than that of its genome. Comparisons of all SSRs, the dinucleotide repeat microsatellites (DNRs) were most dominant (>55%) and the most common DNRs was (AG)n. The most common trinucleotide repeat microsatellites (TNRs) was (AAG)n in EST, while the most common TNRs was (AAT)n in the genome of C. oleifera. Apart from the hexaucleotide repeat microsatellites (HXNRs), the abundance of the other four classes of microsatellites was negatively correlated with their repeat motif lengths. In the EST, apart from the HXNRs, diversification of microsatellite lengths (repeat number variation) was also negatively correlated with their repeat motif lengths, and we predicted that DNRs was the highest polymorphism and pentaucleotide repeat microsatellites (PTNRs) was the lowest. Subsequently, statistical analysis on SSRs within coding regions (CDS) and untranslated regions (UTR) in the Unigene showed that the ratio of motifs distribution was ranked as the following: 3'UTR>CDS>5'UTR. The DNRs were the dominant microsatellite in UTR, while the TNRs were dominant in CDS. The proportion of TNRs in the 5'UTR was higher than that in the 3'UTR, indicating that the SSRs in the 5'UTR were more conservative.

Key words: Camellia, 454 pyrosequencing, microsatellites

中图分类号: