欢迎访问林业科学,今天是

林业科学 ›› 2026, Vol. 62 ›› Issue (3): 133-145.doi: 10.11707/j.1001-7488.LYKX20250509

• 研究论文 • 上一篇    下一篇

林木基因组高杂合度对等位基因表达及功能的影响——以银腺杨84K中IQD基因家族为例

李书媛1,易刚1,樊育彤1,崔祺禛1,李雨霏1,饶国栋1,2,*()   

  1. 1. 林木遗传育种全国重点实验室 中国林业科学研究院林业研究所 北京 100091
    2. 南京林业大学南方现代林业协同创新中心 南京 210037
  • 收稿日期:2025-08-14 修回日期:2025-11-27 出版日期:2026-03-15 发布日期:2026-03-12
  • 通讯作者: 饶国栋 E-mail:rgd@caf.ac.cn
  • 基金资助:
    林木遗传育种全国重点实验室(中国林业科学研究院)基本科研业务费资助项目(TGBFRF202510);中央级公益性科研院所专项资金项目(CAFYBB2023PA005-2);国家自然科学基金项目(32371837)。

Impact of High Genomic Heterozygosity on the Expression and Function of Allelic Genes in Forest Trees: a Case Study of the IQD Gene Family in Populus alba × P. glandulosa ‘84K’

Shuyuan Li1,Gang Yi1,Yutong Fan1,Qizhen Cui1,Yufei Li1,Guodong Rao1,2,*()   

  1. 1. State Key Laboratory of Tree Genetics and Breeding Research Institute of Forestry, Chinese Academy of Forestry Beijing 100091
    2. Co-Innovation Center for Sustainable Forestry in Southern China, Nanjing Forestry University Nanjing 210037
  • Received:2025-08-14 Revised:2025-11-27 Online:2026-03-15 Published:2026-03-12
  • Contact: Guodong Rao E-mail:rgd@caf.ac.cn

摘要:

目的: 针对林木基因组高杂合度的典型特征,以银腺杨84K为例,从系统进化、蛋白结构、表达调控3个维度全面解析IQD(IQ67-domain)基因家族等位基因的分化模式,为理解林木复杂性状的遗传基础和等位基因间的协同互补提供新视角。方法: 统计PagIQD等位基因的序列杂合度,结合系统进化分析与染色体定位确定基因分布和进化关系。预测蛋白理化性质、跨膜结构和亚细胞定位,开展Motif特征分析、三维结构建模和钙调蛋白对接,比较等位基因蛋白特性差异。利用顺式作用元件分析与转录因子结合位点预测,结合表达量数据,解析等位基因特异性表达(ASE)的组织和胁迫响应模式,并评估其表达协同性。应用基因编辑靶点筛选评估等位基因特异性编辑效率。结果: 在系统发育上,PagIQDs成员数量出现轻度收缩,编码区、基因全长、蛋白质和启动子序列杂合度均值高于基因组整体水平(2.1%)。84K中部分IQD基因与参考银白杨基因组的位置对应并不完全一致,且等位基因对在系统发育树中呈现出6种不同的分支模式,提示这些等位基因可能来源于不同的祖先或古老多态性的保留。在蛋白特性上,部分等位基因在蛋白理化性质上表现出显著差异,尤其在稳定性和疏水性方面,个别等位基因在跨膜预测区域和亚细胞定位上也出现分歧。约14%的等位基因对在Motif数量或排列上呈现多样化,部分等位基因在二级结构比例和IQ67核心结构域空间构型上有所不同,可能影响其与钙调蛋白(CaM)的结合模式及结合能。在表达调控上,等位基因间顺式作用元件及转录因子结合位点存在明显差异,导致普遍的等位基因特异性表达:约88.5%的基因对在至少一个组织中表达存在偏倚,且具有显著组织特异性。胁迫条件下等位基因对在响应程度上存在差异甚至呈现相反的表达趋势,进一步对等位基因对的整体分析揭示出协同响应、表达缓冲和稳态维持3种表达互作模式。基因编辑靶点分析显示,等位基因间的序列差异会显著影响编辑策略的设计和效率。结论: 通过对IQD基因家族的深入解析,证明高杂合度基因组背景下等位基因在多维度的分化是一种普遍现象,提示在科学研究中需将等位基因视作具有潜在分化的功能单元,在高杂合林木功能解析、表达分析及基因编辑设计中同时考虑2个等位基因,这对精准解析林木性状遗传机制及指导遗传改良具有重要意义。

关键词: 高杂合基因组, IQD基因家族, 等位基因特异性表达, 功能分化, 基因编辑

Abstract:

Objective: With the highly heterozygous hybrid poplar ‘84K’ as a model, this study aims to characterize the divergence of alleles in the IQD gene family from three dimensions of systematic evolution, protein structure, and expression regulation, in order to provide new insights into the genetic basis of complex traits in forest trees and the potential functional complementarity between alleles. Method: The sequence heterozygosity of PagIQD alleles was calculated, combined with systematic evolutionary analysis and chromosome localization, to determine the gene distribution and evolutionary relationship. Protein physicochemical properties, transmembrane structure, and subcellular localization were predicted. Motifs feature analysis, 3D structure modeling, and calmodulin-binding were conducted. The differences in allele protein characteristics were compared. The tissue and stress response patterns of allele-specific expression (ASE) were analyzed and its expression synergy was assessed by integrating cis-element analysis with RNA-seq data. Allele-dependent genome editing efficiency was further evaluated based on sequence divergence. Result: Phylogenetically, the number of PagIQD gene family members in 84K showed a slight contraction, and the average heterozygosity of coding regions, full-length genes, proteins, and promoters sequences were all higher than the genome-wide level (2.1%). Several IQD genes in 84K did not align perfectly with their chromosomal positions in the reference Populus alba genome. Allelic pairs displayed six distinct phylogenetic branching patterns, suggesting these alleles may originate from different ancestors origins or represent retention of ancient polymorphisms. At the protein level, some allele pairs differed markedly in physicochemical properties, especially in stability and hydrophobicity, and individual alleles showed discrepancies in predicted transmembrane regions and subcellular localization. Approximately 14% of allele pairs varied in motif number or arrangement, and some exhibited differences in secondary structure proportions and IQ67-domain spatial conformation, which may influence their calmodulin (CaM) binding mode and binding energy. In terms of expression regulation, there were significant differences in cis-element and transcription factor binding sites between alleles, resulting in widespread allele-specific expression, with about 88.5% of gene pairs showing biased expression in at least one tissue, and strong tissue specificity. Under stress, allelic pairs often showed different or even opposite responses in terms of responsiveness. Integrated analysis revealed three interaction patterns between alleles: synergistic response, expression buffering, and homeostasis maintenance. Sequence divergence also strongly affected the design and predicted efficiency of allele-specific genome-editing targets. Conclusion: This study demonstrates that multi-dimensional allelic divergence is a pervasive feature in the highly heterozygous genome of 84K. The results highlight the necessity of treating alleles as potentially distinct functional units, and two alleles should be considered simultaneously in functional analysis, expression analysis, and gene editing design of high heterozygous forest trees, which is of great significance for accurately analyzing the genetic mechanism of forest traits and guiding genetic improvement.

Key words: high genomic heterozygosity, IQD gene family, allele-specific expression, functional divergence, genome editing

中图分类号: