直系同源(orthology)和旁系同源(paralogy)
标签:
杂谈 |
分类: 概念解析 |
同源序列可分为两种:直系同源(orthology)和旁系同源(paralogy)。直系同源的序列因物种形成(speciation)而被区分开(separated):若一个基因原先存在于某个物种,而该物种分化为了两个物种,那么新物种中的基因是直系同源的;旁系同源的序列因基因繁殖(gene duplication)而被区分开(separated):若生物体中的某个基因被複製了,那么两个副本序列就是旁系同源的。直系同源的一对序列称为直系同源体(orthologs),旁系同源的一对序列称为旁系同源体(paralogs)。
直系同源体通常有相同或相似的功能,但对旁系同源体则不一定:由于缺乏原始的自然选择的力量,繁殖出的基因副本可以自由的变异并获得新的功能。
肌红蛋白(myoglobin)和血红蛋白(hemoglobin)被认为是古老的旁系同源体(ancient paralogs)。类似的,已知的四种血红蛋白(血红蛋白A,血红蛋白A2,血红蛋白S,和胎儿血红蛋白)均互为旁系同源体。它们均能够运输氧气,但在功能上又有细微的分化:胎儿血红蛋白(血红蛋白F)比成年血红蛋白对氧气有更高的亲和力。
另一个例子是啮齿动物(如老鼠)体内的胰岛素基因。啮齿动物有一对旁系同源的胰岛素基因,尚不清楚它们的功能是否分化。
旁系同源体常见于同一物种,但也不是绝对如此:人类的血红蛋白和非洲黑猩猩的肌红蛋白就是旁系同源体。这是使用生物信息学方法预测基因功能的一大困难:即使不同物种的基因同源,我们也不能立刻推断它们具有相同或相似的功能,因为它们可能为具有不同功能的旁系同源体。
http://s11/middle/4b849cf4498fe55538d2a&690
原图见http://www.ncbi.nlm.nih.gov/Education/BLASTinfo/Orthology.html
且看文字说明:
Homologous sequences. Orthologs and Paralogs are two types of homologous sequences. Orthology describes genes in different species that derive from a common ancestor. Orthologous genes may or may not have the same function. Paralogy describes homologous genes within a single species that diverged by gene duplication.
简单翻译一下
NCBI的Glossary中对paralog的定义是: A paralog is one of a set of homologous genes that have diverged from each other as a consequence of gene duplication. For example, the mouse a-globin and b-globin genes are paralogs. The relationship between mouse a-globin and chick b-globin is also considered paralogous (see the figure).
对ortholog的定义是: Orthology describes genes in different species that derive from a single ancestral gene in the last common ancestor of the respective species.
TRENDS in Genetics Vol.18 No.12 December 2002 的一篇文章:《Orthology, paralogy and proposed classification for paralog subtypes》对两者的定义做了进一步的探究。作者指出:Paralogs are defined as genes that derive from a single gene that was duplicated within a genome.The latter definition does not specify that paralogs can only be found in a single organism, and hence genes in different organisms that arose from gene duplication in an ancestral genome are also paralogs according to the definition。

加载中…