Genomic and structural aspects of protein evolution

Chothia, Cyrus; Gough, Julian

doi:10.1042/BJ20090122

Article navigation

Review Article| March 13 2009

Genomic and structural aspects of protein evolution

Cyrus Chothia;

Cyrus Chothia ¹

*MRC Laboratory of Molecular Biology, Hills Road, Cambridge CB2 0QH, U.K.

¹Correspondence may be addressed to either author (email chc1@mrc-lmb.cam.ac.uk or gough@cs.bris.ac.uk).

Search for other works by this author on:

This Site

PubMed

Google Scholar

Julian Gough

Julian Gough ¹

†Computer Science Department, University of Bristol, Merchant Venturers Building, Woodland Road, Bristol BS8 1UB, U.K.

¹Correspondence may be addressed to either author (email chc1@mrc-lmb.cam.ac.uk or gough@cs.bris.ac.uk).

Search for other works by this author on:

This Site

PubMed

Google Scholar

Author and article information

Publisher: Portland Press Ltd

Received: January 19 2009

Accepted: January 23 2009

Online ISSN: 1470-8728

Print ISSN: 0264-6021

2009

Biochem J (2009) 419 (1): 15–28.

https://doi.org/10.1042/BJ20090122

It has been known for more than 35 years that, during evolution, new proteins are formed by gene duplications, sequence and structural divergence and, in many cases, gene combinations. The genome projects have produced complete, or almost complete, descriptions of the protein repertoires of over 600 distinct organisms. Analyses of these data have dramatically increased our understanding of the formation of new proteins. At the present time, we can accurately trace the evolutionary relationships of about half the proteins found in most genomes, and it is these proteins that we discuss in the present review. Usually, the units of evolution are protein domains that are duplicated, diverge and form combinations. Small proteins contain one domain, and large proteins contain combinations of two or more domains. Domains descended from a common ancestor are clustered into superfamilies. In most genomes, the net growth of superfamily members means that more than 90% of domains are duplicates. In a section on domain duplications, we discuss the number of currently known superfamilies, their size and distribution, and superfamily expansions related to biological complexity and to specific lineages. In a section on divergence, we describe how sequences and structures diverge, the changes in stability produced by acceptable mutations, and the nature of functional divergence and selection. In a section on domain combinations, we discuss their general nature, the sequential order of domains, how combinations modify function, and the extraordinary variety of the domain combinations found in different genomes. We conclude with a brief note on other forms of protein evolution and speculations of the origins of the duplication, divergence and combination processes.

2009

You do not currently have access to this content.

Don't already have an account? Register

You could not be signed in. Please check your email address / username and password and try again.

Genomic and structural aspects of protein evolution

Get Access To This Article

Buy This Article

Cited By

Get Email Alerts

CONNECT

EXPLORE

Cover Image

Genomic and structural aspects of protein evolution

Sign in

Sign in to your personal account

Biochemical Society Member Sign in

Sign in via your Institution

Get Access To This Article

Buy This Article

Cited By

Get Email Alerts

CONNECT

EXPLORE

This Feature Is Available To Subscribers Only