Towards functional characterization of archaeal genomic dark matter

https://doi.org/10.1038/ismej.2017.122

2

Adam

,

P.S.

,

Borrel

,

G.

,

Brochier-Armanet

,

C.

and

Gribaldo

,

S.

(

2017

)

The growing tree of Archaea: new perspectives on their diversity, evolution and ecology

.

ISME J.

11

,

2407

–

2425

https://doi.org/10.1038/nature21031

3

Zaremba-Niedzwiedzka

,

K.

,

Caceres

,

E.F.

,

Saw

,

J.H.

,

Bäckström

,

D.

,

Juzokaite

,

L.

,

Vancaester

,

E.

et al (

2017

)

Asgard archaea illuminate the origin of eukaryotic cellular complexity

.

Nature

541

,

353

–

358

https://doi.org/10.1038/nmicrobiol.2017.81

4

Sorokin

,

D.Y.

,

Makarova

,

K.S.

,

Abbas

,

B.

,

Ferrer

,

M.

,

Golyshin

,

P.N.

,

Galinski

,

E.A.

et al (

2017

)

Discovery of extremely halophilic, methyl-reducing euryarchaea provides insights into the evolutionary origin of methanogenesis

.

Nat. Microbiol.

2

,

17081

https://doi.org/10.1101/gr.10.4.398

5

Bork

,

P.

(

2000

)

Powers and pitfalls in sequence analysis: the 70% hurdle

.

Genome Res.

10

,

398

–

400

https://doi.org/10.1146/annurev-biochem-070611-102400

6

Storz

,

G.

,

Wolf

,

Y.I.

and

Ramamurthi

,

K.S.

(

2014

)

Small proteins can no longer be ignored

.

Annu. Rev. Biochem.

83

,

753

–

777

https://doi.org/10.1016/j.jmb.2010.11.055

7

Márquez

,

V.

,

Fröhlich

,

T.

,

Armache

,

J.P.

,

Sohmen

,

D.

,

Dönhöfer

,

A.

,

Mikolajka

,

A.

et al (

2011

)

Proteomic characterization of archaeal ribosomes reveals the presence of novel archaeal-specific ribosomal proteins

.

J. Mol. Biol.

405

,

1215

–

1232

https://doi.org/10.1016/S0378-1119(98)00330-8

8

Wu

,

P.

,

Brockenbrough

,

J.S.

,

Paddy

,

M.R.

and

Aris

,

J.P.

(

1998

)

NCL1, a novel gene for a non-essential nuclear protein in Saccharomyces cerevisiae

.

Gene

220

,

109

–

117

https://doi.org/10.1016/j.biochi.2015.01.004

9

Makarova

,

K.S.

,

Galperin

,

M.Y.

and

Koonin

,

E.V.

(

2015

)

Comparative genomic analysis of evolutionarily conserved but functionally uncharacterized membrane proteins in archaea: prediction of novel components of secretion, membrane remodeling and glycosylation systems

.

Biochimie

118

,

302

–

312

https://doi.org/10.3389/fmicb.2016.00667

10

Makarova

,

K.S.

,

Koonin

,

E.V.

and

Albers

,

S.V.

(

2016

)

Diversity and evolution of type IV pili systems in Archaea

.

Front. Microbiol.

7

,

667

https://doi.org/10.1128/mBio.01959-17

11

Makarova

,

K.S.

,

Galperin

,

M.Y.

and

Koonin

,

E.V.

(

2017

)

Proposed role for KaiC-like ATPases as major signal transduction hubs in Archaea

.

mBio

8

,

e01959-17

https://doi.org/10.1128/JB.00681-17

12

Galperin

,

M.Y.

,

Makarova

,

K.S.

,

Wolf

,

Y.I.

and

Koonin

,

E.V.

(

2018

)

Phyletic distribution and lineage-specific domain architectures of archaeal two-component signal transduction systems

.

J. Bacteriol.

200

,

e00681-17

https://doi.org/10.1186/1745-6150-7-18

13

Zhang

,

D.

,

de Souza

,

R.F.

,

Anantharaman

,

V.

,

Iyer

,

L.M.

and

Aravind

,

L.

(

2012

)

Polymorphic toxin systems: comprehensive characterization of trafficking modes, processing, mechanisms of action, immunity and ecology using comparative genomics

.

Biol. Direct

7

,

18

https://doi.org/10.1128/microbiolspec.PLAS-0027-2014

14

Forterre

,

P.

,

Krupovic

,

M.

,

Raymann

,

K.

and

Soler

,

N.

(

2014

)

Plasmids from Euryarchaeota

.

Microbiol. Spectr.

2

, PLAS-0027-2014.

https://doi.org/10.1186/s12985-018-0974-y

15

Yutin

,

N.

,

Bäckström

,

D.

,

Ettema

,

T.J.G.

,

Krupovic

,

M.

and

Koonin

,

E.V.

(

2018

)

Vast diversity of prokaryotic virus genomes encoding double jelly-roll major capsid proteins uncovered by genomic and metagenomic sequence analysis

.

Virol. J.

15

,

67

https://doi.org/10.3390/life5010818

16

Makarova

,

K.S.

,

Wolf

,

Y.I.

and

Koonin

,

E.V.

(

2015

)

Archaeal clusters of orthologous genes (arCOGs): an update and application for analysis of shared features between thermococcales, methanococcales, and methanobacteriales

.

Life

5

,

818

–

840

https://doi.org/10.1042/BJ20091328

17

Hanson

,

A.D.

,

Pribat

,

A.

,

Waller

,

J.C.

and

de Crécy-Lagard

,

V.

(

2010

)

‘Unknown’ proteins and ‘orphan’ enzymes: the missing half of the engineering parts list–and how to find it

.

Biochem. J.

425

,

1

–

11

https://doi.org/10.1093/nar/gkx937

18

Ellens

,

K.W.

,

Christian

,

N.

,

Singh

,

C.

,

Satagopam

,

V.P.

,

May

,

P.

and

Linster

,

C.L.

(

2017

)

Confronting the catalytic dark matter encoded by sequenced genomes

.

Nucleic Acids Res.

45

,

11495

–

11514

https://doi.org/10.1016/j.tibtech.2010.05.006

19

Galperin

,

M.Y.

and

Koonin

,

E.V.

(

2010

)

From complete genome sequence to ‘complete’ understanding?

Trends Biotechnol.

28

,

398

–

406

https://doi.org/10.1038/76443

20

Galperin

,

M.Y.

and

Koonin

,

E.V.

(

2000

)

Who's your neighbor? New computational approaches for functional genomics

.

Nat. Biotechnol.

18

,

609

–

613

https://doi.org/10.1104/pp.15.00959

21

Niehaus

,

T.D.

,

Thamm

,

A.M.

,

de Crécy-Lagard

,

V.

and

Hanson

,

A.D.

(

2015

)

Proteins of unknown biochemical function: a persistent problem and a roadmap to help overcome it

.

Plant Physiol.

169

,

1436

–

1442

https://doi.org/10.1093/nar/gkv1324

22

Chang

,

Y.C.

,

Hu

,

Z.

,

Rachlin

,

J.

,

Anton

,

B.P.

,

Kasif

,

S.

,

Roberts

,

R.J.

et al (

2016

)

COMBREX-DB: an experiment centered database of protein function: knowledge, predictions and knowledge gaps

.

Nucleic Acids Res.

44

,

D330

–

D335

https://doi.org/10.1021/bi501388y

23

Vetting

,

M.W.

,

Al-Obaidi

,

N.

,

Zhao

,

S.

,

San Francisco

,

B.

,

Kim

,

J.

,

Wichelecki

,

D.J.

et al (

2015

)

Experimental strategies for functional annotation and metabolism discovery: targeted screening of solute binding proteins and unbiased panning of metabolomes

.

Biochemistry

54

,

909

–

931

https://doi.org/10.1021/bi201312u

24

Gerlt

,

J.A.

,

Allen

,

K.N.

,

Almo

,

S.C.

,

Armstrong

,

R.N.

,

Babbitt

,

P.C.

,

Cronan

,

J.E.

et al (

2011

)

The enzyme function initiative

.

Biochemistry

50

,

9950

–

9962

https://doi.org/10.1126/science.aar4120

25

Doron

,

S.

,

Melamed

,

S.

,

Ofir

,

G.

,

Leavitt

,

A.

,

Lopatina

,

A.

,

Keren

,

M.

et al (

2018

)

Systematic discovery of antiphage defense systems in the microbial pangenome

.

Science

359

,

eaar4120

https://doi.org/10.1007/s00792-014-0672-7

26

Makarova

,

K.S.

,

Wolf

,

Y.I.

,

Forterre

,

P.

,

Prangishvili

,

D.

,

Krupovic

,

M.

and

Koonin

,

E.V.

(

2014

)

Dark matter in archaeal genomes: a rich source of novel mobile elements, defense systems and secretory complexes

.

Extremophiles

18

,

877

–

893

https://doi.org/10.1186/1745-6150-7-46

27

Wolf

,

Y.I.

,

Makarova

,

K.S.

,

Yutin

,

N.

and

Koonin

,

E.V.

(

2012

)

Updated clusters of orthologous genes for Archaea: a complex ancestor of the Archaea and the byways of horizontal gene transfer

.

Biol. Direct

7

,

46

https://doi.org/10.1074/jbc.RA118.002421

28

Jia

,

X.

,

Yao

,

J.

,

Gao

,

Z.

,

Liu

,

G.

,

Dong

,

Y.H.

,

Wang

,

X.

et al (

2018

)

Structure-function analyses reveal the molecular architecture and neutralization mechanism of a bacterial HEPN-MNT toxin-antitoxin system

.

J. Biol. Chem.

293

,

6812

–

6823

https://doi.org/10.1186/1745-6150-4-19

29

Makarova

,

K.S.

,

Wolf

,

Y.I.

and

Koonin

,

E.V.

(

2009

)

Comprehensive comparative-genomic analysis of type 2 toxin-antitoxin systems and related mobile stress response systems in prokaryotes

.

Biol. Direct

4

,

19

https://doi.org/10.1073/pnas.1005681107

30

Gao

,

X.

,

Wang

,

J.

,

Yu

,

D.Q.

,

Bian

,

F.

,

Xie

,

B.B.

,

Chen

,

X.L.

et al (

2010

)

Structural basis for the autoprocessing of zinc metalloproteases in the thermolysin family

.

Proc. Natl Acad. Sci. U.S.A.

107

,

17569

–

17574

https://doi.org/10.1016/j.tibs.2004.02.004

31

Yeats

,

C.

,

Rawlings

,

N.D.

and

Bateman

,

A.

(

2004

)

The PepSY domain: a regulator of peptidase activity in the microbial environment?

Trends Biochem. Sci.

29

,

169

–

172

https://doi.org/10.1101/cshperspect.a018168

32

Touchon

,

M.

and

Rocha

,

E.P.

(

2016

)

Coevolution of the organization and structure of prokaryotic genomes

.

Cold Spring Harb. Perspect. Biol.

8

,

a018168

https://doi.org/10.1016/j.biocel.2008.09.015

33

Koonin

,

E.V.

(

2009

)

Evolution of genome architecture

.

Int. J. Biochem. Cell Biol.

41

,

298

–

306

https://doi.org/10.1093/nar/30.10.2212

34

Rogozin

,

I.B.

,

Makarova

,

K.S.

,

Murvai

,

J.

,

Czabarka

,

E.

,

Wolf

,

Y.I.

,

Tatusov

,

R.L.

et al (

2002

)

Connected gene neighborhoods in prokaryotic genomes

.

Nucleic Acids Res.

30

,

2212

–

2223

https://doi.org/10.1128/JB.05535-11

35

Makarova

,

K.S.

,

Wolf

,

Y.I.

,

Snir

,

S.

and

Koonin

,

E.V.

(

2011

)

Defense islands in bacterial and archaeal genomes and prediction of novel defense systems

.

J. Bacteriol.

193

,

6039

–

6056

https://doi.org/10.1093/nar/gkt157

36

Makarova

,

K.S.

,

Wolf

,

Y.I.

and

Koonin

,

E.V.

(

2013

)

Comparative genomics of defense systems in archaea and bacteria

.

Nucleic Acids Res.

41

,

4360

–

4377

https://doi.org/10.1016/j.virusres.2017.10.019

37

Hurwitz

,

B.L.

,

Ponsero

,

A.

,

Thornton

, Jr,

J.

and

U'Ren

,

J.M.

(

2018

)

Phage hunters: Computational strategies for finding phages in large-scale ‘omics datasets

.

Virus Res.

244

,

110

–

115

https://doi.org/10.1146/annurev-genet-112414-055018

38

Johnson

,

C.M.

and

Grossman

,

A.D.

(

2015

)

Integrative and conjugative elements (ICEs): what they do and how they work

.

Annu. Rev. Genet.

49

,

577

–

601

https://doi.org/10.1093/nar/gkw975

39

Grazziotin

,

A.L.

,

Koonin

,

E.V.

and

Kristensen

,

D.M.

(

2017

)

Prokaryotic virus orthologous groups (pVOGs): a resource for comparative genomics and protein family annotation

.

Nucleic Acids Res.

45

,

D491

–

D498

https://doi.org/10.1038/nature06248

40

Pallen

,

M.J.

and

Wren

,

B.W.

(

2007

)

Bacterial pathogenomics

.

Nature

449

,

835

–

842

https://doi.org/10.1038/nrmicro2350

41

Langille

,

M.G.

,

Hsiao

,

W.W.

and

Brinkman

,

F.S.

(

2010

)

Detecting genomic islands using bioinformatics approaches

.

Nat. Rev. Microbiol.

8

,

373

–

382

https://doi.org/10.1101/cshperspect.a012963

42

Makarova

,

K.S.

and

Koonin

,

E.V.

(

2013

)

Archaeology of eukaryotic DNA replication

.

Cold Spring Harb. Perspect. Biol.

5

,

a012963

https://doi.org/10.1186/1741-7007-12-36

43

Krupovic

,

M.

,

Makarova

,

K.S.

,

Forterre

,

P.

,

Prangishvili

,

D.

and

Koonin

,

E.V.

(

2014

)

Casposons: a new superfamily of self-synthesizing DNA transposons at the origin of prokaryotic CRISPR-Cas immunity

.

BMC Biol.

12

,

36

https://doi.org/10.1038/nrmicro3569

44

Makarova

,

K.S.

,

Wolf

,

Y.I.

,

Alkhnbashi

,

O.S.

,

Costa

,

F.

,

Shah

,

S.A.

,

Saunders

,

S.J.

et al (

2015

)

An updated evolutionary classification of CRISPR-Cas systems

.

Nat. Rev. Microbiol.

13

,

722

–

736

https://doi.org/10.1016/j.virusres.2017.11.025

45

Krupovic

,

M.

,

Cvirkaite-Krupovic

,

V.

,

Iranzo

,

J.

,

Prangishvili

,

D.

and

Koonin

,

E.V.

(

2018

)

Viruses of archaea: structural, functional, environmental and evolutionary genomics

.

Virus Res.

244

,

181

–

193

https://doi.org/10.1073/pnas.1803440115

46

Shmakov

,

S.A.

,

Makarova

,

K.S.

,

Wolf

,

Y.I.

,

Severinov

,

K.V.

and

Koonin

,

E.V.

(

2018

)

Systematic prediction of genes functionally linked to CRISPR-Cas systems by gene neighborhood analysis

.

Proc. Natl Acad. Sci. U.S.A.

115

,

E5307

–

E5316

https://doi.org/10.1111/j.1365-2958.1995.tb02295.x

47

Havarstein

,

L.S.

,

Diep

,

D.B.

and

Nes

,

I.F.

(

1995

)

A family of bacteriocin ABC transporters carry out proteolytic processing of their substrates concomitant with export

.

Mol. Microbiol.

16

,

229

–

240

https://doi.org/10.1006/geno.1998.5666

48

Vitelli

,

F.

,

Piccini

,

M.

,

Caroli

,

F.

,

Franco

,

B.

,

Malandrini

,

A.

,

Pober

,

B.

et al (

1999

)

Identification and characterization of a highly conserved protein absent in the Alport syndrome (A), mental retardation (M), midface hypoplasia (M), and elliptocytosis (E) contiguous gene deletion syndrome (AMME)

.

Genomics

55

,

335

–

340

https://doi.org/10.1016/j.jmb.2010.05.014

49

Dermoun

,

Z.

,

Foulon

,

A.

,

Miller

,

M.D.

,

Harrington

,

D.J.

,

Deacon

,

A.M.

,

Sebban-Kreuzer

,

C.

et al (

2010

)

TM0486 from the hyperthermophilic anaerobe Thermotoga maritima is a thiamin-binding protein involved in response of the cell to oxidative conditions

.

J. Mol. Biol.

400

,

463

–

476