Serial Endosymbiosis Hypothesis: two-step Gram- Bacterium α -Proteobacterium Archaebacterium...

48

Transcript of Serial Endosymbiosis Hypothesis: two-step Gram- Bacterium α -Proteobacterium Archaebacterium...

Page 1: Serial Endosymbiosis Hypothesis: two-step Gram- Bacterium α -Proteobacterium Archaebacterium Archezoan (mito - )Eukaryote (mito + )
Page 2: Serial Endosymbiosis Hypothesis: two-step Gram- Bacterium α -Proteobacterium Archaebacterium Archezoan (mito - )Eukaryote (mito + )

Serial Endosymbiosis Hypothesis: two-step

Gram- Bacterium α -Proteobacterium

Archaebacterium Archezoan (mito-) Eukaryote (mito+)

Page 3: Serial Endosymbiosis Hypothesis: two-step Gram- Bacterium α -Proteobacterium Archaebacterium Archezoan (mito - )Eukaryote (mito + )

Hydrogen Hypothesis: one step

α- proteobacterium

Archaebacterium(methanogen) Eukaryote (mito+)

Page 4: Serial Endosymbiosis Hypothesis: two-step Gram- Bacterium α -Proteobacterium Archaebacterium Archezoan (mito - )Eukaryote (mito + )

Loss of genes from mitochondrial genome

Gain of new genes from nuclear genome

NEW AVENUES IN BIOLOGY THROUGH

COMPARATIVE GENOMICS

Page 5: Serial Endosymbiosis Hypothesis: two-step Gram- Bacterium α -Proteobacterium Archaebacterium Archezoan (mito - )Eukaryote (mito + )

Gene transfer to the nucleus

This process achieved completion, only in the case of some Archaeozoa (eukaryotes without mitochondria)

It is still going on in plants

It is frozen in metazoa

Page 6: Serial Endosymbiosis Hypothesis: two-step Gram- Bacterium α -Proteobacterium Archaebacterium Archezoan (mito - )Eukaryote (mito + )

Whatever its origin, the mitochondrial genome has followed different pathways in the evolution of eukaryotic lineages, so that we are now dealing with a great variety of mt DNA structures present in the organelles of extant organisms.

Page 7: Serial Endosymbiosis Hypothesis: two-step Gram- Bacterium α -Proteobacterium Archaebacterium Archezoan (mito - )Eukaryote (mito + )

Arabidopsis

PlasmodiumHomoC. reinhardtii

S. pombe

C. eugametos

Chondrus

Phytophthora

Ochromonas

Acanthamoeba

Tetrahymena

Prototheca

Allomyces

Reclinomonas

Jakoba

Marchantia

Rickettsia

Size, Shape and Coding Capacity (in red) of Mitochondrial Genomes

Page 8: Serial Endosymbiosis Hypothesis: two-step Gram- Bacterium α -Proteobacterium Archaebacterium Archezoan (mito - )Eukaryote (mito + )
Page 9: Serial Endosymbiosis Hypothesis: two-step Gram- Bacterium α -Proteobacterium Archaebacterium Archezoan (mito - )Eukaryote (mito + )
Page 10: Serial Endosymbiosis Hypothesis: two-step Gram- Bacterium α -Proteobacterium Archaebacterium Archezoan (mito - )Eukaryote (mito + )
Page 11: Serial Endosymbiosis Hypothesis: two-step Gram- Bacterium α -Proteobacterium Archaebacterium Archezoan (mito - )Eukaryote (mito + )

Frozen features (with some exceptions)

Shape (circular) Size (about 15 kb) Gene content

Consistent between phyla Usually modest within phylum

(with some exceptions)

mtDNA in Metazoa(800-1000 million years)

Gene arrangement (mediated by tRNAs?)

Page 12: Serial Endosymbiosis Hypothesis: two-step Gram- Bacterium α -Proteobacterium Archaebacterium Archezoan (mito - )Eukaryote (mito + )

Metazoan mtDNA

Introduction to Comparative Genomics - 2002

Drosophila yakuba

Apis mellifera

Meloidogyne javanica

C. elegans

Artemia franciscana

Page 13: Serial Endosymbiosis Hypothesis: two-step Gram- Bacterium α -Proteobacterium Archaebacterium Archezoan (mito - )Eukaryote (mito + )

D-loopF T

P

Cyt b

E

LS

ND

4L

ND

4

R

ND

3

G

CO III

ATP 6

ATP 8

K

D

S

C O II

CO I

W AN

CY

O L

M

Q

I

ND

1L

ND

2

V

rRNA

rR NA

16S

12S

O H

H

ND

5

VertebratemtDNA

Page 14: Serial Endosymbiosis Hypothesis: two-step Gram- Bacterium α -Proteobacterium Archaebacterium Archezoan (mito - )Eukaryote (mito + )

Gene order usually frozen with a few transpositions tolerated

Which are the other possible sources of variation?

Base composition

mtDNA in Vertebrata(about 350 million years)

Page 15: Serial Endosymbiosis Hypothesis: two-step Gram- Bacterium α -Proteobacterium Archaebacterium Archezoan (mito - )Eukaryote (mito + )

GC Content of Metazoan mtDNA

0 10 20 30 40 50 60

Mammalia

Aves

Reptilia

Amphibia

Osteichthyes

Chondrichthyes

Agnatha

Cephalochordata

Hemichordata

Urochordata

Echinodermata

Arthropoda

Mollusca

Anellida

Nematoda

Platyhelminthes

Cnidaria

Page 16: Serial Endosymbiosis Hypothesis: two-step Gram- Bacterium α -Proteobacterium Archaebacterium Archezoan (mito - )Eukaryote (mito + )

Asymmetry indexes

GC skew =G – C

G + C

AT skew =A – T

A + T

Page 17: Serial Endosymbiosis Hypothesis: two-step Gram- Bacterium α -Proteobacterium Archaebacterium Archezoan (mito - )Eukaryote (mito + )

Gene order usually frozen with a few transpositions tolerated

Which are the other possible sources of variation?

Base composition

Base changes

mtDNA in Vertebrata(about 350 million years)

Page 18: Serial Endosymbiosis Hypothesis: two-step Gram- Bacterium α -Proteobacterium Archaebacterium Archezoan (mito - )Eukaryote (mito + )

Gene order usually frozen with a few transpositions tolerated

Which are the other possible sources of variation?

Base composition Base changes

Regulatory region

mtDNA in Vertebrata(about 350 million years)

Page 19: Serial Endosymbiosis Hypothesis: two-step Gram- Bacterium α -Proteobacterium Archaebacterium Archezoan (mito - )Eukaryote (mito + )

H S P

L S P

S RL R

E TA S d o m a in

5 'OH 3 '

L R

3 2 1

C S B d o m ain C en tra l d o m a in

E TA S s12

C S B s

R N A D N A

5 ' 3 'tR N APhe tR N A

P ro

D-loopF T

P

Cyt b

E

ND

5L

S

ND

4L

ND

4

R

ND 3

G

CO III

ATP 6

ATP8

K

D

S

CO II

CO IW A

NC

Y

OL

M

Q

I

ND

1L

ND

2

V

rRNA

rRNA16

S

12S

O H

H

Page 20: Serial Endosymbiosis Hypothesis: two-step Gram- Bacterium α -Proteobacterium Archaebacterium Archezoan (mito - )Eukaryote (mito + )

VARIABILITY OF THE MITOCHONDRIAL GENOME

Taxonomic level:

Order

Species

Intraspecies

Page 21: Serial Endosymbiosis Hypothesis: two-step Gram- Bacterium α -Proteobacterium Archaebacterium Archezoan (mito - )Eukaryote (mito + )

J Mol Evol (1999) 48:427–434

Nucleotide Substitution Rate of Mammalian Mitochondrial Genomes

Graziano Pesole,1 Carmela Gissi,2 Anna De Chirico,2 Cecilia Saccone2,3

Page 22: Serial Endosymbiosis Hypothesis: two-step Gram- Bacterium α -Proteobacterium Archaebacterium Archezoan (mito - )Eukaryote (mito + )

Stationary Markov model of Saccone et al. (1990), also called "general time reversible"

(GTR) The GTR model has been applied to estimate the average genetic distances between the sequences of each locus within each species. The method involves the assessment of the stationarity of bases composition, an essential condition for the correct estimate of distances. At present, one assumption commonly made in molecular evolutionary studies, and especially in estimating distances, is that of stationarity, i.e., the nucleotide frequencies in sequences do not change with time (Gu and Li, 1996). If stationarity is verified, base compositions should remain similar among sequences. However, analyses of many data sets revealed that this assumption is often unrealistic and that compositional changes occur in the evolutionary process (Tourasse et al., 1999; Lockhart et al., 1994; Galtier and Gouy, 1995). For that reason, ignoring nonstationarity could mislead distances evaluation.

Page 23: Serial Endosymbiosis Hypothesis: two-step Gram- Bacterium α -Proteobacterium Archaebacterium Archezoan (mito - )Eukaryote (mito + )

Nucleotide substitution rate

Central domain

0 5 10 15 20 25 30 35

Nonsyn

Syn

tRNAs

12S rRNA

16S rRNA

ETAS domain

CSB domain Subs/Site*y*10-9

Mammalian mtDNA substitution rateMammalian mtDNA substitution rate

Page 24: Serial Endosymbiosis Hypothesis: two-step Gram- Bacterium α -Proteobacterium Archaebacterium Archezoan (mito - )Eukaryote (mito + )

a) High intragenomic variability in the evolutionary dynamic of mtDNA;

b) Slow- evolving regions: nonsynonymous sites (the rate of nonsynonymous sites depends on the functional constraints and therefore differs considerably between genes) the D-loop central domain, and tRNA and rRNA genes;

c) Fast-evolving regions: synonymous sites (the synonymous rate is fairly uniform over the genome), and the two peripheral D-loop region domains.

Page 25: Serial Endosymbiosis Hypothesis: two-step Gram- Bacterium α -Proteobacterium Archaebacterium Archezoan (mito - )Eukaryote (mito + )

Mitochondrial vs Nuclear Rate in Mitochondrial vs Nuclear Rate in MammalsMammals

Mitochondrial vs Nuclear Rate in Mitochondrial vs Nuclear Rate in MammalsMammals

Type of sites mtDNAa nDNAa Ratio(Mt/Nuc)

Nonsynonymousb

Synonymousb

Small rRNAc

0.8 0.2

1.6 0.9

0.4 0.3

3

22

19

±

±

±

34.6tRNAsc 0.1 0.1 97±

2.6 0.4

3.9

7.7 2.4

±

±9.7 2.4±

±

Large rRNAc 4.1 0.8 4±17.2 3.8±

A Sequence divergence, expressed in % substitutions/site, has been calculated using the Stationary Markov Model (Saccone et al. 1990).

B Referred to the human-common chimpanzee pair.

C Referred the rat-mouse pair.

Page 26: Serial Endosymbiosis Hypothesis: two-step Gram- Bacterium α -Proteobacterium Archaebacterium Archezoan (mito - )Eukaryote (mito + )

The commonly accepted statement that mtDNA evolves more rapidly than nuclear DNA is valid only for some regions, thus it should be referred to specific mitochondrial components. In particular:

a) Nonsynonymous sites show comparable rates in mitochondrial and nuclear genes; synonymous sites

b) Small rRNA evolve about 20 times more rapidly and tRNAs about 100 times more rapidly in mitochondria than in their nuclear counterpart.

c) A species- specific evolution is particularly evident in the Dloop region.

Page 27: Serial Endosymbiosis Hypothesis: two-step Gram- Bacterium α -Proteobacterium Archaebacterium Archezoan (mito - )Eukaryote (mito + )

Mol. Biol. Evol. 17(7):1022–1031. 2000

Lineage-Specific Evolutionary Rate in Mammalian mtDNA

Carmela Gissi,* Aurelio Reyes,† Graziano Pesole,‡ and Cecilia Saccone*†

Page 28: Serial Endosymbiosis Hypothesis: two-step Gram- Bacterium α -Proteobacterium Archaebacterium Archezoan (mito - )Eukaryote (mito + )

35 complete mitochondrial genomes from 14 mammalian orders have been investigated by using both first and second codon position (P12) of H-strand protein-coding genes and ribosomal RNA genes.

C.Gissi et al. (2000) Mol. Biol. Evol. 17:1022-10310

Page 29: Serial Endosymbiosis Hypothesis: two-step Gram- Bacterium α -Proteobacterium Archaebacterium Archezoan (mito - )Eukaryote (mito + )

Relative Rate TestRelative Rate Test

RAB=dA/dB=(dAB+dAO-dBO)/(dAB-dAO+dBO)

ingroup A ingroup B outgroup

root

dA dB

Page 30: Serial Endosymbiosis Hypothesis: two-step Gram- Bacterium α -Proteobacterium Archaebacterium Archezoan (mito - )Eukaryote (mito + )

MtDNA rate heterogeneity in MammaliaMtDNA rate heterogeneity in Mammalia

Page 31: Serial Endosymbiosis Hypothesis: two-step Gram- Bacterium α -Proteobacterium Archaebacterium Archezoan (mito - )Eukaryote (mito + )

Significant rate variations have been observed among orders and among species of the same order with both markers. It was observed, however that rate variation does not exceed 1.8 between the fastest (Proboscidea and Primates) and the slowest (Perissodactyla) evolving orders. We have moreover shown that apparently the evolutionary rate differences.are not dependent on body size, generation time, and metabolic rate.

Page 32: Serial Endosymbiosis Hypothesis: two-step Gram- Bacterium α -Proteobacterium Archaebacterium Archezoan (mito - )Eukaryote (mito + )

A Novel Method for Estimating Substitution Rate Variation Among Sites in a Large Dataset of Homologous DNA Sequences

Graziano Pesole* and Cecilia Saccone†

Genetics 157: 859–865 (February 2001)

Page 33: Serial Endosymbiosis Hypothesis: two-step Gram- Bacterium α -Proteobacterium Archaebacterium Archezoan (mito - )Eukaryote (mito + )

It is based on the simple idea that the more closely related are the compared sequences, the higher is the probability of observing nucleotide changes at rapidly evolving sites. This analysis is crucial not only to define more realistic models of sequence evolution for phylogenetic inferences, but it may be of great practical relevance for the assessment of the implications of a given specific mutation pattern.

Page 34: Serial Endosymbiosis Hypothesis: two-step Gram- Bacterium α -Proteobacterium Archaebacterium Archezoan (mito - )Eukaryote (mito + )

Site-variability mitochondrial algorithm

ij= 0 if the site is unchanged in the j-th pair

Kj= genetic distance measured with GTR method

νi =

γγii = ν = νi(norm)i(norm) / ν / νmaxmax

N=n.of multialigned sequences

νi(norm)= νi/[N(N-1)/2]

Pesole G., Saccone C. Genetics 2001 Feb;157(2):859-65

Page 35: Serial Endosymbiosis Hypothesis: two-step Gram- Bacterium α -Proteobacterium Archaebacterium Archezoan (mito - )Eukaryote (mito + )

Intron distribution along mtDNA gene in Ascomycota

Santamaria Monica1, Vicario Saverio1,3 and Saccone Cecilia1,2*1Istituto di Tecnologie Biomediche, Sede di Bari, CNR, 70126 Bari, Italy2Dipartimento di Biochimica e Biologia Molecolare, Università di Bari, 70126 Bari, Italy3 Dept. Ecology and EvolutionaryBiology, Yale University, New Haven, CT, USA

Inter- versus intra- species molecular biodiversity in eukaryotic mitochondrial genome.

Saverio Vicario 1,3, Monica Santamaria1, Cecilia Lanave1, and Cecilia Saccone1,2

1Istituto di Tecnologie Biomediche, Sede di Bari, CNR, 70126 Bari, Italy2Dipartimento di Biochimica e Biologia Molecolare, Università di Bari, 70126 Bari, Italy3 Dept. Ecology and EvolutionaryBiology, Yale University, New Haven, CT, USA

See the posters…

Page 36: Serial Endosymbiosis Hypothesis: two-step Gram- Bacterium α -Proteobacterium Archaebacterium Archezoan (mito - )Eukaryote (mito + )

Interspecific Genetic Divergence (GTR):five data sets with 35 individuals

Page 37: Serial Endosymbiosis Hypothesis: two-step Gram- Bacterium α -Proteobacterium Archaebacterium Archezoan (mito - )Eukaryote (mito + )

Scaling divergence across species

Page 38: Serial Endosymbiosis Hypothesis: two-step Gram- Bacterium α -Proteobacterium Archaebacterium Archezoan (mito - )Eukaryote (mito + )

Intra/Inter species variation

Page 39: Serial Endosymbiosis Hypothesis: two-step Gram- Bacterium α -Proteobacterium Archaebacterium Archezoan (mito - )Eukaryote (mito + )
Page 40: Serial Endosymbiosis Hypothesis: two-step Gram- Bacterium α -Proteobacterium Archaebacterium Archezoan (mito - )Eukaryote (mito + )
Page 41: Serial Endosymbiosis Hypothesis: two-step Gram- Bacterium α -Proteobacterium Archaebacterium Archezoan (mito - )Eukaryote (mito + )
Page 42: Serial Endosymbiosis Hypothesis: two-step Gram- Bacterium α -Proteobacterium Archaebacterium Archezoan (mito - )Eukaryote (mito + )

Species concept

The exact meaning of what a species marker is tracking can only be clarified by taking into account

the still hot discussion on species concept. As a matter of fact, both the discussion of what a species

is and how to define the boundaries are still open questions.

Page 43: Serial Endosymbiosis Hypothesis: two-step Gram- Bacterium α -Proteobacterium Archaebacterium Archezoan (mito - )Eukaryote (mito + )

Mitochondrial gene as a good marker

a) the presence of multiple copies in each cell facilitates the retrieval even from small tissue samples;

b) the aforementioned evolutionary stability of mitochondrial coding genes makes the assessment of the homology, both at the level of loci and the level of a single site, quite simple. (Comparative studies on mitochondrial coding genes are not plagued by doubt on the orthology of the locus, as for nuclear genes although nuclear mitochondrial (NUMTs) pseudogenes could be at times problematic.

c) the evolutionary stability of those markers reduces the importance of indels as factors of molecular evolution, so that the alignment and analysis are greatly eased.

Page 44: Serial Endosymbiosis Hypothesis: two-step Gram- Bacterium α -Proteobacterium Archaebacterium Archezoan (mito - )Eukaryote (mito + )

Mitochondrial gene as a good marker

Theoretical considerations from population genetics.

The mitochondrial markers have a smaller effective population size than autosomal nuclear markers, because of their matrilineal heredity. This make the markers more subject to genetic drift and, consequentially, decrease intra-specific variability by increasing the fixation rate.The lack of recombination increases the probability of selective hitchhiking, which is an event of positive selection on a single site causing fixation and cancellation of intra-specific variability on the whole mitochondrion in a given population or species. “Genetic draft. (Gillespie 2000): the net effect is an increased level of fixation rate, independent of the effective population size. (Bazin et al., 2006).

Page 45: Serial Endosymbiosis Hypothesis: two-step Gram- Bacterium α -Proteobacterium Archaebacterium Archezoan (mito - )Eukaryote (mito + )

Mitochondrial variation

In the literature there is not consensus yet on what are the prevalent forces shaping the patterns of mitochondrial variation: drift, positive selection, co-adaptation to nuclear genes, parasitic meiotic drive. Clearly more data, which could give a better representation of biodiversity, need to be collected.

Page 46: Serial Endosymbiosis Hypothesis: two-step Gram- Bacterium α -Proteobacterium Archaebacterium Archezoan (mito - )Eukaryote (mito + )

Data integration

A more systematic effort to integrate and access data, annotation, and taxonomy from different database sets is needed to tackle more complex problems such as the measure of co-evolution between mitochondria and nucleus and the comparison of inter-specific and intra-specific variation. In collaboration with the IBM we are implementing an integration of databases based on a federated approach so that the crossing of database queries could be implemented without the need to mirror and update the data on the local server. We think that a greater comprehension of the process directing the mitochondrial variation could be reached only through a wider integration of data.

Page 47: Serial Endosymbiosis Hypothesis: two-step Gram- Bacterium α -Proteobacterium Archaebacterium Archezoan (mito - )Eukaryote (mito + )

Saverio Vicario

Cecilia Lanave

Monica Santamaria

Comparative Genomics and evolutionary analysis of gene families

Setup of protocols for production and analysis of sequence data with high throughput 454 technology

Development of methodology for molecular species assignment within the framework of “Barcode of Life”

ITB: Biodiversity Team

Domenica D’Elia

Molecular Biodiversity databases

Barcode Databases

Page 48: Serial Endosymbiosis Hypothesis: two-step Gram- Bacterium α -Proteobacterium Archaebacterium Archezoan (mito - )Eukaryote (mito + )