Two genes encode related cytoplasmic elongation factors 1α (EF-1α) in Drosophila...

20
Volume 16 Number 8 1988 Nucleic Acids Research Two genes encode related cytoplasmk elongation factors la (EF-la) in Drosophila melanogaster with continuous and stage specific expression Bernd Hoveraann, Sabine Richter, Uwe Walldorf' and Celina Cziepluch Zentrum fur Molekulare Biologie Heidelberg (ZMBH), Universital Heidelberg, Im Neuenheimer Feld 282, D 6900 Heidelberg, FRG and 'Biozentrum, Klingelbergstrasse 70, CH 4056 Basel, Switzerland Received February 4, 1988; Accepted March 18, 1988 Accession nos: Fl=X06869, F2=X06870 ABSTRACT We have characterized two previously cloned genes, Fl and F2 (l)that code for elongation factor EF - la of Drosophila melanoaaster. Genomic Southern blot hybridization revealed that they are the only gene copies present. We isolated cDNA clones of both transcripts from embryonal and pupal stage of development that cover the entire transcription unit. The 5' ends of both genes have been determined by primer extension and for Fl also by RNA sequencing. These start sites have been shown to be used consistently during development. Comparison of cDNA and genomic sequences revealed that EF - la,Fl consists of two and EF - la,F2 of five exons. The two described elongation factor genes exhibit several regions of strong sequence conservation when compared to five recently cloned eucaryotic elongation factors. INTRODUCTION The process of translation of genetic information from mRNA to protein follows a distinct pathway. One step, the elongation of the amino acid chain, involves a series of protein components that have been classified according to their function as elongation factor 1 (EF - 1) and elongation factor 2 (EF - 2). The cytoplasmic EF - 1 complex is thought to consist of three proteins: EF - la, 6 and 7(2). EF - la facilitates the GTP dependent binding of charged tRNAs to the acceptor site of the ribosomes. This process includes the binding and hydrolysis of GTP, the binding of aminoacyl tRNA and the recognition and interaction with the 80S ribosome. The multifunctional demand on EF - la obviously leaves little room for an evolution of divergent protein structures. It is therefore not surprising that the eucaryotic cytoplasmic la elongation factors isolated to date (3-10) show a high degree of homology that is indicated by physical properties like pi and molecular weight. Their © IRL Press Limited, Oxford, England. 3175 at Washington University at St Louis on June 18, 2014 http://nar.oxfordjournals.org/ Downloaded from

Transcript of Two genes encode related cytoplasmic elongation factors 1α (EF-1α) in Drosophila...

Page 1: Two genes encode related cytoplasmic elongation factors 1α (EF-1α) in               Drosophila melanogaster               with continuous and stage specific expression

Volume 16 Number 8 1988 Nucleic Acids Research

Two genes encode related cytoplasmk elongation factors la (EF-la) in Drosophila melanogasterwith continuous and stage specific expression

Bernd Hoveraann, Sabine Richter, Uwe Walldorf' and Celina Cziepluch

Zentrum fur Molekulare Biologie Heidelberg (ZMBH), Universital Heidelberg, Im Neuenheimer Feld282, D 6900 Heidelberg, FRG and 'Biozentrum, Klingelbergstrasse 70, CH 4056 Basel, Switzerland

Received February 4, 1988; Accepted March 18, 1988 Accession nos: Fl=X06869, F2=X06870

ABSTRACTWe have characterized two previously cloned genes, Fl and F2(l)that code for elongation factor EF - la of Drosophilamelanoaaster. Genomic Southern blot hybridization revealed thatthey are the only gene copies present. We isolated cDNA clonesof both transcripts from embryonal and pupal stage ofdevelopment that cover the entire transcription unit. The 5'ends of both genes have been determined by primer extension andfor Fl also by RNA sequencing. These start sites have beenshown to be used consistently during development. Comparisonof cDNA and genomic sequences revealed that EF - la,Fl consistsof two and EF - la,F2 of five exons. The two describedelongation factor genes exhibit several regions of strongsequence conservation when compared to five recently clonedeucaryotic elongation factors.

INTRODUCTION

The process of translation of genetic information from mRNA to

protein follows a distinct pathway. One step, the elongation of

the amino acid chain, involves a series of protein components

that have been classified according to their function as

elongation factor 1 (EF - 1) and elongation factor 2 (EF - 2).

The cytoplasmic EF - 1 complex is thought to consist of three

proteins: EF - la, 6 and 7(2). EF - la facilitates the GTP

dependent binding of charged tRNAs to the acceptor site of the

ribosomes. This process includes the binding and hydrolysis of

GTP, the binding of aminoacyl tRNA and the recognition and

interaction with the 80S ribosome. The multifunctional demand

on EF - la obviously leaves little room for an evolution of

divergent protein structures. It is therefore not surprising

that the eucaryotic cytoplasmic la elongation factors isolated

to date (3-10) show a high degree of homology that is indicated

by physical properties like pi and molecular weight. Their

© IRL Press Limited, Oxford, England. 3 1 7 5

at Washington U

niversity at St Louis on June 18, 2014

http://nar.oxfordjournals.org/D

ownloaded from

Page 2: Two genes encode related cytoplasmic elongation factors 1α (EF-1α) in               Drosophila melanogaster               with continuous and stage specific expression

Nucleic Acids Research

close relationship also becomes evident by comparing the gene

structure of various recently cloned and seguenced EF - la

genes. Saccharomvces cerevisiae for example contains two genes

for the cytoplasmic factor (11-13), either one is sufficient

for cell viability. The fungus Mucor racemosus has three

closely related EF - la genes (14), and the brine shrimp

Artemia salina may even contain four copies of this gene (15).

The number of genes detected in mouse (our unpublished

observation) might exceed ten, including possible pseudogenes.

The one human (16) and part of a mouse EF - la gene sequence

known to date (17-19) all share a very high degree of homology

within the protein coding portion with all cytoplasmic EF - la

genes analysed so far.

Here, we present evidence that Drosophila melanoaaster contains

only two different gene copies of the cytoplasmic factor. We

report the isolation and sequence of cDNA clones covering the

entire mRNA for both EF - la,Fl and F2. Transcription start

and processing sites are determined by comparison of cDNA and

genomic sequences and primer extended RNA sequencing.

MATERIALS AND METHODS

DNA sequencing was performed according to the chemical cleavage

method of Maxam and Gilbert (20) and the chain termination

method of Sanger (21). For RNA sequencing we followed the

protocol of Geliebter (22) using a 20mer oligonucleotide primer

and AMV reverse transcriptase. 10 pq poly (A)* RNA and 0.5

pmol 32P labelled oligonucleotide primer were coprecipitated

and taken up in annealing buffer (250 mM KC1, 10 mM Tris/Cl pH

8.3). After heating to 80"C for 3 min the mixture was kept at

45*C for 30 min. The individual reactions were carried out at

43'C for 15 min.

For primer extended cDNA synthesis the reaction was set up as

described previously (23). For Southern analysis of chromosomal

DNA and for the screening of cDNA libraries we followed the

standard protocols as summarized by Maniatis (24). Poly (A)*

RNA was prepared as described by Hovemann et al. (23).

Oligonucleotide primers were obtained from the service unit of

the ZHBH. cDNA libraries were used that were originally

3176

at Washington U

niversity at St Louis on June 18, 2014

http://nar.oxfordjournals.org/D

ownloaded from

Page 3: Two genes encode related cytoplasmic elongation factors 1α (EF-1α) in               Drosophila melanogaster               with continuous and stage specific expression

Nucleic Acids Research

constructed by the laboratory of T. Kornberg. We screened batch

E6 and Q4. Drosophila melanoaaster strains Canton S (CS) and

Oregon R (OR) were grown as mass population on standard

cornmeal agar medium in fly cages. (25).

RESULTS AND DISCUSSION

In an approach to clone sex dependently regulated genes of

Drosophila melanoctaster we had previously described the

isolation of two closely related genes that were arbitrarily

named Fl and F2. According to Northern blot analysis, Fl RNA

was found in all stages of development whereas F2 RNA was not

present in early embryos and was peaking in pupae. We now

identified the function of their putative protein products by

comparison of their gene sequences with those of a series of

recently published elongation factor (EF - la) genes. The

genomic sequences of both genes (the Fl sequence has partly

been published by us (1)) have now been determined in full

length and are shown in Figure 1 and 2. In order to determine

the length of the 51 untranslated leader sequences preceeding

the translational reading frame of Fl and F2 we used two

independent approaches. We determined the 51 end of the RNA

indirectly by oligonucleotide primed cDNA synthesis and, in

addition, we isolated and sequenced cDNA clones in an attempt

to obtain full length RNA copies (Fig. 3, 4).

Gene structure of Dm EF - la.Fl

For indirect RNA sequencing and primer extension we used a

synthetic oligonucleotide complementary to positions -14 to +6

relative to the translational start site (Fig. 1 and 3). The

sequence determined by reverse transcription of the Fl RNA

deviated from the corresponding strand of the genomic DNA after

the C at position -20 (Fig. 1). Indirect RNA sequencing as well

as the primer extension experiments suggested that the 5' mRNA

leader sequence was encoded by at least one additional miniexon

of together 60 nucleotides (Fig.3). This was corroborated by

the fact that the sequence difference started with an AG

dinucleotide flanked by the splicing acceptor site consensus

sequence observed in Drosophila (26). At the same time we

3177

at Washington U

niversity at St Louis on June 18, 2014

http://nar.oxfordjournals.org/D

ownloaded from

Page 4: Two genes encode related cytoplasmic elongation factors 1α (EF-1α) in               Drosophila melanogaster               with continuous and stage specific expression

Nucleic Acids Research

1 CAATTCCACA AGAAAAAGTA TTTAAACCTT AT1TAAATTT AGTTATAAAT TTTTACAATA TTGTATAAAT TGATACAATA CCAAATTATT

• 1 TCTACTTTTA CTTOTAGCAT ATTTTAATTO TAGCAACGCA CGCAAAAAAA TCTATAAATA AATOAAACGT ATTACATAAA TTCAAACOAT

l t l CTCAAOCTTC CATTCTTATT TAAAOTTCTA TTACGTTAGO OTTCACATAC AAATTAAAGT GGCAGOTTCT ATCTCAAAAC ATTCCTTCAA

2 7 1 AATGCGGACT ACTAATGCAA TTOTTATTOT TTTTACATAT TAAAAOATAT OTCTTCCAAT ATTACCTATA OAAATTATAC ACATCOTTTT

3 6 1 GTAGAAAATA CTTTTGOAAT CACTGATTAT TTACTTTTTC ATATAAAAAC AATCTCOAOC AAACAAGCTT TTTTAAATTC CTCAATCTTT

4 51 AOOTTATTGT ATTTTOCCAC TTTCAATCAC TT1AATTTCA ATAAAATOAA OTOCTTCATT CCCCCOTAOT GGAAACACCG CAGTGOOAAC

3 4 1 ACOGTTTCTG CTCTTTTGAC AOTTOCOTAO CTTCOCTCAC ACCATGTGTC AAACGACGCT TCCTCTGCTC AOCTCTOCCO AACOCTCCTT

6 3 1 CACTTTGTTC GAATCCGTCG CCGCTTAGAC TTCGTGATTT CTCATTCAGC TTATTAOAOA OTAAGTTTTA CCTOCOAOOC TATAATTAAO

TOATTTCTOC AAAAAAACTO CAOQGGOCAA AC1ATTIATA AACAAATATG CACCTOAOAC OCCOAATTTG TCCATATTTC CAGTGTTTTT

CCTUTVTUH. TGTAATAAAC CCOOAOATAA CCTCTAACTO CGCTTTTCCA AAGTGAAAGG TGOCCATAOA AOCAAACACG TOOCAAOTCT

GCAAAOOCAA AAATTTTAAC TOOCOTTCCC AOTTAAAOTT CCCAOCATTC TCAAAATAAT TTTCCGOCTT TTCCGCCCGC A1II1COCCC

TOCAATATOO TGCACTTAOC OTOTAATTAC TTTOCCACGC CCACGCCGGA CACAGAOGTC ATCCACCAOA TGTOCTCATT AACCOAOAAA

AAAAAACGTG CTTTCTCTCT TOCCTTTOTC ATCGCCTATA OATATTCCTT A l H . l l l l . i l TTTGCGGCAT GCAATTCTAA AATOGCGACC

CASTOGCOTO AOTCAAOTOC OCGAAAAAAT TCOCCTOOCA ACAAGCGAAA AAATGTOCTT TTTTOOOTTT CCAOCCCATT AOCATATCTO

OTOTAATCGC ACTCOCATCA OCTATTTCGC C1TTTCCAAC COACTCAATA ATTGGTTTTO OTAAAATOOC TOCCGCTOCA CTACOTTCTT

OATTAATTCO TTOTOTOCCC L l U L I l l l l CATTTCTTTC CAATTACCAA TTGTGCCACC SCOGCOGACA C5CTT0CATT TGTACAAGTC

ACACACGCAC ACTAATCCAC ATCCOCCATT TTGGTCTCTC TCTCTTCCTC TCTTACTTTT TCCOOCCOOC AACAGCGTCA CACAAATACA

CAGGCATAGA TATACACACO CATAGOCAGA TAAGCACATO TGTATTTOCO AATTAAATTT GCTGCAATTT TCCTTTOOAC TCTTCOATTT

AACATOATOA TGATTTTTCA OTTCTGCTAC TOAAGAOAOT TGACAOAAAO CAAAAATACC AAAATCACTG AAACAAAATC OAGTTTCCAT

ATCOAATTTT ATTTOCACGC TCTTTTCTOT AGTTOCOCCC CACTCGTTTT ACCCACACCC CTACATOCOO OCACTOaTCC TAACCTCAAA

AAACACGTTT TGTACOGCTG CAAGAGTTTO AOGTTAGGTr OTOCTCOCOC ATGCAAACAA AAGTCOAACG TACOCTAOGG AAATGAGAAA

OTCTTATACC CACTAATAAT rGTAOTTOTA ATCCCACCGA ATTCTTTTAC CCTTTGTTTA TTCCAACCTC TCTTOCTCOC CAACCCGCCG

AACCCTOCAA CCTTCCAATG TTCCAACGTT CCOTTAATCC AACACTCOAA TACACACAAC AO :CATAGTO TAATCATCCA AC

3 0 6 3 ATO HOC AAG GAA AAO ATT CAC ATT AAC ATT OTC OTO ATC GGA CAC OTC CAT TCC OOT AAG TCO ACC ACC ACC GOAH « t GLr L r « G l u Lys 1 1 « H i s z l « A«n I I * V a l V a l I I * G I T H i * V a l Aap 3 * r G l r Lys 8 * r Ttar T h r Tbr G l r

2 1 3 1 CAC TTO ATC TAC AAO TGC GGT OOT ATC OAC AAO CCT ACC ATC GAG AAG TTC OAG AAO OAO OCC CAG CAO ATG OOAB i * L*u I I * T r r Lr* C y s G l y G l r I I * Aap Lya A x o Tbr I l a G l u Lya Ph* G l u Lya O l u A l a G l a O l u H*t G l r

2 3 1 3 AAG GGA TCC TTC AAG TAC OCC TOO OTT TTG OAT AAO TTG AAO OCT OAG CCC OAO COT OOT ATC ACC ATC OAT ATCLya O l y S a r Pha Lya T y r A l a T r p v a l L « u i A a p L r a L«a Lya Ala Olu Arg Gla Arg Oly I l a Thr I l a Aap I l a

2311 GCC CTO TOG AAO TTC GAA ACT GCC AAO TAC TAC OTG ACC ATC ATT OAT GCC CCC GOA CAC AGO OAT TTC ATC AACAla L«u Trp Lya Pba Olu Tbr Ala Lra Tyr Tyr Val Thr I l a I l a Aap Ala Pro Gly Hi* Arg Aap Ph* I l a Lya

2163 AAC ATO ATC ACT OOT ACC TCO CAO OCC OAT TGC GCC CTO CAG ATT OAC OCC OCC OGA ACC OOA OAA TTC OAO GCCAan H*t I I * Thr Glr Thr f a r OlD Ala Aap Cya Ala Val Gin I I * Aap Ala Ala Gly Thr Oly Olu Pba Gla Ala

2431 OOT ATC TCO AAO AAC CAC CAO ACC COC OAO CAC CCC CTO CTC CCC TTC ACC CTO OCT OTO AAO CAO CTO ATC OTTOly I I * » . r Lya Aaji Aap Gin Thr Arg Olu I l a Ala L*u L*u Ala Pha Thr Lam Oly Tal Lya Cln Lau I I * Val

2913 CGT OTG AAC AAO ATO OAC TCC TCC GAS CCA CCA TAC AOC GAG GCC COT TAT OAG OAA ATC AAO AAO OAA GTG TCCOly Val Aan Lya H«t Aap f*r l « r Ola Pro Pro Tyr l * r Gla Ala Arg Trr Ola Glu I I * Lya Lya Olu Val f a r

2 J I I TCT TAC ATC AAO AW GTC COC TAC AAC CCA OCC CCC GTT OCC TTC OTO CCC ATT TCC OOA TOO CAC OGC CAC AACf a r Tyr I I * Lra Lya Val Gly Tyr Ajn Pro Ala Ala Val Ala Pha Tal Pro I I * f a r Oly Trp HI* Oly Aap Aan

3178

at Washington U

niversity at St Louis on June 18, 2014

http://nar.oxfordjournals.org/D

ownloaded from

Page 5: Two genes encode related cytoplasmic elongation factors 1α (EF-1α) in               Drosophila melanogaster               with continuous and stage specific expression

Nucleic Acids Research

1 6 ( 3 ATC TTO O U CCC TCT ICC U C 1TO CCC TOO TTC U O OOA TOO O U OTO OGA COC U S GAG SGT U C OCT OAC COC• a t Lau O l s f r o I . r Thr U l Hat Pro Trp Ph . Lra Olr Trp Olu V«l O l r Are ! • ! • Olu S i r U n Ala U p O i l

2731 U O ACC CTO OTC S i T SCC CTC OAT OCC ATC CTT CCC CCA OCC COT CCC ACC OAC AAO SCC CTO COT CTO CCC CTOLra Thr Lau Val Asp Ala Lau Aap Ala I l a Lau r r o r r o Ala Arg Pro Ttar A l p Lia Ala Lau Arg Lau f r o ban

2113 CAC BAT OTO TAC AAA ATT OOC OOT ATT OOA ACA OTA CCC OTO SOT COT OTO OAO ACT OOT CT6 CTO AAO CCC COTOln Aap Val TTT Lra I l a Olr Olr I l a S i r Thr Val Pro Val S i r Are Val S l u Thr O l r Val Lau Lra r r o C l r

2 M I ACC CTT OTO OTC TTC CCC CCT OCT AAC ATC ACC ACT CAS OTC AAO TCC OTO OAO ATO CAC CAC OAO OCC CTO CAOThr Val Val Val Pba Ala Pro Ala Aan I l a Thr Thr Olu Val Lra f a r Val Olu Hat I l a I l a S l u Ala M i Oln

1 H ] SAO SCC OTT CCC OOA OAC AAC OTT SCC TTC AAC GTC AAO U C OTO TCC OTO AAO OAO CTO COT COT CCC TAC GTTOla Ala Val Pro S i r Aap Aan Val S i r Pha Aan Val Lra Aan Val Sar Val Lra S l u Lau Arg Arg S i r Trr Val

3 t 3 l SCC SOT SAC TCC AAO SCT AAC CCC CCC AAC CSA SCC OCC SAC TTC ACC CCC CAS GTC ATC STG CTS AAC CAC CCCAla Olr Aap Sar Lra Ala Aan Pro Pro Lra Olr Ala Ala Aap Pha Thr Ala Cln Val I l a Val Lau Aan I l a Pro

3113 SOT CAO ATT CCC AAC SOC TAC ACC CCA OTC TTO SAT TSC CAC ACC SCT CAC ATT SCT TCC AAO TTC SCT CAS ATCS i r Gin I l a Ala Aan Olr Trr Thr Pro Val Lau Aap Cra l i a Thr Ala I l a Zla Ala Cra Lra Pha Ala Olu I l a

31SS TTO OAO AAO OTC OAC COT COT TCC OOC AAO ACC ACC CAC SAC AAC CCC AAG TTC ATC AAG TCT OCC OAT OCT SCCLau Ola Lra Val Aap Arg Arg Sar Olr Lra Thr Thr Olu S l u Aan Pro Lra Pha I l a Lra Sar S i r Aap Ala Ala

32C3 ATC OTC AAC CTO OTO CCC TCT AAO CCC CTO TOC STG GAG CCC TTC CAC GAG TTC CCC CCT CTC SST CSC TTC SCTI l a Val Aan Lau Val Pro Sar Lra Pro Lau Cra Val Olu Ala Pha Oln Ola Pha Pro Pro Lau Olr U g Pha Ala

333S GTO COT OAC ATC ACS CAS ACC STO SCT STC SGT STC ATT AAS SCT OTC AAC TTC AAO SAT SCC TCC OST COC AASVal Arg Aap Hat Arg S i n Thr Val Ala Val Olr Val I l a Lra Ala Val Aan Pha Lra Aap Ala t a r Olr Olr Lra

3413 GTC ACC AAO OCC GCC SAS AAS SCC ACC AAG CCC AAO AAO TAO CT00TTT0CT TCCACTCAAC ' " • i i c u n ACACSCASTAVal Thr Lra Ala A la S l u Lra Ala Thr Lra S i r Lra Lra • • •

3495 STAOCAGCAA CAACAAOCAT ATAACCAACA TCATAATCCA CCCAACAACA CCACTCAATA ATACCASCAA CAOCAOCAOC OUCACAATA

3515 OTAOTATAAC ACCAACACCT OTCCTSCOCA AOATOACCOA TAASATSATS TTTCACCACA ACCATAAGTT TUTTTCTTC CATCOAAAOG

3675 AGTTTCGACO OATACCUTG CTAAATCCAC ACCASCCCGC CTTCACTOOS AAATCOSTCS ATCCCAAOOA TAAOAOTSCA CACTSSCAAA

3765 ACACTTOCAT TTATOCATCC ACTCCTCATC CACTTCCCCO TC01H.1I I t OTTTACTAAA TATGCTATOA TSCACSCAOT TOACTTCOTT

3155 TTATCATATC ATATATAOOA ATCCTCTOTA OCATTTATOA TATCOTTTU ATTAACCTTT ATACTTTOAT ATOTATCATT TATCTTACCC

3945 TACTTTTOCA CACACTACTT TSTACACAAC AAAAOUCCA 0UT1OAAOC OATAAACTAT ATTTACAAAA AAUTAAAAA CCCTATTTTT

4*35 OTATTTCTTT TSTTTTTACC ACCCAOCCCO TAAAAGACCA CTCTCTTTTT OOTTOTTOCC TCCCOATTT

FIGURE 1Sequence of the EF - la,Fl gene region. Throughout the codingportion aminoacids are written below the sequence. The intronarea is boxed. Transcription start and polyadenylation site areindicated by an arrow and a dot, respectively. A sequenceheterogeneity observed in cDNA cDml9 is indicated at position686. The oligonucleotide primer i s shown by an arrow (Pos.2049-2068). Conserved sequence blocks are underlined.

isolated several Fl cDNA clones from a AgtlO library which had

been prepared from embryonal poly (A)* RNA. By comparison of

their restrict ion s i t e patterns with the genomic map some

isolates seemed to contain a mRNA copy including the very

5'end. One such cDNA (cDm 19, Fig.3) was subsequently seguenced

and showed the same 5' end portion when compared to the

indirect RNA sequencing results. The heterogeneity observed at

nucleotide 56 of the f i r s t exon may be attributed to a

difference in the inbred strains Canton S and Oregon R that

3179

at Washington U

niversity at St Louis on June 18, 2014

http://nar.oxfordjournals.org/D

ownloaded from

Page 6: Two genes encode related cytoplasmic elongation factors 1α (EF-1α) in               Drosophila melanogaster               with continuous and stage specific expression

Nucleic Acids Research

were used for genomic and cDNA library construction,

respectively (Fig. 1).

In order to locate the 5' end of EF - la,Fl mRNA within the

genomic sequence, primer extended 32P - labeled cDNA was

hybridized to a blot of DNA containing the subcloned BamHI and

two Hindlll fragments proceeding about 10 kb upstream of the

EF - la,Fl coding portion. Alternatively, cDNA clone cDml9 was

hybridized to a blot of the Fl gene and its upstream sequences

included in phage ACS1 (Fig. 3). Consistently, we observed

hybridization to the 1.3 kb and the neighbouring 1.2 kb EcoRI

fragments. In order to be able to precisely localize the 51

exon(s) in the genomic DNA we subsequently sequenced both EcoRI

fragments (Fig. 1). The resulting data showed that the first

exon is contained within the 1.2 kb EcoRI fragment. EF - la,Fl

thus extents over 3.4 kb and consists of two exons that are

separated by a 1.3 kb intron (Fig 3).

Structure and sequence of the EF - la.F2

An EF - la,F2 containing 4.0 kb EcoRI fragment of genomic DNA

that had been identified by crosshybridization to the EF -

lcr,Fl cDm49 probe (1) has been sequenced using both the methods

of Maxam and Gilbert and of Sanger (Fig. 2). Comparison of this

genomic sequence with those of a series of cDNA clones isolated

from cDNA libraries of pupal RNA by hybridization with the same

4.0 kb EcoRI fragment revealed a single long open reading

frame. It encoded a second, related elongation factor protein

that is one amino acid shorter and shows 90% homology when

compared to EF - la,Fl (Fig. 8). Two introns that interrupt the

coding portion of the F2 gene had already been mapped (1). To

determine the transcriptional start of the EF - la,F2 gene we

proceeded the same way as described for Fl and used a synthetic

oligonucleotide primer complementary to nucleotides -14 to +6

relative to the translation start. The major cDNA product

primed with this oligonucleotide on pupal RNA was 142

nucleotides long (Fig. 4). In addition, cDNA clones that

possibly reached the 5' end of the F2 mRNA were isolated from a

library prepared from pupal poly (A)* RNA. cDNA pc3 was

selected as the clone that extended farthest to the 5' end.

3180

at Washington U

niversity at St Louis on June 18, 2014

http://nar.oxfordjournals.org/D

ownloaded from

Page 7: Two genes encode related cytoplasmic elongation factors 1α (EF-1α) in               Drosophila melanogaster               with continuous and stage specific expression

Nucleic Acids Research

1 TAAGCCAATA GTGTGCACAA I H l l u l l C AATTAOTOGT CAATOTGCAT ACTTTAOTGA CAGTCCCTGA AACTACTATA TTATTTTATC

»1 TOCAAAAGAC TCAOTTTAAO U U T t T U I ATATTCCATO AATOOTAOTA AAATTGTATT ACTAI1UIA TTTTGOTACO TTTTATACTT

1 1 1 AAOOCATOOA AACTTTATTT AAGTCAAGAA ATCCGCATAA TOCAATAGOA AACCCAAOGC CIUU11ATA CATGCAATCC TGTGCCATCT

271 CTAGOTCGGA ATCAOTTCAG CTCCGTTCAC CTCACCATCO U U l i U U L OQTCTTTCCG TTTTCTOATT TCOAobTAAG TGCACGCAGA

361

7 2 1

• 11

OAODT

OCTCCCOTTA AAATTOTGAA AATATTAATA OGCATTOATT ASMJTOOAA ATGTAAAAAO OGAAAGTCCC AGAATTCCCT ACCCTGCATT

ATTAOOCGAA TTTCOOTTCG ATTTCCAACC TAAAGAAACT TCTAAAGTAA ACAAAOTTCC 0AAAASTOA0 AOACTCTAAC TOATTTOCOC

TCCCOOCCCO TCTTCTCATT CCTTTTCCAT AATAOCTGTO TAAATCOATT CCAATTOOAA ATTOCTTTTC CAGCOACCTT AAATTGCAAG

TAAATTAATA AACTTCCATA OACTTTCOAA TTCCAACATG OCOACCGGCT GCATGTOTCT GCOCOTTCOA TTPTOCCTOO ATTOTACCCG

TTTCTCCTTC CCOTTCTCAA GCCGTTTATT CCCOAGTAGT TTCTATTOOA ATTCGCAGOC u m i n n TATCCGCGGC ATGATGOCAC

ATOOTTAOCA OATTATTTTC TTCCCCTCCA TCTCTOACOA AOTATTTTOC ATATTCTTTC CCCCTTCATT CCCATTCCTT CTTCCAATTT

OCACTTCOAT OCAAATACAA AGATTTAAAA ATOGCATCCA GOAAAATCOG CAAOTOAAAC TOTCACTOOG OTAOAAAATA AATCACAACG

CCCTOCAOTT CTCGCCGTCT CTTTCCCTTC CTTCTCTOCA TGACCAGCAA GTGCACTOCO CCCGTTCOCC GTCCCTTTCT CTCCCOCTCT

CTCCATCTCC CTCTACAOTT TTTCACCCTT TGOAATCGCG OOATTTTCOC COCACOACCC CCACCOAATC CCGATOCTTT TOOCCATTTC

CCTTTOOATT TTCTTCCACC GTGCTGCGAA AOTTGCCAAA TTTCGGCATT TCQACA111U OCTTAATTOA AATCCGTTTO GGTGTOCOAT

TTTCATTOGT TTTCCCACTA AAAACGCCGG CCOOCACATT TTCOCCATOC ACTOCCOCAC TTCCCOOCTT TCCOACOAOO U111CTCTTC

OGCTTAATCC TCTCCAOCCG AGGAGAGTGC ATTTTCCCAO TACOCACACT TCOOCTCCAT TCOTTTCTGT CT1WGOCTCO TTATTGATTT

TTCGCCCOGT GCACTTCGGC AGAGGATATA CACOOCAOTC TTTAACCAAC AGACACTTOG CCCOGTCOTO OTCCOOCTOC AGAGTACGGA

AOATCCGCAT AGAGTTTAAA AACTOCCATT TTTATGACAA CGATTTCCTT CTAATTCTAO OATATAOCOT COCOTOOOTT TOTOATCAOT

1 6 2 1 TTCTAAGTCC GCCAOTTGCC OAOTAATAAG AAACTCTAGA AAOTCTCOTO AAAACAG )TO A 0 I 1 U I I . H . CTTGTAAATT CTTGCTGCAT

AOATTTOTOO OCAAAAATAT TATGGOAATA TOOOTOTATT TCTCAATCGT ACA

OTATTTATAT TTTTCCTATT ATTCAGTATA AOOCTTAATT TOAACTAATT OOTAAACTTT TCGCGTOATT TTCGTCTTTA CTCTTOAATT

OTTTAAAATT CGTATTTTCO AAATATAAAA OTTCAACOOT TTTCCCTGTC TACGTTTOTO CCOTCCOTAT OAAOTOTOCT 1"1TUU1\.T<.G

CCACCACGAT GACACGACCC ACAGCATACA OACOTCACTC GTCTOCACCA CCCA1TAAGT TCAGACCCAC ATTOOCATGC TACCTCCCCO

ACTACGGAAA CCACCCACTT TGCTCATCCO AATACCTOCA TCCCTTCTGT CTCCCAC:AG CTCTAAAAAA TAGCTTAATC TGCAAGG

2 1 5 1 ATG OOC AAO GAG AAG ATC CAT ATT AAC ATT GTG GTC ATT OGC CAT OTO OAC TCC GGC AAG TCO ACO ACC ACC GGCK.t Glr Lr> Olu Lra I l« l i a I l « kmo I I . V.I V.I I I . GlT His Val Asp l . r Glr Lrs » .r Tbr Tbr Tbr Oly

2333 CAC TTO ATC TAC AAA TOC GGC GGC ATC GAC AAO CGT ACO ATT GAG AAG TTC OAG AAO OAG GCC CAG GAA ATO GGAH i s L«u U s Trr Lrs Crs Olj Clj U s Asp Lrs Arg Thr I I . Olu Lrs Ph. Olu Lrs Olu Ala Oln Glu Hat Olr

3 3 « « AAA OOC TCC TTT AAO TAC OCT TGO CTA CTO GAC AAG CTG AAO OCA GAG COO CAG CGO COC ATC ACC ATC GAC ATTL T S G l r Ssr Pha Lra Trr Ala Trp Val Lau Asp Lrs L«u Lrs Als Olu Arg Olu Arg Glr I l a Thr I l a Asp I l a

33*3 OCC CTA TOO AAO TTC GAG ACG TCC AAO TAC TAT OTO ACC ATC ATC OAT OCC CCT OOT CAC AOO OAT TTC ATC AAOAls Lau Trp Lra Ph. Glu Thr l ar Lrs Trr Trr Vsl Thr U s U s Asp Als Pro Olr l l s Arg Asp Fha U s Lrs

2431 AAC ATO ATT ACC GGT ACC TCT CAO OCC OAT TOT GCG CTG CTG ATC GAC GCC OCC GOA ACT OOA GAO TTC OAO OCCAsn H a t I la Thr Glr Thr far OlD Ala Asp Crs Ala Val Lau Ila Asp Ala Ala Olr Thr Olr Glu Pha Olu Ala

2 3 3 3 GOO ATC TCO AAO AAC OOC CAO ACC COC CAO CAC GCC CTT CTO OCA TTC ACG CTO OOC OTO AAO CAO CTT ATT OTOGlr I la far Lrs Asa Olr Oln Thr Arg Glu His Ala L*u L«u Ala phs Thr Lau Olr Val Lrs Gin Lmu Zla Val

26«I OOC OTC AAC AAG ATO GAC TCC ACT OAO CCO CCO TAC AGC OAO GCC CGC TAC OAO GAG ATC AAO AAO OAO GTG TCCOlr Vsl Aso Lrs Hat Asp Isr Thr Oln Pro Pro Trr Ssr Olu Ala Arg Trr Olu Olu I la Lrs Lrs Olu Val Ssr

3181

at Washington U

niversity at St Louis on June 18, 2014

http://nar.oxfordjournals.org/D

ownloaded from

Page 8: Two genes encode related cytoplasmic elongation factors 1α (EF-1α) in               Drosophila melanogaster               with continuous and stage specific expression

Nucleic Acids Research

2613 TCC TAC ATC AAG AAG ATC OGC TAC AAT CCC GCC»ar Tyr Il« Lya Lya Ila Gly Tyr Ajn Pro Ala

2751 ATC CTC CAO CCO TCC OAC AAO ATC CCC TCC TTCHat L«u Olu Pro far Olu Lya Hat Pro Trp Pba

2133 AAC TCC TTC ATC CAC CCO CTC OAC OCO ATC CTTLys Cya L«u Ila Aap Ala Lau Aap Ala Ila Lau

2908 CAC CAC CTC TAC AAC ATC CCA COC ATC COA ACCCln Aap Val Tyr Lya Ila Cly Oly lit Oly Tbr

TCO OTC CCC TTC OTO CCC ATC TCC CCA TCC CAC CCC CAC AATfx Val Ala Pba Val Pro Ila far Gly Trp Hla Gly Aap Aan

AAO OCA TCG TCC CTC OAO COC AAG GAA COC AAC CCA OAC CCCLya Oly Trp lar Val Clu Arg Lya Olu Oly Lya Ala Glu Cly

CCA CCC CAO COT CCC ACC OAC AAC CCC CTC CCC CTG CCC CTCPro Pro Cln Arg Pro Tbr Asp Lya Pro Lau Arg Lau Pro Lau

CTA CCA CTA COT COT CTO OAO ACT CCT CTC CTC AAC CCA CVal Pro Val Oly Arg Val Glu Thr Cly Lau Lau Lya Pro

2*11

3071

3161

3251

3341

3431

GTAAGCCTCC

GGGCTGGCAT

AOCOAGACCC

ATAACCCTAG

CTCTCTCTCT

TTTCAC

GGCTTGATCA

CCAACACCCA

TCCACCTCAT

CTAACATATT

ATCCACCCAC

CC ATC OTC

GGTCGGGTCT

CCOCCTCCAA

OCACTTOCTC

AATATATOTA

CCCCATCCAA

OTC AAC m

OOOCCCTCTT

AOCOOAOCCO

AAATTCTCAC

OAOCTATTOA

ATOTCTACAC

TTCTCTTTGG

CAACGAAGTC

TCCGAAGAGC

TTCAAATAAA

ATAACCTCCC

P CCO CCO OTC AAC CTC

OCACTTCATA

TTOCOCATOT

TTCCATTTTC

AATAAATTCG

CATATGTAAC

CTC ACC CAA

CATCTATTCT

ATOCATTATT

AACATOAAAC

AGTTACTTCC

rrcorrrcoo

OCAAAATTTO

CAOCCAACCT

TGAAAGGCCA

AATAATATCG

TCOCTTCCTT

OOTCOACACT

CTTCOTCGAC

TTAAAATAAA

CTCCACOTTT

TCCCCTTTCC

OTA AAC TCT CTO CAC ATC CAC CAC

3502 CAC OCT CTC ACC OAA OCC ATO CCC GGC GAC AAC CTT CCC TTC AAC OTO AAO AAC CTC TCC CTC AAC CAC CTC CCTClu Ala L.u Thr olu Ala IMt Pro Cly Asp Asn Val Clr Pba Aan Val Lys A«n Val lar Val Lys Clu Lau Arg

3577 CCT CCC TAT OTC OCC OCC OAT TCC AAC AAC AAT CCT CCT AOC OCA OCA CCC CAC TTT ACC CCT CAGArg Gly Tyr Val Ala Cly Aap Sar Lys Asn Aan Pro Pro Arg Gly Ala Ala Aap Pha Thr Ala Cln

GTAGCGTAAC

3653 AAACATCACA AATCTTTOAT ACTTCAACTC ATCTTTCTTT GTO ATT GTC CTC AACVal Ila Val Lau Asn

373. CAT CCO OOC CAO ATC OCC AAT OOO TAC ACT CCC GTC TTO OAT TOC CAC ACO OCO CAC ATT OCC TOC AAG TTT TCCHis Pro Oly Oln Ila Ala Asn Oly Tyr Thr Pro Val l*u Aap Cys lis Thr Ala Hia Ila Ala Cys Lys Pha lar

3111 GAO ATC AAO OAO AAG TAC OAC COC COT ACO GGC GGA ACC ACC OAA OAC OGO CCO AAO OCT ATC AAO TCC OGG GATGin Ila Lya Olu Lys Tyr Asp Arg Arg Thr Oly Oly Thr Thr Olu Asp Oly Pro Lys Ala Ila Lys lar Oly Asp

3116 OCO OCC ATC ATT GTG CTG CTO CCC AGC AAO CCO TTO TOC OTA OAO AOC TTC CAO GAG TTC CCA CCO CTO OOA COOAla Ala Ila Ila Val Lau Val Pro Sar Lya Pro Lau Cya Val Olu Sar Pha Oln Glu Pha Pro Pro Lau Oly Arg

39*1 TTC OCT GTG CCC OAC ATO AOO CAO ACC OTO OCC GTG OOC GTC ATC AAG TCG GTO AAC TTT AAA OAG ACG ACC TCGPha Ala Val Arg A«p Mat Arg Oln Thr Val Ala Val Oly Val Il« Lys Sar Val Asa Pha Lys Olu Thr Thr lar

4t3i CGC AAO GTG ACA AAA OCC OCT OAO AAO OCA CAO AAO AAO AAA TAA CTAGCCTACC ACCAGAACAA CCTCATCACTOly Lya Val Thr Lys Ala Ala olu Lys Ala Oln Lys Lys Lys •••

4111 COAACCCAAC AACAACAAAA ACAGACGGCT AOAGCAACAO CAOCAACAAC ACACAACAAC AATACACATO TCAAAATTAT AATACCCACT

42tl CGACGATCAA ATTCACACCT TOACTCCATO OCAAOAOAOA CACCAATTAC TACTATTACT AOCTOCTCOO AOAAOCOOCA OATATTAACC

42«1 OAAATCOAOC AOATTATACC CTATATAATA ACCACACGTA COATTAOCOA OOAOAOOAOC ATCAOOTOCA OCOAOOATOC CAAGOAGOAO

4311 CCCTTCCAOC CTCOCCOOOT COOTTTTOOT COCCTTCCCC OTOGTOOTCT ACTCCAOCTA TCTOAACATO TATCOTCACC GCAAGTCCTT

4471 TCOTAGGAAA CCACCCOCTA OCCACTCCOC AGAGTOGATA GOOGCCTCCO GAGCACTOCT OTAOCCCOCC CCTTCOATAT ATACTCATCT

45*1 CTAAAACTAA CCTTACACTT OATTAGCAGC CACACATCCG OTCGCATCCA CCTOTTTCOA ATCCATTTTA AACACTTTTT ATACTTTTOA

4«51 TAAOTCAAOT COOAOOCATT COATTTAAAA TCTATTOAAA TATOTAATTT CCOAATTTAO TTTTAAACCA COTCCOCOCT CCCAAAAATC

4741 CCCCOAACCO AAAAGACTAC ATTCOCOATO AATTCAAAAT TTCTCTTOAA ACCAAAAAAA ACAAATOCTT AACAAOTATT ACAAAAAAOA

4131 AATCAACATT ACACACATAA TCATOCOOTT TTTGAAAACA TTATAAATOT TTAATCOAGC CTCATTTGCA TTTOCATATT ACATAATATA

4»21 COTTA«CCAC ATOTCATCTC ATTOCCCATA ATAACCTOCA TCCTOCATAT TATACACOTT AATCTCACAC TCTOAATTTA TACAAACCOA

5»11 AOACAATTOT AACCOACACC AOAACAATTC TTOOATACAO AACATCTTCC CTTGATAAAA OATCTTTTAA ATOATCAOAA AAATAAAGGA

51»1 AOCTTAACCO TAAAATACCA CACACGAACO CCTTTTAATT OAAAAATACT TOAATATCTA TOAAOAAAAT CAATTC

3182

at Washington U

niversity at St Louis on June 18, 2014

http://nar.oxfordjournals.org/D

ownloaded from

Page 9: Two genes encode related cytoplasmic elongation factors 1α (EF-1α) in               Drosophila melanogaster               with continuous and stage specific expression

Nucleic Acids Research

FIGURE 2Sequence of the EF - la,F2 gene region. The localization oftranscription start, polyadenylation site, intron, exon, codingand noncoding portions is denoted as in Figure 1. According tocDNA pC3 and pC4 different splice acceptor sites at position2129 and 2132, respectively, are used. Oligonucleotide primersequences are indicated by an arrow above the sequence. Notethat due to the intron the arrow representing the primer closeto the 5'end is split. A conserved sequence motif isunderlined.

Hybridization of the oligonucleotide primed cDNA of 142

nucleotides in length to an EcoRI Xbal digest of the subcloned

fragment 1 (Fig.4) and of the 5'EcoRl - Xbal fragment of cDNA

pc3 to a blot of A17 DNA containing 10 kb sequence upstream of

the F2 reading frame gave rise to hybridization signals in two

different regions (Fig.4). Extending the sequence analysis of

the genomic DNA to about 2.5 kb upstream from the translation

start and determination of the cDNA pc3 sequence resulted in

the localization of two 51 miniexons. They were separated from

the coding portion of the gene by a 0.45 kb intron at position

-27 and a 1.25 kb intron at position -114 relative to the

translational start. When compared with the length of the

oligonucleotide extension product of 142 nucleotides, cDNA pc3

was missing 1 nucleotide at the 51 end. In order to map the

transcription start of F2 to the nucleotide precisely, we

repeated the primer extension experiment with an

oligonucleotide complementary to position

-124 to -104 relative to the translational start (position 336

in Fig: 2). An extension product of 12 nucleotides (data not

shown) confirmed the result already obtained with the first

primer. In summary: EF - la,F2 is organized in five exons

separated by intron sequences of 1.24, 0.45, 0.45, and 0.08 kb

in length and extends over 4.8kb.

Stable promoter elements

The use of alternative promoter elements has been shown to be a

means of differential gene regulation. In order to find out

whether alternative promoters were also used for the expression

of EF - la genes during p_. melanoaaster development, we

performed primer extension experiments using poly (A)* RNA from

3183

at Washington U

niversity at St Louis on June 18, 2014

http://nar.oxfordjournals.org/D

ownloaded from

Page 10: Two genes encode related cytoplasmic elongation factors 1α (EF-1α) in               Drosophila melanogaster               with continuous and stage specific expression

Nucleic Acids Research

T EE E EI I 1.2 I 1.3 I 3 .9 1

Hu B B B BB BH H

1kb

•« P

cDm 49

cDm 19

EF-1 ,F1

II

— 66

FIGURE 3Structure of the EF - lcr,Fl gene locus .Part I shows a r e s t r i c t i o n map of phage x CS1 ( %J) containingthe Fl gene. Res tr ic t ion s i t e s are shown for EcoRI (E), Hindll l

3184

at Washington U

niversity at St Louis on June 18, 2014

http://nar.oxfordjournals.org/D

ownloaded from

Page 11: Two genes encode related cytoplasmic elongation factors 1α (EF-1α) in               Drosophila melanogaster               with continuous and stage specific expression

Nucleic Acids Research

(H), and BamHI (B). The size of the EcoRI fragments harbouringthe Fl gene is indicated in kilobases. Subcloned Hindlll andBamHI fragments used to localize exon I are numbered 1-3. Belowthe restriction site map the localization of cDNAs 19 and 49 isdenoted by horizontal lines. The complete Fl gene structure isgiven as thin line and boxed area for intron and exonsequences, respectively. The open box confines the coding part.P together with a filled triangel denotes the position fromwhich the oligonucleotide primed cDNA synthesis is started.Part II: Panel A shows the cDNA synthesis product and theindirect RNA sequencing result. Panel B shows an Ethbr stainedagarose gel of the subcloned Hindlll and BamHI fragments 1-3digested with Hindlll and EcoRI, respectively, together with anSouthern blot of this gel hybridized with the "P - labelledoligonucleotide primed cDNA. Panel C: Agarose gel separation ofthe EcoRI fragments of the genomic DNA cloned in phage X CS1.Hybridization of 32P labelled cDNA cDM 19 gives rise to thepattern shown to the right.

0 - 16 hr embryos (E), climbing third instar larvae ( L ) , 2 - 4

day old pupae (P), and of adult flies (A). Since both primers

were complementary to sequences encompassing the beginning of

the respective EF - la protein coding regions (Fig.l and 2), a

switch to an alternative promoter would have been reflected in

the appearance of cDNAs exhibiting different lengths. As can be

seen in Figure 5, the primer extension products remained of the

same length throughout development therefore indicating, that a

promoter switch is obviously not apparent during either Fl or

F2 expression. Since the radioactive label incorporated into

the cDNAs represents the relative amount of RNA available for

primer extension at the respective stage of development, our

results indicate that EF - lo,Fl RNA is present in all stages

of development, although to a variable amount. EF - la,F2 RNA,

on the other hand, is highly expressed in the pupal stage but

is also present to a lesser extend in third instar larvae and

flies. In addition to the major cDNA product of 142 nucleotides

in length a weak additional band of about 160 nucleotides

appeared possibly indicating the existence of a minor

transcription start site for EF - la,F2.

Only two gene copies for cvtoplasmic EF - la exist in

Drosophila melanooaster

E. coli or Saccharomvces cerevisiae have been reported to

contain two nearly identical gene copies of EF - la. In

3185

at Washington U

niversity at St Louis on June 18, 2014

http://nar.oxfordjournals.org/D

ownloaded from

Page 12: Two genes encode related cytoplasmic elongation factors 1α (EF-1α) in               Drosophila melanogaster               with continuous and stage specific expression

Nucleic Acids Research

St

IBXh S B EEE Xb S S E E E

i1 kb

Pc3

Pc4

Pc21

EF-1, F2

U2 - I

D

/.o -

2 1 - • «3

02

FIGURE 4S t r u c t u r e of t h e EF - l a , F 2 g e n e l o c u s . Part I shows ar e s t r i c t i o n map o f t h e DNA c l o n e d i n phage A17 ( U ) .Res tr i c t ion s i t e s are indicated for Styl (S t ) , Sa i l (S) , Xhol(Xh), Xbal (Xb) and as in Figure 3 . 1-3 denote s subcloned

3186

at Washington U

niversity at St Louis on June 18, 2014

http://nar.oxfordjournals.org/D

ownloaded from

Page 13: Two genes encode related cytoplasmic elongation factors 1α (EF-1α) in               Drosophila melanogaster               with continuous and stage specific expression

Nucleic Acids Research

fragments used for localization of exon 1 and 2. Primersequence, cDNA localization and resulting gene structure areshown as in Figure 3.Part II: In panel A the cDNA synthesis products are shown.In panel B,C, and D Ethbr stained agarose gels are showntogether with their Southern blot hybridization results: Bsubcloned fragment 1 digested with EcoRI and Xbal andhybridized with primer extended cDNA, C X17 digested with EcoRIand hybridized with the 5'EcoRI-XbaI fragment of cDNA pC3. Dsubcloned fragment 3 digested with EcoRI and Styl hybridized asin in C. Only the Styl site that gives rise to the hybridizingfragment is indicated.

contrary, our results for Drosophila melanoaster suggest the

existence of two different elongation factor genes. In order to

decide whether the two identified gene copies are the only ones

that code for cytoplasmic elongation factor EF - la in

Drosophila. we performed genomic Southern blot hybridization

experiments using EF - la,Fl cDNA cDm49 (Fig. 3) as a probe.

DNA was isolated from Canton S embryos, digested with either

EcoRI or Hindlll, blotted and hybridized. EcoRI digested DNA

gave rise to a 4.0 kb hybridization signal of twice the

intensity when compared to a second signal of 1.3 kb. Two bands

appeared with Hindlll digested genomic DNA (Fig 6). In each

case, the hybridizing DNA fragments could be assigned to the

map of the already cloned Fl and F2 DNA (Fig. 3 and 4). The

part of the Fl gene, for example, that is contained in cDm49,

spans a portion of the 1.3 kb and the 4.0 kb EcoRI fragments

whereas the crosshybridizing part of the F2 gene is also

F 1 F 2

E L P A E L P A

8 6 - 4 ^ ^ ^ ^ _ M.2

FIGURE 5Primer extension products obtained using Fl and F2 specificoligonucleotides (see Fig.l and 2). Poly (A)* RNA from 0-16hour embryos (E), third instar larvae (L) , two to four day oldpupae (P) and flies (A) is used.

3187

at Washington U

niversity at St Louis on June 18, 2014

http://nar.oxfordjournals.org/D

ownloaded from

Page 14: Two genes encode related cytoplasmic elongation factors 1α (EF-1α) in               Drosophila melanogaster               with continuous and stage specific expression

Nucleic Acids Research

U 5 -

63-1 4 0 -

13-

IHtndBI Eco RI

FIGURE 6Genomic Southern hybridization with cDm49 (see Fig.3) DNA ashybridization probe. The size of the identified DNA fragmentsis indicated in kilobases.

located on a 4.0 kb EcoRI fragment. Titration experiments with

gene specific hybridization probes revealed that both genes are

present as single copy per haploid genome (34) . Therefore, we

can conclude that additional related gene copies encoding

cytoplasmic EF - la do not exist in Dj. melanoaaster•

Codon usage in the EF - la.Fl and F2 gene.

Since Fl (464 aminoacids) expression is generally markedly

stronger when compared to F2 (463 aminoacids), we examined the

codon usage of both coding regions in light of the assumption

that highly expressed genes are subject to a more extreme codon

bias (27-29). Consistent with the hypothesis, codon preference

in Fl is restricted to 44 triplets whereas in F2 55 different

codons are used (Fig. 7 ) . Moreover, if one neglects those

triplets that are used only once or twice in Fl, codon usage in

the residual 97% of the reading frame is biased to 34

triplets. This is different in the F2 gene where codon usage is

random with a slight bias against A-T richness.

3188

at Washington U

niversity at St Louis on June 18, 2014

http://nar.oxfordjournals.org/D

ownloaded from

Page 15: Two genes encode related cytoplasmic elongation factors 1α (EF-1α) in               Drosophila melanogaster               with continuous and stage specific expression

Nucleic Acids Research

I T Fl IF2 I C Fl IF2 I A Fl IF2 I C Fl |F2 I

ITTTTITTCITTAITTC

ICTTCICTCICTAICTG

IATTAIATCIATAIATO

IGTTGIGTCIGTAIGTG

PHBPBELEULEU

LEULEULIULEU

1LEILBILEMET

VALVALVALVAL

017

e6

12•15

1120

e8

7141

22

511

e4

361

12

ie23•ie

I95

25

ITCTITCCITCAITCG

ICCTICCCICCAICCG

IACTIACCIACAIACG

IGCTIGCCICCAIGCG

SERSIRSERSER

PROPROPROPRO

THRTHRTHRTHR

ALAALAALAALA

411

e3

21750

42310

113400

1 211211

111

e6

374

111

1 411611

1

17

61191156

ITATITACITAAITAG

ICATICACICAAICAG

IAATIAACIAAAIAAG

IGATIGACIGAAIGAG

TYRTYROCHAMB

HISHISGLNGLN

ASNASNLYSLYS

ASPASPGLUGLU

11001

011011

0181

45

1212725

1 2110I 11 0

1 31 81 0111

1 41121 5142

1 61161 5129

ITGTITGCITGAITGG

ICGTICGCICGAICGG

IAGTIAGCIAGAIAGG

IGGTIGGCIGGAIGGG

CYSCYSOPATRP

ARGARGARGARG

SERSERARGARG

GLYGLYGLYGLY

0505

11400

0102

1812130

1 15 10 15 1

6 16 10 13 1

0 13 10 13 1

4 125 1104

FIGURE 7Codon usage as deduced from the nucleotide sequence of the Fland F2 reading frames.

Evolutionary stability of EF - la genes

When we compared the amino acid sequences of EF - la,Fl and F2

with all eucaryotic cytoplasmic elongation factors (Fig.8)

known to us, we observed a strong conservation of their primary

structure. Some of the highly conserved regions have already

been correlated with functional domains. The most strongly

conserved NH2 - terminal end sequence of EF - la had been

assigned to GTP binding activity. EF - la also shares homology

with several classes of nucleotide binding proteins and even

the procaryotic elongation factor genes (37,38). Another highly

conserved region further substantiated in our comparison is

comprised in the sequences around Ala 92, Lys 244 and Lys 273.

The corresponding amino acids in EF - Tu of £_,_ coli were

considered to be important for tRNA binding. This remarkable

degree of sequence conservation is most easily explained if one

considers the multifunctional nature of the EF - la protein. In

the future, cloning, in combination with site directed

mutagenesis, will open the way to design modified factors that

can be tested for single functional steps in the elongation

process.

3189

at Washington U

niversity at St Louis on June 18, 2014

http://nar.oxfordjournals.org/D

ownloaded from

Page 16: Two genes encode related cytoplasmic elongation factors 1α (EF-1α) in               Drosophila melanogaster               with continuous and stage specific expression

Nucleic Acids Research

UXUIirYWIOHTDSOKVTTTGILIYKCWIIDU'iKEUHICTYVTIGETDSGirTTTCHLIYlCaCIDU'

IKCKIHIMIWICETDtCKnTTOHLIYZCOOIDEl'

IKEIZHIIflVVlOVTDSOESTTTOaLIYECCGIDat-iKEXnilBIVTIGHYDSGlCnTTGHLIYKCOOIDER1

I EXmXAAKLG KG!11EXTKUAAXLC EGIEXTSKXAQEKCKCIEKFKKEAQEHOKGIEXTSKKAgEHCKC

'IEXFEKEAAEKOKG

L WLOIOJUEU101'

, WIDIOJU EJLEXG I

.trVLDIOJLAXREXCI

ALtnUTTP KTOVTTI DA PCHU) r:ALVXTBTFETOVTVIDAPOBRDFAL WOXTPCnrVTV ID A PCH£D FALVXFETAXYTVTIIDAPGERDF

K HUCOK LACE

K D. HEL FIALTCFETSETTVTIIDAPGHRDFAnnrrrAxyrvTiiDAPc MOUSEILWFETfKTTVTIIDAFGlfltDFIi: HUMAfl

\xitiuiMrwiownMirrrtcuLiricGGiDiM.TittnrxxxinGKGtTirrk\riu}r&EJ&MXxoitt KYTVTIIDAPGBRDFIK COWJEHSUS

Xl^#TSOADCAILIIAOGTGEF*EAGISECCOTVXEJll<IJ^TLOVRQLITAVpijQSVK--VDESR:F^EIVKETSJ(FIUVGYHFrTVPFVPISCUVGra YEASTITVTtOADCAILIlAOGTGEFXAOISEOO^n»EBAUJirnjVIQLIYAVinaa)SVIC--WDESIFOEIVIXT««FIKEVOYHPETTFF^ YEAST

[ I T t ^ O A P C A I L I I A O C T O E r i A O I f C M O T i n t T J i i r n ^ F R Q L I T A I i n M ^ HUCO* RACEMHI TtJT»OADC ATLI'm ITCT»O AIX: 1VQ11

JCHHCDJI AJtTEHlAtGVHGDH 0 . HIL FISCWHGOD D. HEL FIItCVVGDN HOUSESOWnGDfl HUWAJI

MITCTtQADCATLI AJLCTGITEAGISnrCOTREHALLAFTLCVKOLITOVIfXKOSTEFPTII RYEItTEIVS YIKXIOYBP TTAFVPISGW CDK CtWIEHSUS

MEATTW^WYIOWEirriJLOVVKOETlAEAIDAIEQPIIFTOItFLRMFWDVTItlGClOTVPVORVrreVII^^ YEASTIUIJ^TWJmmDO¥EXITXAOVTKOKTU*EAIDAIEgF«PTDMLaiJLO^VTKIO^ YEAST

. _ TIRAGKWHFAPAAVTTEVKSVEIIHHETLT HUCOR RACE

TFAFAJIirraVUVIHHHESLE AJtTEHIA

•OTVWFAPAHITTEVKIVEJUniEALO D- MEL "HLEPSE»t»lffI0»STERXXOBJJ0ECLIDAI*OAIlJF^FTOltPLRL*LQDVTEI0OI0TVFV0RVirrGLlJCPCMVVHFA^ D. HEL F I IIl^PSAJIMPVrimnrrTljaWSlICTTUJLfclJ>CILPPT»FTOE*LJUjPLQirVTEIGOIGTVFV HOUSE

201 mjW!AJWFWIO¥ITT*HKIULS<nTUJyUJ>CIlJPTtPTDItPL*lJLO.DVT^ HUHAJI

T1JUC 0 A QETLLXALDAILPP IPTDltFLJILFLflDVTEICCICTVPVCRVITOVSITCWTVTFAPAirVTTIVlUTKHHHEAL COMSEUSUS

IUUJ*VCGDJJOrDPPIUK:MF1U^TVIYlJral^ YEAST(»VF«Dir/orilVDfVSTEEItR<nr/CODAIOrDPPi:0<^UFl(ATTITljniPOOIIAOY«PVUKBTAflIAC^^ YEAiTEGLF^^WVGrirVDrriTEI)ItR0inrCtDfl3fDPAKE«ASrrAQTIILJrHPO0ISAGYA^VU}CHTAHlACB7SELIIKIDRRS0iaQtID8 HUCOC RACEQASFGIXVGnnrnr/tTK£IJULGTVAJDSIDnn>ARGlvrTA^IVljniPOQISirOYTPTUXntTA£UC17UIU AJtTEHIAIATP<IDirVGrirnVVfTUUUlGTVACDSIUJ^FKOAADrrAO^ITLjraP<^IAjrOYTPTU>CaTAHIACRFUILE D. HEL FImMMrwvanfvyyvfTyTHrFWTTVft""!Fin^FB?ftAPrrAQ*r*LJ"tK^TiKnvTPTTjy'WT*HTftr|^>iTFT»>n»»^^TTTgnti>»tTt^cnAA 0 . HEL F I IEALPgCWVGrilTRjrVSfEPVtJtGyVACDSlOntT HOUIE

3*1 EALP<H»nwrinri3rVSTIPYER0MVJUlDliarDPFTiEAJ^rTA0VIIIjraFqqi RUHAM

EA PCnrVCnmanrSTKI RXOWVAODSEJfDPPKO A FTAOVTVLiniPOQmOY P T L O C H T A H I A C K F EL EK ORRSGKX ED PKF MQDAA COHSEMSUS

L VltFTP IKFMCTVAFIET F:L VIFT P f KFHC V U F • EYT:IVXKVP S UPMCVKATTDT PIHITLTPIEPLCVEAFIDTTiIVWL TP SK P LCVEAFQETH

L^ftFATRDtaOTVAVOTIUVD-KTXKAAJCVTIUUkQXJLAKK*'URFAVR^taOTVAVOVIUVD-ETEKAAJCVTIUUQCAJUCE-'LOEFAVRCTttOTVAVCVIEAVE-KTDEAGKVTtAAArAIIlI'

' AVRDMRQTV AVOTIMYWT KBTTSGEVTILAUKAQICU •

IV VPSEFKCTKAFSEYPPLORFATROtaOTVAVOTIUT KJC AOKVTTUU EA K

YIASTHUCCHt RACEARTEHIAD. HBL FI

D. HEL F I IHOUSEHUMAN

COHSSMSUI

FIGURE 8Comparison of the aminoacid sequences deduced from cloned EF -la gene sequences (11-19). For mouse only part of the sequenceis known (17,18). Asterisks denote identical aminoacids, pointsindicate a conservation of at least 50% between the differentproteins,

CONCLUDING REMARKS

cDNA clones were isolated that span the complete mRNA sequences

of the two n^ melanogaster elongation factors EF - la (FI and

F2). Using these clones in addition to oligonucleotide primer

extended cDNA probes, we were able to characterize miniexons

encoding the 5'end of the untranslated leader of both EF - la

mRNAs. By comparison of cDNA and genomic sequences the gene

structures could be determined. In contrast to the two nearly

identical genes present in fij. coli or Saccharomvces cerevlsiae.

3190

at Washington U

niversity at St Louis on June 18, 2014

http://nar.oxfordjournals.org/D

ownloaded from

Page 17: Two genes encode related cytoplasmic elongation factors 1α (EF-1α) in               Drosophila melanogaster               with continuous and stage specific expression

Nucleic Acids Research

Drosophila contains two copies that are clearly different from

each other with respect to sequence and structure. Even though

it is not proven yet that both messages are actually translated

in vivo, we detect both sequences in the high molecular weight

fraction of polysomal gradients at their respective time of

expression indicating active translation (Richter and Hovemann,

unpublished result). The amount of cDNA synthesized using Fl-

and F2 specific oligonucleotide primers reflects the extent of

RNA expression in the various developmental stages. According

to this expression profile, Fl should represent the

housekeeping gene that gives rise to the elongation factor

needed in all growing cells. F2 transcription, peaking in pupal

stage, represents on the other hand an elongation factor gene

that is specifically expressed in certain developmental stages,

possibly,even in a tissue specific manner. It is also

conceivable that EF - lo,F2 exerts a specific function by

preferring its own pool of aminoacyl tRNAs. This would then

mean that it would by itself be involved in some kind of

translational control.

The complete gene structures allow us now to perform a

comparison at the nucleotide level. The coding region of Fl and

F2 differ in sequence between each other to a degree that is

similar when compared with the Artemia salina or the human

gene (15,16). We, therefore, conclude that the genetic

separation of Fl and F2 is not a recent event. Since both genes

established themselves with their unique expression profile,

they could express independent functions. A stage specific EF -

la like activity was also observed in Xenopua laevis

previtellogenic oocytes (35,36). However, a molecular analysis

of the oocyte specific activity is still missing.

Promoter regions of housekeeping function genes have been

characterized of being devoid of the TATA motif (31). We did

not recognize the TATA box sequence in either one of the two

EF - la genes. Still, there is little sequence homology at all

in front of the transcription start of Fl and F2. A comparable

promoter motif, however, that is common to housekeeping genes

is not yet established. Since EF - la belongs to the group of

proteins that are engaged in protein synthesis, we searched for

3191

at Washington U

niversity at St Louis on June 18, 2014

http://nar.oxfordjournals.org/D

ownloaded from

Page 18: Two genes encode related cytoplasmic elongation factors 1α (EF-1α) in               Drosophila melanogaster               with continuous and stage specific expression

Nucleic Acids Research

common sequence motifs that might be indicative for the

concerted regulation of genes encoding translation factors and

ribosomal proteins. In the yeast Saccharomvces cerevisiae a

general promoter enhancer motif, the HOMOL box (32) , has been

identified in front of the EF - la and several ribosomal

protein genes. We find strong homology to this motif 373

nucleotides in front of the EF - la,Fl gene (position 258 in

Fig.l). To our knowledge this is the first case that the yeast

consensus sequence has been observed at the right distance in

front of this group of genes in Drosophila. A sequence homology

between Fl and F2 that might also be worth mentioning is

located at the end of the intron that is preceeding the

translational reading frame. The sequence of 12 nucleotides at

position 1566 in Figure 1 and position 1797 in Figure 2 is

completely conserved. In light of the highly divergent

surrounding this consensus sequence is very unlikely to have

evolved by accident.

EF - la,Fl transcription strength is comparable with that of

the induced vitellogenin I gene (33). The usage of a limited

number of codons in the Fl message, therefore, supports the

hypothesis of a codon bias for such highly expressed genes. In

Drosophila heat shock exerts a strong transcriptional and

translational control. Translation of nonheatshock message is

restrained by a slow down of initiation and elongation (34).

The availability of both elongation factor genes will allow us

to address questions regarding the possible involvement of one

of the EF - la factors in this regulation phenomenon.

Acknowledgment

We are grateful to T. Kornberg for making cDNA libraries

available to us. We would like to thank I. Sures for helpful

criticism in the preparation of this manuscript. This work was

supported by the Deutsche Forschungsgemeinschaft (SFB 229/C3).

S. Richter is recipient of a G.F.G stipend of the Land Baden-

Wurttemberg.

REFERENCES1 Walldorf,U., Hovemann,B., and Bautz, E.K.F. (1985) Proc.

Natl. Acad. Sci. USA SZ, 5795-57992 Kaziro, Y. (1978) Biochim. Biophys. Acta 505. 95-127

3192

at Washington U

niversity at St Louis on June 18, 2014

http://nar.oxfordjournals.org/D

ownloaded from

Page 19: Two genes encode related cytoplasmic elongation factors 1α (EF-1α) in               Drosophila melanogaster               with continuous and stage specific expression

Nucleic Acids Research

3 Drews,J., Bednarik,K., and Grasmuk,H. (1974) Eur. J.Biochem. 41, 217-227

4 Kemper,W.H., Merrick.W.C., Redfield,B., Liu, C.-K.,andWeissbach,H. (1976) Arch. Biochem. Biophys. 174. 603-612

5 Nagata,S., Motoyoshi,K., and Iwasaki,K. (1978) J. Biochem.fil, 423-429

6 Slobin,L.I., and Holler,W. (1978) Eur. J. Biochem. Q£, 69-77

7 Murakami,K., and Miyamoto,K. (1982) J. Neurochem. 38.1315-1322

8 Lauer,S.J., Burks,E., Irvin,J.D., and Ravel,J.M. (1984) J.Biol. Chem. 252, 1644-1648

9 Thiele,D., Cottrelle,P., Iborra,F., Buhler,J.-M.,Sentenac.A., and Fromageot,P. (1985) J. Biol. Chem. 260.3084-3089

10 Crechet,J.-B., Canceill.D., Bocchini,V., and Parmeggiani,A.(1986) Eur. J. Biochem. 1£1, 635-645

11 Nagata,S., Nagashima,K., Tsunetsugu-Yokota, Y. , Fuj imura, K.,Miyazaki,M., and Kaziro,Y. (1984) EMBO J. 2, 1825-1830

12 Schirmaier,F. and Philippsen,P. (1984) EMBO J. 2, 3311-331513 Cottrelle,P., Thiele,D., Price,V.L., Memet,S., Micouin,J.-

Y., Marck,C, Buhler, J.-M., Sentenac,A., and Fromageot,P.(1985) J. Biol. Chem. 2M, 3090-3096

14 Linz,J.E., Katayama,C, and Sypherd,P.S. (1986) Mol. Cell.Biol. £, 593-600

15 Lenstra,J.A., Van Vliet,A., Arnberg,A.C., Van Hemert,F.J.and Moller.W. (1986) Eur. J. Biochem. 155., 475-483

16 Brands,J.H.G.M., Maassen,J.A., Van Hemert,F.J., Amons R.,and Moller.W. (1986) Eur. J. Biochem. 155. 167-171

17 Rao,T.R., Slobin,L.I. (1986) Nucl. Acids Res. 14, 240918 Kohl,A., (1986) Diploma thesis, University of Heidelberg19 Roth,W.W., Bragg,P.W. Corrias,M.V., Reddy,N.S.,

Dholakia,J.N., and Wahba,A.J. (1987) Mol. Cell. Biol. 2,3929-3936

20 Maxam, A.M., and Gilbert,W. (1977) Proc. Natl. Acad. Sci.USA 21, 560-564

21 Sanger, F., Nicklen,S., and Coulson,A.R. (1977) Proc. Natl.Acad. Sci. USA 24, 5463-5467

22 Geliebter,J., Zeff, R.A., Melvold,R.W., and Nathenson,S.G.(1986) Proc. Natl. Acad. Sci. USA 8_3_, 3371-3375

23 Hovemann,B., Walldorf,U., and Ryseck,R.-P. (1986) Mol. Gen.Genet. 201, 334-340

24 Maniatis,T., Fritsch,E.F.,and Sambrook,J. (1982) MolecularCloning (A Laboratory Manual), Cold Spring Harbor

25 Roberts,D.B. (1986) In: "Drosophila: a practical approach",D.B. Roberts, ed. (Oxford, IRL Press), 1-38

26 Keller, E.B., and Noon, W.A. (1985) Nucl. Acids Res. 124971-4981

27 Ashburner,M., Bodmer,M., and Lemeunier,F. (1984) Develop.Genet. 4, 295-312

28 0'Connell,P., and Rosbash,M. (1984) Nucl. Acids Res. 12,5495-5513

29 Sharp,P.M., Tuohy, T.M.F., and Mosurski,K.R. (1986) Nucl.Acids Res. 14, 5125-5143

30 Linz,J.E., Lira,L.M., and Sypherd,P.S. (1986) J. Biol.Chem. 2&1, 15022-15029

31 Dynan,W.S. (1986) Trends Genet. 2., 196-197

3193

at Washington U

niversity at St Louis on June 18, 2014

http://nar.oxfordjournals.org/D

ownloaded from

Page 20: Two genes encode related cytoplasmic elongation factors 1α (EF-1α) in               Drosophila melanogaster               with continuous and stage specific expression

Nucleic Acids Research

32 Huet,J., Cottrelle,P., Cool,M., Vignais,M.-L., Thiele,D.,Marck,C, Buhler,J.-M., Sentenac,A. , and Fromageot,P.(1985) EMBO J. 4_, 3539-3547

33 Walldorf,U. (1985) Ph.D. thesis, University of Heidelberg34 Ballinger,D.G., and Pardue,M.L. (1983) Cell H , 103-11435 Viel,A., Dje',M.K., Hazabraud,A., Denis,H., and le Maire,M.

(1987) FEBS Letters 222, 232-23636 Mattaj,l.W., Coppard,N.J., Brown,R.S., Clark,B.F.C., and De

Robertis,E.H. (1987) EMBO J. £, 2409-241337 Leberman,R., and Egner,U. (1984) EMBO J. 2, 339-34138 Dever,T.E., Glynias,H. J., and Merrick,W.C. (1987) Proc.

Natl. Acad. Sci. USA 8_i, 1814-1818

3194

at Washington U

niversity at St Louis on June 18, 2014

http://nar.oxfordjournals.org/D

ownloaded from