Protein crystallographic studies to understand the …jultika.oulu.fi/files/isbn9514280911.pdf ·...

102
UNIVERSITATIS OULUENSIS ACTA A SCIENTIAE RERUM NATURALIUM OULU 2006 A 460 Prasenjit Bhaumik PROTEIN CRYSTALLOGRAPHIC STUDIES TO UNDERSTAND THE REACTION MECHANISM OF ENZYMES: α-METHYLACYL-CoA RACEMASE AND ARGININOSUCCINATE LYASE FACULTY OF SCIENCE, DEPARTMENT OF BIOCHEMISTRY, BIOCENTER OULU, UNIVERSITY OF OULU

Transcript of Protein crystallographic studies to understand the …jultika.oulu.fi/files/isbn9514280911.pdf ·...

Page 1: Protein crystallographic studies to understand the …jultika.oulu.fi/files/isbn9514280911.pdf · PROTEIN CRYSTALLOGRAPHIC STUDIES TO UNDERSTAND ... Protein crystallographic studies

U N I V E R S I TAT I S O U L U E N S I SACTAA

SCIENTIAE RERUMNATURALIUM

OULU 2006

A 460

Prasenjit Bhaumik

PROTEIN CRYSTALLOGRAPHIC STUDIES TO UNDERSTAND THE REACTION MECHANISM OF ENZYMES:α-METHYLACYL-CoA RACEMASE AND ARGININOSUCCINATE LYASE

FACULTY OF SCIENCE,DEPARTMENT OF BIOCHEMISTRY, BIOCENTER OULU,UNIVERSITY OF OULU

ABCDEFG

UNIVERS ITY OF OULU P.O. Box 7500 F I -90014 UNIVERS ITY OF OULU F INLAND

A C T A U N I V E R S I T A T I S O U L U E N S I S

S E R I E S E D I T O R S

SCIENTIAE RERUM NATURALIUM

HUMANIORA

TECHNICA

MEDICA

SCIENTIAE RERUM SOCIALIUM

SCRIPTA ACADEMICA

OECONOMICA

EDITOR IN CHIEF

EDITORIAL SECRETARY

Professor Mikko Siponen

Professor Harri Mantila

Professor Juha Kostamovaara

Professor Olli Vuolteenaho

Senior assistant Timo Latomaa

Communications Officer Elna Stjerna

Senior Lecturer Seppo Eriksson

Professor Olli Vuolteenaho

Publication Editor Kirsti Nurkkala

ISBN 951-42-8090-3 (Paperback)ISBN 951-42-8091-1 (PDF)ISSN 0355-3191 (Print)ISSN 1796-220X (Online)

A 460

AC

TA P

rasenjit Bhaum

ik

A460etukansi.fm Page 1 Friday, May 26, 2006 11:40 AM

Page 2: Protein crystallographic studies to understand the …jultika.oulu.fi/files/isbn9514280911.pdf · PROTEIN CRYSTALLOGRAPHIC STUDIES TO UNDERSTAND ... Protein crystallographic studies
Page 3: Protein crystallographic studies to understand the …jultika.oulu.fi/files/isbn9514280911.pdf · PROTEIN CRYSTALLOGRAPHIC STUDIES TO UNDERSTAND ... Protein crystallographic studies

A C T A U N I V E R S I T A T I S O U L U E N S I SA S c i e n t i a e R e r u m N a t u r a l i u m 4 6 0

PRASENJIT BHAUMIK

PROTEIN CRYSTALLOGRAPHIC STUDIES TO UNDERSTANDTHE REACTION MECHANISM OF ENZYMES: α-METHYLACYL-COA RACEMASE AND ARGININOSUCCINATE LYASE

Academic Dissertation to be presented with the assent ofthe Faculty of Science, University of Oulu, for publicdiscussion in Kuusamonsal i (Auditorium YB210),Linnanmaa, on June 6th, 2006, at 12 noon

OULUN YLIOPISTO, OULU 2006

Page 4: Protein crystallographic studies to understand the …jultika.oulu.fi/files/isbn9514280911.pdf · PROTEIN CRYSTALLOGRAPHIC STUDIES TO UNDERSTAND ... Protein crystallographic studies

Copyright © 2006Acta Univ. Oul. A 460, 2006

Supervised byProfessor Rik Wierenga

Reviewed byDoctor Anette HenriksenProfessor Reijo Lahti

ISBN 951-42-8090-3 (Paperback)ISBN 951-42-8091-1 (PDF) http://herkules.oulu.fi/isbn9514280911/ISSN 0355-3191 (Printed )ISSN 1796-220X (Online) http://herkules.oulu.fi/issn03553191/

Cover designRaimo Ahonen

OULU UNIVERSITY PRESSOULU 2006

Page 5: Protein crystallographic studies to understand the …jultika.oulu.fi/files/isbn9514280911.pdf · PROTEIN CRYSTALLOGRAPHIC STUDIES TO UNDERSTAND ... Protein crystallographic studies

Bhaumik, Prasenjit, Protein crystallographic studies to understand the reactionmechanism of enzymes: α-methylacyl-CoA racemase and argininosuccinate lyaseFaculty of Science, Department of Biochemistry, University of Oulu, P.O.Box 3000, FI-90014University of Oulu, Finland, Biocenter Oulu, University of Oulu, P.O. Box 5000, FI-90014University of Oulu, Finland Acta Univ. Oul. A 460, 2006Oulu, Finland

AbstractEnzymes catalyze chemical changes in biological systems. Therefore, to understand the chemistry ofliving systems, it is important to understand the enzyme structure and the chemistry of the enzyme'sfunctional groups which are involved in catalysis. In this study, structure and function relationshipsof two enzymes, (1) α-methylacyl-CoA racemase from Mycobacterium tuberculosis (MCR) and (2)argininosuccinate lyase from Escherichia coli (eASL) have been studied using X-ray crystallography.The main focus of this study has been understanding the structure-function relationship of MCR. TheeASL has been crystallized from a highly concentrated sample of purified recombinant α-methylacyl-CoA racemase in which it occurred as a minor impurity.

The structure of eASL has been solved using molecular replacement at 2.44 Å resolution. Theenzyme is a tetramer, but in this crystal form there is a dimer in the asymmetric unit. Each active siteis constructed from loops of three different subunits. One of these catalytic loops, near residue Ser277and Ser278, has been disordered in the previous structures of active lyases, but is very well orderedin this structure in one of the subunits due to the presence of two phosphate ions in the respectiveactive site cavity. The positions of these phosphate ions indicate a plausible mode of binding of thesuccinate moiety of the substrate in the competent catalytic complex and therefore this structure hasprovided new information on the reaction mechanism of this class of enzymes.

α-Methylacyl-CoA racemase (Amacr) catalyzes the racemization of α-methyl-branched CoAesters. An Amacr homologue from the eubacteria Mycobacterium tuberculosis, referred to as MCR,was taken as a model protein. MCR was purified, crystallized and the structure of unliganded proteinwas determined at 1.8 Å resolution using the MIRAS procedure. The structure shows that the enzymeis an interlocked dimer.

To understand the reaction mechanism and the mode of substrate binding, several crystallographicbinding studies were done using both wild type MCR and mutant H126A MCR crystals. In particular,the structures of the wild type MCR-complexes with (R, S)-ibuprofenoyl-CoA (1.85 Å), (R)-2-methylmyristoyl-CoA (1.6 Å) and (S)-2-methylmyristoyl-CoA (1.7 Å) were important in this respect.These crystal structures show that Asp156 and His126 are the two catalytic residues which areinvolved in proton donation and abstraction, respectively; when the (S)-enantiomeric substrate isbound in the active site and vice versa when the (R)-enantiomeric substrate is bound. The tightgeometry of the active site also shows that His126 and Asp156 are involved in stabilizing thetransition state. These crystal structures show that in the active site of MCR, there is one bindingpocket for the CoA part and there are two different binding pockets (R-pocket and S-pocket)connected by a hydrophobic methionine rich surface for binding the fatty acyl part of the substrate.After substrate binding, proton abstraction takes place which produces a planar intermediate. Then,donation of a proton to the other side of the planar intermediate changes the configuration at the chiralcenter. During the stereochemical interconversion of the two enantiomers, the acyl group movesbetween R-pocket and S-pocket by sliding over the hydrophobic surface connecting these twopockets.

Keywords: α-methylacyl-CoA racemase, argininosuccinate lyase, CoA transferase, protontransfer, racemase

Page 6: Protein crystallographic studies to understand the …jultika.oulu.fi/files/isbn9514280911.pdf · PROTEIN CRYSTALLOGRAPHIC STUDIES TO UNDERSTAND ... Protein crystallographic studies
Page 7: Protein crystallographic studies to understand the …jultika.oulu.fi/files/isbn9514280911.pdf · PROTEIN CRYSTALLOGRAPHIC STUDIES TO UNDERSTAND ... Protein crystallographic studies

To Shrii Shrii Anandamurti

Page 8: Protein crystallographic studies to understand the …jultika.oulu.fi/files/isbn9514280911.pdf · PROTEIN CRYSTALLOGRAPHIC STUDIES TO UNDERSTAND ... Protein crystallographic studies
Page 9: Protein crystallographic studies to understand the …jultika.oulu.fi/files/isbn9514280911.pdf · PROTEIN CRYSTALLOGRAPHIC STUDIES TO UNDERSTAND ... Protein crystallographic studies

Acknowledgements

This work was carried out in the Department of Biochemistry and Biocenter Oulu at University of Oulu, during years 2001-2006. Financial support was from University of Oulu, Biocenter Oulu graduate school fellowship, and Tauno Tönningin Research Foundation.

Firstly, I would like to warmly thank Professor Rik Wierenga for his excellent supervision, constant enthusiasm and support during this study. When I started my PhD research project I did not have any background in protein crystallography. However, he has been very patient to teach me both theoretical and practical aspects of protein crystallography. During the years of my thesis, there have been several ups and downs in my research, but he has always provided me enthusiasm and encouragement to make new discoveries. His patience, unquenchable curiosity and love for the research have been a great inspiration to me, and his continual support and encouragement have enablead me to complete my PhD thesis. I am also grateful to Professor Kalervo Hiltunen for his collaboration in this project, inspiration and helpful attitude during these years. I also wish to express my sincere thanks to Dr. Petri Kursula, who has been a great tutor and helped me a lot at the beginning of this research project. He has been a great inspiration to me during these years. I would like to thank also the other professors and group leaders of our department for creating such excellent research facilities and enjoyable working atmosphere.

I would like to express my sincere thanks to Dr. Werner Schmitz and Professor Ernst Conzelmann from Theodor-Boveri-Institut für Biowissenschaften (Biozentrum) der Universität Würzburg, Würzburg, Germany for their collaboration in this project and giving me an opportunity to work in their laboratory. Dr. Schmitz has been very helpful and provided several interesting compounds for crystallographic binding studies. I would like to thank Dr. Päivi Pirilä for her effort to synthesize important ligands for this research project.

I wish to thank all my co-authors for their contributions, especially Dr. Kalle Savolainen for his mutagenesis studies and Dr. Ulrich Bergmann for his help in mass spectrometric analysis of proteins.

I am grateful to Dr. Anette Henriksen and Professor Reijo Lahti for their valuable comments on the manuscript of the thesis. I also thank the members of my thesis

Page 10: Protein crystallographic studies to understand the …jultika.oulu.fi/files/isbn9514280911.pdf · PROTEIN CRYSTALLOGRAPHIC STUDIES TO UNDERSTAND ... Protein crystallographic studies

committee, especially Docent Tuomo Glumoff for his constructive criticism and comments which have helped me to carry out the research in the right direction.

I would like to thank all the former and present members of RW group for everyday collaboration and many cheerful moments. I would like to specially thank Ville Ratas for his help at the beginning of this project. I wish to thank Dr. Antti Haapalainen for sharing tension and lots of fun during several synchrotron trips. I would like to thank Dr. Inari Kursula, Dr. Anu Mursula, Dr. Kristian Koski, Jukka Taskinen, Mikko Salin, Viivi Majava, Gitte Meriläinen, Markus Alahuhta, Mira Pekkala and Sanna Partanen for their help, company during these years.

I wish to extend my sincere thanks to all the staff of our department. I want to thank Virpi Hannus, Anneli Kaattari, Pia Askonen and Tuula Koret for their kind help with many practical tasks. I would like to thank Kyösti Keränen and Jaakko Keskitalo for maintaining the X-ray equipment in proper condition for the datacollection during these years (especially in the summer time!). Jyrki Hänninen, André Juffer, Miki Kallio and Ari-Pekka Kvist are also acknowledged for setting up and maintaining computer systems.

I sincerely thank all the people of the “Indian and Asian community” who have given me company and have always helped me during my stay in Oulu. I would like to thank Dr. Mahesh Somani, Satyan Sharma and Sharat Khungar for their friendship. I would like to specially thank my very good friend Dr. Baylie Damtie for his friendship and help. I also wish to express my big and special thanks to my very close and old friends Ranada, Saurabhda and Saurabhda’s wife Gargee for their support and help.

It would have been very difficult to complete my dissertation without the help and support of my family members and close relatives. I would like to express my sincere thanks to my father and mother for their love, support and encouragement. I would like to thank my brother Surajit (Sintu) for his support and cheers. I would like to thank my friends, relatives and in-laws for their support. I would also like to thank my sweetheart, my wife Dipanwita from the bottom of my heart for her love, care and encouragement.

Finally, I would like to express my sincere gratitude and pranams to Param Purush Shrii Shrii Anandamurti (Baba).

Oulu, April 2006 Prasenjit Bhaumik

Page 11: Protein crystallographic studies to understand the …jultika.oulu.fi/files/isbn9514280911.pdf · PROTEIN CRYSTALLOGRAPHIC STUDIES TO UNDERSTAND ... Protein crystallographic studies

List of original articles

This thesis is based on the original articles, which are referred to in the text by their Roman numerals:

I Bhaumik P, Koski MK, Bergmann U & Wierenga RK (2004) Structure determination and refinement at 2.44 Å resolution of argininosuccinate lyase from Escherichia coli. Acta Cryst D60: 1964-1970.

II Savolainen K, Bhaumik P, Schmitz W, Kotti TJ, Conzelmann E, Wierenga RK & Hiltunen JK (2005) α-Methylacyl-CoA racemase from Mycobacterium tuberculosis: mutational and structural characterization of the active site and the fold. J Biol Chem 280: 12611-12620.

III Bhaumik P, Schmitz W, Hassinen A, Hiltunen JK, Conzelmann E & Wierenga RK (2006) The 1,1-proton transfer reaction mechanism by α-methylacyl-CoA racemase is catalyzed by an aspartate/histidine pair and involves a smooth, methionine rich surface for binding the fatty acyl moiety. Manuscript.

Page 12: Protein crystallographic studies to understand the …jultika.oulu.fi/files/isbn9514280911.pdf · PROTEIN CRYSTALLOGRAPHIC STUDIES TO UNDERSTAND ... Protein crystallographic studies
Page 13: Protein crystallographic studies to understand the …jultika.oulu.fi/files/isbn9514280911.pdf · PROTEIN CRYSTALLOGRAPHIC STUDIES TO UNDERSTAND ... Protein crystallographic studies

Abbreviations

A. pyrophilus Aquifex pyrophilus ASL argininosuccinate lyase ASLD argininosuccinate lyase deficiency Amacr α-methylacyl-CoA racemase CaiB carnitine CoA transferase CIP Cahn-Ingold-Prelog CoA coenzyme A DEAE diethylaminoethyl eASL argininosuccinate lyase from E. coli E. coli Escherichia coli FRC formyl CoA transferase MCR α-methylacyl-CoA racemase from M. tuberculosis MIR multiple isomorphous replacement MIRAS multiple isomorphous replacement with anomalous scattering MMA Methyl mercuric acetate MR mandelate racemase M. tuberculosis Mycobacterium tuberculosis NAD Nicotinamide adenine dinucleotide NCS non-crystallographic symmetry O. formigenes Oxalobacter formigenes PCMPS P-Chloromercuriphenyl sulfonic acid PDB protein data bank PEG Polyethylene glycol P. horikoshi Pyrococcus horikoshi PLP pyridoxal phosphate PLP-D-Ala N-5′-phosphopyridoxyl-D-alanine PLP-L-Ala N-5′-phosphopyridoxyl-L-alanine RacE Bacillus subtilis glutamate racemase r. m. s. root mean square SDS-PAGE sodium dodecyl sulfate ployacrylamide gel electrophoresis TIM triose phosphate isomerase

Page 14: Protein crystallographic studies to understand the …jultika.oulu.fi/files/isbn9514280911.pdf · PROTEIN CRYSTALLOGRAPHIC STUDIES TO UNDERSTAND ... Protein crystallographic studies
Page 15: Protein crystallographic studies to understand the …jultika.oulu.fi/files/isbn9514280911.pdf · PROTEIN CRYSTALLOGRAPHIC STUDIES TO UNDERSTAND ... Protein crystallographic studies

Contents

Abstract Acknowledgements List of original articles Abbreviations Contents 1 Introduction ................................................................................................................... 17 2 Review of the literature ................................................................................................. 19

2.1 Carbon-hydrogen bond breaking ............................................................................19 2.2 Stability of carbanion..............................................................................................20 2.3 Delocalization of the carbanion’s negative charge on the neighbouring

keto group..............................................................................................................21 2.4 Delocalization of the carbanion’s negative charge with a Schiff base ...................22 2.5 Delocalization of the carbanion’s negative charge by formation of a

double bond with the neighbouring carbon atom and elimination of the leaving group.........................................................................................................22

2.6 Carbon-hydrogen bond breaking is the first step of the reaction mechanism in both α-methylacyl-CoA racemase and argininosuccinate lyase.......................................................................................................................23

2.7 Argininosuccinate lyase ..........................................................................................23 2.7.1 Argininosuccinate lyase belongs to a superfamily of enzymes which

act as a tetramer ...............................................................................................24 2.7.2 Overall structure of ASL..................................................................................24 2.7.3 Location of the active site................................................................................27 2.7.4 Closed and open conformation of the C3 loop ................................................28 2.7.5 Reaction mechanism of argininosuccinate lyase .............................................28 2.7.6 Argininosuccinate lyase deficiency .................................................................30

2.8 Racemases ..............................................................................................................31 2.8.1 Enantiomers: R and S nomenclature................................................................31 2.8.2 Racemization ...................................................................................................32 2.8.3 Racemases and epimerases ..............................................................................33

Page 16: Protein crystallographic studies to understand the …jultika.oulu.fi/files/isbn9514280911.pdf · PROTEIN CRYSTALLOGRAPHIC STUDIES TO UNDERSTAND ... Protein crystallographic studies

2.8.4 Understanding the general strategy of the racemization reaction mechanism.......................................................................................................36

2.8.4.1 Alanine racemase......................................................................................36 2.8.4.2 Glutamate racemase..................................................................................39 2.8.4.3 Mandelate racemase .................................................................................41 2.8.4.4 Aspartate racemase ...................................................................................45

2.9 α-Methylacyl-CoA racemase..................................................................................46 2.9.1 Reaction catalyzed by Amacr ..........................................................................48 2.9.2 Some properties of Amacr ...............................................................................49 2.9.3 Amacr belongs to family III CoA-transferases ................................................49

2.9.3.1 Structural fold of family III CoA-transferases..........................................50 2.9.3.2 Structural comparison among the members of family III CoA-

transferases...............................................................................................51 2.9.3.3 Active site cleft and mode of CoA binding in the active site ....................52 2.9.3.4 Ligand binding induces domain movement in CaiB.................................54

2.9.4 Amacr in cancer...............................................................................................55 3 Aims of the present study .............................................................................................. 56 4 Materials and methods................................................................................................... 57

4.1 Recombinant protein expression and purification (I-III) ........................................57 4.2 Enzyme assay .........................................................................................................57 4.3 Dynamic light scattering experiments ....................................................................58 4.4 Crystallization of eASL (I) .....................................................................................58 4.5 Data collection, processing, structure determination, model building and

refinement of eASL (I) ..........................................................................................58 4.6 Crystallization of wild type MCR (II-III) ...............................................................59 4.7 Crystallization of H126A MCR mutant ..................................................................59 4.8 Data collection from native and heavy atom soaked wild type MCR

crystals (II) ............................................................................................................60 4.9 Structure determination, model building and refinement of unliganded

wild type MCR (II)................................................................................................61 4.10 Soaking experiments and data collection of wild type MCR crystals

using substrate analogues (III)...............................................................................61 4.11 Data collection and structure determination of unliganded and 2-

methylmalonyl-CoA bound H126A MCR mutant .................................................62 4.12 Glutaraldehyde cross linking of MCR crystals to prevent cracking during

the soaking experiments (III).................................................................................63 4.13 Soaking experiments and data collection of the cross linked wild type

crystals using (R, S)-ibuprofenoyl-CoA, (R)-, (S)-2-methylmyristoyl-CoA (III) ........................................................................................................................63

4.14 Structure determination, model building and refinement of the liganded wild type MCR (III)...............................................................................................64

4.15 Structural comparison (I-III).................................................................................64 5 Results ........................................................................................................................... 65

5.1 Identification of the band of eASL on the SDS-PAGE (I) ......................................65 5.2 Structure of eASL and it’s open and closed active site (I) ......................................66

Page 17: Protein crystallographic studies to understand the …jultika.oulu.fi/files/isbn9514280911.pdf · PROTEIN CRYSTALLOGRAPHIC STUDIES TO UNDERSTAND ... Protein crystallographic studies

5.3 Crystallization of MCR and H126A MCR mutant and other variants (II-III) ....................................................................................................................68

5.4 Structure of MCR (II) .............................................................................................69 5.5 Structures of MCR complexed with acetyl-CoA, crotonyl-CoA and

acetoacetyl-CoA (III).............................................................................................71 5.6 Structure of unliganded H126A MCR mutant ........................................................72 5.7 Structure of H126A MCR mutant complexed with (S)-2-methylmalonyl-

CoA .......................................................................................................................73 5.8 Structure of MCR complexed with (R, S)-ibuprofenoyl-CoA (III).........................77 5.9 Crystal structure of MCR complexed with (R)- and (S)-2-

methylmyristoyl-CoA (III) ....................................................................................78 6 Discussion ..................................................................................................................... 81

6.1 Presence of two phosphate ions in the closed active site of eASL shows the possible mode of binding of the succinate moiety of the substrate in the competent catalytic complex (I) ......................................................................81

6.2 Structural comparison of MCR with FRC and YfdW (II) ......................................82 6.3 Structural comparison of wild type MCR structure with liganded and

unliganded H126A MCR mutant structures ..........................................................84 6.4 The characteristic features of the extended active site of MCR (III) ......................85 6.5 Understanding the reaction mechanism of MCR (III) ............................................86

7 Conclusions ................................................................................................................... 89 References Original articles

Page 18: Protein crystallographic studies to understand the …jultika.oulu.fi/files/isbn9514280911.pdf · PROTEIN CRYSTALLOGRAPHIC STUDIES TO UNDERSTAND ... Protein crystallographic studies
Page 19: Protein crystallographic studies to understand the …jultika.oulu.fi/files/isbn9514280911.pdf · PROTEIN CRYSTALLOGRAPHIC STUDIES TO UNDERSTAND ... Protein crystallographic studies

1 Introduction

Enzymes are essential to enhance the rate of many critical biochemical reactions under physiological conditions. A reaction that takes many hours, days or years to reach completion can not be metabolically useful to a bacterium that must reproduce in few minutes or to a human nerve cell that must respond to a stimulus. The enzymes are known to enhance the reaction rate by up to 1017 –fold (Radzicka & Wolfenden 1995).

Εnzymes are macromolecular protein catalysts, consisting of α-amino acids joined by an amide linkage. Some enzymes use additional cofactors, like NAD, metal ions or a simple water molecule for their catalytic function. Each enzyme has a unique primary sequence, and it folds in three dimension into a precise conformation with remarkable catalytic activity. To understand the chemistry of living systems and the kind of transformation that occur in them, it is important to understand (1) the composition, size and structure of the proteins that display the enzymatic activity and (2) the chemistry of its functional groups that actually engage in catalysis. These investigations provide insights into the two most distinguishing characteristics of enzymes as catalysts: specificity and rate accelerations of enzyme-mediated transformations.

This study was initiated to understand the structure and function of the enzyme α-methylacyl-CoA racemase (Amacr). Amacr is a key enzyme in the degradation pathway of branched chain fatty acids. An Amacr homologue from the eubacterium Mycobacterium tuberculosis, referred to as MCR, was taken as a model protein. MCR was purified, crystallized and its three dimensional structures with and without ligand were determined. The liganded crystal structures have given a clear insight into the mode of binding of the two enantiomeric substrates in the MCR active site as well as a first understanding of the reaction mechanism.

During the process of optimization of crystallization condition of MCR, a very minor impurity in the protein sample was crystallized. The protein was identified to be argininosuccinate lyase from Escherichia coli (eASL) and the structure of the enzyme was also determined.

These structural studies have given a clear understanding about the reaction mechanism of both enzymes. The first step of the reaction catalyzed by each of these enzymes is the same: C-H bond breaking. In case of MCR, first there is proton abstraction followed by formation of a planar intermediate; subsequently, the donation of a proton to the planar intermediate gives rise to formation of the inverted product.

Page 20: Protein crystallographic studies to understand the …jultika.oulu.fi/files/isbn9514280911.pdf · PROTEIN CRYSTALLOGRAPHIC STUDIES TO UNDERSTAND ... Protein crystallographic studies

18

Similarly, eASL starts the reaction also by proton abstraction to form a planar carbanion intermediate (Sampaleanu et al. 2002). The transfer of the negative charge generates the products fumarate and arginine.

In the subsequent sections the chemistry of breaking a C-H bond as well as the literature on the argininosuccinate lyase (ASL) superfamily is briefly discussed, and subsequently the literature on racemases is reviewed more extensively.

Page 21: Protein crystallographic studies to understand the …jultika.oulu.fi/files/isbn9514280911.pdf · PROTEIN CRYSTALLOGRAPHIC STUDIES TO UNDERSTAND ... Protein crystallographic studies

2 Review of the literature

2.1 Carbon-hydrogen bond breaking

In every chemical reaction, there is bond breaking and bond formation. It is important to understand how this bond breaking takes place in the enzyme catalyzed reaction. There are many enzymatic reactions which are involved in carbon-hydrogen bond breaking at some stage of the reaction. The C-H bond can be cleaved in two ways. (1) Homolytic cleavage (Fig. 1a), with one electron remaining on the carbon and one on the hydrogen, producing carbon radicals and hydrogen radicals (Walsh 1979, Finar 1963).

Fig. 1. Different modes of C-H bond cleavage. The figure was modified form Walsh 1979.

Generally radicals are very reactive and unstable and in most enzyme catalyzed reactions radicals are not formed. (2) Reactions occur much more often by heterolytic cleavage of bonds, with one atom retaining both electrons. When a C-H bond cleaves heterolytically,

C H.. C H. .+

homolytic

cleavage(a)

C H

.

. C H+heterolytic

cleavage(b) ..

radicals

carbanion proton

C H

.

. C H+heterolytic

cleavage(c) ..

carbonium ion hydride ion

Page 22: Protein crystallographic studies to understand the …jultika.oulu.fi/files/isbn9514280911.pdf · PROTEIN CRYSTALLOGRAPHIC STUDIES TO UNDERSTAND ... Protein crystallographic studies

20

there are two options: (a) the electrons can remain on the carbon, yielding a carbanion intermediate and a proton, H+ (Fig. 1b), or (b) the electrons can depart with the hydrogen, producing an electron deficient carbonium ion and a hydride ion (Fig. 1c). Carbon is a more electronegative atom than hydrogen and so in general the electrons remain on the carbon. Indeed, much enzyme chemistry involves controlled generation of carbanions. The fragmentation pattern (c), to yield the equivalent of a hydride ion is seen in for example dehydrogenases that use nicotinamide coenzymes as redox cofactors.

2.2 Stability of carbanion

The carbanion can be stabilized by delocalizing the negative charge among neighbouring atoms (Walsh 1979, Finar 1963). Three different ways of stabilizing the carbanion in the enzyme catalyzed reaction are shown (Fig. 2). These three ways are, (a) delocalization of the negative charge on the neighbouring keto group, (b) delocalization of the negative charge with a Schiff base, (c) formation of a double bond with the neighbouring carbon atom and the elimination of the leaving group.

Fig. 2. Three different ways of stabilizing carbanion. The figure was modified form Walsh 1979.

These three different ways of stabilization of the carbanion during different enzymatic catalytic reactions will be discussed in the next sections.

C C

O

C C

O

Oxyanion

C C

N

C C

N..

Schiff base

C C

X

C C + X

Elimination

(a)

(b)

(c)

Page 23: Protein crystallographic studies to understand the …jultika.oulu.fi/files/isbn9514280911.pdf · PROTEIN CRYSTALLOGRAPHIC STUDIES TO UNDERSTAND ... Protein crystallographic studies

21

2.3 Delocalization of the carbanion’s negative charge on the neighbouring keto group

There are many enzymes in which after C-H bond breaking, the negative charge is delocalized to a neighbouring keto group. Because of the delocalization of the negative charge an oxyanion is formed. In the enzyme active site there is an oxyanion hole to stabilize the oxyanion (Wilmouth et al. 2001, Branden & Tooze 1998). Generally main chain NH groups of the peptide chain are involved in formation of the oxyanion hole. The classical example is the oxyanion hole in chymotypsin, a serine protease. In the serine proteases the oxyanion is generated after nucleophilic attack of a serine (Dodson & Wlodawer 1998) on the reactive carbon atom of the peptidyl group, resulting in a tetrahedral intermediate. In chymotripsin the main chain NH groups of Gly193 and Ser195 are involved in formation of the oxyanion hole (Branden & Tooze 1998). In case of porcine pancreatic elastase which is also a serine protease, the main chain NH groups of Gly193 and Ser195 are also involved in formation of the oxyanion hole (Wilmouth et al. 2001).

One good example where the carbanion is formed because of abstraction of proton and the negative charge is stabilized by neighbouring keto group, is Δ3-Δ2-enoyl-CoA isomerase (Mursula et al. 2001). The first step of the reaction mechanism of yeast Δ3-Δ2-enoyl-CoA isomerase is the proton abstraction from Cα atom by Glu158 resulting in formation of a carbanion. The negative charge on the Cα atom is stabilized by delocalization with neighbouring thioester group. The delocalization of the negative charge leads to formation of an oxyanion which is stabilized by the oxyanion hole present in the enzyme active site. In yeast Δ3-Δ2-enoyl-CoA isomerase the oxyanion hole is formed by main chain NH groups of Ala70 and Leu126 (Fig. 3). In this case the oxyanion containing intermediate is a planar enolate moiety.

Fig. 3. Oxyanion hole in yeast Δ3-Δ2-enoyl-CoA isomerase. The figure was modified from Mursula et al. 2001.

SCoA

O

R

HNNH

H

BBH

SCoA

O

R

HNNHAla70Leu126

Ala70Leu126

Page 24: Protein crystallographic studies to understand the …jultika.oulu.fi/files/isbn9514280911.pdf · PROTEIN CRYSTALLOGRAPHIC STUDIES TO UNDERSTAND ... Protein crystallographic studies

22

2.4 Delocalization of the carbanion’s negative charge with a Schiff base

In pyridoxal phosphate (PLP) dependent enzymes the negative charge of the carbanion is stabilized by charge redistribution with the Schiff base (Eliot & Kirsch 2004). The Schiff base is formed by the reaction of the aldehyde of the pyridoxal phosphate with an amino group of the substrate, for example an amino acid. A good example of such enzymes are PLP dependent amino acid racemases, like, alanine racemase from Bacillus stearothermophilus (Watanabe et al. 2002) in which, for example, the racemization reaction is initiated through proton abstraction by a tyrosine (Tyr265) (Fig. 4). The resulting carbanion is stabilized by delocalizing its negative charge with the Schiff base formed because of the reaction between the amino group of the amino acid and the enzyme bound PLP cofactor. More literature review about the reaction mechanism of PLP dependent alanine racemase is presented in the section 2.8.4.1.

Fig. 4. Stabilization of carbanion during racemization reaction in alanine racemase. The figure was modified from Eliot & Kirsch 2004.

2.5 Delocalization of the carbanion’s negative charge by formation of a double bond with the neighbouring carbon atom and elimination of

the leaving group

Another way of stabilizing the carbanion is the elimination of a neighbouring group. One example of this kind is the enzyme argininosuccinate lyase. In this enzyme, after abstraction of the proton the leaving group is an arginine molecule (Fig. 5) (Sampaleanu et al. 2002). More literature review about the reaction mechanism of argininosuccinate lyase is presented in the section 2.7.5.

NH

NH

PO

CO2

H

ONH3

TyrLys

Page 25: Protein crystallographic studies to understand the …jultika.oulu.fi/files/isbn9514280911.pdf · PROTEIN CRYSTALLOGRAPHIC STUDIES TO UNDERSTAND ... Protein crystallographic studies

23

Fig. 5. Elimination of arginine by argininosuccinate lyase. B deprotonates the substrate; A protonates the leaving group.

2.6 Carbon-hydrogen bond breaking is the first step of the reaction mechanism in both α-methylacyl-CoA racemase and

argininosuccinate lyase

Racemases catalyze the inversion of stereochemistry of a chiral carbon atom of the substrates. The reaction involves abstraction of a proton from one side and donation of a proton to the other side of the chiral carbon atom. So, racemases start the racemization reaction by abstraction of a proton from the susbtrates. In case of enzymes belonging to the ASL superfamily, the reaction is a β-elimination reaction, in which the first step is also abstraction of proton from the substrate. In argininosuccinate lyase and α-methylacyl-CoA racemase, which belong to the different superfamilies, the first step in the catalytic reaction is the same. In the next sections, the literature on the ASL superfamily is briefly discussed; subsequently the literature on racemases is reviewed more extensively.

2.7 Argininosuccinate lyase

Argininosuccinate lyase (ASL, EC 4.3.2.1) is an interesting enzyme because of its structure and function relationship. The enzyme belongs to a superfamily which is active as a tetramer. Most of the structural studies for this enzyme have been done using duck δ2 crystallin which has argininosuccinate lyase activity (Sampaleanu et al. 2001a). The structure of the complex of argininosuccinate bound to inactive S283A duck δ2 crystallin mutant has been determined to understand its reaction mechanism better (Sampaleanu et

NH2+

COO-

A:H

N-OOC H :B

NH H

NH3+

COO-

NH2+ COO-

H:BA:

N-OOC H

NH H

NH3+ COO-

+

Argininosuccinate

Arginine Fumarate

+-

Page 26: Protein crystallographic studies to understand the …jultika.oulu.fi/files/isbn9514280911.pdf · PROTEIN CRYSTALLOGRAPHIC STUDIES TO UNDERSTAND ... Protein crystallographic studies

24

al. 2002). Nevertheless, the reaction mechanism is unclear as one of the catalytic loops is disordered in all of the available structures of the active lyases.

2.7.1 Argininosuccinate lyase belongs to a superfamily of enzymes which act as a tetramer

Argininosuccinate lyase (ASL) participates in arginine synthesis in all organisms to catalyze the reversible breakdown of argininosuccinate to arginine and fumarate (Fig. 5). The reaction is also a part of the urea cycle, the major pathway for the detoxification of ammonia in ureotelic species. ASL is a member of a superfamily of enzymes, which are active as tetramers and which catalyze similar β-elimination reactions in which a C-N or C-O bond is cleaved with subsequent release of fumarate as one of the products. Other members of the superfamily include δ crystallin (Sampaleanu et al. 2002), fumarase (Weaver & Banaszak 1996), aspartase (Fujii et al. 2003), adenylosuccinate lyase (Toth & Yeates 2000). Also 3-carboxy-cis,cis-muconate lactonizing enzyme (Williams et al. 1992, Yang et al. 2004) belongs to this superfamily. Structural studies of the above enzymes from different species have been done (Table 1).

Table 1. Enzymes of the superfamily which are active as tetramer.

Enzyme name Source PDB entry code Argininosuccinate lyase Human 1AOS, 1K62 δ crystallin Duck 1K7W, 1HY1, 1AUW,1HY0,

1I0A, 1DCN, 1TJU, 1TJV, 1TJW, 1U15, 1U16

Escherichia coli 1FUO, 1FUP, 1FUQ, 1FUR, 1KQ7, 1FUS

Thermus thermophilus 1VDK Archaeoglobus fulgidus 1VPJ

Fumarase

Yeast 1YFM Bacillus sp. 1J3U Aspartase Escherichia coli 1JSW Thermotoga maritima 1C3C, 1C3U Pyrobaculum Aerophilum 1DOF, Bacillus subtilis 1F1O

Adenylosuccinate lyase

Caenorhabditis Elegans 1YIS 3-carboxy-cis,cis-muconate lactonizing enzyme Pseudomonas putida 1RE5

2.7.2 Overall structure of ASL

The crystal structures of ASL from several species have been studied, in particular the duck δ1 and δ2 eye lens crystallins (Sampaleanu et al. 2001a, Sampaleanu et al. 2002,

Page 27: Protein crystallographic studies to understand the …jultika.oulu.fi/files/isbn9514280911.pdf · PROTEIN CRYSTALLOGRAPHIC STUDIES TO UNDERSTAND ... Protein crystallographic studies

25

Vallee et al. 1999, Abu-Abed et al. 1997, Tsai et al. 2004) which are inactive and active homologues of ASL, respectively. These studies also concern the structures of the human ASL at 4 Å resolution (Turner et al. 1997), inactive human Q286R ASL (Sampaleanu et al. 2001b) and inactive turkey δ1 crystallin (Simpson et al. 1994). These enzymes have very similar three dimensional structures.

The monomer is approximately 100 Å long and can be divided into three predominantly helical domains. In human ASL, domain 1 includes residues 1-112, domain 2 residues 113-362 and domain 3 residues 363-436 (Turner et al. 1997). Domain 2 is elongated, and is essentially a bundle of five long helices. Domains 1 and 3 are more compact, spherical domains. Fig. 6a shows the overall structure of the monomer of ASL.

ASL is catalytically active as a tetramer. Fig. 6b shows the symmetric arrangements of the four monomers in the tetramer. The formation of the tetramer can be described as the arrangement of two dimeric pairs where each dimer is formed through mainly hydrophobic interactions between helices h8, h11, and h12. The numbering of the helices is consistent with that described for turkey δ1 crystallin (Simpson et al. 1994). Less extensive interactions between dimers result in the formation of a tetramer with h12 of each monomer forming the primary, innermost interface in the 20-helix bundle of the tetramer.

The active site is built from regions of three different subunits (Fig. 6b), such that there are four active sites per tetramer. At each end of the tetrameric helical bundle there are two active sites.

Page 28: Protein crystallographic studies to understand the …jultika.oulu.fi/files/isbn9514280911.pdf · PROTEIN CRYSTALLOGRAPHIC STUDIES TO UNDERSTAND ... Protein crystallographic studies

26

Fig. 6. (a) Structure of the monomer of human ASL. The three structural domains are indicated in three different colours (domain 1: cyan, domain 2: green, domain 3: orange), and the conserved amino acid regions C1, C2 and C3 are shown in magenta. (b) Structure of the ASL tetramer. Four different subunits are shown in different colours (A: cyan, B: orange, C: green, D: blue). The active site regions are also shown and coloured in magenta. The tetramer contains four active sites, highlighted by an arrow.

Page 29: Protein crystallographic studies to understand the …jultika.oulu.fi/files/isbn9514280911.pdf · PROTEIN CRYSTALLOGRAPHIC STUDIES TO UNDERSTAND ... Protein crystallographic studies

27

2.7.3 Location of the active site

The very interesting feature of the ASL tetramer is the formation of its active site cleft. The active site is formed by three different loops (C1, C2 and C3) from three different subunits. These three regions are conserved among all the members of the superfamily as shown in the Fig. 7. According to the duck δ2 crystallin numbering scheme, the C1 loop includes residues 114-121, C2 loop, residues 159-168 and C3 loop, residues 282-296. These conserved regions are mapped onto the structure of the ASL monomer in Fig. 6. The spatial relationship between these conserved regions becomes more obvious in the tetrameric arrangement of monomers in the active protein where three different monomers contribute a section of conserved sequence to form the active site at each of the four corners of the ASL tetramer as shown by arrows in Fig. 6b.

In case of duck δ2 crystallin the C1 loop is involved in the substrate binding and the C2 and C3 loops carry the postulated catalytic base (His162 or Thr161) and the postulated catalytic acid (Ser283), respectively (Sampaleanu et al. 2002). Among these three different loops, C3 loop (280’s loop in duck δ2 crystallin (Sampaleanu et al. 2002)) is very flexible.

Fig. 7. The three regions of high sequence conservation among members of the ASL superfamily, being C1, C2 and C3 respectively. The catalytic histidine and serine are labelled by a *. D2C, δ2 crystallin; ADS, adenylosuccinase; CMLE, 3-carboxy-cis-cis-muconate lactonizing enzyme. The figure was modified from Turner et al. 1997.

Page 30: Protein crystallographic studies to understand the …jultika.oulu.fi/files/isbn9514280911.pdf · PROTEIN CRYSTALLOGRAPHIC STUDIES TO UNDERSTAND ... Protein crystallographic studies

28

2.7.4 Closed and open conformation of the C3 loop

Among the three catalytic loops, the most interesting loop, from the structural point of view is the C3 loop. In the duck δ2 crystallin the loop carries the postulated catalytic acid Ser283 (Sampaleanu et al. 2002). The loop is very flexible and invisible in most of the structures. There are two duck δ crystallin structures where the loop is visible.

In the argininosuccinate bound S283A duck δ2 crystallin mutant structure the C3 loop is visible and it is in the open conformation. But in the structure of inactive duck δ1 crystallin, there is one sulfate ion bound in the active site and the C3 loop is in closed conformation (Fig. 8) (Sampaleanu et al. 2001a). The sulfate ion interacts directly with Ser281, Lys287, and Asn289, rigidifying the loop and locking it into a specific conformation. The sulfate ion appears to mimic one of the carboxylate acid groups of the succinate moiety of the argininosuccinate substrate. The binding of sulfate ion in the active site and rigid conformation of the closed C3 loop shows that the C3 loop must be involved in the sequestering the substrate from the solvent during catalysis.

Fig. 8. Stereo view of the open and closed conformation of C3 loops in duck δ1 and δ2 crystallin structures. Duck δ1 and δ2 crystallin structres are shown in olive-green and magenta colour respectively. The bound sulfate ion in the closed active site of duck δ1 crystallin is shown as ball-and-stick model.

2.7.5 Reaction mechanism of argininosuccinate lyase

There is much structural and enzymological data available on duck δ2 crystallin and its mutated variants (Sampaleanu et al. 2002, Wu et al. 1998). From the structural and mutational studies (Chakraborty et al. 1999) precise roles of different residues of duck δ2 crystallin in binding of argininosuccinate in the active site have been understood. However, the precise catalytic reaction mechanism of ASL is unclear, because (i) there are no structures of functional wild type ASL enzymes and (ii) in the structures of the active eye lens homologues the C3 loop is disordered.

Page 31: Protein crystallographic studies to understand the …jultika.oulu.fi/files/isbn9514280911.pdf · PROTEIN CRYSTALLOGRAPHIC STUDIES TO UNDERSTAND ... Protein crystallographic studies

29

Analysis of argininosuccinate bound structure of inactive S283A duck δ2 crystallin mutant gives some idea about the reaction mechanism of argininosuccinate lyase. Previously His162 has been hypothesized to be involved in proton abstraction (Simpson et al. 1994). In the duck δ2 crystallin S283A mutant structure, His162 and Thr161 are quite close to the Cβ atom of succinate moiety of the substrate, so proton abstraction can be done by either His162 or Thr161. Although the mutagenesis data (Chakraborty et al. 1999) has highlighted the importance of His162 possibly as a base, but the role of His162 as the catalytic base is disputed by the observation that this histidine is not completely conserved in the superfamily. Thr161 is highly conserved across the superfamily, and could also be the potential catalytic base. The activation of Thr161’s hydroxyl oxygen for proton abstraction could potentially be accomplished by its interaction with Lys289. From the structural data it is seen that Lys289 might also be involved in stabilizing the negatively charged carbanion intermediate (Sampaleanu et al. 2002).

The catalytic acid (Fig. 5) of the ASL reaction mechanism is still unclear. It has been proposed that Ser283 in duck δ2 crystallin may be acting as a proton donor (Sampaleanu et al. 2001a). But in the S283A duck δ2 crystallin mutant structure the 280s loop is in the open conformation with the Cβ atom of Ala283 ~7.75 Å from the Cβ atom of the substrate (Sampaleanu et al. 2002) making it difficult to confirm the catalytic role of this side chain. In contrast, in the duck δ1 crystallin structure (Sampaleanu et al. 2001a) where the 280s loop is in the closed conformation, the Oγ atom of Ser283 is only 4.5 Å away from the substrate.

The possible reaction mechanism of breakdown of argininosuccinate by duck δ2 crystallin has been proposed as follows (Sampaleanu et al. 2002). After substrate binding, Thr161 (or His162) abstracts a proton from Cβ atom of argininosuccinate to form a carbanion intermediate. Either during or following the base catalysis step, the 280s loop closes over the active site and sequesters the substrate from the solvent. The negative charge developing on the substrate is stabilized by Lys289. The movement of the 280s loop together with the change in hybridization state of the substrate’s Cα and Cβ atoms from sp3 to sp2, place Ser283 in the position to donate a proton to the N1 atom. After proton donation, the unstable negatively charged Ser283 will need to be reprotonated and it is proposed, that might be the driving force for the loop to return to the open conformation, where Ser283 could acquire a proton from the solvent. The loop movement would also allow the products, fumarate and then arginine, to be released (Raushel & Garrard 1984). Alternately, if there is no requirement for an acid group, an uncharged arginine would be produced, and the solvent could donate the proton required to produce charged arginine. The schematic representation of the possible reaction mechanism is shown in Fig. 9.

Nevertheless a precise description of its reaction mechanism has been elusive due to the lack of a crystal structure of an active metabolic ASL having a liganded competent active site. More crystallographic binding studies using active metabolic ASL are required to understand the reaction mechanism in details.

Page 32: Protein crystallographic studies to understand the …jultika.oulu.fi/files/isbn9514280911.pdf · PROTEIN CRYSTALLOGRAPHIC STUDIES TO UNDERSTAND ... Protein crystallographic studies

30

Fig. 9. Schematic representation of breakdown of argininosuccinate by duck δ2 crystallin. The figure was modified from Sampaleanu et al. 2002.

2.7.6 Argininosuccinate lyase deficiency

The urea cycle disorder argininosuccinate aciduria is caused by deficiency of argininosuccinate lyase. Argininosuccinate lyase deficincey (ASLD), which was first recorded in the late1950s (Allan et al. 1958), is transmitted as an autosomal-recessive trait and occurs with an incidence of approximately 1:70,000 live birth (Levy et al. 1980). It leads to hyperammonaemia, accumulation of argininosuccinic acid in the body fluids and depletion of arginine. The clinical course of ASLD is of marked heterogeneity. The onset ranges from neonatal (in the majority of cases with severe symptoms, such as life-threatening hyperammonaemic crises, seizures, hypothermia, hyperventilation, vomiting and lethargy) to late (mainly with psychomotor radiation and mental disorders). Therapy consists of protein restriction, supplementation of arginine and stimulation of alternative routes of nitrogen detoxification (Brusilow & Horwich 2001, Brusilow & Maestri 1996, Burton 1998, Wu 1991). The clinical outcome is unsatisfactory in many cases, although asymptotic indivuals, diagnosed in routine urine tests, have been reported (Applegarth et al. 1975, Brusilow & Horwich 2001).

-OOC

NH3+

NH

NH

H

COO-

COO-NH2+

-OOC

NH3+

NH

NH

COO-

COO-NH2+

-OOC

NH3+

NH

NH

COO-

COO-NH2+

:B

-OOC

NH3+

NH

NH

NH2

COO-

COO-

-OOC

NH3+

NH

NH2

NH2+

HO:H

-OOC

NH3+

NH

NH

COO-

COO-NH2+

-OOC

NH3+

NH

NH2

NH2+

B:H

COO-

COO-

Argininosuccinate

Base catalysis: His162 or Thr161 ?(280s loop closed)

Arginine Fumarate

Fumarate

Arginine

Acid catalysis: S283(or another residue)Without acid catalysis

Carbanion intermediate(Stabilized by K289 ?)

Page 33: Protein crystallographic studies to understand the …jultika.oulu.fi/files/isbn9514280911.pdf · PROTEIN CRYSTALLOGRAPHIC STUDIES TO UNDERSTAND ... Protein crystallographic studies

31

2.8 Racemases

Enzymes are commonly referred to as catalytic proteins with a unique substrate specificity, including the specific recognition of the (R)- form or the (S)- form of a chiral molecule. Racemases are enzymes which differ in this respect as they have evolved to bind both the (R)- form and the (S)- form of the substrate with similar affinity in order to interconvert these (R)- and (S)- forms. In the next section the R, S nomenclature will be summarized. Subsequently, the currently available structural data of racemases will be reviewed and there after the properties of the α-methylacyl-CoA racemase will be discussed.

2.8.1 Enantiomers: R and S nomenclature

“Chirality is a property of any geometrical figure if its image in a plane mirror, ideally realized, cannot be brought to coincide with itself” (Lord Kelvin 1893) (Stryer 1988). The word ´chiral´ is derived from the Greek word ´cheir´, meaning hand (Lamzin et al. 1995). It is well known from geometric principles that any three points always lie in a plane. If the number of points is increased to four, they no longer need to lie in a plane and therefore become candidates to serve as a chiral geometric structure. All chiral molecules have non-superimposable mirror images. And as a general rule of thumb, chiral molecules must have one or more chiral centers that is, carbons that have four non-identical substituents around it. A classic case of a simple chiral molecule is the following halogenated methane derivative (Fig. 10).

Fig. 10. Two enantiomers of the same compound are shown. The compounds (a) and (b) are non superimposable mirror images of each other.

This carbon atom has four non-identical substituents around it, making this carbon a chiral center, and as proof of its chirality the molecule has a non-superimposable mirror image. A term used in the literature to describe molecules that are mirror images of each other is “enantiomer”: one of a pair of molecular species that are mirror images of each other and non-superimposable (Eliel & Wilen 1994). In Fig. 10, the enantiomeric mate of the left molecule is the molecule on the right, its mirror image.

Br

I

Cl

F

BrCl

F I

(a) (b)

Page 34: Protein crystallographic studies to understand the …jultika.oulu.fi/files/isbn9514280911.pdf · PROTEIN CRYSTALLOGRAPHIC STUDIES TO UNDERSTAND ... Protein crystallographic studies

32

To distinguish between enantiomers, it is desirable to give differentiating symbols to each enantiomer. Nowadays the Cahn-Ingold-Prelog (CIP) system is used to define the configuration of the enantionmers. In this system, the (italicized) descriptors R and S, which are placed in parentheses and connected with a hyphen only when used as prefix of chemical names, are used to denote the configurations of enantiomers. To determine the proper configuration of a chiral center, the CIP chirality rule (Cahn et al. 1966) is employed. In general, the four ligands attached to the chiral center can be arranged in a unique sequence; for the moment let us call these ligands A, B, D, and E with the sequence of priority is A precedes B precedes D precedes E (A > B > D > E). The CIP chirality rule requires that the model be viewed from the side opposite to that occupied by the ligand E of lowest priority (Fig. 11). The remaining three ligands then present a tripodal array, with the legs extending towards the viewer. If the sequential arrangement or sense of direction of these three ligands (A to B to D) is then clockwise (Fig. 11a), the configurational descriptor is R (for Latin rectus meaning right), if it is counterclockwise (Fig. 11b), it is S (for Latin sinister meaning left).

In general the priority of the ligands attached to the chiral center is determined by the atomic number. Ligands with higher atomic number precede those with lower ones, e.g., Br precedes Cl (Br > Cl). This is the simplest way of determining the priority of different ligands. More detailed explanation about the definition of the priority of more complex ligands in the CIP chirality rule is given by Eliel and Wilen (Eliel & Wilen 1994).

2.8.2 Racemization

A chiral molecule is said to have “optical activity”. This refers to the observation that the plane of the polarized light is changed when passing through a solution of a pure R or pure S compound. The pure R or pure S compounds rotate the plane of the polarized light in opposite directions. If in a solution of pure R compound, this compound is converted into the S compound then eventually a mixture of 50 % R compound and 50 % S compound is formed. This chemical conversion is referred to as racemization. The final mixture, the racemic mixture, has no optical activity (Finar 1959). In the process of racemization, the chemistry of the compound is unaffected but the configurational integrity of the sample is changed. Racemization is an irreversible process arising from the reversible interconversion of enantiomers; it is always associated with the disappearance of optical activity. Uncatalyzed racemization is generally a quite slow process. Compounds are capable of racemization under the influence of heat, light or chemical reagents like acids and bases.

Page 35: Protein crystallographic studies to understand the …jultika.oulu.fi/files/isbn9514280911.pdf · PROTEIN CRYSTALLOGRAPHIC STUDIES TO UNDERSTAND ... Protein crystallographic studies

33

Fig. 11. Chirality rule for (R) and (S) nomenclature. E is the lightest atom and is pointing away from the viewer. D is lighter than B which is lighter than A. The figure was modified from Eliel & Wilen 1994.

2.8.3 Racemases and epimerases

In biological systems, the enzymes which catalyze the inversion of stereochemistry of biological molecules are known as racemases and epimerases (Tanner 2002). The racemases typically interconvert enantiomers which have only one chiral center (like amino acids), whereas epimerases usually refer to enzymes which act on substrates which have several chiral centers like sugar molecules. It has been found that racemases and epimerases catalyze with equal efficiency the (R)- to (S)- as well as the (S)- to (R)- interconversion of their natural substrates (Tanner 2002). Most of the biological molecules are chiral, including proteins, nucleic acids, polysaccharides, lipids and steroids. If polymers, they are built up of units possessing chiral centers and the resulting three dimensional fold is chiral. Indeed, any polymer chain folded into a three dimensional structure will generally adopt one of two mirror images: for proteins where the individual units are chiral, only one of two images is energetically feasible. Only one of these mirror images is synthesized by living organism (Lamzin et al. 1995). So enzymes are inherently asymmetric. The interesting question is, how (and why) can the racemases and epimerases, which are inherently asymmetric, deal with both enantiomers with almost equal efficiency. Although most enzymes are famous for being very stereoselective, racemases and epimerases act on both enantiomers with almost equal efficiency. The logical answer to this question is that racemase and epimerases must have evolved a functional asymmetry or pseudosymmetry in their active site. There are several racemases and epimerases in the living organism. Structural studies of some racemases

B

AD

E

A B

D

E

B

DA

E

D B

A

E

(R)-Compound

(S)-Compound

(a)

(b)

Page 36: Protein crystallographic studies to understand the …jultika.oulu.fi/files/isbn9514280911.pdf · PROTEIN CRYSTALLOGRAPHIC STUDIES TO UNDERSTAND ... Protein crystallographic studies

34

and epimerases have been done to understand the structure-function relationship of these racemases (Table 2).

Although racemases and epimerases can act on both enantiomeric substrates, nevertheless, under physiological conditions, usually one enantiomer is the essential substrate. For example, although Amacr can act on the both enantiomeric substrates of 2-methyl branched chain fatty acyl-CoAs, nevertheless in the living organism the enzyme is only converting (R)-enantiomers to the (S)-enantiomers, which then get oxidized by the β-oxidation pathway (Savolainen et al. 2004). A literature review on several racemases, in particular those whose structure-function relationships are well characterized, is presented in the next sections.

Page 37: Protein crystallographic studies to understand the …jultika.oulu.fi/files/isbn9514280911.pdf · PROTEIN CRYSTALLOGRAPHIC STUDIES TO UNDERSTAND ... Protein crystallographic studies

35

Table 2. Different epimerases and racemases in the PDB (December 2005).

Enzyme name Source PDB entry code Epimerase

Escherichia coli 1A9Y, 1A9Z, 1KVQ, 1KVR, 1KVS, 1KVT, 1KVU, 1LRJ, 1LRK, 1LRL, 1NAH, 1NAI, 1UDA, 1UDB, 1UDC, 2UDP

Homo sapiens 1EK5, 1EK6, 1HZJ, 1I3K, 1I3L, 1I3M, 1I3N,

UDP-galactose-4-epimerase

Trypanosoma brucei 1GY8 GDP-4-Keto-6-Deoxy-D-Mannose Epimerase

Escherichia coli 1BWS

Dtdp-4-Keto-6-Deoxy-Glucose-5-Epimerase

Amycolatopsis orientalis 1OFN

Diaminopimelate epimerase Haemophilus influenzae 1BWZ, 1GQZ Salmonella typhimurium 1DZR, 1DZT Dtdp-6-Deoxy-D-Xylo-4-Hexulose

3,5 Epimerase Methanobacterium thermoautotrophicum

1EPZ

Udp-N-Acetylglucosamine 2-Epimerase

Escherichia coli 1F6D, 1VGV

Pseudomonas aeruginosa 1SB8, 1SB9 Udp-N-Acetylglucosamine 4-Epimerase Bacillus subtilis 1O6C Methylmalonyl-Coenzyme A Epimerase

Propionibacterium shermanii 1JC4, 1JC5

L-Ala-D/L-Glu Epimerases Escherichia coli 1JPD, 1JPM Bacillus Subtilis 1TKK L-Ribulose-5-Phosphate 4-Epimerase

Escherichia coli 1K0W

D-Ribulose 5-Phosphate 3-Epimerase

Synechocystis sp. 1TQJ, 1TQX

Lactococcus lactis 1L7J, 1L7K, 1MMU, 1MMX, 1MMY, 1MMZ, 1MN0, 1NS0, 1NS2, 1NS4, 1NS7, 1NS8, 1NSM, 1NSR, 1NSS, 1NSU, 1NSV, 1NSX, 1NSZ

Galactose Mutarotase

Homo sapiens 1SNZ, 1SO0 Tyvelose Epimerase Salmonella typhi 1ORR Rmlc Epimerase Mycobacterium tuberculosis 1UPI ManNac-6-P Epimerase Staphylococcus aureus 1Y0E Acetylmannosamine-6-P Epimerase Staphylococcus aureus 1YXY GDP-mannose 3,5-epimerase Arabidopsis thaliana 2C54, 2C59, 2C5A, 2C5E

Racemase Aquifex pyrophilus 1B73, 1B74 Glutamate racemase Bacillus subtilis 1ZUW Bacillus stearothermophilus 1BD0, 1EPV, 1FTX, 1L6F, 1L6G, 1NIU,

1XQK, 1XQL, 2SFP Pseudomonas aeruginosa 1RCQ, 1SFT Streptomyces lavendulae 1VFH,1VFS, 1VFT

Alanine racemase

Mycobacterium tuberculosis 1XFC Pseudomonas aeruginosa 1DTN, 1MDL Mandelate racemase Pseudomonas putida 1MDR, 1MNS, 1MRA, 2MNR

Aspartate racemase Pyrococcus horikoshii 1IU9, 1JFL N-Acylamino Acid Racemase Deinococcus radiodurans 1R0M Proline racemase Brucella melitensis 1TM0 Serine racemase Schizosaccharomyces pombe 1V71

Page 38: Protein crystallographic studies to understand the …jultika.oulu.fi/files/isbn9514280911.pdf · PROTEIN CRYSTALLOGRAPHIC STUDIES TO UNDERSTAND ... Protein crystallographic studies

36

2.8.4 Understanding the general strategy of the racemization reaction mechanism

To catalyze an inversion of stereochemistry, a racemase must break and reform a bond in a nonstereospecific manner. Concerning a chiral carbon atom, the racemization can be achieved for example by breaking and reforming a C-C bond, but in the biological system it very often concerns breaking and reforming a C-H bond. For such enzymes the puzzling question concerns the mechanisms by which these racemases catalyze the rapid proton exchange involving carbon-hydrogen bond breaking of protons with relatively high pKa. Most racemases act at a stereogenic center adjacent to a carbonyl functionality and reversibly cleave a carbon-hydrogen bond (Tanner & Kenyon 1998). In this case the pKa of the hydrogen is lowered because of resonance stabilization of the resulting anion, allowing the enzymes to use a deprotonation and reprotonation mechanism. Detailed structural studies on different racemases have been reported, for example on mandelate racemase (Hasson et al. 1998), alanine racemase (Watanabe et al. 2002), glutamate racemase (Hwang et al. 1999) and aspartate racemase (Liu et al. 2002), which are all proposed to work via the initial abstraction of a proton from the chiral carbon atom. Each of these enzymes catalyzes an overall 1,1-proton transfer reaction and have two catalytic protic residues which are required for (i) the initial proton abstraction from the substrate and for (ii) the subsequent proton donation to the intermediate (Fig. 12). The reaction mechanism of some of these racemases will be discussed briefly in the next sections.

Fig. 12. General acid base catalysis of the racemization mechanism. In this reaction scheme B1 is acting as a catalytic base and B2:H is acting as a catalytic acid in the left to right direction.

2.8.4.1 Alanine racemase

The D-isomer of alanine is an essential building block in the synthesis of the peptidoglycan layer of cell walls in Gram-positive and Gram-negative bacteria. It is known that the synthesis of the bacterial cell wall starts with UDP-N-acetylmuramic acid, to which free amino acids are added (Adams 1976). UDP-N-acetylmuramyl-L-Ala-D-Glu-meso-diaminopimelate-D-Ala-D-Ala is a key intermediate in the initial stage of

B1:

R3

R1

O

H R2

B2:H

B1:H

B2:H

R2

R3

R1

O-

B1:H

B2:

R3

R1

O

R2H

Page 39: Protein crystallographic studies to understand the …jultika.oulu.fi/files/isbn9514280911.pdf · PROTEIN CRYSTALLOGRAPHIC STUDIES TO UNDERSTAND ... Protein crystallographic studies

37

synthesis of peptidoglycan layer. D-alanine is added onto the growing amino acid chain as D-alanine dipeptide. This dipeptide is produced by two enzymes, alanine racemase and D-Ala:D-Ala ligase. Alanine racemase, the first enzyme of the biosynthetic pathway of peptidoglycan synthesis catalyzes the conversion of the common L isomer of alanine to the D isomer (Shaw et al. 1997).

Alanine racemase belongs to the fold-type III group of PLP dependent enzymes (Eliot & Kirsch 2004). The first three dimensional structure of alanine racemase from Bacillus stearothermophilus has been determined by Shaw et al (Shaw et al. 1997) at 1.9 Å resolution. The monomer of alanine racemase is composed of two domains, an eight stranded α/β barrel at the N-terminus, which includes residues 1-240, and a C-terminal domain primarily composed of β-strand (residues 241-388). The crystal structure shows that the PLP cofactor is present in and above the mouth of α/β barrel and covalently linked to Lys39 which is present at the C-terminus of the first β-strand of the α/β barrel. This has been the first structure of a PLP cofactor binding in the active site of a α/β barrel. A number of other residues are involved in maintaining the proper position of the PLP in the protein. Among those residues, Arg219 is the most interesting, as it forms a hydrogen bond with the nitrogen atom of the pyridine moiety of the cofactor. Another arginine residue, Arg136, donates a hydrogen bond to the phenolic oxygen of PLP and may be involved in the binding of substrate as well as stabilization of the intermediates. One more interesting residue is Tyr265, which is coming from the second monomer, and plays an important role in the catalysis (Fig.13).

Fig. 13. Zoomed in view of the active site of alanine racemase from Bacillus stearothermophilus complexed with (a) PLP-L-Ala and (b) PLP-D-Ala. Two different subunits are shown in orange and green colours. The active site amino acid residues are shown as stick model and the compounds are shown as ball-and-stick model. The acid base pair concern the Lys39 and Tyr265 residues.

For the progress of the racemization reaction, alanine racemase needs to abstract the substrate’s α-proton and subsequently donates the proton to the opposite face of the resulting planar anionic intermediate. In the fully protonated form of the amino acid, the pKa of the α-proton is ∼ 21; however, in solution at neutral pH, this value would be considerably higher because of the presence of carboxylate anion (Tanner 2002). So at this situation deprotonation of the α-proton is a difficult task. One way in which nature

Page 40: Protein crystallographic studies to understand the …jultika.oulu.fi/files/isbn9514280911.pdf · PROTEIN CRYSTALLOGRAPHIC STUDIES TO UNDERSTAND ... Protein crystallographic studies

38

has managed to overcome this difficulty is through use of PLP. The formation of an imine linkage between the cofactor and the substrate greatly acidifies the α-proton as the anionic intermediate is stabilized by resonance (Watanabe et al. 2002).

The mechanism followed by PLP-dependent alanine racemase proceeds via an initial transaldimination between the lysine bound PLP cofactor and the amino group of the substrate. The resulting adduct is then deprotonated to give a resonance stabilized carbanion. Reprotonation on the opposite face and transaldimination yields the enantiomer of the substrate. To understand the reaction mechanism of alanine racemase, the structures of the enzyme complexes with (R)-1-aminoethylphosphonic acid, a synthetic L-alanine analog serving as a slow tight inhibitor (Stamper et al. 1998, Morollo et al. 1999) have been studied. These crystallographic studies have suggested that the enzyme uses two catalytic bases, Tyr265 and Lys39, for the reaction. Recently Watanabe et al has determined the crystal structure of alanine racemase from Bacillus stearothermophilus bound with reaction intermediate analoges, N-5′-phosphopyridoxyl-L-alanine (PLP-L-Ala) and N-5′-phosphopyridoxyl-D-alanine (PLP-D-Ala) (Watanabe et al. 2002). In this analogue the C-N double bond (Fig. 4) of the Schiff base has been reduced to a C-N single bond. This structure is the first structure of this enzyme which shows the proper orientation of Tyr265 and Lys39 with respect to the α-proton of the two enantiomeric substrates. Fig. 13 shows the zoomed in view of the active site. The crystal structure clearly shows the orientation of the methyl group of the substrate and other functional residues in the active site. It is clear that the α-hydrogen of either enantiomer points to the corresponding residue: α-hydrogen of the L-enantiomer toward Tyr265 and that of the D-enantiomer toward Lys39. The interatomic distances are also close enough to cause a reaction: 3.6 Å from Cα of PLP-D-Ala to the OH of Tyr265, and 3.5 Å from Cα of PLP-D-Ala to NZ of Lys39 (Watanabe et al. 2002). Based on these crystallographic data the reaction mechanism of L-alanine by alanine racemase is proposed to proceed as shown in Fig. 14. According to the two-base mechanism, Tyr265 is unprotonated and Lys39 is protonated in the reaction with L-alanine. The deprotonated Tyr265 abstracts the α-hydrogen from L-alanyl-PLP aldimine to produce the alanyl-PLP carbanion intermediate, and the tyrosine itself is converted to the protonated form. On the other hand, the protonated Lys39 provides its proton to the carbanion intermediate to produce D-alanyl-PLP aldimine, and Lys39 itself becomes unprotonated.

Page 41: Protein crystallographic studies to understand the …jultika.oulu.fi/files/isbn9514280911.pdf · PROTEIN CRYSTALLOGRAPHIC STUDIES TO UNDERSTAND ... Protein crystallographic studies

39

Fig. 14. Classical two-base dependent reaction mechanism of the alanine racemase reaction through the quinonoid intermediate formation. Lys39 and Tyr265 act as the catalytic bases abstracting the α-proton from D-alanine and L-alanine respectively. Lys39 and Tyr265 are from two different subunits. The figure was modified from Watanabe et al. 2002.

2.8.4.2 Glutamate racemase

The enzyme glutamate racemase catalyzes the interconversion of the enantiomers of glutamic acid and provides bacteria with a source of D-glutamate. D-glutamate is used in the construction of peptidoglycan, which rigidifies the bacterial cell wall and protects the organism from osmotic lysis (Glavas & Tanner 1999). As discussed before, it is a very difficult task to remove the α-proton from amino acids. The enzyme alanine racemase overcomes this problem by using a cofactor PLP. The reaction mechanism catalyzed by glutamate racemase is very interesting because it is among a group of enzymes which does not require any cofactor. Mechanistic studies on glutamate racemase show that the reaction in 2H2O proceeds with the incorporation of solvent-derived deuterium at Cα atom and that the racemization of either enantiomer of [2-2H] glutamate is slowed by primary kinetic isotope effect (Nakajima et al. 1988, Tanner et al. 1993). These observations support a deprotonation/reprotonation mechanism by an acid-base pair. Further studies showed that glutamate racemase uses two active site cysteine residues in promoting the 1,1-proton transfer process. Experiments with deuterated substrates confirm the two-base mechanism involving monoprotic bases that do not undergo proton exchange with solvent during the lifetime of the carbanionic intermediate (Choi et al.

N

H

P

NH

O

HO

O

N

H

P

NH

O

O

O

O

Lys39-NH3

COO

NH2

Lys39-NH2

N

H

P

NH

O

Lys39

N

H

P

NH

O

O

O

H

Lys39-NH2

COO

NH2

D-Ala

L-Ala

Tyr265*

Tyr265*-OH

Lys39-NH2

Page 42: Protein crystallographic studies to understand the …jultika.oulu.fi/files/isbn9514280911.pdf · PROTEIN CRYSTALLOGRAPHIC STUDIES TO UNDERSTAND ... Protein crystallographic studies

40

1994, Gallo et al. 1993). In all cases, the cofactor-independent amino acid racemases are thought to employ cysteines as the catalytic acid/base residues (Koo et al. 2000). Mutagenesis studies of Lactobacillus fermenti glutamate racemase, have identified two cysteine residues (Cys73 and Cys184) to be involved in catalysis (Tanner et al. 1993). Based on the biochemical and mutagenesis data the possible reaction mechanism by glutamate racemase is proposed to proceed as shown in the Fig. 15.

Fig. 15. Mechanism of racemization reaction by glutamate racemase. The figure was modified from Tanner 2002.

According to this reaction mechanism scheme Cys73 is responsible for the deprotonation of D-glutamate and Cys184 is responsible for the deprotonation of L-glutamate.

The X-ray structure of Aquifex pyrophilus glutamate racemase has been determined at 2.3 Å resolution with the weak inhibitor glutamine (Hwang et al. 1999) and it confirms the presence of two corresponding cysteine residues in the active site. The enzyme forms a dimer and each monomer has a α + β topology. This structure has given the first structural details of the architecture of the active site of this enzyme. The crystal structure shows that the substrate analogue, D-glutamine is bound in the deep pocket formed by conserved residues from two monomers. However, from the crystal structure it is quite difficult to analyse the role of the different active site residues since the glutamine is bound in a reversed orientation and likely does not represent the Michaelis complex. It is also not clear how the oxyanion formed in the intermediate is stabilized in the enzyme active site.

Recently the crystal structure of Bacillus subtilis glutamate racemase (RacE) complexed with D-glutamate has been determined (Ruzheinikov et al. 2005). The crystal structure shows that the glutamate is buried in a deep pocket. Binding of glutamate in the active site pocket causes large scale conformational changes in the structure. In this liganded structure the active site is much more shielded from solvent, however the relative positions of the catalytic residues are not effected. The active site of RacE is shown in Fig. 16.

O

O O

OH NH3

+

S

Cys73

SH

Cys184

O

O O

O

SH

Cys73

SH

Cys184

NH3+

O

O O

O+H3N H

SH

Cys73

S

Cys184

D-Glutamate L-Glutamate

Page 43: Protein crystallographic studies to understand the …jultika.oulu.fi/files/isbn9514280911.pdf · PROTEIN CRYSTALLOGRAPHIC STUDIES TO UNDERSTAND ... Protein crystallographic studies

41

Fig. 16. Zoomed in view of the active site of Bacillus subtilis glutamate racemase complexed with D-glutamate. Two different domains are shown in orange and green colour. The active site residues are shown as stick model and the bound D-glutamate is shown as ball-and-stick model.

Two catalytic residues Cys74 and Cys185 are located at equivalent positions on the two domains of RacE, with a 7.4 Å spacing between their respective sulfur atoms. These two residues are also close to the side chains of Asp10 (3.9 Å) and His187 (4.2 Å) respectively. Furthermore, they are also in close proximity to the α-carboxyl (Cys74 and Cys185) and the α-amino and side chain carboxyl (Cys74 only) of the glutamate substrate. Spatial arrangements shows that the α-carbon atoms of the glutamate substrate and the sulfur atoms of Cys74 and Cys185 are located approximately 4 Å away from the N-terminal ends of helices α3 and α8. It has been proposed that the developing negative charge generated during the catalytic reaction could also be stabilized by the dipoles of these helices (Hol et al. 1978).

2.8.4.3 Mandelate racemase

Mandelate racemase (MR) catalyzes the interconversion between (R)-mandelate and (S)-mandelate (Hegeman et al. 1970, Gerlt et al. 1992). MR is a Mg2+ dependent racemase. In this enzyme, the reaction also proceeds by abstraction of the α-proton from one face of the substrate, reprotonation of the α-carbon from the other face completes the reaction with formation of the stereochemically inversed product (Fig.17).

Page 44: Protein crystallographic studies to understand the …jultika.oulu.fi/files/isbn9514280911.pdf · PROTEIN CRYSTALLOGRAPHIC STUDIES TO UNDERSTAND ... Protein crystallographic studies

42

Fig. 17. The reaction catalyzed by mandelate racemase.

To understand the reaction mechanism of MR several structural studies (Neidhart et al. 1991, Kallarakal et al. 1995, Landro et al. 1994, Mitra et al. 1995, Schafer et al. 1996) have been done. MR has an α,β-barrel structure similar to the structure of triose phosphate isomerase (TIM). Fig. 18 shows the active site of MR bound with (S)-atrolactate (Kallarakal et al. 1995). The two key catalytic bases, His297 and Lys166 (Powers et al. 1991, Neidhart et al. 1991) which are involved in the catalytic reaction by abstraction and donation of proton at the α-carbon are shown. It also shows the other key catalytic amino acid residues involved in the catalysis, namely, the carboxylates of Asp195, Glu221 and Glu247, which chelate the absolutely required Mg2+. Glu317 is hydrogen bonded with the carboxylate group of the substrate. Lys164 and the essential Mg2+ provide charge neutralization of the substrate carboxylate, and have been postulated to stabilize the enolate reaction intermediate. The important interaction is the interaction between Glu317 and the substrate carboxylate group. Glu317 is hydrogen bonded through its side chain carboxylate to the peptide backbone and to Ser139 on the opposite side of the substrate, thus positioning it very precisely. This is reflected in the fact that the effective concentration of Glu317 is higher than that of any other catalytic residue (Bearne & Wolfenden 1997). Glu317 forms a charged hydrogen bond to the substrate carboxylate group, suggesting that it is protonated. Consequently, it has been proposed that concerted acid-base catalysis takes place: Glu317 protonates the enolate intermediate to stabilize it and lower the pKa of the α-proton. Indeed, protonation of the neutral acid carboxyl group could lower the pKa of α-proton to ~7 (Gerlt et al. 1991). Consequently, Gerlt et al (1991) suggested that deprotonation of the α-carbon is concerted with protonation of the nascent enolate species by Glu317, and that the enolate is further stabilized by hydrogen bonds and Mg2+.

H OH

O

O

Mg2+

O

O

OH

Mg2+

B2H+B1 B1H+ B2H+ B1H+

HO H

O

O

B2

Mg2+

(R)-Mandelate (S)-Mandelate

Page 45: Protein crystallographic studies to understand the …jultika.oulu.fi/files/isbn9514280911.pdf · PROTEIN CRYSTALLOGRAPHIC STUDIES TO UNDERSTAND ... Protein crystallographic studies

43

Fig. 18. The stereo picture showing the zoomed in view of the active site of mandelate racemase complexed with (S)-atrolactate. (S)-atrolactate, is similar to (S)-mandelate, except the Cα atom has one methyl group instead of hydrogen. The important active site residues are shown as stick model and the compound is shown as ball-and-stick model. The important magnesium ion is shown as sphere.

The racemization reaction proceeds via a two-base mechanism, with His297 and Lys166 abstracting the α-proton from (R)-mandelate and (S)-mandelate respectively. In addition it has been observed that Glu317 acts as an electrophilic catalyst (Mitra et al. 1995), Lys164 interacts with the carboxyl function of mandelate (Neidhart et al. 1991). Mg2+ is interacting with the hydroxyl group and one of the oxygen of carboxylic acid group of the substrate. The proposed reaction mechanism of racemization by mandelate racemase is shown in the Fig. 19.

MR can catalyze the interconversion of both (R)-mandelate and (S)-mandelate. So the enzyme must be capable of binding both enantiomers. However, all crystal structures of MR complexes have either (S)-mandelate or analogues of (S)-mandelate bound in the active site. From the crystal structures it is quite clear that there is a hydrophobic pocket in the active site that binds the phenyl group of the substrate. It has been shown by kinetic studies and molecular dynamics simulation that there are two different binding pockets (R- and S- pockets) in the active site of MR for binding of the two different enantiomers (Siddiqi et al. 2005). First, the phenyl group of either (R)- or (S)-mandelate is bound at its corresponding pocket. The phenyl group then moves through the hydrophobic pocket to form a planar intermediate upon substrate deprotonation, and then comes to rest in the pocket for the product enantiomer after stereospecific reprotonation. The schematic representation of two different pockets in the MR active site is shown in Fig. 20.

Page 46: Protein crystallographic studies to understand the …jultika.oulu.fi/files/isbn9514280911.pdf · PROTEIN CRYSTALLOGRAPHIC STUDIES TO UNDERSTAND ... Protein crystallographic studies

44

Fig. 19. The proposed reaction mechanism of mandelate racemase. The figure was modified from Mitra et al. 1995.

In summary, the major contributions to MR catalysis is due to the general acid and base (Lys166, His297), and the precise positioning of the substrate and enzyme catalytic residues. Additional contributions are due to hydrogen bonding and increased electrostatic stabilization of the transition state. The most important component of electrostatic catalysis must be the Mg2+ cofactor. It seems that, during the catalytic reaction the hydroxyl group and the carboxylate acid group are fixed. The phenyl group is changing its position as well as the α-proton (Siddiqi et al. 2005). During the catalysis the phenyl ring is moving between the R- and S- pocket.

Fig. 20. Two different binding pockets in the active site of mandelate racemase. The figure was modified from Siddiqi et al. 2005.

HO O

O H O C

O

Glu317

NH3+ Lys164

Ph

Mg2+

Lys166 H2NH+

HO O

O

Ph

H H O C

O

Glu317

NH3+ Lys164

NH+HN

His297

Mg2+

Lys166 H2N

NH+HN

His297

HO O

O

H

Ph H O C

O

Glu317

NH3+ Lys164

Mg2+

Lys166 H2NH+

NHN

His297

O O-

OH

H

-O O-

OH

O O-

OHH

S-pocket

R-pocket

Page 47: Protein crystallographic studies to understand the …jultika.oulu.fi/files/isbn9514280911.pdf · PROTEIN CRYSTALLOGRAPHIC STUDIES TO UNDERSTAND ... Protein crystallographic studies

45

2.8.4.4 Aspartate racemase

Aspartate racemase catalyzes the reversible interconversion of L and D-aspartate. The initial characterization and partial purification of aspartate racemase was performed on the enzyme from Streptococcus faecalis (Lamont et al. 1972). Subsequently the recombinant enzyme from Streptococcus thermophilus has been overexpressed in E. coli and purified (Okada et al. 1991, Yohda et al. 1991). The enzyme is similar to glutamate racemase, it shows high substrate specificity, is inactivated by thiol-specific reagents and does not require any cofactor for the catalysis. Mechanistic studies in D2O indicated that a two-base mechanism is utilized and that a primary isotope effect slows the reaction of α-deuterated substrates (Yamauchi et al. 1992). Aspartate racemase shows significant sequence homology with glutamate racemase and both the active site cysteine residues are conserved (Gallo & Knowles 1993, Yohda et al. 1991). This suggests that two different cysteine residues serve as the base, as in the case of glutamate racemase.

Recently the crystal structure of aspartate racemase from Pyrococcus horikoshi OT3 has been determined at 1.9 Å resolution (Liu et al. 2002). Each subunit of P. horikoshi OT3 aspartate racemase consists of 228 amino acids. In the crystal form the enzyme is present as a dimer. Each subunit consists of two compact domains. The N-terminal domain is composed of residues 1-102 and 214-228, and the C-terminal domain includes residues 103-213. In each domain, a central four-stranded parallel open twisted β-sheet is flanked by six α-helices, three on each side. The structural comparison shows that the folding of P. horikoshi OT3 aspartate racemase subunit is similar to that of A. pyrophilus glutamate racemase although they have very low sequence identity (~26 %). The dimeric structure of P. horikoshi OT3 aspartate racemase is formed by two mononomers related by a twofold non-crystallographic symmetry (NCS). One interesting feature of the dimeric structure is the inter-subunit disulfide linkage formed by Cys73 and Cys73′. This disulfide bond may play a significant role in increasing the thermostability of the dimer.

Sequence and structural comparison shows that P. horikoshi OT3 aspartate racemase has two catalytic cysteine residues, Cys82 and Cys194. The spatial arrangement of the conserved residues Cys82 and Cys194 strongly indicated that the active site of P. horikoshi OT3 aspartate racemase is located in the cleft between the two domains. The two strictly conserved regions, Cys82-Asn83-Thr84 and Gly193-Cys194-Thr195 are exposed to the cleft from both side and face to each other. The active site residues are organized in a pseudo mirror-symmetry (Fig. 21). Cys82 and Cys194, which are located at the waist of the pocket, are related to each other by the pseudo mirror. The orientation of their thiol groups are related by this pseudo mirror-symmetry. At the bottom of the cleft two threonines, Thr84 and Thr124, are located symmetrically on both sides of the pseudo mirror. Side chains of Asn83 and Thr127 are located on the pseudo mirror, indicating a self-symmetric arrangement. Interestingly, in the mouth of the cleft, Arg48 and Lys164 are found in the mirror symmetric position. These residues are conserved in all aspartate racemase from different organism.

Page 48: Protein crystallographic studies to understand the …jultika.oulu.fi/files/isbn9514280911.pdf · PROTEIN CRYSTALLOGRAPHIC STUDIES TO UNDERSTAND ... Protein crystallographic studies

46

Fig. 21. Pseudo mirror-symmetric arrangement of the active residues of Pyrococcus horikoshi OT3 aspartate racemase. Two domains are coloured as green and orange, The active site residues are shown as stick model. The view of this figure was adapted from Liu et al. 2002.

The arrangement of the residues in the active site allows the substrate to enter the cleft and then two cysteine residues act synergistically as the catalytic base/acid pair to abstract/donate a proton from/to the α-carbon of the substrate/intermediate. The transition state of proton transfer between two atoms is a hydrogen-bond interaction (Schowen et al. 2000). The proton transfer in the racemization (-S-…H-Cα…H-S-) reaction by aspartate racemase should proceed through the approximate distance of two hydrogen bonds. So, the distance between the γ-sulfur atoms of the two active site cysteine residues should be about 8.0 Å, which is twice the sulfur involved hydrogen-bond distance (Gregoret et al. 1991). However, in the crystal structure, the distance between the two γ-sulfur atoms of Cys82 and Cys194 is 9.6 Å. This distance raises the question about the involvement of Cys82 and Cys194 in the catalytic mechanism (for example in the competent glutamate-racemase-substrate complex the distance between the two catalytic sulfurs is 7.4 Å). Being apart at such a large distance, cooperation between the two cysteine residues is probably difficult. Since Pyrococcus is a hyperthermophilic archaeum preferring temperature of 98oC, a so-called “hinge motion” (Gerstein et al. 1994) between two domains could be responsible for narrowing the cleft bringing the two cysteine residues close enough for the catalysis.

2.9 α-Methylacyl-CoA racemase

α-Methylacyl-CoA racemase (Amacr) catalyzes the racemization of (S)- and (R)-enantiomers of a wide spectrum of α-methyl-branched carboxyl coenzyme A thioesters (Fig. 22). In mammals, such substrates are derived from (i) C27-bile acid intermediates (Van Veldhoven et al. 1996, Pedersen et al. 1996), or from (ii) branched-chain fatty acids

Page 49: Protein crystallographic studies to understand the …jultika.oulu.fi/files/isbn9514280911.pdf · PROTEIN CRYSTALLOGRAPHIC STUDIES TO UNDERSTAND ... Protein crystallographic studies

47

(pristanic acid), but also from (iii) 2-arylpropionic acids (ibuprofen). The latter compound is used as a non-steroidal anti-inflammatory drug (Knights 1996). The conversion of α(R)-methylacyl-CoA into α(S)-methylacyl-CoA is a requirement for the subsequent degradation by the β-oxidation pathway, due to the chiral specificity of the β-oxidation oxidases (Kotti et al. 2000). Consequently, Amacr is required for bile acid synthesis and β-oxidation of the α(R)-methylacyl-CoA molecules (Fig. 23). Detailed literature review on Amacr is discussed in the following sections.

Fig. 22. Different substrates of Amacr.

CoA-S

O

OH

OH

CoA-S

O

HHO

Cα Cβ

C2

C2

C2

CoA-S

O

Pristanoyl-CoA

3,7,12-Trihydroxycoprostanoyl-CoA (THCA-CoA)

Ibuprofenoyl-CoA

Page 50: Protein crystallographic studies to understand the …jultika.oulu.fi/files/isbn9514280911.pdf · PROTEIN CRYSTALLOGRAPHIC STUDIES TO UNDERSTAND ... Protein crystallographic studies

48

Fig. 23. Involvement of Amacr in the degradation of phytanic acid.

2.9.1 Reaction catalyzed by Amacr

Amacr does not require any cofactor for the catalytic reaction. The reaction is assumed to proceed via formation of a carbanion which gets stabilized via keto/enol tautomerization with the adjacent thioester group (Schmitz et al. 1994, Mukherji et al. 2003, Reichel et al. 1997) (Fig. 24). Studies with deuterated analogues of ibuprofen at C-2 show loss of label during chiral inversion (Chen et al. 1991, Baillie et al. 1989).

It is assumed that the reaction is catalyzed via a two-base mechanism (Walsh 1979). Schmitz et al (1994) proposed that the reaction is initiated via abstraction of the α-proton which leads to the formation of a carbanion. The carbanion is stabilized by resonance with neighbouring thioester group leading to formation of a planar intermediate. Subsequently the intermediate is reprotonated by an acid delivering the proton from the other side (Fig. 24). Similar kind of two-base racemization reaction mechanism has been proposed for other cofactor independent racemases (Tanner et al. 1993). Schematic representation of the reaction mechanism catalyzed by Amacr is shown in Fig. 24.

Fig. 24. Schematic representation of the reaction mechanism catalyzed by Amacr.

S-CoA

O

CH3 H

(3R)-Phytanic acid (3S)-Phytanic acid

(2R)-Pristanoyl-CoA (2S)-Pristanoyl-CoA

α - Oxidation

β -Oxidation

α - Oxidation

S-CoACH3H

O

α-Methylacyl-CoA racemase

CH3CoA-S

O

HR

B2:H+ B2:H+

CoA-S

O-

R

CH3

B1:H+ B2:

CH3CoA-S

O

RH

B1:H+

(S)-compound (R)-compoundreaction intermediate

B1:

Page 51: Protein crystallographic studies to understand the …jultika.oulu.fi/files/isbn9514280911.pdf · PROTEIN CRYSTALLOGRAPHIC STUDIES TO UNDERSTAND ... Protein crystallographic studies

49

2.9.2 Some properties of Amacr

Amacr was first purified some 5600-fold from rat liver, to apparent homogeneity (Schmitz et al. 1994). The rat enzyme was thought to be a monomer of 45 kDa with an isoelectric point of pH 6.1 and it is optimally active between pH 6 and pH 7. The enzyme gets inactivated by micromolar concentrations of Hg2+ and to a lesser extent by Cu2+ but not by iodoacetamide and only slightly by N-ethylmalemide and thimerosal (Schmitz et al. 1994). Subsequently, the enzyme was also purified from human liver with an isolelectric point of pH 6.1 and is optimally active between pH 7 and pH 8 (Schmitz et al. 1995). Later the cDNA was cloned from rats and mice and the encoded protein had a molecular mass of approximately 39.7 kDa (Schmitz et al. 1997). In rats the highest activity of Amacr was found in liver followed by the Harderian gland, kidneys and intestinal mucosa (Schmitz et al. 1997, Van Veldhoven et al. 1997). In human Amacr expression has been detected in liver, kidneys and at lower levels in the intestinal tract and salivary glands (Schmitz et al. 1995, Zha et al. 2003). The substrate specificities and kinetic constants of human and rat Amacrs are very similar.

Amacr plays an important role in the formation of bile acids and degradation of pristanic acid by peroxisomal β-oxidation (Prydz et al. 1986, Verhoeven et al. 1998). So, probable localization of the enzyme would be peroxisomes. Subcellular localization of Amacr is not similar among different species. It has been shown that human and rat Amacr is primarily (75-90 %) located in peroxisomes and rest is distributed in mitochondria (Schmitz et al. 1995, Van Veldhoven et al. 1997). In mice, the Amacr activity is evenly distributed between peroxisomes and mitochondria (Schmitz et al. 1997, Kotti et al. 2000). In mouse, Amacr is an unmodified product of a single gene which carries both an N-terminal and a C-terminal targetting sequence for import into mitochondria and peroxisomes, respectively (Kotti et al. 2000). The precise factor(s) that determine the bimodal subcellular localization of Amacr is not fully understood yet.

2.9.3 Amacr belongs to family III CoA-transferases

Recent sequence comparison studies have shown that Amacr belongs to a new family of CoA-transferases called family III CoA transferases (Heider 2001). Most of the known enzymes of this family III catalyze CoA-transfer reactions in a highly substrate and stereospecific manner. The amino acid sequences of the family III CoA-transferases are completely different from those of families I and II. The known members of the family III CoA-transferases are formyl CoA-transferase (FRC, AAC45398), (R)-benzylsuccinate CoA-transferase (BbsE, AAF89840; and BbsF, AAF89841), (R)-phenyllactate CoA-transferase (PLCT), carnitine CoA-transferase (CaiB, CAA52112), putative cholate CoA-transferase (BaiF, AAC45415), and α-methylacyl-CoA racemase (Amacr, XP043772). Sequence comparisons (Heider 2001) suggest that these enzymes belong to the same suerfamily. The respective sequence identities are tabulated in Table 3.

Page 52: Protein crystallographic studies to understand the …jultika.oulu.fi/files/isbn9514280911.pdf · PROTEIN CRYSTALLOGRAPHIC STUDIES TO UNDERSTAND ... Protein crystallographic studies

50

Table 3. Identity matrix of members of CoA-transferase family III. The values represent identical amino acids (in %) between the individual full length sequences (Heider 2001).

BbsE BbsF FRC PLCT Caib BaiF BbsF 26 FRC 23 28 PLCT 24 21 25 CaiB 25 18 24 24 BaiF 22 23 27 24 37 Amacr 26 21 25 21 22 22

2.9.3.1 Structural fold of family III CoA-transferases

Recent structural studies of formyl-CoA transferase (FRC) from Oxalobacter formigenes (Ricagno et al. 2003, Jonsson et al. 2004), carnitine CoA transferase (CaiB) (Stenmark et al. 2004, Rangarajan et al. 2005) and the closely related E. coli YfdW protein (Gruez et al. 2003, Gogos et al. 2004) have revealed the fold of this family III CoA-transferases. In FRC the subunit consists of two domains, the large domain and the small domain and two linker regions (α10 and α17-α18) that connect these two domains (Fig. 25a). The large domain consists of both the N-terminal (residues 2-199, β1-β6) and C-terminal part of the protein chain (residues 367-428, β10-α20). The small domain contains residues 219-351, α11-β9). The large domain contains an open α/β-twisted sheet structure with a Rossmann like fold consisting of a central six-stranded parallel β-sheet with helices packed on both sides of it. The small domain starts with a short helix (α11) and a long loop (residues 232-268) connected to a three stranded, anti-parallel β-sheet. The first two strands (β7 and β8) are joined by a short loop, while five helices (α12-α16) are inserted between the second and third strand. After strand β9, the second linker (α17-α18) connects the small domain back to the large domain. The C-terminal part of the large domain consists of a two-stranded β-sheet (β10-β11), a long loop and two short helices (α19 and α20). An interesting feature of the subunit is the hole formed between the domains. In FRC, the hole is approximately 13 Å wide and 22 Å long.

Another interesting feature of the structure is the tight interlocked dimer. The two-fold related subunits are passing through each other to form a tight interlocked dimer (Fig. 25b). The linker helix α10 of molecule A is passing through the hole present in molecule B and vice versa. This puts the domains of one subunit on top of and below the hole of the other subunit. The hole is filled further by α7-α8, but this does not form a knot since this is only a protrusion from the large domain.

Page 53: Protein crystallographic studies to understand the …jultika.oulu.fi/files/isbn9514280911.pdf · PROTEIN CRYSTALLOGRAPHIC STUDIES TO UNDERSTAND ... Protein crystallographic studies

51

Fig. 25. Structure of FRC. (a) Monomer of FRC. Different domains are shown in different colours. Large domain in cyan, small domain in green, two linker regions in orange and two protruding helices are shown in blue. α10 is a kinked helix which connects the large domain with the small domain. α7 and α8 are protruding out of the large domain. α9 is the active site helix. (b) Dimer of FRC. Two subunits are shown in orange and green colour. Bound CoAs in both active sites are also shown as ball-and-stick models.

Sequence comparison between FRC and CaiB shows that there is 30 % identity among the residues from the large domain, whereas the residues from the small domain have no significant sequence identity. This is also reflected in the structures of these two proteins, having very similar large domains, but the small domains are rather different in these two proteins. Both of these CoA transferases are folded as interlocked dimers. So it has been proposed that most likely the other members of the family III CoA transferase have this structural fold although they have very low sequence identity (Stenmark et al. 2004). It is unlikely that monomeric proteins of this fold exist. So all the members of this family are dimeric or consist of several dimeric units. However, how the folding of the polypeptide chain takes place must be quite complex because the dimer cannot be formed by the individually prefolded monomers. One possibility is that the monomers are able to exist in a partly unfolded state in solution before dimerization induces the final folding of individual monomers into dimers (Ricagno et al. 2003).

2.9.3.2 Structural comparison among the members of family III CoA-transferases

The structures of three enzymes belonging to the family III CoA-transferase are known, namely formyl-CoA transferase (FRC) from O. formigenes (Ricagno et al. 2003, Jonsson

Page 54: Protein crystallographic studies to understand the …jultika.oulu.fi/files/isbn9514280911.pdf · PROTEIN CRYSTALLOGRAPHIC STUDIES TO UNDERSTAND ... Protein crystallographic studies

52

et al. 2004), carnitine CoA transferase (CaiB) (Stenmark et al. 2004, Rangarajan et al. 2005) and the E. coli YfdW protein (Gruez et al. 2003, Gogos et al. 2004). FRC has good sequence similarity (57 %) with YfdW protein from E. coli, but shares very low sequence identity (22 %) with CaiB (Rangarajan et al. 2005). Structural comparison between these three proteins has been done by Rangarajan et al (Rangarajan et al. 2005). Structures of the FRC and the YfdW protein are quite similar and they superimpose very well. Although FRC and CaiB do not share high sequence identity, they have similar structure. Structural comparison of CaiB and FRC shows that there are two extra loops (L1 and L2) present in FRC (Fig. 26). The short loop (L1) is in the large domain and in FRC (Trp48-Leu58) this loop is 8 residues longer than the corresponding loop in CaiB (Gln55-Asn57). The second one is a longer loop (L2) near the first linker region; in FRC the region (Glu219-Val251) is 29 residues longer than the corresponding loop (Gly219-Cys222) in CaiB. The absence of this longer loop in CaiB has caused the removal of the glycine-rich loop that points directly into the active site of FRC (Ricagno et al. 2003), which creates a large pocket in the active site region of CaiB. Structural comparison of the dimers of CaiB and FRC also shows that the large domain of these two proteins superimposes much better than the small domain.

Fig. 26. Superposition of the monomers of FRC (blue), CaiB (Green) and YfdW protein (orange). Two extra loop regions (L1 and L2) in FRC are shown in thicker line and coloured in magenta. 87 Cα atoms from the large domain have been used for the superposition.

2.9.3.3 Active site cleft and mode of CoA binding in the active site

Several structural studies have been done to understand the mode of substrate binding in the active site of the enzymes belonging to the family III CoA-transferase. The structures of formyl-CoA transferase (FRC) from O. formigenes complexed with CoA (Ricagno et

Page 55: Protein crystallographic studies to understand the …jultika.oulu.fi/files/isbn9514280911.pdf · PROTEIN CRYSTALLOGRAPHIC STUDIES TO UNDERSTAND ... Protein crystallographic studies

53

al. 2003), E. coli YfdW protein complexed with acetyl-CoA (Gruez et al. 2003) and E. coli carnitine CoA transferase (CaiB) complexed with CoA and carnitinyl-CoA (Rangarajan et al. 2005) have shown the mode of binding of the substrate in the enzyme active site cleft. The substrate recognized by the enzymes of this family III CoA-transferase can be divided into two parts, being the CoA part and the fatty acyl part. CoA is common to all substrates recognized by the enzymes. However, the fatty acyl parts of the substrates differ from enzyme to enzyme. As the CoA binding is common to all of the members of this family, it is not surprising that the large domain which is much more conserved is responsible for binding of CoA. Binding of CoA to a Rossmann fold has also been reported previously for other enzymes, e. g., succinyl-CoA synthetase (Wolodko et al. 1994) which binds the nucleotide part in the standard mode of binding. Interestingly the mode of binding of the CoA moiety to FRC is different from the classical mode of binding of nucleotides to this fold, for example, the pyrophosphate moiety of CoA is not interacting with the N-terminal end of the central α-helix of the Rossmann fold (Ricagno et al. 2003).

In the CoA bound FRC structure (Ricagno et al. 2003) the adenine part of CoA is bound in a cleft and buried from the solvent, while the ribose and the ribose phosphate are exposed to the solvent (Fig. 27). The pantetheine chain is buried in a cleft formed mainly by residues coming from the larger domain. The adenine moiety is positioned properly by the side chain of Arg38 and on the other side by interactions with Met74 and Met105. The phosphoribose is anchored to FRC via four hydrogen bonding interactions: one between O4′ and the side chain of Arg38, another two because of O2′ and one of the phosphate oxygen atoms interacting with the main chain carbonyl oxygen atom of Ala101 via a water molecule and the fourth one is by the hydrogen bonding of a phosphate oxygen atom to the side chain of Arg104. The basic side chains of Lys137, Arg38 and His15 interact with the pyrophosphate moiety. The two methyl groups of the pantetheine chain make hydrophobic interactions with Val16, Met44 and Tyr139. The hydroxyl group of the pantetheine chain froms hydrogen bonds to the side chain of His15 and to the main chain carbonyl oxygen of Asn96. One of the peptide NH groups on the pantetheine chain has interaction with this carbonyl oxygen. The other amino group forms a hydrogen bond to the carbonyl oxygen of Val38. The thiol group (SH) of CoA is pointing towards the N-terminus of α-helix 1. Similar binding mode of CoA has been observed in E. coli YfdW protein (Gruez et al. 2003) and in CaiB (Stenmark et al. 2004, Rangarajan et al. 2005).

As the fatty acyl part of the substrates are different for different enzymes, the architecture of active sites to bind that part is also organized in a different way. In case of FRC there is an interesting glycine rich loop (Gly258-Gly261) present at the end of the active site cleft where SH group of CoA is bound. In the apo enzyme the loop is in open conformation and in the liganded structure the loop is in closed conformation (Ricagno et al. 2003). Opening of this loop in the active site of FRC forms an entry gateway which allows the entry of oxalate in the active site. Closing of this glycine rich loop might be involved in protecting the activated CoAS- anion and/or the enzyme-anhydride intermediate during the reaction (Jonsson et al. 2004). Similar glycine rich loop (Gly246-Gly249) is also present in E. coli YfdW protein and the loop is also in open conformation in the apo structure and in the closed conformation in the ligand bound structure (Gruez et al. 2003). In CaiB there is no glycine rich loop, however a loop consisting of residues Asp230-Cys236, is found at an equivalent position and is more closely packed against the

Page 56: Protein crystallographic studies to understand the …jultika.oulu.fi/files/isbn9514280911.pdf · PROTEIN CRYSTALLOGRAPHIC STUDIES TO UNDERSTAND ... Protein crystallographic studies

54

C-terminal domain, resulting in a more open active site cleft in CaiB than in FRC and E. coli YfdW protein. The loop is always in the open conformation in apo and liganded forms of CaiB (Stenmark et al. 2004, Rangarajan et al. 2005). The opening of the loop creates a large pocket in the active site region of CaiB. A larger binding pocket in CaiB is required, because carnitine is twice the size of oxalate. It has been reported that this loop is also missing in a number of other enzymes in this family, specifically those that bind larger substrates than FRC (Stenmark et al. 2004).

Fig. 27. Active site cleft of FRC showing the mode of CoA binding. Two domains are shown in green and orange colour. The active site residues are shown as stick model. Two important water molecules involved in CoA binding are shown as sphere. The CoA molecule is shown as ball-and-stick model.

2.9.3.4 Ligand binding induces domain movement in CaiB

The crystal structures of E. coli CaiB in unliganded form and complexed with CoA, and carnitinyl-CoA have been determined (Stenmark et al. 2004, Rangarajan et al. 2005). It has been observed that when CoA or carnitinyl-CoA is bound in the active site, the small domain moves closer to the large domain, which leads to a more closed active site. The domain movement closes the active site in CaiB. This domain movement is observed only in CaiB. Binding of CoA or CoA derivative to FRC and E. coli YfdW protein does not induce any movement of the smaller domain. However, closing of the glycine rich loop has been observed in FRC and YfdW protein. It has been proposed that a closed active site might be necessary to protect or stabilize the activated CoAS- anion and/or the enzyme-anhydride intermediate during the reaction (Jonsson et al. 2004). It might be that, the flexible glycine rich loop in FRC or E. coli YfdW protein and the domain movement in CaiB, are two different modes of obtaining this shielding effect.

Page 57: Protein crystallographic studies to understand the …jultika.oulu.fi/files/isbn9514280911.pdf · PROTEIN CRYSTALLOGRAPHIC STUDIES TO UNDERSTAND ... Protein crystallographic studies

55

2.9.4 Amacr in cancer

It has been found that Amacr is overexpressed in prostate cancer (Xu et al. 2000, Dhanasekaran et al. 2001, Luo et al. 2001, Welsh et al. 2001). This has been verified with immunohistochemistry and the overexpression of Amacr was introduced as a new marker for prostate cancer (Jiang et al. 2001, Rubin et al. 2002). Nine fold increase in the expression level of Amacr in prostate carcinoma compared to that in benign prostate tissue has been shown by quantitative reverse transcriptase polymerase chain reaction (Luo et al. 2002).

Overexpression of Amacr as high as in prostate cancer was found also in colorectal cancer by microarray screening (Zhou et al. 2002). Various other cancers e.g., breast, ovarian, bladder, lung and renal carcinomas showed overexpression of Amacr using the same screening method. Increased Amacr mRNA levels have been proposed as useful in differential diagnosis of renal carcinomas as they are elevated in papillary renal carcinomas, but not in other subtypes (Tretiakova et al. 2004).

Page 58: Protein crystallographic studies to understand the …jultika.oulu.fi/files/isbn9514280911.pdf · PROTEIN CRYSTALLOGRAPHIC STUDIES TO UNDERSTAND ... Protein crystallographic studies

3 Aims of the present study

α-Methylacyl-CoA racemase (Amacr) catalyzes the racemization of α-methyl-branched CoA esters. Several mutational and biochemical studies have been done on this enzyme to understand its properties. However, there was no structural information available about this enzyme. The study was initiated to understand the structure-function relationship of this enzyme. Because of difficulties in the purification procedure of the mammalian enzyme, the structural studies were done with bacterial enzyme. Amacr from Mycobacterium tuberculosis (MCR) was taken as a representative of the racemases. MCR has a high sequence identity with mouse, rat and human Amacr of 41%, 44% and 43%, respectively. So MCR serves as a good model for studying the structural properties of Amacr. The main aims of this study were:

1. To purify MCR overexpressed in E. coli 2. To determine the crystal structure of MCR 3. To determine the crystal structure of MCR complexed with different ligands to

understand the structure-function relationship of this enzyme.

During the optimization procedure of the crystallization condition of MCR, a very minor impurity of argininosuccinate lyase from E. coli (eASL) was crystallized. There was no structural information about this bacterial enzyme and the reaction mechanism of this enzyme was quite unclear because of lack of the availability of a competent liganded structure of the active enzyme. As eASL was crystallized, that initited the motivation to determine the crystal structure of the enzyme.

Initially, our crystallographic studies on MCR concerned crystals grown in the absence of active site ligands. Subsequently these crystals were used in extensive binding studies. Also crystallization studies of active site mutated variants were done in particular concerning the H126A MCR mutant. Several structures of the H126A MCR mutant were also determined.

Page 59: Protein crystallographic studies to understand the …jultika.oulu.fi/files/isbn9514280911.pdf · PROTEIN CRYSTALLOGRAPHIC STUDIES TO UNDERSTAND ... Protein crystallographic studies

4 Materials and methods

The materials and methods have been described in more detail in the original articles referred to by their Roman numerals (I-III).

4.1 Recombinant protein expression and purification (I-III)

Wild type and all mutants of α-methylacyl-CoA racemase from M. tuberculosis (MCR) have been over expressed in E. coli BL21(DE3)plysS cells following the supplier’s instructions (Novagen). Initially the protein purification was performed using three purification steps involving 60 % ammonium sulfate precipitation, diethylaminoethyl (DEAE) anion exchange chromatography and hydrophobic interaction chromatography (octyl sepharose) (Bhaumik et al. 2003). The purity of the protein samples were analyzed using sodium dodecyl sulfate polyacrylamide gel electrophoresis (SDS-PAGE) (Laemmli 1970) and Coomassie Blue staining (Ausubel et al. 1989). The purified MCR sample is not completely pure after these three steps of purification. Presence of very minor quantity of impurity of argininosuccinate lyase from E. coli (eASL) has been confirmed after the structure determination trial from the crystals produced from the purified sample. The MCR sample was further purified using gel filtration chromatography. Purity of the MCR sample was further analyzed using SDS-PAGE and confirmed by crystallization experiment and structure determination.

4.2 Enzyme assay

The purified protein (1mg ml-1) is stored at 277 K in 10 mM sodium phosphate, pH 8.8, containing 1 mM reduced dithiothreitol, 1mM ethylenediaminetetraacetic acid and 1 mM NaN3. Enzyme assay was performed to measure the tritium release activity of the purified enzyme. Formation of [3H]H2O from [2-3H]pristanoyl-CoA was measured as described previously (Schmitz et al. 1994). Shortly, the enzyme source was incubated with the labelled substrate in 50 μl 50 mM Tris/HCl pH 8.0, at 310 K, for 30 min. The reaction was stopped with 450 μl 1 % trichloroacetic acid and the [3H]H2O was separated from

Page 60: Protein crystallographic studies to understand the …jultika.oulu.fi/files/isbn9514280911.pdf · PROTEIN CRYSTALLOGRAPHIC STUDIES TO UNDERSTAND ... Protein crystallographic studies

58

unreacted substrate on reverse-phase silica gel (RP-18) and quantified by liquid scintillation counting.

4.3 Dynamic light scattering experiments

Dynamic light scattering experiments were performed to check the monodispersity of the purified protein sample. These measurements were done with the DynaPro instrument from Protein Solutions. The concentration of the protein sample used for the measurement was 3 mg ml-1. The temperature was maintained at 277 K during the experiment (Bhaumik et al. 2003).

4.4 Crystallization of eASL (I)

The E. coli argininosuccinate lyase (eASL) has been crystallized from a highly concentrated sample of purified recombinant α-methylacyl-CoA racemase in which it occurred as a minor impurity. The purified sample of MCR after octyl sepharose column was used for these crystallization experiments. The crystallization screen was performed at 295 K by the hanging-drop vapor diffusion method (Zeelen et al. 1994). In all experiments the crystallization drops, containing 1 μl protein solution and 1 μl reservoir solution, were equilibrated against 1 ml reservoir solution. The best crystals grow in 1.26 M ammonium phosphate, pH = 7.0 using a protein concentration of 24 mg ml-1. These crystals grow to their maximum size within three weeks (Bhaumik et al. 2003). These crystals were used for the structure determination trial for MCR. Because of weak diffraction due to the small size of these crystals, studies aimed at finding suitable growth conditions for larger crystals were carried out. The mother liquor and crystals in the crystallization drops were analyzed by SDS-PAGE. From this SDS-PAGE it was found that the bipyramidal crystals were not from ΜCR but from a very minor contaminant while the intact racemase was still present in the mother liquor. A subsequent analysis of the dissolved crystalline protein by mass spectrometry showed it to be eASL.

4.5 Data collection, processing, structure determination, model building and refinement of eASL (I)

Diffraction data from eASL crystals were collected from a cryo-cooled crystal at 100 K using a solution of 1.26 M ammonium phosphate (pH=7.0) containing also 30 % glycerol as cryoprotectant. A data set was collected by the oscillation method with 0.5o rotations per frame at wavelength 1.134 Å at the beamline I711, Max-lab, Lund, Sweden. The dataset was processed using program packages DENZO (Otwinowski & Minor 1997), XDS (Kabsch 1993) and CCP4 (Collaborative Computational Project, Number 4 1994).

The structure of eASL was solved by molecular replacement using the program CNS (Brunger et al. 1998) and the A-subunit of the mutant of duck δ2 crystallin (PDB ID code

Page 61: Protein crystallographic studies to understand the …jultika.oulu.fi/files/isbn9514280911.pdf · PROTEIN CRYSTALLOGRAPHIC STUDIES TO UNDERSTAND ... Protein crystallographic studies

59

is 1K7W) (Sampaleanu et al. 2002) as a search model. The best molecular replacement solutions were obtained using space group P6122 (cell dimensions: a = b = 122.0, c = 256.4). The orientations and positions of two subunits were found. After rigid body refinement, simulated annealing refinement was done with CNS, followed by a REFMAC5 refinement (Murshudov et al. 1997) and a density modification cycle with the program DM (Cowtan & Zhang 1999). Finally an initial model was built using program RESOLVE (Terwilliger 2000). After another round of REFMAC5 refinement, a second automated model building step was done using program MAID (Levitt 2001). From this model (70 % complete) the complete model was built manually by visual inspection of the electron density map using the graphics package O (Jones et al. 1991). The final model was refined using REFMAC5. PROCHECK (Laskowski et al. 1993) was used to analyze the stereochemistry of the model. The data collection and refinement statistics have been presented in Table 1 of the original article I.

4.6 Crystallization of wild type MCR (II-III)

Crystallization trials using purified and highly concentrated MCR sample after an octyl-sepharose column produced the crystals of eASL. It has been observed that if the crystallization drops containing the bipyramidal crystals of eASL are left for a long time (several months), very small plate like crystal appeared. These crystals were crystals of MCR. But these crystals were very small for a diffraction experiment. This observation has driven us to purify the MCR sample further to get better MCR crystals.

To get very pure MCR, the protein sample after octyl sepharose column was further purified using gel filtration chromatography. Then the pure sample of MCR was used for crystallization. First, a crystallization screen was performed at 295 K by the hanging-drop vapor diffusion method using different concentrations of ammonium phosphate (pH=7.0). The crystallization screen produced very small plate like crystals of MCR. But bipyramidal crystals of eASL did not appear anymore in the drop. Then the condition was further optimized by adding barium chloride (BaCl2) to the crystallization drop. Well diffracting crystals of the wild type MCR were grown at 295 K by the hanging-drop, vapor-diffusion method (reservoir 1.52 M ammonium phosphate and 10 mM BaCl2 at pH 7.0) in drops of 2 μl volume containing 1:1 (v/v) mixture of reservoir and protein solution (6 mg ml-1 in 10 mM potassium phosphate buffer, pH = 8.8). The crystal size was further improved using different volumes of crystallization drops. Best quality crystals grow in the drops of 6 μl volume containing 1:1 (v/v) mixture of reservoir and protein solution.

4.7 Crystallization of H126A MCR mutant

Purified H126A MCR mutant protein sample was dialyzed into 10 mM Tris buffer pH 8.5, containing 1 mM reduced dithiothreitol, 1mM ethylenediaminetetraacetic acid and 1 mM NaN3. Then the sample was concentrated to 6 mg ml-1 by ultrafiltration. The concentrated sample was used for crystallization. Well diffracting crystals of the H126A

Page 62: Protein crystallographic studies to understand the …jultika.oulu.fi/files/isbn9514280911.pdf · PROTEIN CRYSTALLOGRAPHIC STUDIES TO UNDERSTAND ... Protein crystallographic studies

60

MCR mutant were grown at 295 K by the hanging-drop, vapor-diffusion method. Two different types of crystals of H126A MCR mutant were grown.

One type of crystal (type I) was grown in presence of 1.5 mM of (S)-ibuprofenoyl-CoA. The protein sample was mixed with stock solution of (S)-ibuprofenoyl-CoA to give a final concentration of protein to be 6 mg ml-1. The mixture was used for crystallization. The crystallization was done by the hanging-drop, vapor-diffusion method. The best crystals were produced in drops of 2 μl volume containing 1:1 (v/v) mixture of reservoir (10.0 % PEG8K, 10.0 % PEG1K, 0.08 M calcium acetate, 0.10 M Tris-HCl, pH=7.0) and protein solution. Another type of crystal (type II) was grown without any ligand in drops of 2 μl volume containing 1:1 (v/v) mixture of reservoir (10.0 % PEG8K, 10.0 % PEG1K, 0.20 M calcium acetate, 0.10 M Tris-HCl, pH=7.0) and protein solution (6 mg ml-1). The crystallization was also done by the hanging-drop, vapor-diffusion method using a protein solution of concentration 6 mg ml-1. As discussed later (section 5.6), H126A MCR mutant is unliganded in the type I crystals and the structure of the type I H126A MCR mutant is essentially identical to the structure of H126A MCR mutant from the type II crystals (grown in the absence of ligand).

4.8 Data collection from native and heavy atom soaked wild type MCR crystals (II)

Diffraction data from a native MCR crystal was collected from a cryo-cooled crystal at 100 K using a cryosolution of 1.52 M ammonium phosphate (pH=7.0) containing also 24 % glycerol as cryoprotectant. A data set was collected by the oscillation method with 0.5o rotations per frame using synchrotron radiation source at wavelength 0.8122 Å at the beamline X11, EMBL-Hamburg, DESY.

The crystals complexed with heavy atoms were required for the determination of the crystal structure of MCR, as there was no protein structure available which could be used for molecular replacement to solve the structure. The crystals of MCR were derivatised for solving the structure by multiple isomorphous replacement (MIR) by soaking the crystals in heavy atom compound solutions. The heavy atom derivatives of methyl mercuric acetate (MMA, C3H6HgO2) and monosodium salt of p-chloromercuriphenyl sulfonic acid (PCMPS, C6H4ClHgO3SNa) were obtained by soaking native protein crystals in a solution of heavy atom compounds in mother liquor at a concentration of 1.0 mM for 30-40 minutes. The datasets were also collected under cryogenic conditions using a synchrotron radiation source (EMBL-Hamburg, DESY). For the data collection of the heavy atom derivatised crystals, the crystals were frozen using the same cryoprotectant solution, containing also 1.0 mM heavy atom derivative. The data collections were performed by the oscillation method using a CCD detector.

All the datasets were processed using program packages DENZO (Otwinowski & Minor 1997), XDS (Kabsch 1993) and CCP4 (Collaborative Computational Project, Number 4 1994). The crystals belong to space group C2 and the asymmetric unit contains four subunits (two dimers). The data collection statistics are presented in Table I of the original article II.

Page 63: Protein crystallographic studies to understand the …jultika.oulu.fi/files/isbn9514280911.pdf · PROTEIN CRYSTALLOGRAPHIC STUDIES TO UNDERSTAND ... Protein crystallographic studies

61

4.9 Structure determination, model building and refinement of unliganded wild type MCR (II)

The initial phases were obtained with the MIRAS method. The program SOLVE (Terwilliger & Berendzen 1999) was used for finding and refining the heavy atom positions. The mean figure of merit of the refined phases was 0.50 at 2.5 Å resolution. Extensive fragments of each dimer were built in this map by the automatic model building procedures of RESOLVE. By superimposing common fragments of the four different subunits on each other the matrices relating the four subunits were obtained and four-fold averaging plus phase extension till 1.8 Å was carried out with DM. Then the improved electron density map at 1.8 Å resolution was used for automated model building with RESOLVE. The program could build about 60 % of the side chains of all four subunits. Then iterative cycles of model refinement with REFMAC5 and model building in electron density maps using O were carried out. PROCHECK was used to monitor the stereochemistry of the model. The refinement statistics are presented in Table I of the original article II.

4.10 Soaking experiments and data collection of wild type MCR crystals using substrate analogues (III)

After solving the crystal structure of unliganded MCR, structure determination trials of MCR complexed with its substrate (ibuprofenoyl-CoA) were initiated. Several soaking experiments and co-crystallization experiments have been done. None of the co-crystallization trials of MCR with (S)- and (R, S)-ibuprofenoyl-CoA produced any crystal. Soaking experiments with these compounds ((S)-, (R, S)-ibuprofenoyl-CoA and (R)-, (S)-2-methylmyristoyl-CoA) using MCR crystals also failed, as crystals always cracked. This observation has given us the idea of doing soaking experiments with other substrate analogues of MCR.

Three different compounds, acetyl-CoA, acetoacetyl-CoA, and crotonyl-CoA have been used for this study. The crystals were soaked in 2 mM of solution (prepared in mother liquor) of the compound for one hour. After soaking, the crystals were briefly soaked (30 seconds) in cryosolution (mother liquor containing 24 % glycerol and 2 mM compound) and immediately flash-frozen in a cold nitrogen gas stream at 100 K just before data collection. The data collections were done using the oscillation method. The datasets from acetyl-CoA and crotonyl-CoA soaked crystals were collected using the home X-ray source. The datset from acetoacetyl-CoA soaked MCR crystal was collected using synchrotron radiation source at the beamline I711, Max-lab, Lund. All the datasets were processed using program packages XDS and CCP4. The crystals belong to space group C2 and asymmetric unit contains four subunits of the protein, just as the unliganded MCR crystals. The data collection statistics of acetyl-CoA and acetoacetyl-CoA are presented in Table 1 of the original article III. Data collection statistics from crotonyl-CoA soaked MCR crystal are presented in Table 4. The structure determination and

Page 64: Protein crystallographic studies to understand the …jultika.oulu.fi/files/isbn9514280911.pdf · PROTEIN CRYSTALLOGRAPHIC STUDIES TO UNDERSTAND ... Protein crystallographic studies

62

refinement procedure of these substrate analogue bound MCR structures have been discussed later.

4.11 Data collection and structure determination of unliganded and 2-methylmalonyl-CoA bound H126A MCR mutant

For the data collection from the type I crystal of H126A MCR mutant, the crystal was transferred briefly to the cryoprotectant solution (35 % PEG1K and 1.5 mM (S)-ibuprofenoyl-CoA) and then flash cooled in a cold nitrogen gas stream at 100 K just before data collection. The dataset was collected by the oscillation method using the home X-ray source.

To solve the structure of the 2-methylmalonyl-CoA bound H126A MCR mutant, the type II crystals were used. 1 μl of 6 mM 2-methylmalonyl-CoA (prepared in mother liquor) was added to the 2 μl drop containing the type II crystals of the H126A MCR mutant. The crystal was soaked in the drop contating 2-methylmalonyl-CoA for nine hours. Then the crystal was flash frozen using cryoprotectant solution (35 % PEG1K and 2 mM 2-methylmalonyl-CoA). A dataset was collected by the oscillation method under cryogenic conditions using a synchrotron radiation source (EMBL-Hamburg, DESY).

Both datasets were processed using program packages XDS and CCP4. The crystals belong to space group P1 with similar cell dimensions and the asymmetric unit contains four subunits of the protein (two dimers). The data collection statistics are presented in Table 4. The structures were determined by molecular replacement using the program MOLREP (Vagin & Teplyakov 1977) and a dimer of the wild type MCR (PDB access code is 1X74) as a search model. In this way the orientation and position of the two dimers were found. Then the rigid body refinement of the tetrameric model was performed using the program REFMAC5. The initial model contained the side chain of His126, and visual inspection of the electron density showed negative density at the position of the imidazole ring of this histidine side chain. The histidine was mutated to alanine using the program O and then subsequently the mutated structures were refined.

After two cycles of refinement, the careful inspection of the unbiased (Fo - Fc)αc electron density showed that the structure determined form type I crystal of H126A MCR mutant does not contain ibuprofenoyl-CoA although the crystallization was performed in presence of 1.5mM (S)-ibuprofenoyl-CoA; also this ligand was included in the cryoprotectant. In case of the structure determined using the type II crystal of H126A MCR mutant soaked with 2-methylmalonyl-CoA, strong σa-weighted (Fo - Fc)αc electron density allowed to model 2-methylmalonyl-CoA in the four active sites. The topology, parameter, and energy minimization files for 2-methylmalonyl-CoA was generated by LIBCHECK (Collaborative Computational Project, Number 4 1994). Water molecules with proper hydrogen-bonding coordination in electron densities higher than 3.0σ on (Fo - Fc)αc σa-weighted map were progressively introduced while monitoring the decrease in Rfree. Then iterative refinement cycles of the final model with REFMAC5 were done. PROCHECK was used to monitor the stereochemistry of the model. The refinement statistics are presented in Table 4.

Page 65: Protein crystallographic studies to understand the …jultika.oulu.fi/files/isbn9514280911.pdf · PROTEIN CRYSTALLOGRAPHIC STUDIES TO UNDERSTAND ... Protein crystallographic studies

63

4.12 Glutaraldehyde cross linking of MCR crystals to prevent cracking during the soaking experiments (III)

To understand the reaction mechanism of MCR, it is important to solve the structure of the enzyme with real substrate. Several trials have been done to do successful soaking experiments with (R, S)-ibuprofenoyl-CoA, (R)- and (S)-2-methylmyristoyl-CoA using MCR crystals. The crystals always crack as soon as they get exposed to the soaking solution containing these substrates.

Finally successful soaking experiments were done using glutaraldehyde cross linked MCR crystals. In this procedure glutaraldehyde has been added directly to the drop containing the protein crystal. The process is very sensitive and lack of attention during the experiment may cause serious damage to the crystal because of too extensive cross linking. The crystal drop containing the protein crystal needs to be examined carefully. The crystal should be moved to one corner of the crystallization drop. Then 1 μl of 25 % glutaraldehyde solution was added to the other corner of the 6 μl crystallization drop containing the protein crystals. The drop was mixed slowly using the crystal mounting loop and the crystal in the drop was allowed to get cross linked for 30 seconds. Subsequently, the crystal was transferred to a fresh drop of mother liquor supplemented with the appropriate substrate.

4.13 Soaking experiments and data collection of the cross linked wild type crystals using (R, S)-ibuprofenoyl-CoA, (R)-, (S)-2-

methylmyristoyl-CoA (III)

The cross linked crystals were used for doing these soaking experiments. Three different crystals were soaked in 2 mM solution of the compound prepared in mother liquor. Different soaking times were used for the three different soaking experiments. For (R, S)-ibuprofenoyl-CoA, (R)- and (S)-2-methylmyristoyl-CoA the soaking time was two hours, twenty two hours and twenty four hours, respectively. After soaking, the crystals were briefly soaked (30 seconds) in cryosolution (mother liquor containing 24 % glycerol and 2 mM compound) and immediately flash-frozen in a cold nitrogen gas stream at 100 K just before data collection. The data collections were done using the oscillation method. The dataset from the (R, S)-ibuprofenoyl-CoA soaked MCR crystal was collected using a home X-ray source. The datasets from (R)- and (S)-2-methylmyristoyl-CoA soaked MCR crystals were collected using the synchrotron radiation X-ray source, at Max-Lab, Sweden (beamline I911). The crystals belong to space group C2 and the asymmetric unit contains four subunits like the unliganded crystals. All the datasets were processed using program packages XDS and CCP4. The data collection statistics are presented in Table 1 of the original article III.

Page 66: Protein crystallographic studies to understand the …jultika.oulu.fi/files/isbn9514280911.pdf · PROTEIN CRYSTALLOGRAPHIC STUDIES TO UNDERSTAND ... Protein crystallographic studies

64

4.14 Structure determination, model building and refinement of the liganded wild type MCR (III)

Structures of MCR complexed with acetyl-CoA, crotonyl-CoA, acetoacetyl-CoA, (R, S)- ibuprofenoyl-CoA, (R)- and (S)- 2-methylmyristoyl-CoA have been determined. All the structures were determined by molecular replacement using the program MOLREP and a dimer of the wild type MCR as a search model. Then the rigid body refinement of the two dimers of the asymmetric unit was performed using the program REFMAC5. Strong σa-weighted (Fo - Fc)αc electron density allowed to model the ligands in the four active sites. The topology, parameter, and energy minimization files for the ligands were generated by LIBCHECK. Water molecules with proper hydrogen-bonding coordination in electron densities higher than 3.0σ in (Fo - Fc)αc σa-weighted maps were progressively introduced while monitoring the decrease in Rfree. Then iterative refinement cycles of the new model with REFMAC5 were done. PROCHECK was used to monitor the stereochemistry of the model. The refinement statistics of crotonyl-CoA bound MCR structure are presented in Table 4 and for the other structures the refinement statistics are presented in Table 1 of the original article III.

4.15 Structural comparison (I-III)

The structure of eASL was compared by superposition studies with the other known structures of argininosuccinate lyase. For the comparisons, the following PDB entries were used: inactive duck δ2 crystallin S283A mutant (liganded with argininosuccinate, 1K7W), wild type duck δ1 crystallin (two sulfate ions are bound in the two active sites, 1HY0).

The structure of wild type MCR, liganded MCR and H126A MCR mutant were compared by superposition. All superpositions were done with the lsq option in O. For the MCR superposition the Cα atoms of the core of large domain were used (residues 9-14, 19-30, 32-37, 54-59, 78-82, 107-112, 160-173, 181-186).

All images describing structural informations of the crystal structures were made using the program PYMOL (DeLano 2002) and DINO (Philippsen 2000).

Page 67: Protein crystallographic studies to understand the …jultika.oulu.fi/files/isbn9514280911.pdf · PROTEIN CRYSTALLOGRAPHIC STUDIES TO UNDERSTAND ... Protein crystallographic studies

5 Results

5.1 Identification of the band of eASL on the SDS-PAGE (I)

MCR was purified using three different steps of purification protocols: (a) 60 % ammonium sulfate precipitation, (b) anion exchange chromatography using DEAE-sepharose column and (c) hydrophobic interaction chromatography using octyl-sepharose column. Fig. 28a shows the purity of the sample after octyl-sepharose column. The purified protein sample was concentrated and used for the crystallization. Small bipyramidal crystals were grown using this protein sample obtained from many different purification batches.

Fig. 28. (a) Purification of MCR using three steps purification. Lane 1: molecular mass marker, lane 2: protein sample after 60 % ammonium sulfate precipitation, lane 3: protein sample after DEAE-column purification and lane 4: protein sample after octyl sepharose column purification. (b) Analysis of bipyramidal crystals and the crystallization drop. Lane 1: molecular mass marker, lane 2: protein sample used for crystallization, lane 3: sample obtained after dissolving the bipyramidal crystals and lane 4: crystallization drop after removal of the bipyramidal crystals.

Initially it was thought that the small bipyramidal crystals are MCR crystals. Structure determination trials using these crystals were initiated. The weak diffraction quality

Page 68: Protein crystallographic studies to understand the …jultika.oulu.fi/files/isbn9514280911.pdf · PROTEIN CRYSTALLOGRAPHIC STUDIES TO UNDERSTAND ... Protein crystallographic studies

66

(because of small size of these crystals) initiated the process to find out the way to produce the bigger crystals. We decided to analyze the crystals and the crystallization drop using SDS-PAGE to make sure that the protein is not getting degraded during the crystallization. This analysis has produced the surprising result (Fig. 28b) that the bipyramidal crystals do not contain MCR. The minor impurity present in the purified sample got crystallized. In Fig. 28b, lane 2 shows the protein which was used for crystallization experiment. It seems that the presence of eASL is very neglisible (<2%). Lane 2 shows the band of eASL. Lane 3 shows the crystallization drop after removal of the bipyramidal crystals. MCR is very nicely present in the crystallization drop and it is not degraded. This experiment also proved that MCR is a very stable enzyme in the solution. The impurity was identified as argininosuccinate lyase from E. coli using mass spectrometry.

5.2 Structure of eASL and it’s open and closed active site (I)

When the bipyramidal crystals were identified as the crystals of argininosuccinate lyase from E. coli (eASL), database searches for the structural information of this enzyme were done. The search results showed that the crystal structures of ASL from several species have been studied, in particular the duck δ1 and δ2 eye lens crystallins which are inactive and active homologues of ASL, respectively. Structural studies have also been done using the human enzyme. However, there was no structural information about the E. coli enzyme. We decided to solve the structure of this E. coli enzyme using our bipyramidal crystals.

The structure of eASL was solved using molecular replacement at 2.44 Å resolution. The structure was refined to R-factor and free R-factor of 16.6 % and 21.7 % respectively. In this crystal form, the asymmetric unit contains two subunits. The structure of one subunit of eASL consists of three domains like other enzymes of this β-elimination superfamily. Domain 1 is compact and composed of residues 3-105. Domain 2 consisting of residues 106-363 is elongated and is essentially a bundle of five long helices. Domain 3 is also compact and composed of residues 364-457. The active enzyme is a tetramer. The core of the tetramer is formed by the interactions between the helical bundles of domain 2 of each of the four subunits (Fig. 3a, original article I).

The most interesting part of the eASL structure is the active site of the enzyme. The tetramer contains four active sites. Each active site is composed of the regions from three different subunits. Each end of tetramer contains two active sites which are related by the NCS-2-fold axis. In this structure these two active sites are dissimilar. One active site is in an open conformation. The other active site is in a closed conformation and contains two phosphate ions (phos-1 and phos-2). The three highly conserved peptide regions C1, C2 and C3 are present in the loops near the active site. The active site complexed with the phosphate ions is formed by the C1-residues 106-116 (just after domain 1 of subunit A), C2-residues 153-163 (protruding out of domain 2 of subunit C) and the C3-residues 273-291 (protruding out of domain 2 of subunit B). Among these loops, the most interesting is the C3 loop with two different conformations (Fig. 29).

Page 69: Protein crystallographic studies to understand the …jultika.oulu.fi/files/isbn9514280911.pdf · PROTEIN CRYSTALLOGRAPHIC STUDIES TO UNDERSTAND ... Protein crystallographic studies

67

Fig. 29. Open and closed conformations of the eASL C3 loop. Subunit A and B are shown in cyan and green colour respectively. Two bound phosphate ions in the closed active site are also shown as ball-and-stick model.

Careful examination of the closed active site (Fig. 30) shows that the phos-1 is near to the C2-region and the phos-2 is near to the C1-region. Structural comparison shows that the phos-1 has occupied the position of the sulfate ion in the structure of the inactive duck δ1 crystallin. Two phosphate ions have very good hydrogen bonding interactions with the protein side chains in the active site. For example, phos-1 is hydrogen bonded to NZ(Lys283) and ND2(Asn285), phos-2 is hydrogen bonded to NH2(Arg109), N(Ser278) and N(Arg109).

Fig. 30. Zoomed in view of the eASL active site. The three different loops are shown in different colours. C1 in orange, C2 in green and C3 in cyan. The bound phosphate ions (P1 and P2) are also shown as ball-and-stick model.

Page 70: Protein crystallographic studies to understand the …jultika.oulu.fi/files/isbn9514280911.pdf · PROTEIN CRYSTALLOGRAPHIC STUDIES TO UNDERSTAND ... Protein crystallographic studies

68

5.3 Crystallization of MCR and H126A MCR mutant and other variants (II-III)

It was observed, when the crystallization drops containing bipyramidal crystals of eASL were left for long time (several months) some plate like crystals appeared (Fig. 31a). These crystals were tested by SDS-PAGE and proved to be MCR crystals. This observation showed the crystallizabiliy of MCR in presence of ammonium phosphate. To remove the very minor quantity of eASL, the protein sample after octyl sepharose column was further purified using gel filtration chromatography. Then crystallization screen was setup with more pure MCR using different concentrations of ammonium phosphate. The crystallization screen produced very small plate like MCR crystals (Fig. 31b) which were inadequate for the diffraction experiment. Then optimization using additive screen (Hampton Research) showed that addition of BaCl2 to the crystallization drop improved further the crystal quality and size. Then appropriate concentration of BaCl2 was optimized by another screen. Good quality of MCR crystals were grown using 1.52 M ammonium phosphate (pH=7.0) and 10 mM BaCl2. Further optimization of the drop volume produced very good quality crystals with bigger size which were used for all structural studies (Fig. 31c).

Fig. 31. The crystals of MCR. (a) Old crystallization drop containing bipyramidal crystals of eASL and very small plate like crystals of MCR. (b) Crystallization screen using ammonium phosphate produced very small plate like crystals of wild type MCR. (c) Good quality crystals (1.2 × 0.2 × 0.1 mm) of wild type MCR used for structural studies. (d) H126A MCR mutant type II crystal (0.2 × 0.2 × 0.2 mm).

H126A MCR mutant crystals (type I and type II) have been crystallized almost at the same crystallization condition (see section 4.7) with and without ibuprofenoyl-CoA respectively. The crystals grown form these two different conditions have same morphology and they belong to same space group with similar cell dimensions. Type II crystal is shown in Fig. 31d. Many crystallization attempts with the E241A and D156A MCR mutant were also carried out; however they did not produce any successful result.

High quality crystals of wild type MCR and H126A MCR mutant are only obtained in presence of at least 10 mM BaCl2 and Ca(Ac)2, respectively. Interestingly, in none of the structures a specific Ba2+ or Ca2+ sites have been identified. Possibly, these ions prevent undesirable interactions between MCR molecules, thereby favouring those interactions that result in the formation of high quality crystals.

Page 71: Protein crystallographic studies to understand the …jultika.oulu.fi/files/isbn9514280911.pdf · PROTEIN CRYSTALLOGRAPHIC STUDIES TO UNDERSTAND ... Protein crystallographic studies

69

5.4 Structure of MCR (II)

The structure of α-methylacyl-CoA racemase from M. tuberculosis (MCR) was solved by the MIRAS procedure using two mercury compounds, methyl mercuric acetate (MMA, C3H6HgO2) and the monosodium salt of p-chloromercuriphenyl sulphonic acid (PCMPS, C6H4ClHgO3SNa). The final model of MCR was refined at 1.8 Å resolution to a R-factor and free R-factor of 16.0 % and 19.3 %, respectively. In the crystal form the asymmetric unit contains two dimers of MCR.

The overall fold of MCR is similar to that of FRC from O. formigenes (Ricagno et al. 2003) and the E. coli YfdW protein (Gruez et al. 2003, Gogos et al. 2004). The monomer is composed of two domains, the large domain and the small domain, two linker regions (α8 and α12-β11, respectively) which connect the two domains and the C-terminal region (α14) (Fig. 4A, original article II). The large domain has a Rossmann-like fold and consists of the residues from the N-terminal region (residues 2-187) as well as the residues from the C-terminal region (residues 336-360). The small domain consists of residues 224-300, β7-β9. One interesting feature of the structure of one subunit of MCR is the big hole inside the protein molecule. Two molecules are passing through each other through this hole to form a very tight interlocked dimer (Fig. 6A, original article II). In this interlocked dimer two subunits are related by a local two fold axis. Different views of the interlocked dimer are shown in Fig. 32.

The active site of MCR has been identified by structural and mutagenesis studies. The active site is present in a cleft formed by the large domain of one subunit and the small domain of the other subunit. Structural comparison and mutagenesis studies showed that the important residues involved in the catalysis might be His126, Asp127, Asp156 and Glu 241. Among these residues the first three are from one subunit and Glu241 is from the small domain of the other subunit. His126 and Asp127 are present at the beginning of α5. Asp156 is protruding out of α7 near the N-terminal end of this helix. The central helix α7 is a unique feature of the MCR structure. This helix starts as a π-helix and continues as an α-helix. Formation of the π-helix is important to position Asp156 properly for the catalysis. Fig. 33 shows the active site of unliganded MCR. The active site of unliganded MCR contains one glycerol molecule which has very good hydrogen bonding interaction with ND1(His126), N(Asp127) and OD1, OD2(Asp156). Structural comparison with acetyl-CoA bound E. coli YfdW protein showed that the thioester oxygen of acetyl-CoA is pointing towards the main chain NH group of Asp127, which suggested that NH(Asp127) might be involved in formation of oxyanion hole to stabilize the oxyanion produced during the catalytic reaction. Another interesting feature in the MCR active site is the presence of a methionine rich binding pocket. The pocket consists of six methionine residues. Met50, from one subunit and Met198, Met199, Met202, Met207, Met216 from the other subunit. These methionine residues might be involved in binding of the hydrophobic fatty acyl part of the substrate during the catalysis.

Page 72: Protein crystallographic studies to understand the …jultika.oulu.fi/files/isbn9514280911.pdf · PROTEIN CRYSTALLOGRAPHIC STUDIES TO UNDERSTAND ... Protein crystallographic studies

70

Fig. 32. The dimer of MCR is shown in three different orientations. Different domains of two subunits are painted in different colours. For subunit-1, the large domain is coloured as green, the small domain as orange, the protruding helix and the first linker region as pink and the C-terminal region as blue. For subunit-2, the large domain is coloured as cyan, the small domain as yellow, the protruding helix and the first linker region as red and the C-terminal region as grey. (a) The dimer is viewed from a direction perpendicular to the two-fold axis. (b) The dimer in (a) has been rotated 90o along the two-fold axis. (c) The dimer is viewed from the top along the two-fold axis.

Page 73: Protein crystallographic studies to understand the …jultika.oulu.fi/files/isbn9514280911.pdf · PROTEIN CRYSTALLOGRAPHIC STUDIES TO UNDERSTAND ... Protein crystallographic studies

71

Fig. 33. The active site of wild type MCR. Three important active site residues are shown as a ball-and-stick model. The bound glycerol molecule (used as a cryoprotectant) in the enzyme active site is also shown as a stick model. The refined (2Fo - Fc)αc electron density map is shown around the glycerol molecule. The hydrogen bonding interactions of the terminal hydroxyl group of glycerol are shown by thin lines.

5.5 Structures of MCR complexed with acetyl-CoA, crotonyl-CoA and acetoacetyl-CoA (III)

After solving the crystal structure of wild type MCR, crystallographic binding studies were initiated to solve the liganded structure of MCR. The crystal structures of MCR complexed with acetyl-CoA, crotonyl-CoA and acetoacetyl-CoA were determined to understand the mode of substrate binding in the enzyme active site. Interestingly, soaking experiments with different concentrations (highest, 6 mM) of CoA never resulted in any binding.

First, the crystal structure of acetyl-CoA bound MCR was determined and the structure was refined at 2.30 Å resolution with R-factor of 22.0 % and free R-factor of 27.1 %. The σa-weighted (Fo - Fc)αc electron density was good in three active sites (A, B and D) to model the compound in these three active sites. In the C active site the σa-weighted (Fo - Fc)αc electron density was good for the CoA part of the ligand, but the acetyl part was disordered because of presence of a glycerol molecule in that active site. In A, B and D active site the thioester oxygen of acetyl-CoA is hydrogen bonded to the main chain NH group of Asp127 via a water molecule. This structure gave a good idea of binding of the CoA part of the substrate in the active site, however, it was quite difficult to understand the possible catalytic role of His126 and Asp156 from this structure.

Subsequently, the crystal structure of crotonyl-CoA bound MCR was determined and the structure was refined at 2.20 Å resolution with R-factor of 23.6 % and free R-factor of 28.2 %. In all four active sites, the σa-weighted (Fo - Fc)αc electron density was good for the CoA part of the ligand but was quite fragmented for the crotonyl part of the ligand.

Page 74: Protein crystallographic studies to understand the …jultika.oulu.fi/files/isbn9514280911.pdf · PROTEIN CRYSTALLOGRAPHIC STUDIES TO UNDERSTAND ... Protein crystallographic studies

72

However, the structure was refined with the model of crotonyl-CoA in all four active sites. Careful analysis of all the active sites showed that the Cα atom of the ligand is not positioned properly to assign the correct role of His126 and Asp156 in the catalytic mechanism. Apparently crotonyl-CoA is not a tight binding transition state analogue.

Then the crystal structure of acetoacetyl-CoA bound MCR was determined and the structure was refined at 1.70 Å resolution with R-factor of 20.8 % and free R-factor of 25.0 %. The σa-weighted (Fo - Fc)αc electron density was good in A and D active sites to model the compound in these two active sites. In B and C active sites the σa-weighted (Fo - Fc)αc electron density was good for the CoA part but the acetoacetyl part was disorderdered because of presence of glycerol in these active sites. The final structure was refined with acetoacetyl-CoA in the A and D active sites and with CoA and glycerol in the B and C active sites. In active sites A and D, the thioester oxygen is hydrogen bonded to N(Asp127) and this mode of binding of acetoacetyl-CoA gave the first insight about the position of the Cα atom of the substrate during the catalysis. This structure also gave some idea about the orientation of the fatty acyl chain of the substrate in the enzyme active site. The position of the Cα atom of acetoacetyl-CoA clearly showed that Asp156 and His126 form the catalytic acid/base pair involved in the catalysis, however, the precise role of these two residues was still unclear because of the absence of the 2-methyl group in the ligand.

5.6 Structure of unliganded H126A MCR mutant

Diffraction data from type I crystals of H126A MCR mutant were used to determine this structure. The crystals were grown in presence of 1.5 mM (S)-ibuprofenoyl-CoA. Calculation of σa-weighted (Fo - Fc)αc omit density map did not show the density for the ligand in any of the four active sites. The structure was refined at 1.95 Å resolution with R-factor of 17.3 % and free R-factor of 23.5 %.

The overall structure of the H126A MCR mutant is similar as the structure of the unliganded wild type MCR described before. It has also two domains, the large domain and the small domain and the domains are linked by two linker regions. The striking feature of the unliganded H126A MCR mutant structure is the movement of the small domain towards the large domain (Fig. 40). The dimeric structure of unliganded H126A MCR mutant has been superimposed on the dimer of apo wild type MCR structure. Residues from α-helices and β-sheets in the large domain have been used for structural superposition. 122 Cα atoms have been used for superposition and have an r. m. s. fit of 0.2 Å.

This structure shows very nice arrangement of water molecules in the active site (Fig. 34). In case of all other liganded and unliganded structures of wild type MCR the active site water molecules are not visible properly because of presence of either glycerol (used as the cryoprotectant) or ligand in the active site. In this crystal form, the crystals were obtained in presence of polyethylene glycol and that was also used as a cryprotectant. So there was no possibility of perturbation of the hydrogen bonding network between the water molecules and the active site residues. Asp156 is very nicely hydrogen bonded to W1, W2 and W3. OD1(Asp156) is hydrogen bonded to NH(Asp127) via W3.

Page 75: Protein crystallographic studies to understand the …jultika.oulu.fi/files/isbn9514280911.pdf · PROTEIN CRYSTALLOGRAPHIC STUDIES TO UNDERSTAND ... Protein crystallographic studies

73

OD2(Asp156) is interacting with OH(Tyr130) via W1 and W4. There are also several other water molecules which are also involved in good hydrogen bonding network. His126 has been mutated to alanine, W5 has taken the position of the side chain of histidine in the wild type structure. Because of absence of histidine, Glu241 is pointing outwards and it is hydrogen bonded to side chain –NH2 group of Gln243.

Fig. 34. Stereo picture showing the zoomed in view of the active site of the unliganded H126A MCR mutant structure. Two subunits are shown in orange and green colour. The important active site residues are shown as stick model. The water molecules are shown as sphere.

5.7 Structure of H126A MCR mutant complexed with (S)-2-methylmalonyl-CoA

The crystal structure of 2-methylmalonyl-CoA bound H126A MCR mutant has been determined and refined at 1.7 Å resolution with good refinement statistics. In this crystal form there are also two dimers per asymmetric unit. The soaking experiment was done with a racemic mixture of (R, S)-2-methylmalonyl-CoA. However, after solving the structure the σa-weighted (Fo - Fc)αc density in all four active sites showed that the (S)-enantiomer is present in the active sites (Fig. 35).

The overall structural fold of H126A MCR mutant complexed with (S)-2-methylmalonyl-CoA is similar as described for unliganded H126A MCR mutant. The structural comparison between liganded and unliganded H126A MCR mutant shows that only the small domain is slightly shifted (Fig. 40). For the superposition the 122 Cα atoms of the large domains of the dimer were used; the r. m. s. fit for these 122 Cα atoms is 0.3 Å. Close inspection of the active site of liganded H126A MCR mutant structure shows that except the Glu241, all the other active site residues have almost the same conformation as observed in the case of apo H126A MCR mutant structure. Glu241 does not have any hydrogen bonding with Gln243 as observed in the apo H126A MCR mutant structure. Glu241 is pointing away from Gln243. Careful examination of the active site shows the mode of binding of (S)-2-methylmalonyl-CoA in the active site of the H126A MCR mutant. The mode of binding of the CoA part of the substrate is very much like as expected on the basis of the structural comparison with the CoA bound FRC and the

Page 76: Protein crystallographic studies to understand the …jultika.oulu.fi/files/isbn9514280911.pdf · PROTEIN CRYSTALLOGRAPHIC STUDIES TO UNDERSTAND ... Protein crystallographic studies

74

acetyl-CoA bound E. coli YfdW protein. The adenine part is wedged in a thin cleft and buried from solvent while the ribose, the ribose phosphate and the pyrophosphate are solvent exposed. The pantetheine chain is buried in a cleft formed mainly by the large domain. The small domain participates by closing off the cleft. The adenine moiety is positioned by a stacking interaction (Fig. 36) with the side chain of Arg38, and on the other side Leu61 and Leu92. The –NH2 group of the adenine moiety forms a hydrogen bond with the carbonyl oxygen of Ala59. The hydroxyl group of ribose is hydrogen bonded to the side chain of Lys62. The 3′-phosphate group is anchored to the protein through three hydrogen bonds: one between the phosphate oxygen O9 and the side chain of Lys62, another one between the phosphate oxygen O7 and the side chain of Arg91 and the third one between the hydrogen bonding of the phosphate oxygen O8 and side chain of Arg85 mediated by a water molecule. Two oxygen atoms of the pyrophosphate part of CoA are hydrogen bonded to the side chain of Arg85. The dimethyl group of the pantetheine moiety has hydrophobic contacts with side chains of Ile16 and Phe244 (of the small domain of the other subunit). The pantetheine hydroxyl group is hydrogen bonded to a neighbouring water molecule. The first peptide NH group of the pantetheine chain has hydrogen bonding interaction with carbonyl oxygen of Gly83. The last carbonyl group of the pantetheine chain is hydrogen bonded to a neighbouring water molecule. The thioester oxygen of the ligand is anchored by the hydrogen bonding interactions with main chain NH group of Asp127 and carboxylic acid group of Asp156. The α-methyl group is pointing towards the hydrophobic pocket formed at the interface between the large and the small domain. The α-methyl group has hydrophobic interactions with Leu217 and Ile240 of the other subunit.

Fig. 35. Initial difference Fourier electron density map at 2.5 σ for the (S)-2-methylmalonyl-CoA bound in the active site of H126A MCR mutant.

The α-proton is pointing towards Ala126. The carboxylic acid group is pointing towards a hydrophobic pocket formed by Met198, Met50 and Phe157. The mode of binding of (S)-2-methylmalonyl-CoA gives a good idea of the possible binding mode of (S)-2-methylacyl-CoA substrate in the enzyme active site of MCR. In the subsequent sections it

Page 77: Protein crystallographic studies to understand the …jultika.oulu.fi/files/isbn9514280911.pdf · PROTEIN CRYSTALLOGRAPHIC STUDIES TO UNDERSTAND ... Protein crystallographic studies

75

is shown that the 2-methyl group of true substrates binds in the same pocket of wild type MCR.

Fig. 36. Stereo picture showing the zoomed in view of the active site of H126A MCR mutant complexed with (S)-2-methylmalonyl-CoA. Two subunits are coloured as orange and green. The important residues in the active site involved in catalysis and substrate binding are shown as stick model. The ligand is shown as ball-and-stick model and the important water molecules involved in binding the CoA part of the ligand are shown as spheres.

Page 78: Protein crystallographic studies to understand the …jultika.oulu.fi/files/isbn9514280911.pdf · PROTEIN CRYSTALLOGRAPHIC STUDIES TO UNDERSTAND ... Protein crystallographic studies

76

Table 4. Data collection and refinement statistics.

H126A MCR mutant (unliganded)

H126A MCR mutant complexed with 2-methylmalonyl-CoA

MCR complexed with crotonyl-CoA

A. Data collection statisticsa

Space group P1 P1 C2

Unit cell parameters a, b, c (Å) α, β, γ (deg.)

62.4, 68.9, 85.1 90.0, 106.8, 95.2

62.1, 66.9, 86.7 90.0, 72.58, 84.94

181.2, 80.0, 118.6 β = 90.1

Temperature (K) 100 100 100

Wavelength (Å) 1.54179 0.8048 1.54719

Resolution (Å) 35.0-1.95 (2.50-1.95) 35.0-1.70 (2.50-1.70) 35.0-2.20 (2.25-2.20)

Rmerge (%)b 2.4 (6.1) 6.0 (15.6) 7.3 (27.5)

Completeness (%) 93.5 (92.0) 96.8 (96.7) 95.3 (58.8)

I/σ(Ι) 23.3 (12.8) 10.30 (5.6) 15.8 (3.6)

Mosaicity (deg.) 0.3 0.2 0.3

B-factor from Wilson plot (Å2) 28.2 26.3 23.5

B. Refinement statistics

Resolution (Å) 20.0-1.95 20.0-1.70 20.0-2.20

Total number of reflections 133816 193228 106133

Working set: number of reflections 127126 183567 100827

Rfactor (%) 17.3 18.4 23.6

Test set: number of reflections 6690 9661 5306

Rfree (%) 23.5 23.1 28.2

Total number of atoms 11823 12109 11523

C. Geometry statistics

rmsd (bond distance) (Å) 0.019 0.015 0.019

rmsd (bond angle) (deg.) 1.7 1.6 1.7

Coordinate errorc 0.19 0.12 0.32

rmsd B:

Main-chain bonded atoms (Å2) 1.0 0.9 0.2

Side-chain bonded atoms (Å2) 2.3 2.2 1.3

Ramachandran plotd

Most favored region (%) 90.6 90.3 89.5

Additionally allowed regions (%) 8.8 8.7 10.0

Generously allowed regions (%) 0.4 0.6 0.5

Disallowed regions (%) 0.2 0.3 a The values in parentheses are for the highest resolution shell, b Rmerge = Σh Σi |<I>h – Ih,i| / ΣhΣi Ih,i. , c Cruickshanks DPI for coordinate error, d As defined by PROCHECK, rmsd: root mean square deviation

Page 79: Protein crystallographic studies to understand the …jultika.oulu.fi/files/isbn9514280911.pdf · PROTEIN CRYSTALLOGRAPHIC STUDIES TO UNDERSTAND ... Protein crystallographic studies

77

5.8 Structure of MCR complexed with (R, S)-ibuprofenoyl-CoA (III)

The crystal structure of MCR complexed with (R, S)-ibuprofenoyl-CoA was determined by soaking the cross-linked crystals in the racemic mixture of (R, S)-ibuprofenoyl-CoA. The structure was refined at 1.85 Å resolution. After the first cycle of refinement using REFMAC5, strong σa-weighted (Fo - Fc)αc electron density for (S)-ibuprofenoyl-CoA was observed in all four active sites, which allowed to model (S)-ibuprofenoyl-CoA in each active site. Then another cycle of refinement was done with the model of (S)-ibuprofenoyl-CoA in the four active sites. After the refinement, there was still some residual electron density left in each active site. This residual electron density was most clear in the C active site. Initially it was thought that a glycerol molecule (with low occupancy) may be present in the active site. Glycerol was modeled and both (S)-ibuprofenoyl-CoA and glycerol were refined together in each active site. But the model of glycerol did not fit nicely in the electron density; the residual density was bigger than the size of glycerol. Careful examination of the residual density showed that, a phenyl group can be nicely placed in the density, but a simple rotation of the benzene moiety of the ibuprofenoyl group of the (S)-ibuprofenoyl-CoA did not position the phenyl ring in the density. As the soaking experiment was done with racemic mixture of (R, S)-ibuprofenoyl-CoA, there was a possibility that both the compounds were bound in the active site. We modeled the (R)-ibuprofenoyl-CoA in the active site and both enantiomers were refined together with occupancy 0.5 (Fig. 37). The final model was refined with R-factor of 20.2 % and free R-factor of 24.6 %.

Overall structure of MCR complexed with (R, S)- ibuprofenoyl-CoA is similar with the structure of unliganded MCR. Structural comparison showed also no structural changes near the active site. The mode of binding of the CoA part of the substrate is the same as observed in case of 2-methylmalonyl-CoA bound to the H126A MCR mutant structure. In both enantiomeric forms of the ibuprofen moiety, only small adjustments are seen for the thioester oxygen and the α-carbon, whereas the 2-methyl group and the β-carbon display a larger movement. Because of the movement of the β-carbon, the phenyl moieties of the two enantiomeric substrates are pointing in two different directions. The benzene moiety of the (S)-enantiomer binds between the side chains of Ile16, Pro20, Met50 and Met198 (from the other subunit). The side chains of Ile16 and Met50 adopt slightly different conformations because of the presence of ligand in the active site (as compared to the unliganded structure). The benzene moiety of the (R)-enantiomer binds near the side chains of Met216, Leu217 and Ile240 of the other subunit. The side chains of Met50 from one subunit and Met198, Met199, Met202, Met207, Trp208 from the other subunit form a large hydrophobic surface over which the benzene ring slides when the catalytic enantiomeric interconversion takes place.

In both enantiomeric forms of ibuprofenoyl-CoA the thioester oxygen atom and the 2-methyl group of ibuprofenoyl-CoA are in a cis-conformation with respect to each other and both are pointing towards the bottom of the catalytic pocket. The thioester oxygen is hydrogen bonded to N(Asp127) as also seen in the case of 2-methylmalonyl-CoA. The thioester oxygen atom is also in contact with ND1(His126), OD1(Asp156) and OD2(Asp156). The 2-methyl groups are in Van der Waals contact with side chain atoms of His126, Asp127, Asn152 and Asp156 of the large domain of one subunit and of

Page 80: Protein crystallographic studies to understand the …jultika.oulu.fi/files/isbn9514280911.pdf · PROTEIN CRYSTALLOGRAPHIC STUDIES TO UNDERSTAND ... Protein crystallographic studies

78

Leu217, Tyr224 and Ile240 of the small domain of the other subunit. The mode of binding of ibuprofenoyl-CoA confirms the importance of His126 and Asp156 as catalytic residues (Fig. 37).

Fig. 37. Stereo picture showing the zoomed in view of the active site of MCR complexed with (R, S)-ibuprofenoyl-CoA. Two subunits are shown in orange and green colours. The ligands are shown in ball and sticks. (S)- and (R)-enantiomeric forms of ibuprofenoyl-CoA are coloured as pink and olive-green respectively. The important active site residues are shown as sticks and two important water molecules are shown as spheres.

The crystal structure of MCR complexed with (R, S)-ibuprofenoyl-CoA has given a better understanding about the reaction mechanism of racemization by this enzyme. The structure also nicely describes the mode of binding of the 2-methyl group and the fatty acyl part of the two enantiomeric substrates in the enzyme active site. However, still two different questions remain: (a) how do the bigger substrates (e.g., pristanoyl-CoA, trihydroxycoprostanoyl-CoA) get accommodated in the active site, and (b) how does the enzyme change the position of the fatty acyl group of the substrates during the catalysis. To answer these questions, two different liganded structures of MCR, one complexed with (R)-2-methylmyristoyl-CoA and the other complexed with (S)-2-methylmyristoyl-CoA were determined.

5.9 Crystal structure of MCR complexed with (R)- and (S)-2-methylmyristoyl-CoA (III)

High resolution crystal structure of MCR complexed with (R)-2-methylmyristoyl-CoA was refined at 1.60 Å resolution with R-factor of 21.0 % and free R-factor of 24.4 %. The σa-weighted (Fo - Fc)αc electron density was very clear for CoA part of the ligand in all four active sites, but for the alkyl part, it was completely visible only in B and D active site. In A and C active sites, there was no density for the alkyl chain after C5. So, complete model of the ligand was built in B and D active sites whereas partial model (till C5 of the alkyl chain) of the ligand was built in A and C active sites. The alkyl chain

Page 81: Protein crystallographic studies to understand the …jultika.oulu.fi/files/isbn9514280911.pdf · PROTEIN CRYSTALLOGRAPHIC STUDIES TO UNDERSTAND ... Protein crystallographic studies

79

adopts two different conformations after C5 in the B active site (Fig. 38). The positions of these two conformations are near Met216, Leu217 and Ile240 from the other subunit.

Fig. 38. Stereo picture showing the zoomed in view of the active site of MCR complexed with (R)-2-methylmyristoyl-CoA. Two subunits are shown in orange and green colours. The ligand is shown as ball-and-stick model. Two different conformations of the alkyl part of the ligand are coloured as pink and olive-green respectively. The important active site residues are shown as stick model.

The structure of MCR complexed with the (S)-2-methylmyristoyl-CoA was refined at 1.70 Å with R-factor of 22.5 % and free R-factor of 26.5 %. The σa-weighted (Fo - Fc)αc electron density was good in all four active sites to model the compound in the four active sites. However, the highest occupancy of the alkyl chain was observed in the B active site. The alkyl chain of (S)-2-methylmyristoyl-CoA binds in a different direction compared to the binding of the alkyl chain in the (R)-enantiomer. The alkyl chain of the (S)-enantiomer is pointing upward in the binding pocket and has contact with Met198, Met202 and Met207 from the other subunit (Fig. 39).

Structural comparison shows that, the 2-methyl group and the thioester oxygen of (R)- and (S)-2-methylmyristoyl-CoA are bound in the same way as observed for (R, S)-ibuprofenoyl-CoA. These structures also highlight the importance of the methionine rich surface for binding large fatty acyl moieties (Fig. 38, Fig. 39).

Page 82: Protein crystallographic studies to understand the …jultika.oulu.fi/files/isbn9514280911.pdf · PROTEIN CRYSTALLOGRAPHIC STUDIES TO UNDERSTAND ... Protein crystallographic studies

80

Fig. 39. Stereo picture showing the zoomed in view of the active site of MCR complexed with (S)-2-methylmyristoyl-CoA. Two subunits are shown in orange and green colours. The ligand is shown as ball-and-stick model. The important active site residues are shown as stick model.

Page 83: Protein crystallographic studies to understand the …jultika.oulu.fi/files/isbn9514280911.pdf · PROTEIN CRYSTALLOGRAPHIC STUDIES TO UNDERSTAND ... Protein crystallographic studies

6 Discussion

6.1 Presence of two phosphate ions in the closed active site of eASL shows the possible mode of binding of the succinate moiety of the

substrate in the competent catalytic complex (I)

Several structural and enzymological studies have been done on duck δ2 crystallin and its mutated variants (Sampaleanu et al. 2002, Wu et al. 1998) to understand the reaction mechanism of ASL and mode of substrate binding in the enzyme active site. From the structural and mutational studies (Chakraborty et al. 1999), possible roles of different residues of duck δ2 crystallin in binding of argininosuccinate in the active site have been suggested. However, the precise catalytic reaction mechanism of ASL is unclear. Based on several structural and mutagenesis studies it has been hypothesized that, in duck δ2 crystallin His162 is possibly the catalytic base and Ser283 is possibly the catalytic acid (Sampaleanu et al. 2002, Chakraborty et al. 1999). However, His162 is not completely conserved in the superfamily. The structure of the duck δ2 crystallin S283A mutant complexed with argininosuccinate shows that another residue Thr161 which is highly conserved across the superfamily, is quite close to the Cβ atom of the succinate moiety of the substrate. So there is also a possibility that, Thr161 can act as a proton abstractor during the catalysis. The possible role of Ser283, which is fully conserved across the superfamily, is uncertain because of the absence of a closed competent active site.

Our 2.44 Å resolution crystal structure of eASL is a dimer having two dissimilar active sites, one is in the open form and another is in the closed form complexed with two phosphate ions. The closed active site arrangement of the eASL structure provides new information about the possible catalytic role of histidine (His162 in duck δ2 crystallin) and serine (Ser283 in duck δ2 crystallin). In eASL the corresponding residues are His156 and Ser277. The possible mode of binding of the succinate moiety (to eASL) has been obtained by assuming that the positions of the two phosphate ions mimick the positions of the two carboxylate moieties of the succinate (Fig. 5b, original article I). The crystal structures of duck δ2 S283A (Sampaleanu et al. 2002) and H162N (Valle et al. 1999) mutant complexed with argininosuccinate shows the mode of binding of the substrate in the open active site. From the structural comparison of the predicted mode of binding of succinate in the closed active site of eASL and the observed mode of binding of

Page 84: Protein crystallographic studies to understand the …jultika.oulu.fi/files/isbn9514280911.pdf · PROTEIN CRYSTALLOGRAPHIC STUDIES TO UNDERSTAND ... Protein crystallographic studies

82

argininosuccinate in the open active site it is apparent that the succinate moiety of argininosuccinate rotates into a new position when the active site closes. This re-arrangement of the succinate moiety of the substrate is essential to avoid clashes with atoms of the closed active site loop. In this orientation of the succinate, the arginine moiety of the substrate points downward into its corresponding binding pocket and the hydrogen attached to the Cβ atom points to the side chain oxygen atom of Ser277. In this arrangement of the substrate in the active site, Ser277 side chain oxygen could initiate the catalysis by abstracting the proton from the Cβ atom. The role of Ser277 as a catalytic base is facilitated by enhanced nucleophilicity of its side chain oxygen atom because it points into an oxyanion-hole like environment formed by the main chain NH groups of residues 279 and 280 (Fig. 5b, original article I). A similar role for the corresponding serine residue has been proposed in aspartase (Fujii et al. 2003). The proposed orientation of the substrate in the closed active site also helps to assign possible role for His156 in the catalysis. In the proposed orientation of the substrate, the His156 side chain points to the guanidine group of the arginine moiety. So it can donate a proton to the leaving arginine moiety in the second half of the reaction. However, to become a proton donor, His156 needs to be doubly protonated both at ND1 and NE2. In the closed active site complexed with the two phosphate ions, ND1 and NE2 of His156 are hydrogen bonded to Glu290 and Phos-1 respectively. In the modeled complex with the succinate moiety His156 is doubly protonated: ND2 and NE2 of His156 are hydrogen bonded to Glu290 and the substrate carboxylic acid moiety (which is in the same position as Phos-1) respectively. Presence of Glu290 is quite important for the protonation of His156. The sequence conservation of Glu290 is highly correlated with the sequence conservation of histidine (Sampaleanu et al. 2002).

From the available structural data it is quite clear that the C3 loop adopts a closed conformation in the presence of the substrate in the active site and that Ser277 acts as a catalytic base during the catalysis.

6.2 Structural comparison of MCR with FRC and YfdW (II)

When the structural study on MCR was initiated, there was no structural information available for any enzyme belonging to the family III CoA transferases. When we solved the structure of MCR, at the same time the structure of FRC (native and complexed with CoA) from O. formigenes was published (Ricagno et al. 2003). A few months later also, the structures of E. coli YfdW protein complexed with CoA and acetyl-CoA were published (Gruez et al. 2003, Gogos et al. 2004). MCR has a similar structural fold as observed in FRC and E. coli YfdW protein.

Sequence comparison shows that MCR has sequence identity of 24 % and 25 % with FRC and E. coli YfdW protein respectively. Sequence comparison between FRC and E. coli YfdW protein shows that they have 60 % sequence similarity and both are around 60 residues longer than MCR. Structural comparison between FRC and E. coli YfdW protein shows that they have similar structure. Sequence and structural comparisons between MCR and FRC show that FRC has two extra loops which are not present in MCR (Fig. 6B, original article II). The first loop is 13 residues long and located in the large domain

Page 85: Protein crystallographic studies to understand the …jultika.oulu.fi/files/isbn9514280911.pdf · PROTEIN CRYSTALLOGRAPHIC STUDIES TO UNDERSTAND ... Protein crystallographic studies

83

after β2. The second one is longer, 27 residues long and located after α8, the kinked linker helix. It is of interest that the comparison of the structures of FRC and CaiB (Rangarajan et al. 2005) also highlight structural differences in these two loops. It is also interesting to point out that in MCR, the first loop region is the only disordered part of the molecule. It is disordered in all structures. In FRC, there is also an extra C-terminal helix (α20). Comparison between the dimers of MCR and FRC shows that the structure of the large domains are quite similar, however, the small domains are more different. This is also observed in the sequence comparison. The residues from the large domain have 31 % sequene identity whereas the residues from the small domain have only 13 % sequence identity.

Structural comparison between FRC and E. coli YfdW protein shows that the mode of binding of CoA is similar. The active site cleft and the mode of CoA binding in the active site of MCR was predicted using the structure of E. coli YfdW protein complexed with acetyl-CoA. Structural comparison showed, Arg38, Lys62, Arg85 and Arg91 might be involved in the binding of CoA in the active site cleft. Involvement of Arg91 in the catalysis was confirmed by mutagenesis studies. From the mutagenesis studies it was predicted that His126 and Asp156 might be involved in the catalysis. Structural comparison of the MCR structure with the acetyl-CoA bound E. coli YfdW protein showed that the active site of MCR is near Asp156 and His126. In E. coli YfdW protein structure the bound acetyl-CoA has two modes of binding for the acetyl group. However in MCR structure, the downward conformation of the acetyl group is acceptable as the upward conformation clashes with Pro18. In this downward conformation of the acetyl group, the thioester oxygen enters inside an oxyanion hole created by ND1(His126) and N(Asp127). This kind of oxyanion hole, facilitating thioester-dependent catalysis, is commonly found in CoA- dependent enzymes (Holden et al. 2001).

Structural comparison shows that the position of Asp156 in MCR is same as the coresponding aspartate observed in FRC and E. coli YfdW protein. In case of FRC the corresponding aspartate (Asp169) is the catalytic base. This aspartate is conserved across the superfamily. So, structural comparison confirms Asp156 to be one of the catalytic residues. The other catalytic residue might be His126 as it is positioned opposite to Asp156. This histidine is not conserved in the superfamily. His126 also froms a hydrogen bond with Glu241 from the other subunit. The hydrogen bonding interaction between His126 and Glu241 also enhances the catalytic properties of the histidine side chain. This His-Glu pair of MCR is not conserved in FRC. The corresponding residues in FRC are Tyr139 and Ala285. The active site architecture of FRC and MCR are different. In the active site of FRC, there is a small glycine rich loop (residues 258-261, corresponding to the region near residues 218-221 in MCR) which is in open conformation in the unliganded structure and in closed conformation in the liganded structure. The corresponding loop in MCR is shorter.

The structural comparison shows that MCR, FRC and E. coli YfdW protein have a similar structural fold, despite a very low sequence similarity. However, there are some structural features wich are different in FRC and MCR. The unique structural features in MCR, not seen in the structures of the sequence related FRC, suggest that the family III CoA transferases can be subdivided in at least two classes, being racemases and CoA transferases.

Page 86: Protein crystallographic studies to understand the …jultika.oulu.fi/files/isbn9514280911.pdf · PROTEIN CRYSTALLOGRAPHIC STUDIES TO UNDERSTAND ... Protein crystallographic studies

84

6.3 Structural comparison of wild type MCR structure with liganded and unliganded H126A MCR mutant structures

The unliganded structure of H126A MCR mutant was determined at 1.95 Å resolution. The structure of the H126A MCR mutant complexed with (S)-2-methylmalonyl-CoA was also determined at 1.70 Å resolution. These two mutant structures were superimposed on the unliganded MCR structure. The structural superposition was done using the residues from α-helices and β-sheets in the large domains of both subunits and 122 Cα atoms had an r. m. s. fit of 0.2 Å and 0.3 Å for the apo and liganded H126A MCR mutant structures, respectively. Structural superposition shows that the large domains superimpose very well, but the small domains are in a different position in case of both liganded and unliganded H126A MCR mutant structure (Fig. 40). The relative positions of the small domains of the unliganded MCR and the two mutant structures have been compared using a superposition analysis (Table 5). In both mutant structures the small domain has moved towards the large domain.

In case of CaiB, it has been reported that the small domain moves towards the large domain upon binding of CoA or the CoA derivative in the enzyme active site (Stenmark et al. 2004, Rangarajan et al. 2005). However, a domain movement because of mutation, has not been reported in any of the enzymes of the family III CoA transferases. Structural comparison between liganded and unliganded H126A MCR mutant structure shows no significant structural change because of ligand binding.

Fig. 40. Structural superposition of the dimers of wild type MCR (blue), apo H126A MCR mutant (green) and (S)-2-methylmalonyl-CoA bound H126A MCR mutant (magenta). Important catalytic active site residues along with Gln243 are shown as stick model.

Page 87: Protein crystallographic studies to understand the …jultika.oulu.fi/files/isbn9514280911.pdf · PROTEIN CRYSTALLOGRAPHIC STUDIES TO UNDERSTAND ... Protein crystallographic studies

85

Table 5. Superposition of the small domains.

Structures r. m. s. fit (Å) Rotation angle K (o) Unliganded H126A MCR mutant on wild type MCR

0.3 3.0

Liganded H126A MCR mutant on wild type MCR

0.6 5.0

For the superposition the Cα-atoms of residues 224 to 300 were used.

6.4 The characteristic features of the extended active site of MCR (III)

The active site of MCR was first identified by the structural comparison of wild type unliganded MCR structure with acetyl-CoA bound E. coli YfdW protein structure. Afterwards, several liganded structures of MCR were determined, which include the structure of wild type MCR complexed with acetyl-CoA, crotonyl-CoA, acetoactyl-CoA, (R, S)-ibuprofenoyl-CoA, (R)-, (S)-2-methylmyristoyl-CoA and H126A MCR mutant complexed with (S)-2-methylmalonyl-CoA. These structures confirm the presence of the active site of MCR in the center of a large extended pocket. The extended pocket provides the binding site for the CoA part of the substrate on one side of the catalytic center, as well as the binding pocket for the acyl part on the other side of the catalytic center. The CoA binding pocket and the acyl binding pocket are pointing in the opposite directions away from the catalytic center. The mode of CoA binding in the CoA binding pocket has been discussed in section 5.7. The CoA binding pocket is mainly formed by the residues from the large domain from subunit-1. The acyl binding region is formed partly by the residues from the large domain from subunit-1, but predominantly by the residues from subunit-2. These residues (190-223) are coming from the linker region-1 connecting the large and small domain of subunit-2. The residues from the small domain of subunit-2 complete the acyl binding pocket.

The crystal structure of MCR complexed with (R, S)-ibuprofenoyl-CoA shows the mode of binding of (R)- and (S)-enantiomeric forms of 2-methylacyl-CoAs in the active site. The ibuprofen parts of the two different enantiomeric substrates are pointing in two different directions. There are several structures of other racemases complexed with different ligands. In all these cases, the structures have been determined with either (R)- or (S)-enantiomeric forms of the compound complexed in the enzyme active site. This is the first structure of a racemase complexed with both enantiomeric substrates in the enzyme active site. To understand the mode of binding of the bigger substrate in the MCR active site, the structures of MCR complexed with (R)- and (S)-2-methylmyristoyl-CoA were determined at 1.6 Å and 1.7 Å resolution, respectively. Both the structures show that the CoA moiety, the thioester oxygen and the 2-methyl group are bound in the same way as seen for ibuprofenoyl-CoA.

The crystal structures of MCR complexed with (R, S)- ibuprofenoyl-CoA, (R)- and (S)-2-methylmyristoyl-CoA show the mode of binding of the acyl part of the substrate in the enzyme active site. The mode of binding is determined by the nature of the acyl group

Page 88: Protein crystallographic studies to understand the …jultika.oulu.fi/files/isbn9514280911.pdf · PROTEIN CRYSTALLOGRAPHIC STUDIES TO UNDERSTAND ... Protein crystallographic studies

86

and the chirality at Cα. The structures show that the acyl groups bind in two different directions inside the acyl binding pocket consisting of several methionine residues. In case of (S)-ibuprofenoyl-CoA, the ibuprofen part of the substrate binds in a pocket (the S-pocket) near Ile16 and Pro20. In case of (R)- ibuprofenoyl-CoA, the ibuprofen part of the substrate binds in a pocket (the R-pocket) near Ile240 from subunit-2. In these structures it is seen that the acyl groups of the two different enantiomeric substrates are bound in two different ways in the active site and the geometry at the catalytic center is quite tight. This observation suggests that during the catalytic stereochemical inversion of the substrate, the acyl group moves between these two pockets (the R and S pocket) by sliding over the methionine rich surface which connects these two pockets (Fig. 41).

Fig. 41. Stereo picture showing the zoomed in view of the active site of MCR complexed with (R, S)-ibuprofenoyl-CoA. The protein part is shown as a molecular surface. The surfaces of the different protein atoms are coloured according to atom type (carbon in grey, nitrogen in blue, oxygen in red and sulfur in orange). Bound ibuprofenoyl-CoAs are shown as ball-and-stick model. (S)- and (R)-ibuprofenoyl-CoA are shown in pink and olive-green, respectively.

6.5 Understanding the reaction mechanism of MCR (III)

From the crystal structures of MCR complexed with (R, S)-ibuprofenoyl-CoA, (R)- and (S)-2-methylmyristoyl-CoA it is clear that His126 and Asp156 are the two key catalytic residues involved in the raction mechanism. However, to assign the correct role of these two residues during the catalytic reaction it is important to know the protonation state of these two residues (Puig et al. 2006), as the racemization reaction involves proton abstraction and donation. From the biochemical studies it is known that the pH optimum of Amacr is around 7.0 (Schmitz et al. 1994). So the pKa of His126 and Asp156 should be

Page 89: Protein crystallographic studies to understand the …jultika.oulu.fi/files/isbn9514280911.pdf · PROTEIN CRYSTALLOGRAPHIC STUDIES TO UNDERSTAND ... Protein crystallographic studies

87

around 7.0. The key features of the active site (Fig. 4, original article III) which facilitate the catalytic properties are briefly discussed below.

It is quite reasonable to have a pKa of 7.0 for His126, as it is quite common for a histidine side chain, especially when it is hydrogen bonded to a glutamate side chain, which is also observed in other active sites (Gutteridge & Thornton 2005). Also the pKa of Asp156 is critical; it will be influenced by its neighbouring residues like Met188, Asn152 and Asp127 (Fig. 4, original article III). The side chain of Asp127 is not directly interacting with Asp156, however the position of Asp127 at a close proximity to Asp156 will influence the pKa of Asp156. Asp127 is positioned in such a way that the side chain carboxylic acid group is hydrogen bonded to the main chain NH groups of helix-7 and two active site water molecules (Fig. 42). This polar environment suggests that Asp127 will be deprotonated at pH 7.0 like in bulk water. The presence of a negative charge on the side chain carboxylic acid group of Asp127 will raise the pKa of Asp156. Analysis of the geometry of Asp156 in the active site gives some explanation for the protonation state of the carboxylic acid group of Asp156. OD1(Asp156) is interacting with SD(Met188) and the geometry suggests that the carboxylate oxygen is a nucleophilic partner of this interaction and this is not a hydrogen bonding interaction (Rosenfield et al. 1977, Pal & Chakrabarti 2001). The OD2(Asp156) atom is nicely positioned to exchange a proton with the Cα atom of the substrate. The OD2(Asp156) atom is also nicely positioned to form a hydrogen bonding interaction with ND2(Asn152), which allows the exchangeable proton of OD2(Asp156) to interact with the Cα atom of the substrate. From the geometry and the surrounding environment of the side chain carboxylic acid group of Asp156, it is expected that OD1 is unprotonated and OD2 is involved in proton exchange with the substrate and also the protonated OD2 can be involved in stabilizing the thioester oxygen anion which is formed in the transition state during the catalysis. So the acid form of Asp156 is neutral and the basic form is singly negatively charged.

Fig. 42. Stereo picture of the zoomed in view of the active site of MCR complexed with (R, S)-ibuprofenoyl-CoA showing the hydrogen bonding interactions. The ligands are shown as ball-and-stick models. (R)- and (S)-enantiomeric forms of ibuprofenoyl-CoA are coloured as pink and olive-green, respectively. The important active site residues are shown as stick models and two important water molecules are shown as spheres. Hydrogen bonding interactions are shown by blue dotted lines.

Page 90: Protein crystallographic studies to understand the …jultika.oulu.fi/files/isbn9514280911.pdf · PROTEIN CRYSTALLOGRAPHIC STUDIES TO UNDERSTAND ... Protein crystallographic studies

88

Binding mode of (R, S)-ibuprofenoyl-CoA in the active site of MCR confirms that Asp156 and His126 are the two catalytic residues. Analysis of the geometry and the surrounding environment of these two residues show that proper protonation state of His126 and Asp156 will be achieved during the catalysis. In this proposed reaction mechanism for conversion of (S)- to (R)-enantiomer, His126 is neutral and Asp156 is also neutral and protonated (Fig. 8, original article III). For the conversion of (R)-enantiomer to (S)-enantiomer, the competent active site will be the protonated positively charged His156 and negatively charged Asp156. After each catalytic cycle, the protonation state of the active site will be regenerated by proton exchange with the bulk solvent. The first step of the reaction is the abstraction of the α-proton. The generated negative charge of the thioester oxygen of the planar intermediate is stabilized primarily by hydrogen bonding interactions with N(Asp127). The tight geometry of the MCR active site complexed with (R, S)-ibuprofenoyl-CoA shows that during the catalytic conversion between the two enantiomers, the thioester oxygen atom and the 2-methyl group remain bound almost in the same position. The acyl group changes its position between the R- and S- pocket by sliding over the methionine rich surface connecting these two pockets in the active site.

Page 91: Protein crystallographic studies to understand the …jultika.oulu.fi/files/isbn9514280911.pdf · PROTEIN CRYSTALLOGRAPHIC STUDIES TO UNDERSTAND ... Protein crystallographic studies

7 Conclusions

High resolution protein crystallographic studies on the enzyme α-methylacyl-CoA racemase from M. tuberculosis have been done. The crystal structure of wild type unliganded MCR shows that the enzyme is a tight interlocked dimer with similar fold as observed in the other members of the familiy III CoA-transferases. The important catalytic active site residues, His126 and Asp156 were identified from the structral analysis.

Several crystallographic binding studies have been done using wild type MCR crystals and crystals of the active site H126A MCR mutant. Crystal structures of wild type MCR complexed with acetyl-CoA, acetoacetyl-CoA, crotonyl-CoA, (R, S)-ibuprofenoyl-CoA, (R)- and (S)-2-methylmyristoyl-CoA have been determined. Crystal structures of the H126A MCR mutant in unliganded form and also complexed with 2-methylmalonyl-CoA have been determined. These crystal structures suggest that His126 and Asp156 act as the catalytic base and acid, respectively, when the (S)-enantiomeric substrate is bound in the active site and vice versa when the (R)-enantiomeric substrate is bound. The crystal structures also suggest that the negative charge developing on the thioester oxygen during the catalytic reaction is stabilized by the main chain NH group of Asp127. These structural studies suggest that during the interconversion, the thioester oxygen and 2-methyl group of the two enantiomeric substrates remain bound in the same position whereas the fatty acyl groups change their positions between the R and S pocket by sliding over a methionine rich surface which connects these two pockets in the enzyme active site.

During the process of optimization of the crystallization condition of MCR, a very minor impurity in the protein sample, being argininosuccinate lyase from E. coli (eASL) was crystallized. The crystal structure was the first structure of the active enzyme where the important active site 270s loop was visible. The 2.44 Å resolution crystal structure suggests that this loop adopts a closed conformation during the presence of the substrate in the active site and Ser277 acts as the catalytic base during the catalysis.

Page 92: Protein crystallographic studies to understand the …jultika.oulu.fi/files/isbn9514280911.pdf · PROTEIN CRYSTALLOGRAPHIC STUDIES TO UNDERSTAND ... Protein crystallographic studies

References

Abu-Abed M, Turner MA, Vallee F, Simpson A, Slingsby C & Howell PL (1997) Structural comparison of the enzymatically active and inactive forms of δ crystallin and the role of histidine 91. Biochemistry 36: 14012-14022.

Adams, E (1976) Catalytic aspects of enzymatic racemization. Adv Enzymol Relat Areas Mol Biol 44: 69-138.

Allan JD, Cusworth DC, Dent CE & Wilson VK (1958) A disease, probably hereditary, characterized by severe mental deficiency and constant gross abnormality of amino acid metabolism. Lancet I: 182-187.

Appelgarth D, Davidson A, Perry L, Podon S, Chrichton J, Flandwick D (1975) Argininosuccinate academia in a healthy individual detected by urine screening program. Clin Chem 21: 950-951.

Ausubel FM, Brent R, Kingston RE, Moore DD, Seidman JG, Smith JA & Struhl K (1989) Current protocols in Molecular Biology. John Wiley & Sons, Inc. New York, USA.

Baille TA, Adams WJ, Kaiser DG, Olanoff LS, Halstead GW, Harpootlian H & Van Giessen GJ (1989) Mechanistic studies of the metabolic chiral inversion of (R)-ibuprofen in humans. J Pharmacol Exp Ther 249: 517-523.

Bearne SL & Wolfenden R (1997) Mandelate racemase in piece: Effective concentrations of enzyme functional groups in the transition state. Biochemistry 36: 1646-1656.

Bhaumik P, Kursula P, Ratas V, Conzelmann E, Hiltunen JK, Schmitz W & Wierenga RK (2003) Crystallization and preliminary X-ray diffraction studies of an α-methylacyl-CoA racemase from Mycobacterium tuberculosis. Acta Cryst D59: 353-355.

Branden C & Tooze J (1998) Introduction to Protein Structure. Garland Publishing Inc. New York. Brunger AT, Adams PD, Clore GM, DeLano WL, Gros P, Grosse-Kunstleve RW, Jiang J-S,

Kuszewski J, Nilges N, Pannu NS, Read RJ, Rice LM, Simonson T & Warren GL (1998) Crystallography and NMR system (CNS): A new software system for macromolecular structure determination. Acta Cryst. D54: 905-921.

Brusilow SW & Horwich AL (2001) Urea cycle enzymes. In: Scriver CR, Beaudet AL, Valle D, Sly WS (eds) The metabolic and molecular bases of inherited disease, 8th edn. McGrawHill, New York: 1909-1963.

Brusilow SW & Maestri NE (1996) Urea cycle disorders: diagnosis, pathophysiology, and therapy. Adv Pediatr 43:127-170.

Cahn RS, Ingold C & Prelog V (1966) Specification of Molecular Chirality. Angew Chem Int E. Engl 5: 385-390

Chakraborty AR, Davidson A & Howell PL (1999) Mutational analysis of amino acid residues involved in argininosuccinate lyase activity in duck delta II crystallin. Biochemistry 38: 2435-2443.

Page 93: Protein crystallographic studies to understand the …jultika.oulu.fi/files/isbn9514280911.pdf · PROTEIN CRYSTALLOGRAPHIC STUDIES TO UNDERSTAND ... Protein crystallographic studies

91

Chen CS, Shieh WR, Lu PH, Harriman S & Chen SY (1991) Metabolic stereoisomeric inversion of ibuprofen in mammals. Biochim Biophys Acta 1078: 411-417.

Choi SY, Esaki N, Ashiuchi M, Yoshimura T & Soda K (1994) Bacterial glutamate racemase has high sequence similarity with myoglobins and forms an equimolar inactive complex with hemin. Proc Natl Acad Sci USA 91: 10144-10147.

Collaborative Computational Project, Number 4 (1994). The CCP4 suite: programs for protein crystallography. Acta Cryst D50: 760-763.

Cowton KD & Zhang KY (1999) Density modification for macromolecular phase improvement. Prog Biophys Mol Biol 72: 245-270.

DeLano WL (2002) The PyMOL molecular graphics system. http://www.pymol.org. Dhanasekaran SM, Barrette TR, Ghosh D, Shah R, Varambally S, Kurachi K, Pienta KJ, Rubin MA

& Chinnaiyan AM (2001) Delineation of prognostic biomarkers in prostate cancer. Nature 412: 822-826.

Dodson G & Wlodawer A (1998) Catalytic triads and their relatives. Trends Biochem Sci 23: 347-352.

Eliel EL & Wilen SH (1994) Stereochemistry of organic compounds. John Wiley & Sons, Inc New York.

Eliot AC, & Kirsch JF (2004) PYRIDOXAL PHOSPHATE ENZYMES: Mechanistic, Structural, and Evolutionary Considerations. Annu Rev Biochem 73: 383-415.

Finar IL (1959) Organic chemistry, Volume II Longmans. Finar IL (1963) Organic chemistry, Volume I Longmans. Fujii T, Sakai H, Kawata Y & Hata Y (2003) Crystal structure of thermostable aspartase from

Bacillus sp. YM55-1: structure-based exploration of functional sites in the aspartase family J Mol Biol 328: 635-654.

Gallo KA & Knowles JR (1993) Purification, cloning, and cofactor independence of glutamate racemase from Lactobacillus. Biochemistry 32: 3981-3990.

Gallo KA, Tanner ME & Knowles JR (1993) Mechanism of the reaction catalyzed by glutamate racemase. Biochemistry 32: 3991-3997.

Gerlt JA, Kenyon GL, Kozarich JW, Neidhart DJ, Petsko GA & Powers VM (1992) Mandelate racemase and class-related enzymes. Curr Opin Struct Biol 2: 736-742.

Gerlt JA, Kozarich JW, Kenyon GL & Gassman PG (1991) Electrophilic catalysis can explain the unexpected acidity of carbon acids in enzyme-catalyzed reactions. J Am Chem Soc 113: 9667-9669.

Gerstein M, Lesk AM & Chothia C (1994) Structural mechanisms for domain movements in proteins. Biochemistry 33: 6739-6749.

Glavas S & Tanner ME (1999) Catalytic acid/base residues of glutamate racemase. Biochemistry 38: 4106-4113.

Gogos A, Gorman J & Shapiro L (2004) Structure of Escherichia coli YfdW, a type III CoA transferase. Acta Cryst D60: 507-511.

Gregoret LM, Rader SD, Fletterick RJ & Cohen FE (1991) Hydrogen bonds involving sulfur atoms in proteins. Proteins: Struct Funct Genet 9: 99-107.

Gruez A, Roig-Zamboni V, Valencia C, Campanacci V & Cambillau C (2003) The crystal structure of the Escherichia coli YfdW gene product reveals a new fold of two interlaced rings identifying a wide family of CoA transferases. J Biol Chem 278: 34582-34586.

Gutteridge A & Thronton JM (2005) Understanding nature’s catalytic toolkit. Trends Biochem Sci 30: 622-629.

Hasson MS, Schlichting I, Moulai J, Taylor K, Barrett W, Kenyon GL, Babbitt PC, Gerlt A, Petsko GA & Ringe, D (1998) Evolution of an enzyme active site: the structure of a new crystal form of muconate lactonizing enzyme compared with mandelate racemase and enolase. Proc Natl Acad Sci USA 95: 10396-10401.

Page 94: Protein crystallographic studies to understand the …jultika.oulu.fi/files/isbn9514280911.pdf · PROTEIN CRYSTALLOGRAPHIC STUDIES TO UNDERSTAND ... Protein crystallographic studies

92

Hegeman GD, Rosenberg EY & Kenyon GL (1970) Mandelic acid racemase from Pseudomonas putida. Purification and properties of the enzyme. Biochemistry 9: 4029-4036.

Heider J (2001) A new family of CoA-transferases. FEBS lett 509: 345-349. Hol WGJ, van Duljen PT & Berendsen HJC (1978) The alpha-helix dipole and the properties of

proteins. Nature 273: 443-446. Holden HM, Benning MM, Haller T & Gerlt JA (2001) The crotonase superfamily: divergently

related enzymes that catalyze different reactions involving acyl coenzyme a thioesters. Acc Chem Res 34: 145-157.

Hwang KY, Cho CS, Kim SS, Sung HC, Yu YG & Cho Y (1999) Structure and mechanism of glutamate racemase from Aquifex pyrophilus. Nat Struct Biol 6: 422-426.

Jencks WP (1980) When is an intermediate not an intermediate ? Enforced mechanisms of general acid-base catalyzed, carbocation, carbanion, and ligand exchange reactions. Chem Soc Rev 10: 264-270.

Jiang Z, Woda BA, Rock KL, Xu Y, Savas L, Kahn A, Pihan G, Cai F, Badcook JS, Rathanaswami P, Reed SG, Xu J & Fanger GR (2001) P504S: a new molecular marker for the detection of prostate carcinoma. Am J Surg Pathol 25: 1397-1404.

Jones TA, Zou JY, Cowan SW & Kjeldgaard M (1991) Improved methods for building protein models in electron density maps and the location of errors in these models. Acta Cryst A47: 110-119.

Jonsson S, Ricagno S, Lindqvist Y & Richards NJ (2004) Kinetic and mechanistic characterization of the formyl-CoA transferase from Oxalobacter formigenes. J Biol Chem 279: 36003-36012.

Kabsch W (1993) Automatic processing of rotation diffraction data from crystals of initially unknown symmetry and cell constants. J Appl Cryst 26: 795-800.

Kallarakal AT, Mitra B, Kozarich JW, Gerlt JA, Clifton JG, Petsko GA & Kenyon GL (1995) Mechanism of the reaction catalyzed by mandelate racemase: structure and mechanistic properties of the K166R mutant. Biochemistry 34: 2788-2797.

Knights KM (1998) Role of hepatic fatty acid: coenzyme A ligase in the metabolism of xenobiotic carboxylic acids. Clin Exp Pharmacol Physiol 25: 776-782.

Koo CW, Sutherland A, Vederas JC & Blanchard JS (2000) Identification of active site cysteine residues that function as general bases: diaminopimelate epimerase. J Am Chem Soc 122: 6122-6123.

Kotti TJ, Savolainen K, Helander HM, Yagi A, Novikov DK, Kalkkinen N, Conzlmann E, Hiltunen JK & Schmitz W (2000) In mouse α-methylacy-CoA racemase, the same gene product is simultaneously located in mitochondria and peroxisomes. J Biol Chem 275: 20887-20895.

Laemmli UK (1970) Cleavage of structural proteins during the assembly of the head of bacteriophage T4. Nature 227: 680-685.

Lamont HS, Staudenbauer WL & Strominger JL (1972) Partial Purification and Characterization of an Aspartate Racemase from Streptococcus faecalis. J Biol Chem 247: 5103-5106.

Lamzin VS, Dauter Z & Wilson KS (1995) How Nature deals with stereoisomers. Curr Opin Struct Biol 5: 830-836.

Landro JA, Gerlt JA, Kozarich JW, Koo CW, Shah VJ, Kenyon GL, Neidhart DJ, Fujita S & Petsko GA (1994) The role of lysine 166 in the mechanism of mandelate racemase from Pseudomonas putida: Mechanistic and crystallographic evidence for stereospecific alkylation by (R)-α-phenylglycidate. Biochemistry 33: 635-643.

Laskowski RA, MacArthour MW, Moss DS & Thronton JM (1993) PROCHECK: a program to check the stereochemical quality of protein structures. J Appl Cryst 26: 283–291.

Levitt DG (2001) A new software routine that automates the fitting of protein X-ray crystallographic electron-density maps. Acta Cryst D57: 1013-1019.

Page 95: Protein crystallographic studies to understand the …jultika.oulu.fi/files/isbn9514280911.pdf · PROTEIN CRYSTALLOGRAPHIC STUDIES TO UNDERSTAND ... Protein crystallographic studies

93

Levy HL, Coulombe JT & Shih VE (1980) Newborn urine screening In: Bickel H, Guthrie R, Hammersen G (eds) Neonantal screening for inborn errors of metabolism. Springer, Berlin Heidelberg New York: 89-103.

Liu L, Iwata K, Kita A, Kawarabayasi Y, Yohada M & Miki K (2002) Crystal structure of aspartate racemase from Pyrococcus horikoshi OT3 and its implications for molecular mechanism of PLP-independent racemization. J Mol Biol 319: 479-489.

Luo J, Duggan DJ, Chen Y, Sauvageot J, Ewing CM, Bitner ML, Trent JM & Isaacs WB (2001) Human prostate cancer and benign prostatic hyperplasia: molecular dissection by gene expression profiling. Cancer Res 61: 4683-4688.

Luo J, Zha S, Gage WR, Dunn TA, Hicks JL, Bannett CJ, Ewing CM, Platz EA, Ferdinandusse S, Wanders RJ, Trent JM, Isaacs WB & De Marzo AM (2002) α-methylacyl-CoA racemase: a new molecular marker for prostate cancer. Cancer Res 62: 2220-2226.

Mitra B, Kallarakal AT, Kozarich JW, Gerlt JA, Clifton JG, Petsko GA & Kenyon GL (1995) Mechanism of the reaction catalyzed by mandelate racemase: importance of the electrophilic catalysis by glutamic acid 317. Biochemistry 34: 2777-2787.

Morollo AA, Petsko GA, & Ringe D (1999) Structure of a Michaelis complex analogue: propionate binds in the substrate carboxylate site of alanine racemase. Biochemistry 38: 3293-3301.

Mukherji M, Schofield CJ, Wierzbicki AS, Jansen GA, Wanders RJA & Lloyd MD (2003) The chemical biology of branched-chain lipid metabolism. Prog Lipid Res 42: 359-376.

Murshudov GN, Vagin AA & Dodson EJ (1997) Refinement of Macromolecular Structures by the Maximum-Likelihood Method. Acta Cryst D53: 240-255.

Mursula AM, van Aalten DMF, Hiltunen JK & Wierenga RK (2001) The crystal structure of Δ3-Δ2-enoyl-CoA isomerase. J Mol Biol 309: 845-853.

Nakajima N, Tanizawa K, Tanaka H & Soda K (1988) Distribution of glutamate racemase in lactic acid bacteria and further characterization of the enzyme from Pediococcus pentosaceus. Agric Biol Chem 52: 3099-3104.

Neidhart DJ, Howell PL, Petsko GA, Powers VM, Li RS, Kenyon GL & Gerlt JA (1991) Mechanism of the reaction catalyzed by mandelate racemase 2. Crystal structure of mandelate racemase at 2.5 Å resolution: Identification of the active site and possible catalytic residues. Biochemistry 30: 9264-9273.

Otwinowski Z & Minor W (1997) Processing of X-ray diffraction data collection in oscillation mode. Methods Enzymol 276: 307-326.

Okada H, Yohda M, Giga-Hama Y Ueno Y, Ohdo S & Kumagai H (1991) Distribution and purification of aspartate racemase in lactic acid bacteria. Biochem Biophys Acta 1078: 377-382.

Pal D & Chakrabarti P (2001) Nonhydrogen bond interactions involving the methionine sulfur atom. J Biomol Struct Dyn 19: 115-128.

Pedersen JI, Veggan T & Björkhem I (1996) Substrate stereospecificity in oxidation of (25S)-3α,7α,12α-trihydroxy-5β-cholestanoyl-CoA by peroxisomal trihydroxy-5β-cholestanoyl-CoA oxidase. Biochem Biophys Res Commun 22: 37-42.

Philippsen A (2000) DINO: Visualizing structural biology. http://www.bioz.unibas.ch/ ~xray /dino. Powers VM, Koo CW, Kenyon GL, Gerlt JA & Kozarich JW (1991) Mechanism of the reaction

catalyzed by mandelate racemes 1. Chemical and kinetic evidence for a two-base mechanism. Biochemistry 30: 9255-9263.

Prydz K, Kase BF, Bjorkhem I & Pedersen JI (1986) Formation of chenodeoxycholic acid from 3α,7α-dihydroxy-5β-cholestanoic acid by rat liver peroxisomes. J Lipid Res 27: 622-628.

Puig E, Garcia-Viloca M, Gonzalez-Lafont A & Liuch JM (2006) On the ionization state of the substrate in the active site of glutamate racemase. A QM/MM study about the importance of being zwitterionic. Phys Chem A Mol Spectrosc Kinet Environ Gen 110: 717-725.

Radzicka A & Wolfenden R (1995) A proficient enzyme. Science 267: 90-93.

Page 96: Protein crystallographic studies to understand the …jultika.oulu.fi/files/isbn9514280911.pdf · PROTEIN CRYSTALLOGRAPHIC STUDIES TO UNDERSTAND ... Protein crystallographic studies

94

Rangarajan ES, Li Y, Iannuzzi P, Cygler M & Matte A (2005) Crystal structure of Escherichia coli crotonobetainyl-CoA: carnitine CoA-transferase (CaiB) and its complexes with CoA and carnitinyl-CoA. Biochemistry 44: 5728-5738.

Raushel FM & Garrard LJ (1984) A positional isotope exchange study of the argininosuccinate lyase reaction. Biochemistry 23:1791-1795.

Reichel C, Brugger R, Bang H, Geisslinger G & Brune K (1997) Molecular cloning and expression of a 2-arylpropionyl-coenzyme A epimerase: a key enzyme in the inversion metabolism of ibuprofen. Mol Pharmacol 51:576-582.

Ricagno S, Jonsson S, Richards N & Lindqvist Y (2003) Formyl-CoA transferase encloses the CoA binding site at the interface of an interlocked dimer. EMBO J 22: 3210-3219.

Rosenfield RE, Parthasarathy JR & Dunitz JD (1977) Directional preferences of nonbonded atomic contacts with divalent sulfur. 1. Electrophiles and nucleophiles. J Am Chem Soc 99: 4860-4862.

Rubin MA, Zhou M, Dhanasekaran SM, Varambally S, Barrette TR, Sanda MG, Pienta KJ, Ghosh D & Chinnaiyan AM (2002) alpha-Methylacyl coenzyme A racemase as a tissue biomarker for prostate cancer. JAMA 287: 1662-1670.

Ruzheinikov SN, Taal MA, Sedelnikova SE, Baker PJ & Rice DW (2005) Substrate-induced conformational changes in Bacillus subtilis glutamate racemase and their implications for drug discovery. Structure 13: 1707-1713.

Sampaleanu LM, Vallee F, Slingsby C & Howell PL (2001a) Structural studies of duck δ 1 and δ 2 crystallin suggest conformational changes occur during catalysis. Biochemistry 40: 2732-2742.

Sampaleanu LM, Vallee F, Thompson GD & Howell PL (2001b) Three-dimensional structure of the argininosuccinate lyase frequently complementing allele Q286R. Biochemistry 40: 15570-15580.

Sampaleanu LM, Yu B, & Howell PL (2002) Mutational analysis of duck δ2 crystallin and the structure of an inactive mutant with bound substrate provide insight into the enzymatic mechanism of argininosuccinate lyase. J Biol Chem 277: 4166-4175.

Savolainen K, Kotti TJ, Schmitz W, Savolainen TI, Sormunen RT, Ilves M, Vainio SJ, Conzelmann E, & Hiltunen JK (2004) A mouse model for α-methylacyl-CoA racemase deficiency: adjustment of bile acid synthesis and intolerance to dietary methyl-branched lipids. Hum Mol Genet 13: 955-965.

Schafer SL, Barrett WC, Kallarakal AT, Mitra B, Kozarich JW, Gerlt JA, Clifton JG, Petsko GA & Kenyon GL (1996) Mechanism of the reaction catalyzed by mandelate racemase: Structure and mechanistic properties of the D270N mutant. Biochemistry 35: 5662-5669.

Schmitz W, Albers C, Fingerhut R & Conzelmann E (1995) Purification and characterization of an α-methylacyl-CoA racemase from human liver. Eur J Biochem 231: 815-822.

Schmitz W, Fingerhut R & Conzelmann E (1994) Purification and properties of an α-methylacyl-CoA racemase from rat liver. Eur J Biochem 222: 313-323.

Schmitz W, Helander HM, Hiltunen JK & Conzelmann E (1997) Molecular cloning of cDNA species for rat and mouse liver α-methylacyl-CoA racemases. Biochem J 326: 883-889.

Schowen KB, Limbach HH, Denisov GS & Schowen RL (2000) Hydrogen bonds and proton transfer in general-catalytic transition-state stabilization in enzyme catalysis. Biochim Biophys Acta 1458: 43-62.

Shaw JP, Petsko GA, & Ringe D (1997) Determination of the structure of alanine racemase from Bacillus stearothermophilus at 1.9 Å resolution. Biochemistry 36: 1329-1342.

Siddiqi F, Bourque, JR, Jiang H, Gardner M, St. Maurice M, Blouin C & Bearne SL (2005) Perturbing the hydrophobic pocket of mandelate racemase to probe phenyl motion during catalysis. Biochemistry 44: 9013-9021.

Simpson A, Bateman O, Driessen H, Lindley P, Moss D, Mylvaganam S, Narebor E & Slingsby C (1994) The structure of avian eye lens δ-crystallin reveals a new fold for a superfamily of oligomeric enzymes. Nat Sruct Biol 1: 724-734.

Page 97: Protein crystallographic studies to understand the …jultika.oulu.fi/files/isbn9514280911.pdf · PROTEIN CRYSTALLOGRAPHIC STUDIES TO UNDERSTAND ... Protein crystallographic studies

95

Stamper GF, Morollo AA & Ringe D (1998) Reaction of alanine racemase with 1-aminoethylphosphonic acid forms a stable external aldimine. Biochemistry 37: 10438-10445.

Stenmark P, Gurmu D & Nordlund P (2004) Crystal structure of CaiB, a type-III CoA transferase in carnitine metabolism. Biochemistry 43: 13996-14003.

Stryer L (1988) Biochemistry, 3rd edn. New York, Freeman WH and Co: 386. Tanner ME & Kenyon GL (1998) Inversion of Stereocenters. In Comprehensive Biological

Catalysis; Sinnot, M., Ed.; Academic Press: San Diego, Vol II, Chapter 21. Tanner ME (2002) Understanding nature’s strategies for enzyme-catalyzed racemization and

epimerization. Acc Chem Res 35: 237-246. Tanner ME, Gallo KA & Knowles RJ (1993) Isotope effects and the identification of catalytic

residues in the reaction catalyzed by glutamate racemase. Biochemistry 32: 3998-4006. Terwilliger TC (2000) Maximum-likelihood density modification. Acta Cryst D56: 965-972. Terwilliger TC & Berendzen J (1999) Automated MAD and MIR structure solution. Acta Cryst

D55: 849-861. Toth EA & Yeates TO (2000) The structure of adenylosuccinate lyase, an enzyme with dual

activity in the de novo purine biosynthetic pathway. Structure 8: 163-174. Tretiakova MS, Sahoo S, Takahashi M, Turkyilmaz M, Vogelzang NJ, Lin F, Krausz T, The BT &

Yang XJ (2004) Expression of α-methylacyl-CoA racemase in papillary renal cell carcinoma. Am J Surg Pathol 28: 69-76.

Tsai M, Sampaleanu LM, Greene C, Creagh L, Haynes C & Howell PL (2004) A duck δ1 crystallin double loop mutant provides insight into residues important for argininosuccinate lyase activity. Biochemistry. 43: 11672-11682.

Turner MA, Simpson A, Mcinnes RR & Howell PL (1997) Human argininosuccinate lyase: a structural basis for intragenic complementation. Proc Natl Acad Sci USA 94: 9063-9068.

Vagin A & Teplyakov A (1977) MOLREP: an automated program for molecular replacement. J Appl Cryst 30: 1022-1025.

Vallee F, Turner M, Lindley P & Howell P (1999) Crystal structure of an inactive duck δ II crystallin mutant with bound argininosuccinate. Biochemistry 38: 2425-2434.

Van Veldhoven PP, Croes K, Asselberghs S, Herdewijn P & Mannaerts GP (1996) Peroxisomal β-oxidation of 2-methyl-branched acyl-CoA esters: stereospecific recognition of the 2S-methyl compounds by trihhydroxycoprostanoyl-CoA oxidase and pristanoyl-CoA oxidase. FEBS Lett 388: 80-84.

Van Veldhoven PP, Croes K, Casteels M & Mannaerts GP (1997) 2-methylacyl-CoA racemase: a coupled assay based on the use of pristanoyl-CoA oxidase/peroxidase and reinvestigation of its subcellular distribution in rat and human liver. Biochim Biophys Acta 1347:62-68.

Verhoeven NM, Wanders RJ, Poll-The BT, Saudubray JM & Jakobs C (1998) The metabolism of phytanic acid and pristanic acid in man: a review. J Inherit Metab Dis 21: 697-728.

Walsh C (1979) Enzymatic reaction mechanism. W.H. Freemann San Francisco. Watanabe A, Yoshimura T, Mikami B, Hayashi H, Kagamiyama H & Esaki N (2002) Reaction

mechanism of alanine racemase from Bacillus stearothermophilus: X-ray crystallographic studies of the enzyme bound with N-(5'-phosphopyridoxyl)alanine. J Biol Chem 277: 19166-19172.

Weaver T & Banaszak L (1996) Crystallographic studies of the catalytic and a second site in fumarase C from Escherichia coli. Biochemistry 35: 13955-13965.

Welsh JB, Sapinso LM, Su AI, Kern SG, Wang-Rodriguez J, Moskaluk CA, Frierson HF, Jr. & Hampton GM (2001) Analysis of gene expression identifies candidate markers and pharmacological targets in prostate cancer. Cancer Res 61: 5974-5978.

Page 98: Protein crystallographic studies to understand the …jultika.oulu.fi/files/isbn9514280911.pdf · PROTEIN CRYSTALLOGRAPHIC STUDIES TO UNDERSTAND ... Protein crystallographic studies

96

Williams SE, Woolridge EM, Ransom SC, Landro JA, Babbitt PC & Kozarich JW (1992) 3-Carboxy-cis,cis-muconate lactonizing enzyme from Pseudomonas putida is homologous to the class II fumarase family: a new reaction in the evolution of a mechanistic motif. Biochemistry 31: 9768-9776.

Wilmouth RC, Edman K, Neutze R, Wright PA, Clifton IJ, Schneider T, Schofield CJ & Hajdu J (2001) X-ray snapshots of serine protease catalysis reveal a tetrahedral intermediate. Nat Struct Biol 8: 689-694.

Wolodko WT, Fraser ME, James MN & Bridger WA (1994) The crystal structure of succinyl-CoA synthetase from Escherichia coli at 2.5 Å resolution. J Biol Chem 269: 10883-10890.

Wu CY, Lee HJ, Wu SH, Chen ST, Chiou SH & Chang GG (1998) Chemical mechanism of the endogenous argininosuccinate lyase activity of duck lens δ2-crystallin. Biochem J 333: 327-34.

Wu YT (1991) Screening for inborn errors of amino acid metabolism. Ann Clin Lab Sci 21: 123-142.

Xu J, Stolk JA, Zhang X, Silva SJ, Houghton RL, Matsumura M, Vedvick TS, Leslie KB, Badaro R & Reed SG (2000) Identification of differentially expressed genes in human prostate cancer using subtraction and microarray. Cancer Res 60: 1677-1682.

Yamauchi T, Choi SY, Okada H, Yohda M, Kumagai H, Esaki N & Soda K (1992) Properties of aspartate racemase, a pyridoxal 5'-phosphate-independent amino acid racemase. J Biol Chem 267: 18361-18364.

Yang J, Wang Y, Woolridge EM, Arora V, Petsko GA, Kozarich JW & Ringe D (2004) Crystal structure of 3-carboxy-cis,cis-muconate lactonizing enzyme from Pseudomonas putida, a fumarase class II type cycloisomerase: enzyme evolution in parallel pathways. Biochemistry 43: 10424-10434.

Yohda M, Okada H & Kumagai H (1991) Molecular cloning and nucleotide sequencing of the aspartate racemase gene from lactic acid bacteria Streptococcus thermophilus. Biochim Biophys Acta 1089: 234-240.

Zeelen JP, Hiltunen JK, Ceska TA & Wierenga RK (1994) Crystallization experiments with 2-enoyl-CoA hydratase, using an automated 'fast-screening' crystallization protocol. Acta Cryst D50: 443-447.

Zha S, Ferdinandusse S, Denis S, Wanders RJ, Ewing CM, Luo J, De Marzo AM & Isaacs W (2003) α-methylacy-CoA racemase as an androgen-independent growth modifier in prostate cancer. Cancer Res 60: 1677-1682.

Zhou M, Chinnaiyan AM, Kleer CG, Lucas PC & Rubin MA (2002) Alpha-Methylacyl-CoA racemase: a novel tumor marker over-expressed in several human cancers and their precursor lesions. Am J Surg Pathol 26: 926-931.

Page 99: Protein crystallographic studies to understand the …jultika.oulu.fi/files/isbn9514280911.pdf · PROTEIN CRYSTALLOGRAPHIC STUDIES TO UNDERSTAND ... Protein crystallographic studies

Original articles

I Bhaumik P, Koski MK, Bergmann U & Wierenga RK (2004) Structure determination and refinement at 2.44 Å resolution of argininosuccinate lyase from Escherichia coli. Acta Cryst D60: 1964-1970.

II Savolainen K, Bhaumik P, Schmitz W, Kotti TJ, Conzelmann E, Wierenga RK & Hiltunen JK (2005) α-Methylacyl-CoA racemase from Mycobacterium tuberculosis: mutational and structural characterization of the active site and the fold. J Biol Chem 280: 12611-12620.

III Bhaumik P, Schmitz W, Hassinen A, Hiltunen JK, Conzelmann E & Wierenga RK (2006) The 1,1-proton transfer reaction mechanism by α-methylacyl-CoA racemase is catalyzed by an aspartate/histidine pair and involves a smooth, methionine rich surface for binding the fatty acyl moiety. Manuscript.

The permission from the following copyright owners to reproduce the original articles have been obtained:

I International Union of Crystallography

II The American Society for Biochemistry and Molecular Biology

Original papers are not included in the electronic version of the dissertation.

Page 100: Protein crystallographic studies to understand the …jultika.oulu.fi/files/isbn9514280911.pdf · PROTEIN CRYSTALLOGRAPHIC STUDIES TO UNDERSTAND ... Protein crystallographic studies
Page 101: Protein crystallographic studies to understand the …jultika.oulu.fi/files/isbn9514280911.pdf · PROTEIN CRYSTALLOGRAPHIC STUDIES TO UNDERSTAND ... Protein crystallographic studies

A C T A U N I V E R S I T A T I S O U L U E N S I S

Distributed byOULU UNIVERSITY LIBRARY

P.O. Box 7500, FI-90014University of Oulu, Finland

Book orders:OULU UNIVERSITY PRESSP.O. Box 8200, FI-90014University of Oulu, Finland

S E R I E S A S C I E N T I A E R E R U M N A T U R A L I U M

444. Kiviahde, Maija (2005) Effects of authentic learning and e-learning in anintroductory chemistry laboratory course

445. Meissner, Kristian (2005) Consequences of predator-prey interactions in borealstreams. Scaling up from processes to large-scale patterns

446. Oksanen, Pirita O. (2005) Development of palsa mires on the northern Europeancontinent in relation to Holocene climatic and environmental changes

447. Harjumaa, Lasse (2005) Improving the software inspection process with patterns

448. Ylianttila, Mari (2005) Structure-function studies of the peroxisomalmultifunctional enzyme type 2 (MFE-2)

449. Moisio, Kari (2005) Numerical lithospheric modelling: rheology, stress anddeformation in the central Fennoscandian Shield

450. Pöykkö, Heikki (2005) Host range of lichenivorous moths with special referenceto nutritional quality and chemical defence in lichens

451. Kinnula, Marianne (2006) The formation and management of a softwareoutsourcing partnership. A case study

452. Autio, Jyrki (2006) Environmental factors controlling the position of the actualtimberline and treeline on the fells of Finnish Lapland

453. Rautiainen, Pirjo (2006) Population biology of the Primula sibirica group speciesinhabiting frequently disturbed seashore meadows: implications for management

454. Taskinen, Jukka (2006) Protein crystallographic studies of CoA-dependentproteins: new insight into the binding mode and exchange mechanism of acyl-CoA

455. Molin-Juustila, Tonja (2006) Cross-functional interaction during the early phasesof user-centered software new product development: reconsidering the commonarea of interest

456. Thomson, Robert L. (2006) Breeding habitat selection and its consequences inboreal passerines. Using the spatial dispersion of predators and heterospecifics asa source of information

457. Iivari, Netta (2006) Discourses on "culture" and "usability work" in softwareproduct development

458. Vähäoja, Pekka (2006) Oil analysis in machine diagnostics

459. Mutanen, Marko (2006) Genital variation in moths—evolutionary and systematicperspectives

A460etukansi.fm Page 2 Friday, May 26, 2006 11:40 AM

Page 102: Protein crystallographic studies to understand the …jultika.oulu.fi/files/isbn9514280911.pdf · PROTEIN CRYSTALLOGRAPHIC STUDIES TO UNDERSTAND ... Protein crystallographic studies

U N I V E R S I TAT I S O U L U E N S I SACTAA

SCIENTIAE RERUMNATURALIUM

OULU 2006

A 460

Prasenjit Bhaumik

PROTEIN CRYSTALLOGRAPHIC STUDIES TO UNDERSTAND THE REACTION MECHANISM OF ENZYMES:α-METHYLACYL-CoA RACEMASE AND ARGININOSUCCINATE LYASE

FACULTY OF SCIENCE,DEPARTMENT OF BIOCHEMISTRY, BIOCENTER OULU,UNIVERSITY OF OULU

ABCDEFG

UNIVERS ITY OF OULU P.O. Box 7500 F I -90014 UNIVERS ITY OF OULU F INLAND

A C T A U N I V E R S I T A T I S O U L U E N S I S

S E R I E S E D I T O R S

SCIENTIAE RERUM NATURALIUM

HUMANIORA

TECHNICA

MEDICA

SCIENTIAE RERUM SOCIALIUM

SCRIPTA ACADEMICA

OECONOMICA

EDITOR IN CHIEF

EDITORIAL SECRETARY

Professor Mikko Siponen

Professor Harri Mantila

Professor Juha Kostamovaara

Professor Olli Vuolteenaho

Senior assistant Timo Latomaa

Communications Officer Elna Stjerna

Senior Lecturer Seppo Eriksson

Professor Olli Vuolteenaho

Publication Editor Kirsti Nurkkala

ISBN 951-42-8090-3 (Paperback)ISBN 951-42-8091-1 (PDF)ISSN 0355-3191 (Print)ISSN 1796-220X (Online)

A 460

AC

TA P

rasenjit Bhaum

ikA460etukansi.fm Page 1 Friday, May 26, 2006 11:40 AM