membrane proteins and drug discovery: what is the...

34
Drug Discovery Today: Informatics (Computational Drug Discovery) Computational Analysis of Membrane Proteins: the Largest Type of Drug Targets Yalini Arinaminpathy* ,1 , Donald M. Engelman 1 , Mark B. Gerstein 1 Address: 1 Department of Molecular Biophysics and Biochemistry, Yale University, 266 Whitney Avenue, New Haven, CT 06520-8114, USA * Corresponding Author: Dr Yalini Arinaminpathy; E-mail: [email protected] 25 – 30 Word Teaser: The sheer pharmacological importance of membrane proteins contrasts starkly with their limited number of known structures. Computational tools are valuable in bridging the gap between their structure, function and mechanism. Keywords: Membrane Proteins, Molecular Dynamics, KcsA, GPCR, ion channels. 1 1 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 19 20 21 22 23 24 25 2

Upload: others

Post on 10-Jun-2020

0 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Membrane proteins and drug discovery: what is the linkhomes.gersteinlab.org/.../Drug_Discovery_Review_Arinami…  · Web view[1] Davey, J. (2004) G-Protein-Coupled-Receptors: New

Drug Discovery Today: Informatics (Computational Drug Discovery)

Computational Analysis of Membrane Proteins: the Largest Type of Drug Targets

Yalini Arinaminpathy*,1, Donald M. Engelman1, Mark B. Gerstein1

Address:

1 Department of Molecular Biophysics and Biochemistry, Yale University, 266 Whitney

Avenue, New Haven, CT 06520-8114, USA

* Corresponding Author:

Dr Yalini Arinaminpathy; E-mail: [email protected]

25 – 30 Word Teaser: The sheer pharmacological importance of membrane proteins

contrasts starkly with their limited number of known structures. Computational tools are

valuable in bridging the gap between their structure, function and mechanism.

Keywords: Membrane Proteins, Molecular Dynamics, KcsA, GPCR, ion channels.

1

1

123456789

10

1112

13

16

17

18

19

2

Page 2: Membrane proteins and drug discovery: what is the linkhomes.gersteinlab.org/.../Drug_Discovery_Review_Arinami…  · Web view[1] Davey, J. (2004) G-Protein-Coupled-Receptors: New

Drug Discovery Today: Informatics (Computational Drug Discovery)

Abstract

The biological importance of integral membrane proteins, from the level of the

cell to entire organism is highlighted by the profound effects of various genetic

deficiencies in transporters and channels. Given their key roles and significance, it is

therefore necessary to understand their structures and thence mechanisms and regulation,

at the molecular level. Membrane proteins represent approximately 30% of currently

sequenced genomes. Paradoxically, however, at present, > 45,000 crystal structures are

deposited in the protein data bank (PDB), of which only 2% are of membrane proteins.

Interestingly, 971 protein folds are currently known, of which only 5% (49 folds) are of

membrane proteins. The great disparity between our understanding of soluble proteins

and membrane proteins has occurred largely due to the many practical problems of

working with membrane proteins, specifically difficulties in expression, purification and

crystallization. Thus, technological developments have been increasingly utilized in order

to make crucial advances in understanding membrane protein structure and function.

2

3

20

21

22

23

24

25

26

27

28

29

30

31

32

33

4

Page 3: Membrane proteins and drug discovery: what is the linkhomes.gersteinlab.org/.../Drug_Discovery_Review_Arinami…  · Web view[1] Davey, J. (2004) G-Protein-Coupled-Receptors: New

Drug Discovery Today: Informatics (Computational Drug Discovery)

1.1 Introduction

Integral membrane proteins play essential roles in numerous physiological

functions, such as molecular recognition, energy transduction and ion regulation. Despite

the experimental challenges of studying these proteins, they are critical to understand

since they represent more than 60% of drug targets [1], [2]. For example, G-protein

coupled receptors (GPCRs), a class of membrane proteins, are intensively studied using

computational resources since the malfunction of these receptors results in serious

disorders such as hypertension, congestive heart failure, stroke and cancer. On a similar

scale, genetic disorders of ion channels result in ‘channelopathies’ such as cystic fibrosis,

Bartter syndrome and paralysis. Therefore, ongoing technological advances are exploited

to study membrane proteins, in order to improve or develop novel pharmacological drug

targets.

The availability of complete or partial genome sequences for a number of

organisms from a number of domains including the eubacterial, archaean and eukaryotic

domains now makes possible much more detailed studies of membrane protein topology.

Compounded by their genomic abundance, the use of computational tools in this field is

essential and timely. In combination with the advancement of simulation techniques, the

advent of structural genomics has spurred the membrane protein field to consider high-

throughput methods, which can help redress the disparity between soluble proteins and

membrane proteins. Indeed, numerous bioinformatics and proteomic analyses (e.g.[3],

[4], [5], [6], [7]) have been carried out to examine membrane protein architecture and

3

5

34

35

36

37

38

39

40

41

42

43

44

45

46

47

48

49

50

51

52

53

54

6

Page 4: Membrane proteins and drug discovery: what is the linkhomes.gersteinlab.org/.../Drug_Discovery_Review_Arinami…  · Web view[1] Davey, J. (2004) G-Protein-Coupled-Receptors: New

Drug Discovery Today: Informatics (Computational Drug Discovery)

even to closely analyze detailed stabilizing and mediating interactions between

transmembrane helices in membrane proteins.

Membrane proteins are, in many respects, easier to investigate computationally

than experimentally, due to the uniformity of their structure and interactions [8], [9]. The

high propensity to form secondary structures reduces the number of degrees of freedom,

which determine the protein’s fold, and hence lowers the complexity of predicting the

structures of these proteins. Computational techniques represent key methods for relating

the few static experimental membrane protein structures to dynamic biological systems,

thereby yielding maximum benefit from the limited structural and mechanistic

information available. The computational techniques employed in the biological arena to

explore membrane proteins are dizzyingly vast, thus in this review, we will focus on one

intensively utilized technique: MD simulations.

1.2 Membrane Protein Structure

Membrane proteins are essentially divided into two main classes: some contain a

significant portion of their mass within the interior of the membrane (intrinsic or integral

membrane protein (IMP)) while other proteins are only associated to the membrane

surface (extrinsic or peripheral proteins). For IMPs, two common structural motifs have

been observed for the transmembrane (TM) domains of membrane proteins: (1) α-helical

or (2) a β-sheet topology [10]. These two folds [Figure 1] are the simplest solutions to

satisfying the hydrogen bonding potential of the polypeptide backbone amide groups

within the lipid bilayer. The majority of IMPs display α-helical transmembrane segments

4

7

55

56

57

58

59

60

61

62

63

64

65

66

67

68

69

70

71

72

73

74

75

76

8

Page 5: Membrane proteins and drug discovery: what is the linkhomes.gersteinlab.org/.../Drug_Discovery_Review_Arinami…  · Web view[1] Davey, J. (2004) G-Protein-Coupled-Receptors: New

Drug Discovery Today: Informatics (Computational Drug Discovery)

and can be further divided into two types: bitopic (those which traverse the lipid bilayer

with a single α-helix) and polytopic (an α-helical bundle).

Membrane proteins that are α-helical typically form well-packed bundles as, for example,

found in bacteriorhodopsin, photosynthetic reaction centers and cytochrome C oxidase.

Formation of β-sheets is seen in bacterial outer membrane proteins (e.g. OmpA [11] and

FecA [12]), which span the membrane as β-barrels.

1.3 Experimental structure determination

The disparity between our knowledge of soluble proteins and membrane proteins

is largely due to the practical difficulties involved in expressing and crystallizing the

latter [13]. Their inherent membrane-bound nature makes structure determination a

particular challenge, and thus requires special treatment. This is particularly true for α-

helical membrane proteins as they tend to be hydrophobic and are therefore difficult to

unfold and refold in vitro. β-barrel proteins, on the other hand, are more hydrophilic and

amenable to the traditional methods of denaturation and refolding into detergents, or

directly into lipids [14], [15].

Three main bottlenecks exist with obtaining structural information of membrane

proteins. Firstly, it is difficult to obtain the protein of interest since membrane proteins

5

9

77

78

79

80

81

82

83848586878889909192939495

96

97

98

99

100

101

102

103

104

105

10

Page 6: Membrane proteins and drug discovery: what is the linkhomes.gersteinlab.org/.../Drug_Discovery_Review_Arinami…  · Web view[1] Davey, J. (2004) G-Protein-Coupled-Receptors: New

Drug Discovery Today: Informatics (Computational Drug Discovery)

are usually only present in the cell at low concentrations. Overexpression is therefore a

necessity for the majority of membrane proteins that cannot be readily obtained in

sufficient amounts from their native environments [16]. Many different expression

systems are used though each have their drawbacks, including low yield (often due to

toxicity), heterogeneous post-translational modification, low stability and partial

proteolysis [13]. The majority of membrane protein crystal structures result from proteins

that naturally occur at high concentrations or have been overexpressed in a homologous

system. Secondly, membrane proteins are naturally embedded in a heterogeneous

dynamic environment of the mosaic lipid bilayer [Figure 2] and it is impossible to use

high-resolution experimental techniques in their native environment.

The proteins therefore need to be extracted from the native membrane and studied in

detergent or lipid environment in vitro, which leads to difficulties in sample preparation

for biophysical methods, such as X-ray crystallography and NMR. However, cryo-

electron microscopic analysis differs from these techniques in that it can be used to study

membrane proteins in a crystalline or non-crystalline state at high resolution [17]. This

6

11

106

107

108

109

110

111

112

113

114

115

116

117

118

119

120

121

122

123124

125

126

127

128

129

12

Page 7: Membrane proteins and drug discovery: what is the linkhomes.gersteinlab.org/.../Drug_Discovery_Review_Arinami…  · Web view[1] Davey, J. (2004) G-Protein-Coupled-Receptors: New

Drug Discovery Today: Informatics (Computational Drug Discovery)

has enabled, for example, the structure of bacteriorhodopsin to be analyzed to a resolution

of 2.8Å [18], [19]. Thirdly, membrane proteins are generally insoluble in aqueous

solution hence detergents are required in concentrations above the critical micellar

concentration (CMC). Too much detergent can denature the protein or impede

crystallization by phase separation, whilst too little and the protein may become

insoluble. The production of three-dimensional (3D) or two-dimensional (2D) crystals

remains one of the major challenges in obtaining structural information.

Despite the inherent difficulties in studying the structure of membrane proteins,

they remain a crucial area of study due to their essential role in the control of important

biochemical processes. A number of experimental methods exist and are continually

being developed with the aid of technological tools to extract structural information on

membrane proteins. Spectroscopic methods [20], [21] such as vibrational spectroscopy,

Raman, FTIR and circular dichroism (CD) have been utilized to determine their

secondary structure and to help distinguish between competing models of structure or

function. Bacteriorhodopsin [22], the acetylcholine receptor [23], lactose permease [24]

and the outer membrane proteins of E. coli [25] are examples of membrane proteins

whose secondary structure content have been determined with such techniques.

Alternative methods, namely crystallographic techniques, have since been used to

determine high-resolution structures. Three methods are generally employed: electron

microscopy, NMR and X-ray crystallography. Despite the difficulties involved in

generating large and sufficiently well-ordered 3D crystals, X-ray crystallography is still

the most successful and least difficult technique for obtaining high-resolution structures.

Electron crystallography [26] and atomic force microscopy (AFM) [27], [28] are also

7

13

130

131

132

133

134

135

136

137

138

139

140

141

142

143

144

145

146

147

148

149

150

151

152

14

Page 8: Membrane proteins and drug discovery: what is the linkhomes.gersteinlab.org/.../Drug_Discovery_Review_Arinami…  · Web view[1] Davey, J. (2004) G-Protein-Coupled-Receptors: New

Drug Discovery Today: Informatics (Computational Drug Discovery)

used as methods for membrane proteins whose natural propensity is to form 2D arrays

[29].

Each method has its own advantages and hence the structural data obtained are

complementary. All methods are generally used in parallel in an attempt to achieve the

best structural description of a membrane protein. In addition, methods are continually

being developed, with the use of computational resources, leading to an increasing rate of

membrane protein structure determination. From the plot [Figure 3], the growth of known

structures is exponential with a growth rate of ~1.3-fold per year, but lags behind the rate

for soluble proteins during the equivalent time period. Assuming a continuous

exponential rate of growth in the number of structures determined, we would expect close

to 300 structures to be known by the end of 2008. Despite the increasing rate of structure

determination, improved structure prediction methods combined with computational tools

are important in studying membrane proteins.

8

0

0.5

1

1.5

2

2.5

3

1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004 2005 2006 2007

Year

Log

(cum

ulat

ive

num

ber o

f str

uctu

res)

15

153

154

155

156

157

158

159

160

161

162

163

164

165

166

167

168

169

170

171

172

173

174

175

176

16

Page 9: Membrane proteins and drug discovery: what is the linkhomes.gersteinlab.org/.../Drug_Discovery_Review_Arinami…  · Web view[1] Davey, J. (2004) G-Protein-Coupled-Receptors: New

Drug Discovery Today: Informatics (Computational Drug Discovery)

1.4 Computational Structure Determination

Computer simulation methods have provided key insights into the general nature

of protein motion and aspects of motion linked to the function of proteins in their native

state. They are rapidly becoming a standard tool to study the structure and dynamics of

membrane proteins. With the increasing number of high-resolution structures of

membrane proteins, a wide range of membrane proteins can now be simulated over time

spans that capture essential biological processes. Whilst X-ray structures of membrane

proteins provide static, spatially and temporally averaged snapshots of the proteins in

specific crystal environments, simulations enable us to explore the structural dynamics of

the proteins in an attempt to bridge the gap between structure and function of proteins. By

employing computational methods one can combine both experimental and theoretical

data of ion channels in order to describe their physiological properties in terms of

underlying physical processes.

One of the main challenges is to relate molecular structures to the physiological

properties of the protein. For ion channels this has been addressed by describing the

structure of the channel at varying levels of detail and accuracy, where the key regions

can be broken down into the transbilayer pore, the selectivity filter and the gate. A wide

variety of computational approaches such as molecular dynamics (MD) simulations (e.g.

[30], [31], [32]), continuum electrostatic Poisson-Boltzmann (PB) theory [33], [34],

Brownian dynamics (BD) [35], [36], electrodiffusion theory [37] have helped to refine

our understanding of the molecular determinants of channel function. MD arguably

provides the most detailed information in the theoretical studies of membrane proteins.

9

17

177

178

179

180

181

182

183

184

185

186

187

188

189

190

191

192

193

194

195

196

197

198

18

Page 10: Membrane proteins and drug discovery: what is the linkhomes.gersteinlab.org/.../Drug_Discovery_Review_Arinami…  · Web view[1] Davey, J. (2004) G-Protein-Coupled-Receptors: New

Drug Discovery Today: Informatics (Computational Drug Discovery)

In an MD simulation, all atoms in the system (including ions and water

molecules) are represented explicitly. In the classical fine-grained approach, simulations

are typically carried out using empirically determined pairwise interaction potentials

between the atoms. Another MD approach, known as ab initio MD, uses interactions

between atoms determined from first principles electronic structure calculations. Since

there are no free parameters with this approach, it could potentially be the ideal approach

to modeling ion channels. However, due to the extremely demanding nature of the

computations, its applications are currently limited to very small systems.

With the atomistic approach of classical MD simulations, rapid advances in

simulation methodologies and computational power has led to accessible timescales of up

to 0.1 s. Amongst all structurally known membrane proteins, the ion channels (KvAP,

KcsA), transporters (AQP, GlpF, ABC transporter), and outer membrane proteins

(OmpA) have been examined via simulation in particular detail. The methods described

will be briefly compared using the bacterial K+ channel, KcsA, as a case study. This is the

first biological ion channel whose tertiary structure was elucidated [38] and has been

studied extensively in terms of ion selectivity, permeation and gating.

1.5 Comparisons between MD techniques

MD simulations on KcsA have been employed to examine channel selectivity, ion

permeation and ion transport energetics in potassium channels, with the main focus being

on the selectivity filter and understanding the permeation properties of K+ ions in the

filter and cavity region. Many of the results from MD simulations based on realistic all-

10

19

199

200

201

202

203

204

205

206

207

208

209

210

211

212

213

214

215

216

217

218

219

220

20

Page 11: Membrane proteins and drug discovery: what is the linkhomes.gersteinlab.org/.../Drug_Discovery_Review_Arinami…  · Web view[1] Davey, J. (2004) G-Protein-Coupled-Receptors: New

Drug Discovery Today: Informatics (Computational Drug Discovery)

atom models have been consistent with the information obtained from high-resolution

structural data [39].

The scope of MD simulations can be extended by using commonly utilized

algorithms. For example, three methods employed are (i) umbrella sampling [40]; (ii) the

application of external forces to the system, such as using an expanding sphere inside the

pore at the gate region of the KcsA channel to induce gating [41], and with the

application of steered-MD; (iii) alchemical free energy perturbation (FEP) [42][43].

Although results from these techniques increase our confidence in MD, we cannot build a

complete picture of ion permeation if ion fluxes cannot be simulated or if channel

conductance cannot be calculated. Single-channel measurements reveal the net

translocation of one ion in KcsA to be in 10-20 ns [44] which is the order of timescales

accessible by MD simulations [42][45]).

Despite the significant increase in computational power and molecular dynamics

methods, the direct simulation of ionic fluxes across channels and large conformational

changes using an atomistic description remains computationally challenging at present.

This is due to time scales of 10 – 100 ns being much shorter than the typical timescale for

allosteric effects which is usually in the order of microseconds to milliseconds. Thus,

biased MD methods, such as steered MD (e.g. [46]) and targeted MD (e.g. [47]) have

been developed to circumvent these time scale limitations. However, the above described

weakness of MD is the strength of Brownian Dynamics (BD). The drawback of BD is,

however, the comparatively poor description or parameterization of the biological system

simulated. Thus, diffusion coefficients and free energies for example, which can be

11

21

221

222

223

224

225

226

227

228

229

230

231

232

233

234

235

236

237

238

239

240

241

242

22

Page 12: Membrane proteins and drug discovery: what is the linkhomes.gersteinlab.org/.../Drug_Discovery_Review_Arinami…  · Web view[1] Davey, J. (2004) G-Protein-Coupled-Receptors: New

Drug Discovery Today: Informatics (Computational Drug Discovery)

determined with MD, cannot be calculated with BD. Hence, both BD and MD are

complementary.

In BD, ion permeation can be simulated for sufficiently long to measure channel

conductance without having to treat a system in all atomic details explicitly. BD

simulations (for example on KcsA [48] [49]) treat protein atoms forming the channel as

rigid, and the water implicitly as a dielectric continuum performing Brownian motion.

Despite these severe limitations of the continuum electrostatic approximation and the

assumption of a rigid channel structure, BD simulations [50] confirmed the multi-ion

mechanism to be in agreement with the ion flux determined experimentally [51]. The

ability to compute current flow across ion channels confers a distinct advantage to BD

simulations over other techniques with the applications of BD to calculating current and

voltage conductance in ion channels. The assumptions in BD of treating the water-protein

interface as a rigid boundary and the treatment of water in a narrow pore as a continuum

are simplifications since proteins (and lipid bilayers) are in fact dynamical, undergoing

fluctuations on a picosecond timescale, which is much more rapid that the timescale for

ion permeation.

The treatment of the water-protein boundary has been modified in some studies in

an attempt to reduce its simplified stochastic nature. For example, an elaborate treatment

of boundaries was proposed [52][53] using a grand canonical Monte Carlo (GCMC)

method. However, comparison of BD using a simple stochastic boundary and the GCMC

boundary [54] revealed no significant differences with the results obtained when the

boundaries were at a reasonable distance from the channel. MD is also the preferred

12

23

243

244

245

246

247

248

249

250

251

252

253

254

255

256

257

258

259

260

261

262

263

264

24

Page 13: Membrane proteins and drug discovery: what is the linkhomes.gersteinlab.org/.../Drug_Discovery_Review_Arinami…  · Web view[1] Davey, J. (2004) G-Protein-Coupled-Receptors: New

Drug Discovery Today: Informatics (Computational Drug Discovery)

technique for size-dependent selectivity among ions with the same valence, since such

ions cannot be distinguished in BD. Whilst microscopic quantities can be deduced from

BD, the increased high-level detail adopted by MD and Monte Carlo (MC) algorithms

enables the analysis of large-scale conformational changes. For example, one study [55]

explored the conformational changes between the open (KcsA crystal structure [38] and

closed forms (model generated from MthK [56]) of KcsA. The simulation of the large-

scale conformational transition was run by imposing lateral forces to the C-termini of the

inner helices and minimizing the energy at each step. As a result of the applied forces the

inner helices converged to form a tightly packed structure, with a change in backbone

geometry in the central region.

Whilst MD provides the most detailed information about the dynamics of ion

channels, currently accessible simulation times are its greatest limitation. Hence it is

unclear at present if the results obtained from MD simulations reflect reality or are an

artifact of the method. However, this problem seems sure to be surmounted in the future

with the doubling of computer speeds over the years. In the meantime faster, more

coarse-grained methods are being employed [57][58] to calculate the conductance of ion

channels. In addition, free energy methods such as replica exchange and ensemble

dynamics [59][60] are methods that are becoming increasingly viable with increasing

computational power. Despite such hurdles, MD simulations in combination with other

computational tools such as homology modelling, and experimental studies such as

mutagenesis analysis, have proved to be essential in the study of membrane protein

structure and function, which in turn enables the development of novel pharmacological

drug targets.

13

25

265

266

267

268

269

270

271

272

273

274

275

276

277

278

279

280

281

282

283

284

285

286

287

26

Page 14: Membrane proteins and drug discovery: what is the linkhomes.gersteinlab.org/.../Drug_Discovery_Review_Arinami…  · Web view[1] Davey, J. (2004) G-Protein-Coupled-Receptors: New

Drug Discovery Today: Informatics (Computational Drug Discovery)

1.6 Conclusions and Future Outlooks

Integral membrane proteins (IMPs) perform key functions in regulating the

physiological state of the cell. This is especially true for receptors and ion channels that

control, for example, the transmembrane (TM) potential. The scarcity of IMP structures is

due to the fact that the route from membrane protein sequences to atomic-resolution

structures is not as straightforward as for their soluble counterparts. This is, in turn,

primarily due to the substantial difficulties with overexpression and crystallization of

IMPs. Thus, the use of computational tools such as protein simulation methods, in

combination with experimental and structural genomic studies is becoming increasingly

valuable in studying the structure and function of membrane proteins.

The explosion of genomic data in combination with huge advances in

computational resources and experimental techniques is leading to a greater

understanding of biological structure, function and mechanisms. Considering the

dramatic advancements in molecular dynamics simulation methodologies in recent years,

it is likely that current drawbacks will be overcome considerably in the near future.

Reassuringly, over the last few years there has been a dramatic increase in the number of

membrane protein crystal structures obtained, with 30 structures being solved in 2006

alone.

Unsurprisingly, given the immense computing power and wealth of genomic and

structural data, recent years have seen a rise in structural genomics initiatives primarily

focusing on membrane proteins in an attempt to harness the synergy between the growing

data and technology available. Examples of such initiatives include the (1) Swiss

14

27

288

289

290

291

292

293

294

295

296

297

298

299

300

301

302

303

304

305

306

307

308

309

28

Page 15: Membrane proteins and drug discovery: what is the linkhomes.gersteinlab.org/.../Drug_Discovery_Review_Arinami…  · Web view[1] Davey, J. (2004) G-Protein-Coupled-Receptors: New

Drug Discovery Today: Informatics (Computational Drug Discovery)

National Center of Competence in Research (NCCR; www.structralbiology.ethz.ch), (2)

Membrane Protein Network (MePNet; www.mepnet.org), (3) European Membrane

Proteins (E-MeP; www.e-mep.org), (4) Protein Wide Analysis of Membrane Proteins

(ProAMP; www.pst-ag.com), (5) Biological Information Research Center, Japan (BRIC;

unit.aist.go.jp/birc) and the (6) Membrane Protein Structure Initiative (MPSI;

www.mpsi.ac.uk). At present, the large majority of crystallized membrane proteins are

bacterial proteins, thus there is an urgent need to obtain structures of eukaryotic

membrane proteins as these could be potential drug targets. In this respect, structural

genomics initiatives are essential for rapidly increasing the structure determination

throughput of eukaryotic membrane proteins. Interestingly, this situation is analogous to

that of soluble proteins; slow structure determination in the 1970s was followed by an

exponential increase of structures generated due to improved experimental protocols.

The paradox posed by the sheer number of potential helical membrane proteins

and the lack of high-resolution structural and thermodynamic information for them

emphasizes the extensive work that remains to be done in the field of membrane proteins.

The potential payoff may be great as this class of proteins has historically contained

excellent targets for therapeutics. Advances in our ability to understand and manipulate

membrane proteins may lead to the discovery or design or pharmaceutical agents that can

modulate their functions.

15

29

310

311

312

313

314

315

316

317

318

319

320

321

322

323

324

325

326

327

328

30

Page 16: Membrane proteins and drug discovery: what is the linkhomes.gersteinlab.org/.../Drug_Discovery_Review_Arinami…  · Web view[1] Davey, J. (2004) G-Protein-Coupled-Receptors: New

Drug Discovery Today: Informatics (Computational Drug Discovery)

References

[1] Davey, J. (2004) G-Protein-Coupled-Receptors: New Approaches to Maximize the Impact of GPCRs in Drug Discovery. Expert Opinion on Therapeutic Targets 8, 165-170.

[2] Terstappen, G. C., and Reggiani, A. (2001) In silico research in drug discovery. Trends Pharmacol. Sci. 22, 23-26.

[3] Xia, Y. et al. (2006) Integrated prediction of the helical membrane protein interactome in yeast. J. Mol. Biol. 357, 339-349.

[4] Hurwitz, N. et al. (2006) Towards genome-scale structure prediction for transmembrane proteins. Phil. Trans. Roy. Soc. ser. B 361, 465-475.

[5] Lundstrom, K. (2005) Structural Genomics of GPCRs. Trends Biotech. 23, 103-108.

[6] Walian, P. et al. (2004) Structural Genomics of Membrane Proteins. (2004) Genome Biol. 5, 215.

[7] Lehnert, U. et al. (2004) Computational analysis of membrane proteins: genomic occurrence, structure prediction and helix interactions. Q. Rev. Biophys. 37, 121-146.

[8] Sachs, J.N. and Engelman, D.M. (2006) Introduction to Membrane Protein Reviews: Interplay between Structure, Dynamics, and Environment in Membrane Protein Function. Annu. Rev. Biochem. 75, 707-712.

[9] Engelman, D.M. et al. (2003) Membrane Protein Folding: Beyond the Two Stage Model. (2003) Febbs Lett. 555, 122-125.

[10] Cowan, S. W., and Rosenbusch, J. P. (1994) Folding pattern diversity of integral membrane-proteins. Science 264, 914-916.

[11] Pautsch, A., and Schulz, G. E. (1998) Structure of the outer membrane protein A transmembrane domain. Nature Struct. Biol. 5, 1013-1017.

[12] Ferguson, A. D. et al. (2002) Structural basis of gating by the outer membrane transporter FecA. Science 295, 1715-1719.

[13] Grisshammer, R., and Tate, C. G. (1995) Overexpression of integral membrane-proteins for structural studies. Q. Rev. Biophys. 28, 315-422.

16

31

329

330331332333334335

336337

338339340341342343344345346347348349350351352353354355356357358359360361362363364365366367368

32

Page 17: Membrane proteins and drug discovery: what is the linkhomes.gersteinlab.org/.../Drug_Discovery_Review_Arinami…  · Web view[1] Davey, J. (2004) G-Protein-Coupled-Receptors: New

Drug Discovery Today: Informatics (Computational Drug Discovery)

[14] Buchanan, S. K. (1999) ß-Barrel proteins from bacterial outer membranes: structure, function and refolding. Curr. Opin. Struct. Biol. 9, 455-461.

[15] Schulz, G. E. (2000) -Barrel membrane proteins. Curr. Opin. Struct. Biol. 10, 443-447.

[16] Tate, C. G. (2001) Overexpression of mammalian integral membrane proteins for structural studies. FEBS Lett. 504, 94-98.

[17] Chiu, W. (1993) What does electron cryomicroscopy provide that X-ray crystallography and NMR spectroscopy cannot? Ann. Rev. Biophys. Biomol. Struct. 22, 233-355.

[18] Baldwin, J. M., Henderson, R., Beckmann, E., & Zemlin, F. (1998) Images of purple membrane at 2.8Å resolution obtained by cryo-electron microscopy. J. Mol. Biol. 202, 585-591.

[19] Fujiyoshi, Y. (1998) The structural study of membrane proteins by electron crystallography. Adv. Biophys. 35, 25-80.

[20] Bazzi, M. and Woody, R. W. (1985) Oriented secondary structure in integral membrane proteins. Biophys. J. 48, 957-966.

[21] Krimm, S. and Bandekar, J. (1986) Vibrational spectroscopy and conformation of peptides, polypeptides and proteins. Adv. Protein Chem. 38, 183-364.

[22] Glaeser, R. M. et al. (1991) What spectroscopy can still tell us about the secondary structure of bacteriorhodopsin. Biophys. J. 59, 934-938.

[24] Yager, P. et al. (1984) The secondary structure of acetylcholine receptor reconstituted in a single lipid component as determined by Raman spectroscopy. Biophys. J. 45, 26-28.

[25] Vogel, H. et al. (1985) The structure of the lactose permease derived from Raman spectroscopy and prediction methods. EMBO J. 4, 3625-3631.

[25] Vogel, H. and Jähnig, E. (1986) Models for the structure of outer-membrane proteins of Escherichia coli derived from raman spectroscopy and prediction methods. J. Mol. Biol. 190, 191-199.

[26] Saibil, H. R. (2000) Conformational changes studied by cryo-electron microscopy. Nature 7, 711-717.

[27] Clausen-Schaumann, et al. (2000) Force spectroscopy with single bio-molecules. Curr. Opin. Chem. Biol. 4, 524-530.

17

33

369370371372373374375376377378379380381382383384385386387388389390391392393394395396397398399400401402403404405406407408409410411412413414

34

Page 18: Membrane proteins and drug discovery: what is the linkhomes.gersteinlab.org/.../Drug_Discovery_Review_Arinami…  · Web view[1] Davey, J. (2004) G-Protein-Coupled-Receptors: New

Drug Discovery Today: Informatics (Computational Drug Discovery)

[28] Scheuring, S. et al. (2002) Sampling the conformational space of membrane protein surfaces with the AFM. Eur. Biophys. J. 31, 172-178.

[29] Stahlberg, H. et al. (2001) Two-dimensional crystals: a powerful approach to assess structure, function and dynamics of membrane proteins. FEBS Lett. 504, 166-172.

[30] Sotomayor, M., Vasquez, V., Perozo, E. & Schulten, K. (2007) Ion Conduction through MscS as Determined by Electrophysiology and Simulation. Biophys. J. 92, 886-902.

[31] Saparov, S.M. et al. (2007) Fast and Selective Ammonia Transport by Aquaporin-8. J. Biol. Chem. 282, 5296-5301.

[32] Mukherjee, P. et al. (2006) Picosecond dynamics of a membrane protein revealed by 2D IR. Proc. Natl. Acad. Sci. U.S.A. 103, 3528-3533.

[33] Peter, C. and Hummer, G. (2005) Ion Transport through Membrane-Spanning Nanopores Studied by Molecular Dynamics Simulations and Continuum Electrostatics Calculations Biophys. J. 89, 2222-2234.

[34] Sengupta, D. et al. (2005) Understanding the energetics of helical peptide orientation in membranes. Proteins 58, 913-922.

[35] Sieber, J.J. et al. (2007) Anatomy and Dynamics of a Supramolecular Membrane Protein Cluster. Science 317, 1072-1076

[36] Jin, S. et al. (2007) Single-Particle Tracking of Membrane Protein Diffusion in a Potential: Simulation, Detection, and Application to Confined Diffusion of CFTR Cl– Channels. Biophys. J. 93, 1079-1088.

[37] Coalson, R.D. (2005) Poisson-Nernst-Plank Theory approach to the calculation of current through biological ion channels. Nanobiosci. 4, 81-93.

[38] Doyle, D. A. et al. (1998) The structure of the potassium channel: molecular basis of K+ conduction and selectivity. Science 280, 69-77.

[39] Roux, B. (2002) Theoretical and computational models of ion channels. Curr. Opin. Struct. Biol. 12, 182-189.

[40] Crouzy, S. et al. (2001) Extracellular blockade of K+ channels by TEA: Results from molecular dynamics simulations of the KcsA channel. J. Gen. Physiol. 118, 207-217.

[41] Biggin, P. C. and Sansom, M. S. P. (2002) Open-state models of a potassium channel. Biophys. J. 83, 1867-1876.

18

35

415416417418

419420421

422423

424425

426427428

429430

431432433434435436437438439440441442443444445446447448449450451452453

36

Page 19: Membrane proteins and drug discovery: what is the linkhomes.gersteinlab.org/.../Drug_Discovery_Review_Arinami…  · Web view[1] Davey, J. (2004) G-Protein-Coupled-Receptors: New

Drug Discovery Today: Informatics (Computational Drug Discovery)

[42] Åqvist, J. and Luzhkov, V. (2000) Ion permeation mechanism of the potassium channel. Nature 404, 881-884.

[43] Berneche, S. and Roux, B. (2002) The Ionization State and the Conformation of Glu-71 in the KcsA Channel. Biophys. J. 82, 772-780.

[44] LeMasurier, M. et al. (2001) KcsA: It's a potassium channel. J. Gen. Physiol. 118, 303-313.

[45] Domene, C. and Sansom, M. S. P. (2003) Potassium channel, ions and water: simulation studies based on the high-resolution X-ray structure of KcsA. Biophys. J. 85, 2787-2800.

[46] Treptow, W. and Tarek, M. (2006) K+ conduction in the selectivity filter of potassium channels is monitored by the charge distribution along their sequence. Biochem. 91, L81-L83.

[47] Compoint, P. et al. (2006) Targeted Molecular Dynamics of an open-state KcsA channel. J. Chem. Phys. 13, 134707.

[48] Allen, T. W. and Chung, S. H. (2001) Brownian dynamics study of an open-state KcsA potassium channel. Biochim. Biophys. Acta. 1515, 83-91.

[49] Burykin, A. et al. (2003) Exploring the origin of the ion selectivity of the KcsA potassium channel. Proteins: Structure Function and Genetics 52, 412-426.

[50] Chung, S. H. et al. (2002) Conducting-state properties of the KcsA potassium channel from molecular and Brownian dynamics simulations. Biophys. J. 82, 628-645.

[51] Ermak, D. L. and McCammon, J. A. (1978) Brownian dynamics with hydrodynamic interactions. J. Chem. Phys., 69, 1352-1360.

[52] Im, W. et al (2000) A Grand Canonical Monte Carlo-Brownian dynamics algorithm for simulating ion channels. Biophys. J. 79, 788-801.

[53] Woo, H.-J. et al. (2004) Grand canonical Monte Carlo simulations of water in protein environments. J. Chem. Phys. 121, 6392-6982.

[54] Corry, B. et al. (2002) Reservoir Boundaries in Brownian Dynamics Simulations of Ion Channels. Biophys. J. 82, 1975-1984.

[55] Tikhonov, D. B. and Zhorov, B. S. (2004) In Silico Activation of KcsA K+ channel by Lateral Forces Applied to the C-termini of Inner Helices. Biophys. J. 87, 1526-1536.

19

37

454455456457458459460461462463464465466467468469470471472473474475476477478479480481482483484485486487488489490491492493494495496497498499

38

Page 20: Membrane proteins and drug discovery: what is the linkhomes.gersteinlab.org/.../Drug_Discovery_Review_Arinami…  · Web view[1] Davey, J. (2004) G-Protein-Coupled-Receptors: New

Drug Discovery Today: Informatics (Computational Drug Discovery)

[56] Jiang, Y., et al. (2002) The open pore conformation of potassium channels. Nature 417, 523-526.

[57] Brannigan, G. et al. (2007) Implicit solvent simulation models for biomembranes. Eur. Biophys. J. 35, 104-124.

[58] Lei, H. and Duan, Y. (2007) Improved sampling methods for molecular simulation. Curr. Opin. Struct. Biol. 17, 187-191.

[59] Bu, L. et al. Membrane Assembly of Simple Helix Homo-Oligomers studied via molecular dynamics simulations. Biophys. J. 92, 854-863.

[60] Chen, Z. and Xu, Y. (2006) Energetics and stability of transmembrane helix packing: A replica-exchange simulation with a knowledge-based membrane potential. Proteins: Structure Function and Genetics 62, 539-552

20

39

500501502503504505506507508509510511512513514

40

Page 21: Membrane proteins and drug discovery: what is the linkhomes.gersteinlab.org/.../Drug_Discovery_Review_Arinami…  · Web view[1] Davey, J. (2004) G-Protein-Coupled-Receptors: New

Drug Discovery Today: Informatics (Computational Drug Discovery)

Figure Legends

Figure 1: Two examples of membrane proteins. KcsA (left) (PDB: 1K4C) is a voltage-

gated K+ selective α-helical protein. OmpA (right) (PDB: 1QJP) is an example of a β-

barrel membrane protein. The dashed lines indicate the position of the bilayer.

Figure 2: Illustration of a membrane protein (KcsA, shown in purple) embedded in a lipid

bilayer. For clarity, the water molecules on either side of the lipid bilayer have not been

included. The hydrocarbon core of a membrane is typically ~25-30Å wide with the

headgroups spanning ~10Å. The polar head groups of the lipids face the aqueous

environment on both sides of the membrane, whereas their hydrophobic chains form the

insulating interior of the bilayer. Owing to the ester carbonyls and water associated to the

lipid headgroups, lipid molecules possess electrical dipoles which result in a considerable

electrical potential (positive inside the bilayer). (Figure generated using KcsA crystal

structure, PDB: 1K4C).

Figure 3: Rate at which new structures (α-helical and β-barrel) have been determined

since 1991. The count of membrane protein structures includes the same protein from

different organisms. The data suggests that there will be ~300 structures at the end of

2008. Data extracted from http://blanco.biomol.uci.edu/Membrane_Proteins_xtal.html

21

41

515516517

518

519

520521522

523

524

525

526

527

528

529

530

531532533

534

535

536

537

42