high throughput technique in structural bioinformatics- application to catalase, an enzyme of 57 kda...
Post on 14-Jan-2016
219 Views
Preview:
TRANSCRIPT
High Throughput Technique in Structural Bioinformatics-
Application to Catalase, an enzyme of 57 kDa molecular weight
By
Prof. D. VELMURUGAN
DEPARTMENT OF CRYSTALLOGRAPHY & BIOPHYSICS
UNIVERSITY OF MADRAS
GUINDY CAMPUS
CHENNAI – 600 025
One of the main interests in the molecular biosciences is in understanding structure function relations and X-ray crystallography plays a major role in this.
ab initio solutions of the crystal structures of small molecules are possible by using atomic-resolution diffraction data, usually at ~0.8 Å. Most of these small molecular crystal structures are usually solved using direct-methods programs.
Macromolecules have mainly been solved at resolutions less than atomic and this has necessitated determination of initial phases either from experimental techniques such as Molecular replacement techniques, MIR or MAD .
During the last decade, admirable advances have taken place in the data-collection facilities and techniques available to the macromolecular crystallographer. To get better X-ray intensity data for this purpose, new techniques like cryo temperature data collection, halide soaking and passing of Ar, Ne, Hg gas have been developed.
With the above advances, more data sets appear to be coming from atomic-resolution data. The above possibility of gaining atomic resolution data even for macromolecules prompted the direct methods practitioners to make attempts to extend the direct methods using other macromolecular techniques to enable them to tackle the structure solution of macromolecules.
X-ray Crystallography has become a central tool in modern drug and target discovery, providing important insights into molecular interactions and biological function. The past few years have seen many advances in the methods underlying macromolecular crystallography such as protein production, crystallization, cryo-crystallography and synchrotron technology. Together these advances mean that X-ray data can be collected extremely quickly for many different crystals and ligand-bound complexes. The challenge is to ensure rapid and accurate interpretation of the data to provide valuable structural information.
The High Throughput Crystallography (HTC) Consortium offers scientists a valuable new dimension to the drug discovery process. The HTC Consortium aims to accelerate crystallographic structure determination by developing new science as well as utilizing current technology to go from initial phasing through to structure refinement and analysis while minimizing the amount of human intervention that is required. The ability to examine in atomistic detail the interactions between many different proteins and ligands provides scientists unprecedented insight into the mechanics of drug binding.
Rapid and revolutionary developments in genome sciences, combinatorial chemistry, informatics and robotics are having major impacts on drug discovery. Genome sequencing projects in man and micro-organisms have provided an unprecedented number of potential drug targets. These have given impetus to the study of protein expression (proteomics) and structure (structural genomics), and have allowed a clearer description of drug targets as molecular components of disease processes. At the same time, there is rapidly expanding range of screening technologies, as well as consolidations in medicinal chemistry arising from the combinatorial approaches that were pioneered in the 1990s. These developments have created an environment for the emergence of new strategies for drug discovery.
High-Throughput Crystallography is essential for structure-based lead discovery – a strategy that combines features of random screening and rational structure-based design.
More than 29,000 protein structures are deposited in the Protein Data Bank (PDB) and more than 1,50,000 sequence (SWISS-PROT) entries exist for which the three dimensional structures are not available.
In Structural Genomics, one is interested in determining the structure in the fastest way to understand new folds and this has opened up the “High Throughput Crystallography”. An understanding of the three-dimensional structure (fold) correlates the function of the molecule.
High Throughput Crystallography using Automated procedures promotes a quicker elimination of the structure having the same fold among the deposited ones when analyzing thousands of macromolecular structures for which functional assignments are yet to be known.
ACORN is a comprehensive and efficient phasing procedure involving direct methods for the determination of protein structures when atomic-resolution data are available (better than 1.2 Å) (Foadi et al., 2000; Mcauley et al., 2001; Yao, 2002; Foadi, 2003; Dodson & Yao, 2003).
The fragment can be as a small-idealized piece of secondary structure (Rajakannan et al., 2004a, b; Selvanayagam et al., 2004) or an experimental substructure, such as a metal or a set of S, Se or similar atoms which can be located from anomalous scattering measurements.
ACORN then uses a combination of approaches, most importantly dynamic density modification, to develop a refined set of phases. Key to the procedure is the use of a correlation factor for the weak amplitudes as a criterion of phase quality.
Dynamic Density Modification (DDM) is designed to modify
the densities in three steps:
’ = 0 if <0
’ = tanh{0.2[/()]3/2} if >0
’ = kn() if ’>kn(),
It sets all negative densities to zero. It modifies the positive densities according the ratio
/(). It truncates the modified densities to a value of kn(),
where k is a constant given by the user (default value is 3); n is the cycle number of DDM.
The reflections are divided into three groups (strong,medium and weak) according to their normalized structure-factor (E) values.
The strong reflections (E > 1.2) are used in the phaserefinement by the Dynamic density modification (DDM)and Patterson superposition (SUPP) procedures.
Both strong and weak reflections (E < 0.1) are used inSayre-equation refinement (SER).
The medium reflections (0.1 < E < 1.2) are used to calculate a correlation coefficient (CC) for each potential solution of DDM.
An important component of ACORN is a CC that describes
the extent to which the magnitudes of the calculated
normalized structure factors (Ec) resemble the observed
normalized structure-factor amplitudes (Eo). A fragment in
a particular position and orientation in the unit cell will have
an associated set of structure factors and the CC will be
expressed by
,οc
ococ
ΕE
EEEE
CC
where = <E2> - <E2>½
Ec and CC values are calculated from the starting
fragment for all reflections to find the correct orientation
and position in molecular replacement (MR) or random MR
or for single random-atom searching.
In phase refinement Ec and CC values are calculated
from the modified map for medium reflections, which are
not used for computing the map, to indicate solutions of
DDM.
The ACORN procedure, as implemented in CCP4, isdivided into two parts, ACORN-MR and ACORN-PHASE,as illustrated in the flow diagram.
L a rgeM o tif
A M oR ein C C P 4
P D B S E Tin C C P 4
P o s it io n e d fra g m e n t
S ta n da rdh e lic e s
K no w np ha se s
S U P P ? S E R ?
Ye s
N o
Ye s
N o
S a y re -e qu a t io nre fin e m e n t
P a t te rs ons up e rp o si t io n
D yn am ic d e n si tym o d ific a t ion
B e s t se t o f re fin e d p h a s e s
A C O R N -P H A S EA C O R N -M R
S m a llm o tif
S in g le ra n d om -a tom se a rc h o n C C
K no w n h ea vy -a tom p os i tio ns
M o lec u la r r e p lac e m e n to f ran do m m o lec u le r e -
p lac em e n t b y sea rch o n C C
S tru c tu re -fa c to r
c a lc u la tion
In it ia l p ha ses e ts
ACORN-MR, deals with finding the position of a fragment of the structure, even a single atom, that provides an initial set of estimated phases. This set is passed into ACORN-PHASE, where phase refinement by a number of real-space processes is performed.
For locating a single atom, this approach randomly generates thousands of positions in the asymmetric unit. For each random position, the calculated normalized structure factor values and corresponding CCs are calculated for all reflections. 1000 sets with highest CCs are saved as starting points for further calculations. In most cases, the solution is normally found in the top 100 sets. This approach can be used to determine a native protein structure from AR data, if the structure contains at least one heavy atom (sulphur or heavier).
Foadi (2003) has given a detailed explanation of the reasons for the failure of ACORN when the resolution is below 1.2 Å. At atomic resolution, two neighbouring atomic peaks will be two separate entities and DDM will enhance both of them. At lower resolutions, these two peaks will merge into a single peak and DDM will just enhance it and no positive phase refinement can be expected in this situation.
The present work overcomes the above problem at low resolution using the fragments for seed phasing information.
The use of ACORN in solving a 57 kDa macromolecule
with atomic resolution (0.88 A) / truncated synchrotron data
(1.5Å resolution)
Micrococcus lysodeikticus catalase (Murshudov et
al., 2002)
1
2
3
456
78
910
11
12
13
14
15 ,16
17
1819
20
13
4
5
678
P D B i.d .:1 g w eTo ta l res id u e s: 5 0 3
Details of the crystallographic data, helices, sheets and sets
Ab initio phasing using ACORNACORN was run with 5000 random single atom
trials and the 40 positions with highest CCs’ were selected.
ACORN refined the phases from the random atom trials using DDM and led to the solution with good agreement of CC.
In this run, 78 cycles of DDM increased the CC for medium reflections with E values from 0.0285 to 0.5246 in 14.2 hours of CPU time.
In this ab initio case 8 chains could be automatically built with the ARP/wARP (Perrakis et al., 1999) followed by REFMAC (Murshudov et al., 1999) (482 residues). Manual model building was carried out for the missing residues and the final Rw and Rf values are 14.0 and 16.2% respectively. The superposition using PROFIT of the backbone atoms of this structure with the backbone atoms of the same structure solved using conventional technique gives the r.m.s deviation of 0.143 Å.
Details of ACORN, ARP/wARP and REFMAC results for ab initio case
Applications of truncated data at 1.5 Å resolutionFor set 23(minimum input), all sheets and one helix
(helix4) containing 76 residues were given as input to ACORN. Here, the ACORN-PHASE option was selected for the structure solution. The R-factor and correlation coefficient for the medium E value reflections of the initial model are 54.2% and 0.0469, respectively. Within 56 cycles of DDM the R-factor and correlation coefficient attained 53.9% and 0.0771 indicating a good solution.
The phases were then fed to ARP/wARP (Perrakis et al., 1999) followed by REFMAC (Murshudov et al., 1999). After the initial model building by ARP/wARP, the Rw and Rf values were 44.8 and 44.4% respectively. This initial model was refined for ten cycles of auto building along with five cycles of REFMAC in each auto-building cycle. Finally, ARP/wARP was able to build 212 residues. At this stage Rw and Rf values were 28.9 and 36.3% respectively. An iterative cycle carried out with these output phases revealed 481 residues out of 503 residues with a connectivity index of 0.97.
Manual model building was carried out in the missing regions as densities were clear. After the manual model building, 20 cycles of maximum-likelihood refinement were performed using REFMAC and solvent atoms were updated after the refinement using ARP/wARP ‘build solvent atoms’ script. The final Rw and Rf values were 13.6 and 15.6% respectively.
The backbone of this final model was superimposed with the structure conventionally solved by the molecular replacement method. The root-mean square deviation was 0.176 Å and the details are shown in Table 2. The results for sets 1-16 and 23 are also shown in Table 2.
Figs 3a to 3q describe the final models obtained after all the sets were used for ‘seed-phasing’ information to ACORN.
Table 2 lists the ACORN statistics and the ARP/wARP details for all these cases. The final results obtained in each case are also mentioned in this table.
Table 2.Details of ACORN phasing, ARP/wARP model building and REFMAC refinement
PR O G RA M SE T 9 SE T 10 SE T 11 SE T 12
ACO R N STA RTING R -factor (% ) CC R -factor (% ) CC R -factor (% ) CC R -factor (% ) CC
Large E (L ) 0 .399 0 .2110 0 .402 0 .2070 0 .411 0 .1871 0 .417 0 .1708
M edium E (M ) 0 .521 0 .1212 0 .523 0 .1153 0 .526 0 .1044 0 .529 0 .0935
Input H 1 ,4-5 ,7 ,10-11 ,13 -14 &17-20 (151a.a) H 1,4 -5 ,7 ,10-11 ,14 &17 -20 (145a .a)
H 1,5 ,7 ,10-11 ,14& 17 -20 (135 a .a .) H 1,5 ,10-11 ,14& 17 -20 (125 a .a )
A fte r 37 cycles of DDM After 39 cyc les of D DM After 37 cycles o f D D M After 41 cyc les of D DM
FIN AL L 0 .269 0 .6329 0 .269 0 .6330 0 .270 0 .6331 0 .271 0 .6308
M 0 .525 0 .1199 0 .526 0 .1166 0 .526 0 .1156 0 .528 0 .1108
ARP /w A R P R -factor (% ) R free R -factor (% ) R free R -factor (% ) R free R -factor (% ) R free
IN ITIAL 43.5 42.8 43.5 42.8 43.6 43.7 43.8 43.2 AU TO BUILD ING : 10 C ycles RE FM AC : 5 C ycles for each auto bu ilding ; S ide dock afte r 6 cyc les of auto bu ild ing FIN AL 14.4 18.1 13.6 17.1 15.3 19.3 14.6 18.2
Deta ils o f ARP /w AR P result
8 cha ins , 475 residues, m iss ing res idues 1 -7 , 59 , 60, 113, 114, 142 , 143 , 175 , 176 , 186, 187 ,195 -202,388, 389 ,503 ,dum my a tom s 1288 ,connectiv ity index 0 .96
7 cha ins, 482 residues, m iss ing res idues 1 -7 , 59 , 60, 113, 114, 142 , 143 , 174-176, 388, 389 , 401 , 402 , 503, d um my atom s 1273 , connectiv ity index 0 .97
8 cha ins, 481 residues, m iss ing res idues 1 -7 , 59 , 60, 113 , 114 , 142 , 143, 174 ,175, 186 , 187, 201 , 202, 388 , 389 , 503, dum m y a tom s 1235, connectivity index 0 .97
8 cha ins, 481 residues, m iss ing res idues 1 -7 , 59 , 60, 1 42, 143 , 186 , 187, 201 , 202 , 331, 332 , 388 , 389, 401 , 402 ,503 , dum m y a tom s1203, connectiv ity index 0 .97
R -factor (% ) R free R -factor (% ) R free R -factor (% ) R free R -factor (% ) R free
W itho ut du m m y ato m s m ade b y A RP/w ARP 28.4 29.4 27.3 28.2 27.6 28.9 27.2 28.3
After m an ual m o del b uilding fo r m issing residu es and solvent bu ild in g 14.2 15.8 13.6 15.4 13.9 15.7 13.8 15.6
r.m .s. deviations of the m ode l w ith backbone a tom s superposed w ith tha t o f 1gwe 0 .145 0 .191 0 .161 0 .169
PR O G RAM S ET 13 SE T 14 SE T 15 SE T 16 SE T 23 ACO R N STA RTING R-factor (% ) CC R-factor (% ) CC R-factor (% ) CC R-factor (% ) CC R-factor
(% ) CC
Large E (L ) 0.423 0 .1513 0 .424 0 .1418 0 .432 0 .1225 0 .437 0 .1058 0 .437 0 .1143 M edium E (M ) 0 .533 0 .0821 0 .534 0 .0760 0 .539 0 .0614 0 .542 0 .0544 0 .542 0 .0469
Input H1,10-11 ,14& 17 -20 (114 a.a ) H 1,10-11 ,14& 18 -20 (103 a .a ) H 10-11 ,14& 18 -20 (91 a .a) H 11,14&18 -20 (79 a .a) Sheets (1 -8) & H4 (76 a .a )
After 42 cycles of DDM After 46 cycles of DDM After 55 cycles of DDM After 55 cycles o f D DM After 56 cycles of DDM FINAL L 0 .272 0 .6286 0 .274 0 .6283 0 .273 0 .6289 0 .272 0 .6275 0 .273 0 .6211 M 0 .529 0 .1085 0 .531 0 .1022 0 .534 0 .0961 0 .536 0 .0869 0 .539 0 .0771
ARP /w AR P R-factor (% )
Rfree R-factor (% ) Rfree R -factor (% ) Rfree R-factor (% ) Rfree R-factor (% )
Rfree
AU TO BUILDING : 10 C ycles RE FM AC : 5 C ycles for each auto bu ilding ; IN ITIAL 43.8 43.8 44.2 44.2 44.5 44.5 45.0 44.8 44.8 44.4
S ide dock afte r 6 cyc les o f auto bu ild ing FINAL 14.9 19.0 15.2 19.1 14.5 18.1 29.1 35.7 28.9 36.3
DE TAILS O F ARP /w ARP result
6 cha ins, 485 residues, m iss ing res idues 1 -7 , 59 , 60, 113 , 114 , 142, 143 , 174, 175 , 388, 389 , 503, dum m y a tom s 1200, connectiv ity index 0 .98
8 cha ins, 474 residues, m iss ing residues 1 -9 , 59 60, 1 42, 143 , 174-180, 186, 187, 201 , 202, 258 , 259, 388, 389 , 503,dum m y a tom s 1250, connectivity index 0 .97
7 cha ins, 482 residues, m issing residues 1 -8 , 59 , 60, 113, 114, 142 , 143 , 186 ,187, 201, 202, 503 , dum m y atom s 1224, connectiv ity index 0 .97
20 cha ins, 3 10 residues, connectiv ity index 0 .88
22 cha ins, 212 residues, connectiv ity index 0 .79
ARP /w AR P R-factor (% ) Rfree R-factor (% ) Rfree
29.1 35.8 28.9 36.3 AU TO BUILDING : 10 C ycles RE FM AC : 5 C ycles for each auto bu ilding ;
S ide dock afte r 6 cyc les o f auto bu ild ing 13.0 16.6 14.0 17.6
Details o f ARP /w AR P resu lt
8 cha ins, 481 residues, m issing residues 1-7 , 59 , 60, 142 , 143, 186, 187 , 258, 259, 388 , 389, 401, 402 , 503, dum m y atom s 1337, connectiv ity index 0 .97
7 cha ins, 481 residues, m issing residues 1-9 , 59 , 60, 142 , 143, 174, 175 , 186 , 187, 388 , 389, 401, 402 , 5 03, dum m y a tom s 1287, connectivity index 0 .97
R -factor (% )
Rfree R-factor (% ) Rfree R -factor (% ) Rfree R-factor (% ) Rfree R-factor (% )
Rfree
W ithout dum m y atom s m ade by A RP/w ARP 27.0 28.2 28.0 29.3 27.3 28.3 27.2 28.1 27.1 28.2
After m anual m odel building fo r m issing residues and solvent bu ild ing 13.3 14.9 15.0 16.9 14.3 15.8 14.6 16.2 13.6 15.6
r.m .s. deviations o f the m odel w ith backbone atom s superposed w ith that o f
1gw e 0 .146 0 .182 0 .146 0 .204 0 .176
F ig . 1P D B -id : 1 G W ETo ta l:5 0 3 re s id u e s
F ig . 2
A u to B u ilt : 4 8 2 re s id u e sA b in i tio
F ig . 3 aS E T 1
A u to B u ilt : 4 7 6 re s id u e sInpu t: 187 res idues
F ig . 3 bS E T 2
A u to B u ilt : 4 7 0 re s id u e sInpu t: 184 res idues
F ig . 3 cS E T 3
A u to B u ilt : 4 7 4 re s id u e sInpu t: 181 res idues
F ig . 3 dS E T 4
A u to B u ilt : 4 8 2 re s id u e sInpu t: 177 res idues
F ig . 3 eS E T 5
A u to B u ilt : 4 7 7 re s id u e sInpu t: 172 res idues
F ig . 3 fS E T 6
A u to B u ilt : 4 7 2 re s id u e sInpu t: 167 res idues
F ig . 3 gS E T 7
A u to B u ilt : 4 7 9 re s id u e sInpu t: 162 res idues
F ig . 3 hS E T 8
A u to B u ilt : 4 8 4 re s id u e sInpu t: 157 res idues
F ig . 3 iS E T 9
A u to B u ilt : 4 7 5 re s id u e sInpu t: 151 res idues
F ig . 3 jS E T 1 0
A u to B u ilt : 4 8 2 re s id u e sInpu t: 145 res idues
F ig . 3 kS E T 11
A u to B u ilt : 4 8 1 re s id u e sInpu t: 135 res idues
F ig . 3 lS E T 1 2
A u to B u ilt : 4 8 1 re s id u e sInpu t: 125 res idues
F ig . 3 mS E T 1 3
A u to B u ilt : 4 8 5 re s id u e sInpu t: 114 res idues
F ig . 3 nS E T 1 4
A u to B u ilt : 4 7 4 re s id u e sInpu t: 103 res idues
F ig .3 oS E T 1 5
A u to B u ilt : 4 8 2 re s id u e sInpu t: 91 res id ues
F ig . 3 pS E T 1 6
A u to B u ilt : 4 8 1 re s id u e sInpu t: 79 res id ues
F ig . 3 qS E T 2 3
A u to B u ilt : 4 8 1 re s id u e sInpu t: 76 res id ues
Seed phasing using Cα atoms
Only the 503 Cα atoms from the known structure were used for seed phasing to ACORN with the truncated data extending to 1.3 Å resolution. Successful model could be built with 474 amino acids (a.a), the backbone atoms of which had an r.m.s deviation 0.132 Å with the actual structure (1gwe).
To mimic the above ‘seed feeding’ in real situations, mean positional errors (MPE, hereafter) of 0.1, 0.2 Å were introduced for the above Cα atoms using MOLEMAN (Kleywegt, 1992-2004). Successful model could be built with 483, 481 a.a corresponding to input fragments with MPE of 0.1 and 0.2 Å respectively. The backbone atoms of these had an r.m.s deviation of 0.169, 0.163 Å respectively with the actual structure (1gwe).
Results of ACORN and ARP/wARP using only Cα atoms (1gwe)
Resolution 20-1.3 ÅMean Positional Error (MPE) of Cα atoms PROGRAM Cα atoms alone
0.1 Å 0.2 Å
ACORN STARTING R-factor (%) CC R-factor (%) CC R-factor (%) CC
Large E (L) 0.446 0.0607 0.445 0.0640 0.446 0.0543
Medium E (M) 0.551 0.0368 0.551 0.0348 0.552 0.0303
Input (1 atom /a.a - ~13% of the total structure) 503 atoms 503 atoms 503 atoms
After 123 cycles of DDM After 121 cycles of DDM After 165 cycles of DDM
FINAL L 0.254 0.6542 0.255 0.6520 0.257 0.6452
M 0.503 0.1944 0.505 0.1904 0.508 0.1826
ARP/wARP R-factor (%) Rfree R-factor (%) Rfree R-factor (%) Rfree
INITIAL 39.9 40.6 40.5 40.6 40.7 40.4 AUTOBUILDING : 10 Cycles REFMAC : 5 Cycles for each auto building; Side dock after 6 cycles of auto building FINAL 13.3 16.8 14.7 18.4 14.5 18.2
Details of ARP/wARP result
9 chains, 474 residues, missing residues 1-7, 34-40,59,60,113,114,142,143,174,175,186,187,388,389,401, 402,503, dummy atoms 1389, connectivity index 0.96
7 chains, 483 residues, missing residues 1-7, 59, 60, 113, 114, 142, 143, 186, 187, 331, 332, 388 ,389, 503, dummy atoms 1225, connectivity index 0.97
8 chains, 481 residues, missing residues 1-7, 59, 60, 113, 114, 142, 143, 175 ,176, 201, 202, 331, 332, 388 ,389, 503, dummy atoms 1247, connectivity index 0.97
R-factor (%) Rfree R-factor (%) Rfree R-factor (%) Rfree
Without dummy atoms made by ARP/wARP 28.2 28.9 26.9 28.1 27.5 28.5
After manual model building for missing residues and solvent building
15.1 17.1 14.2 16.0 14.4 16.1
r.m.s. deviations of the model with backbone atoms superposed with that of 1gwe 0.132 0.169 0.163
PDB i.d. : 1gweTotal residues:503
Input: Calpha atoms (503)Auto built: 474 residues
Input: 0.1Angstrom error at calpha atomsAuto built: 483 residues
Input: 0.2Angstrom error at calpha atomsAuto built: 481 residues
Seed phasing using 120 a.a as polyala
The first 120 a.a from the actual structure were treated as polyala model and the above procedures were carried out to obtain the final model. Results are detailed in Table.
With the 120 residues as polyala model, ARP/wARP was able to build 111 residues in 15 chains when the above procedures were followed. An iterative cycle carried out with this output as input revealed 480 residues out of 503 residues with a connectivity index of 0.98. In the case of first 120 residues of polyala model with 0.1 Å MPE, ARP/wARP initially built only 6948 dummy atoms. Two iterative cycles carried out with this as input finally built 481 residues. These two models have an r.m.s deviation of 0.176, 0.173 Å respectively with the backbone atoms of the actual structure (1gwe).
Results of ACORN and ARP/wARP using polyala model (5atoms/a.a) (1gwe)
Resolution 20 – 1.5 ÅPROGRAM First 120 residues of polyala
First 120 residues of polyala with a mean positional error (MPE) of 0.1 Å
ACORN STARTING R-factor (%) CC R-factor (%) CC Large E (L) 0.439 0.0915 0.442 0.0853
Medium E (M) 0.541 0.0525 0.541 0.0499 After 56 cycles of DDM After 55 cycles of DDM
FINAL L 0.276 0.6196 0.274 0.6195 M 0.537 0.0807 0.540 0.0775
ARP/wARP R-factor (%) Rfree R-factor (%) Rfree
INITIAL 45.1 44.4 45.1 44.9 AUTOBUILDING : 10 Cycles REFMAC : 5 Cycles for each auto building;
Side dock after 6 cycles of auto building FINAL 32.3 42.7 24.5 45.1
DETAILS OF ARP/wARP result 15 chains, 111 residues, connectivity index 0.88, dummy atoms 4437
0 chains, 0 residues, connectivity index 0.00, dummy atoms 6948
ARP/wARP R-factor (%) Rfree R-factor (%) Rfree
INITIAL 32.3 42.8 24.5 45.2 AUTOBUILDING : 10 Cycles REFMAC : 10 Cycles for each auto building;
Side dock after 6 cycles of auto building FINAL 13.1 16.6 26.8 35.8
Details of ARP/wARP result
6 chains, 480 residues, missing residues 1-9, 59, 60, 110-114, 142, 143, 174, 175, 388, 389, 503, dummy atoms 1354, connectivity index 0.98
20 chains, 282 residues, dummy atoms 3271, connectivity index 0.85
ARP/wARP R-factor (%) Rfree
INITIAL 26.9 35.8 AUTOBUILDING : 10 Cycles REFMAC : 10 Cycles for each auto building;
Side dock after 6 cycles of auto building FINAL
13.6 17.5
Details of ARP/wARP result 8 chains, 481 residues, missing residues 1-7, 39, 40, 59,
60, 142, 143, 174, 175, 186, 187, 388, 389, 401, 402, 503, dummy atoms 1201, connectivity index 0.97
R-factor (%) Rfree R-factor (%) Rfree
Without dummy atoms made by ARP/wARP 27.3 28.0 27.1 28.0
After manual model building for missing residues and solvent building
14.0 15.7 14.0 15.5
r.m.s. deviations of the model with backbone atoms superposed with that of 1gwe
0.176 0.173
PDB i.d. : 1gweTotal residues:503
Input: First 120 a.a as polyala modelAuto built: 480 residues
Input: First 120 a.a as polyala model after introducing the MPE of 0.1Angstrom
Auto built: 481 residues
STEREO VIEW OF THE ELECTRON DENSITY (2FO-FC|) MAP SUPERPOSED WITH FINAL MODEL (Input: Polyala model for the first 120a.a with a MPE of 0.1 Å)
STEREO VIEW OF THE FINAL ELECTRON DENSITY (2FO-FC|) MAP STARTING WITH THE POLYALA MODEL OF FIRST 120A.A
WITH MPE OF 0.1 Å
FINAL ELECTRON DENSITY (2FO-FC|) MAP FOR POLY ALA MODEL
ELECTRON DENSITY (2FO-FC|) MAP FOR HEME GROUP IN POLYALA MODEL
Seed phasing using Ncap, Ccap and Middle portions of helices/sheets
Instead of feeding the entire helices or sheets [Selvanayagam et al., 2004 (a minimum of 76 residues were found to be sufficient for seed phasing with 1.5 Å truncated data to solve the three dimensional structure of catalase)] either the N cap/C cap regions or the mid portion in the helices or sheets could also be fed as input for seed phasing. Successful model can be built in these cases also. The results obtained are listed in Table.
Results of ACORN and ARP/wARP using Ncap, Ccap and Middle portions of helices/sheets (1gwe)
Resolution 20-1.5 Å
PROGRAM Ncap region of helices/sheets
Ccap region of helices/sheets
Middle region of helices/sheets
ACORN STARTING R-factor (%) CC R-factor (%) CC R-factor (%) CC
Large E (L) 0.435 0.1118 0.437 0.1076 0.434 0.1319
Medium E (M) 0.539 0.0569 0.542 0.0553 0.538 0.0623
Input 76 a.a 76 a.a 76 a.a
After 51 cycles of DDM After 53 cycles of DDM After 52 cycles of DDM
FINAL L 0.275 0.6270 0.271 0.6307 0.272 0.6248
M 0.534 0.0901 0.533 0.0946 0.533 0.0926
ARP/wARP R-factor (%) Rfree R-factor (%) Rfree R-factor (%) Rfree
INITIAL 44.5 44.8 44.4 44.9 44.7 44.5 AUTOBUILDING : 10 Cycles REFMAC : 5 Cycles for each auto building; Side dock after 6 cycles of auto building FINAL 14.9 18.6 14.5 18.2 15.2 19.1
Details of ARP/wARP result
10 chains, 470 residues, missing residues 1-7, 39, 40, 59, 60, 113, 114, 142, 143, 174, 175, 176, 195-202, 331, 332, 388, 389, 401, 402, 503, dummy atoms 1308, connectivity index 0.95
9 chains, 474 residues, missing residues 1-7, 39-40, 59, 60, 142, 143, 174, 175, 186, 187, 201, 202, 331, 332, 388, 389, 503, dummy atoms 1277, connectivity index 0.96
8 chains, 479 residues, missing residues 1-7, 60, 110-114, 142, 143, 175, 176, 186, 187, 388, 389, 503, dummy atoms 1230, connectivity index 0.97
R-factor (%) Rfree R-factor (%) Rfree R-factor (%) Rfree
Without dummy atoms made by ARP/wARP
28.9 29.3 28.3 29.3 27.8 28.7
After manual model building for missing residues and solvent building
13.3 16.2 14.6 17.4 13.0 15.8
r.m.s. deviations of the model with backbone atoms superposed with that of
1gwe 0.218 0.183 0.151
Input: Ncap region of helices/sheets(76 a.a)Auto Built: 470 residues
Input: Ccap region of helices/sheets(76 a.a)Auto Built: 474 residues
Input: Middle region of helices/sheets(76 a.a)Auto Built: 479 residues
Black shaded regions correspond to the input residues from 1gwe
Conclusion• Based on the published work and the work being carried
out by our group (Rajakannan et al., 2004a; 2004b), it has now become very clear that very little information (15%) is needed to determine the structure of a protein using ACORN.
• Ours is the first case of ACORN applications using seed-phasing information to solve even larger molecular weight protein (57 kDA) when the resolution extends to 1.5 Å.
• Among the multiple solutions, the correct solutions can be obtained in all trials with high reliability by the working of correlation coefficient and hence high resolution and fairly complete diffraction data enable one to solve a protein ab initio, in a relatively short amount of time.
• ACORN has the great potential to establish itself as program for high-throughput structure determination.
• Currently, in order to extend the applicability of ACORN to lower resolutions, the seed phasing has been obtained from the native structure itself (as the structure had already been solved by traditional macromolecular crystallographic methods). Data mining approach to feed fragments using the PDB entries is in progress.
ReferencesBanumathi, S., Rajakannan, V., Velmurugan, D., Dauter, Z., Dauter, M., Tsai,
M. D. & Sekar, K. (2002). Japanese Crystallographic Society Meeting, Poster, P3-II-27, 123.
Collaborative Computational Project, Number 4 (1994). Acta Cryst. D50, 760-763.
Dodson, E. J. & Yao, J. -X. (2003). Crystallogr. Rev. 9, 67-72.Foadi, J. (2003). Crystallogr. Rev. 9, 43-65.Foadi, J., Woolfson, M. M., Dodson, E. J., Wilson, K. S., Yao, J. -X. & Chao-de,
Z. (2000). Acta Cryst. D56, 1137-1147.Kleywegt, G. J. (1992-2004). Uppsala University, Uppsala, Sweden.
Unpublished program.McAuley, K. E., Yao, J. –X., Dodson, E. J., Lehmbeck, J., Ostergaard, P. R. &
Wilson, K. S. (2001). Acta Cryst. D57, 1571-1578.Murshudov, G. N., Lebedev, A., Vagin, A. A., Wilson, K. S. & Dodson, E. J.
(1999). Acta Cryst. D55, 247-255.Murshudov, G. N., Grebenko, A. I., Brannigan, J. A., Antson, A. A., Barynin, V.
V., Dodson, G. G., Dauter, Z., Wilson, K. S. & Melik-Adamyan, W. R. (2002). Acta Cryst. D58, 1972-1982.
Perrakis, A., Morris, R. M. & Lamzin, V. S. (1999). Nature Struct. Biol. 6, 458-463.
Rajakannan, V., Velmurugan, D., Yamane, T., Dauter, Z., Dauter, M., Tsai, M. D. & Sekar, K. (2002). Japanese Crystallographic Society Meeting, Poster, P3-I-22, 84.
Rajakannan,V., Yamane, T., Shirai, T., Kobayshi, T., Ito, S. & Velmurugan, D. (2003). International Symposium on Diffraction Structural Biology, Tsukuba, Japan, 28-31 May 2003, Poster P-085.
Rajakannan, V., Yamane, T., Shirai, T., Kobayshi, T. Ito, S. & Velmurugan, D. (2004a). J. Synchrotron Rad. 11, 64-67.
Rajakannan, V., Selvanayagam, S., Yamane, T., Shirai, T., Kobayshi, T., Ito, S. & Velmurugan, D. (2004b). J. Synchrotron Rad. 11, 358-362.
Selvanayagam, S., Velmurugan, D., Yamane, T. (2004). Asian Crystallographic Association Meeting (AsCA’04) Poster(P0165).
Velmurugan, D., Rajakannan, V., Yamane, T., Dauter, Z. & Sekar, K. (2002). Japanese Crystallographic Society Meeting, Poster, P3-II-26, 122.
Yao, J. -X. (2002). Acta Cryst. D58, 1941-1947.
top related