This tutorial illustrates the use of BUSTER for Maximum Likelihood rigid-body refinement of a model as obtained from a Molecular Replacement solution.
The structure of the SCAFet lectin from Scilla Campanulata bluebell bulbs was solved by Wright et al. in 1999 at 3.3 Å at room temperature [1,2,3]. There are 6 monomers in the a.u., arranged in a doughnut-shaped hexamer. The structure was deposited with PDB accession code 1DLP.
A molecular replacement search was run against a new low-temperature 2.9 Å dataset for the same protein; the search model was the NCS-averaged SCAFet monomer from the 3.3 Å structure, AAs 1-235. This search model is fairly complete but has main-chain and side-chain errors in it. The 6 correct solutions of this MR search are used for the rigid-body refinement of the SCAFet hexamer in this tutorial.
A few parameters need some attention: see the simple quick input guide
| The R values at cycle 0 are between 38 and 54%, across all resolutions. | The R values at
cycle 10 show a marginal improvement at medium-high resolution, mainly due to the
scaling and a more adequate error model; the low
resolution values are worse, due to the fact that the model after
rigid-body refinement now clashes with the bulk solvent mask.
Upon entering a new cycle of rigid-body refinement the initial masks for the bulk solvent will be more accurate, and the initial R factors improve the ones shown aside for cycle zero. |
The (Fobs,Ffrag) and (Fobs,Fcalc) curves only differ at low resolution because in Fcalc there is a contribution from the bulk solvent, while in Ffrag only the atoms in the PDB model for the partial structure are taken into account.
Most importantly, the (Fcalc,Fexpct) curve depends on the imperfection parameters that parameterise the BUSTER internal error model. The larger the internal estimate for the error on the calculated F, the more this CC curve departs from unity. A comparison between the (Fcalc,Fexpct) and the (Fobs,Fcalc) correlation coefficients curves can inform as to the adequacy of the BUSTER internal error model: if the latter is correct, after the first cycle the two curves should be close to one another.
The (Fobs,Fobs+d) curve is a measure of the noise on the data vs. resolution. This correlation coefficient is lower than unity when the noise on the data becomes large (typically at high resolution, where the I/s(I) is lowest).
|
Cycle 0: At the outset, you can see that the ``observed correlation''
(Fobs,Fcalc) is slightly lower than the
``predicted correlation''
(Fcalc,Fexpct). The initial error model is
overoptimistic (especially at high resolution).
This is because the initial imperfection parameter for the
partial structure and the solvent are too low,
(the starting partial structure is VERY imperfect).
|
Cycle 10: At the end of the refinement the medium- and high-res CC values are
closer to unity than they were at the beginning:
this is due to the improved placement of the partial structure.
The imperfection Bfactors for the various model components (solvent, partial structure and missing atoms) adjust so that the ``predicted correlation'' (Fcalc,Fexpct) keeps in good agreement with the ``observed'' one, (Fobs,Fcalc). Still the lowest-resolution CCs are not optimal. They will improve upon entering a new cycle of rigid-body refinement: the initial masks for both solvent and missing atoms will be more accurate due to the improvement brought about to the partial structure by the rigid-body refinement. |
Inspection of this PDB file together with the starting strcutrue and the fully refined SCAFet hexamer confirms that the shifts were in the correct direction.