BUSTER User Manual
Refinement and completion of a partial
structure tutorial: IF3-C
This tutorial illustrates the use of BUSTER for Maximum Likelihood refinement of a partial model against 2.0 Å data, followed by 2.5 Å Maximum Entropy completion of the missing part of the structure. The degree of incompleteness is about 30%.
This tutorial is a synthetic example of refinement of a "bad model". To
prepare it, the published structure [1] was shaken by
torsion angle dynamics to an rms of about 1 Å away from the
published structure. The worst one third of the structure - as judged
on the basis of the main-chain B factors - was removed;
the solvent molecules were
also excluded from the partial model and all fragment's B factors
were set to 25 Å2.
What does BUSTER do here
There are three separate steps in this job:
A few parameters need some attention: see the simple quick input guide
| The R values at
after the first round of scaling, at cycle 0 are around 40%. By chance
the free R is lower than the working R.
During refinement (cycles 1-21) the R factors decrease; the final working-set and free-set Rfactors do not differ by less than one percent. The Maximum Entropy completion (cycles 22-23) introduces some overfitting - although still managing to improve the fit to the free set. |
The plot of Log-Likelihood gain vs. cycle number
shows an increase of the LLG, starting from the initial value of 0;
The working-set LLG is bound to increase because the refinement is driven by maximising the likelihood of the model with respect to the working set; more important is to check that the free-set LLG increases as well. Again, as observed with R factors, some overfitting is evident during Maximum Entropy completion. |
The (Fobs,Ffrag) and (Fobs,Fcalc) curves only differ at low resolution because in Fcalc there are contributions from the solvent and the low-resolution envelope for the missing atoms, while in Ffrag only the atoms in the PDB model for the partial structure are taken into account.
Most importantly, the (Fcalc,Fexpct) curve depends on the imperfection parameters that parameterise the BUSTER internal error model. The larger the internal estimate for the error on the calculated F, the more this CC curve departs from unity. A comparison between the (Fcalc,Fexpct) and the (Fobs,Fcalc) correlation coefficients curves can inform as to the adequacy of the BUSTER internal error model: if the latter is correct, after the first cycle the two curves should be close to one another.
The (Fobs,Fobs+d) curve is a measure of the noise on the data vs. resolution. This correlation coefficient is lower than unity when the noise on the data becomes large (typically at high resolution, where the I/s(I) is lowest).
|
Cycle 0: After the first round of scaling, the ``observed correlation'' (Fobs,Fcalc) agrees with the ``predicted correlation'' (Fcalc,Fexpct): the initial error model is adequate. The (Fobs,Fcalc) correlation coefficients are better than the (Fobs,Ffrag) at low resolution because of the improvements brought about by solvent and missing atoms models. |
Imperfection B factors: The model imperfection parameter for the partial structure, BImpffrag, decreases while the structure improves. Solvent and missing atoms models are roughly constant during this partial structure refinement (only the missing atoms scale factor and B factor are being refined here), so that the changes in the missing atoms and solvent imperfection parameters are mostly due to the coupling with the partial structure model: as the partial structure contributes less variance ( BImpffrag decreases), the solvent and missing atoms models contribute more ( BImpfsolv and BImpfmiss increase). The Luzzati parameters BLuzzWork, BLuzzFree and BLuzz, are not refined: rather, they are estimated at each cycle from a sigmaA plot, and should also decrease as the structure gets better /more complete. |
|
Cycle 21: At the end of the refinement all the CC values are
closer to unity than they were at the beginning:
this is due to the improvement of the model for the partial structure.
The imperfection Bfactor for the partial structure at the end of refinement has a value around 2 (see curves above): the ``predicted correlation'' (Fcalc,Fexpct) is still in good agreement with the ``observed'' one, (Fobs,Fcalc). |
Cycle 23:After Maximum Entropy completion the
(Fobs,Fcalc)curve improves
at resolution ranges up to 2.5 Å - due to a better model for the missing
atoms. No improvement is brought about to the missing atoms model past
the 2.5 Å limit because no Lagrange multipliers were used for
those data.
The (Fobs,Ffrag) CC curve remains unchanged because no changes are made to the partial structure. The ``predicted correlation'' (Fcalc,Fexpct) is now too pessimistic - during Maximum Entropy completion all imperfection parameters are frozen and not refined anymore. |
|
As refinement progresses, and if the X-ray weight is adequate, the normalised sum of observed residuals should tend towards the ideal value of 1 (meaning that ideally every restraint should be obeyed within 1 associated e.s.d.). Final values of the observed residuals higher than unity suggest too loose a geometry - and point to the need for a lower X-ray weight; values lower than unity pertain to too tight a geometry - the X-ray weight could be increased. In this case the refinement hasn't converged yet - so no changes to the X-ray weight are advised. Other useful stereochemistry statistics such as the list of the top violations of stereochemistry restraints are found in the geometry output file; |