| SHARP/autoSHARP Tutorials | previous next |
| Chapter 1 |
Copyright © 2004 by Clemens Vonrhein
and the Buster Development Group.
All rights reserved.
The "molecule" is more precisely the "unique entity" in your asymmetric unit: so everything of which there might be more than one copy of.
Unless you are sure about this it is not necessary to know this value before starting to run the program: autoSHARP will try to detect this automatically.
This is the most accurate information about the contents of the asymmetric unit (since the number of residues and molecular weight can be calculated from that).
Make sure to check all extinctions to know the correct space group - or maybe a list of possible space groups (if the extinctions are misleading or not measured).
Even if several different types of heavy atoms might be available for phasing (e.g. S and Ca), there is usually one type that is most visible within a dataset.
Unless you are sure that the dataset was collected far enough away from the edge for the given heavy atom element, you should have a fluorescence scan that gives you these values. If you are far enough away from the edge, the calculated values should be fine. autoSHARP will use these calculated values if no f'/f'' values are known - but it then relies on an accurate wavelength (see below).
autoSHARP uses these in Ångstrom - so you might need to convert energies into Å.
Usually, any reflection file should have only valid data in it. However, if e.g. data was processed to the high resolution limit of the detector (rather than to the resolution limit of the data) you might want to remember the limits for which the data follows the standard quality criteria (I/sig(I), Rmerge, completeness ... or whatever you prefer).
This can be in merged MTZ format (i.e. usually from the CCP4 program TRUNCATE) or merged SCALEPACK format (i.e. using the keyword 'MERGE' in scalepack). Ideally, the files should have the correct spacegroup, cell parameters and resolution limits (e.g. in the header).
A simple ASCII file with the one-letter amino-acid sequence can be used to describe the contents of the asymmetric unit. If you are sure about the number of molecules in the asymmetric unit, you can cut-and-paste the same sequence several times into this file. The complete file needs to follow the format for sequence files.
You can then start the autoSHARP input interface (using the 'Start' autoSHARP button at the top). Normally, you would leave the selection at the default value of 'None' - since you are probably starting a completely new autoSHARP run.
Note : the hyperlinks explanation can be used to bring up the manual page with the relevant information for any given item (so please make use of it!).
Some related information can be given as several items (e.g. content of asymmetric unit or f'/f'' values). Only one of these need to be input (leave the others at the default values). It is recommended to use a sequence file and the f'/f'' values when available.
The least known value might be the number of heavy atoms to search for: it shouldn't matter too much if this value is overestimated - but not by a factor of 5 or 10.
It is usually a god idea to let autoSHARP run through the whole sequence of steps (it will not try automatic building if only 5Å data is available). Especially the density modification step will (hopefully) show a clear difference in correlation between the two hands - a good sign that things are on the right track.
The main log file should have a header that says 'autoSHARP run'. This log file will give all crucial information: which steps the program performed and what messages each step generated. For each step there should be a details and an explanation link to give you more information.
If you are not happy with the results of an autoSHARP run try following these warning messages: maybe you are able to correct the reasons for these. Especially if the warnings are related to data quality or how well different datasets relate to eac other, it can be helpful to start with fewer (but better and better matching) datasets than trying to do everything at once.
Furthermore, some warnings might be fixed when using a stricter resolution cut (either by changing the reflection file or by setting the limits directly - after specifying yourself as Expert in the Preferences page).
A weird looking difference or anomalous Patterson map can point to scaling problems or data outliers. These should have been detected (to some degree) during the initial data analysis stage - so going back to any warning messages there is advisable.
Any solution used based on a set of such sites is hardly trustworthy (the same applies for solutions based on 'funny' or poor looking Patterson maps).
If the peak wavelength in a MAD experiment doesn't give a sensible solution it is very unlikely that the remote anomalous difference will. But maybe there were things that happened during data collection that explain this?
The starting correlation for the correct hand should ideally be above 0.20 (or higher). The wrong hand should then have a correlation of maybe 0.10 to 0.15. With very good phases this might also be 0.40 (correct hand) versus 0.20 (wrong hand).
A very low starting correlation (e.g. below 0.15) for both hands nearly always shows that either the heavy atoms sites are wrong (most likely) or that there isn't enough signal in the data.
Either use the SHARP run that represents the correct hand (if the density modification statistics enable you to decide) or just the latest one to start the SHARP Input Editor. This allows you to change the exact parameters for the heavy atom refinement and phasing step (i.e. what SHARP does).
A visual inspection of the various residual maps (as opposed to the automatic analysis done inside the autoSHARP pipeline) can help getting a more complete or corrected heavy atom model.