Content:

Introduction

Here are some viewpoints, explanations and background information regarding recent and numerous discussions (e.g. on the CCP4bb) regarding various aspects of data handling - and what apparently should be done or should not be done. We try to stay as neutral as possible and refrain from pointing to particular software: we might misrepresent or misunderstand software from other groups and it is better for readers to form their own opinions anyway.

Please also check more software-specific pages at the BUSTER Wiki (since October 2020) - espcially the section about combining mmCIF reflection data from processing and refinement, STARANISO server (since December 2019) and the BUSTER discussion list (exchange February 2023).

Data truncation

Selecting reflection data can be a balancing act: are we excluding useful signal or are we including problematic noise? When talking about data truncation we might want to consider different ways this might happen or be done:

Anisotropic correction

This is usually applied only to amplitudes at different stages:

Conversion from intensities to amplitudes

Multiple programs exist for doing this: all of those will carry the input intensities through into output. Downstream programs/processes need to take care not to disrupt the connection between the original intensities and the resulting amplitudes: MTZ files are a very good and structured format to keep collections of tightly related data items (intensities, amplitudes, anomalous data etc) intact and together.

Deposition considerations

There are good reasons for applying any of those truncation or correction methods described above - and good reasons not to do so: different stages of data analysis, structure solution or model refinement might require different views or versions of the same underlying diffraction data. The important point is to provide as complete a record of the reflection data as possible so that downstream programs (and future methods) have acccess to the data they require or want. Then all of the above topics, methods and approaches don't really matter since a user is able to do something different with the same data - maybe because new methods are available, personal preferences differ or as a check and validation of a deposited structural model. Therefore, the deposition process itself is as important as writing any part of the methods section of a paper: reproducibility is key here.

For those reasons, we think access to the following would be ideal: