WORK-IN-PROGRESS

Content:


6W9C

Introduction

The data was collected on 18th March 2020 at beamline 19-ID (SBC-CAT, APS).

Images

We have two distinct sweeps of data available:

     y3_b07_#####.cbf   : 1 - 60
     y3_b07b_#####.cbf  : 2 - 140

At first sight this seems a slightly unusual strategy given the characteristics of the detector used (Förster et al, 2019). We would usually want to see low-dose, high-multiplicity, fine-sliced data collected at modern synchrotrons with the current generation of detectors (Pflugrath, 1999, Mueller at al, 2012).

It is also unfortunate that the second sweep was started again at the same Omega angle (–90) as the first one. So although 100 degrees of data was collected, only 70 degrees of rotation is covered - which for monoclinic C2 results in very poor overall completeness (but then: even 100 degrees would have been problematic): see e.g. PDBpeep.

The images have a very high average background value of about 40 counts

y3_b07_00001.png

This seems to suggest that there could have been at least a factor of 10 reduction in dose per 0.5 degree image. A better sampling of the reflection profile (through fine-slicing) would have been possible by using 0.1 degree per image with an exposure time of 0.1 seconds: this would have put the same amount of dose into the crystal as the current 0.5sec/0.5deg. The Pilatus3 6M can run at a frame rate of 25Hz, i.e. at 0.04 sec/image). So one could collect data as

  sweep 1: 0.1  deg/image
           0.05 sec/image
           100% transmission
           60   degrees of data
           start at Omega=-90
  sweep 2: 0.1  deg/image
           0.05 sec/image
           100% transmission
           140  degrees of data
           start at Omega=-30

which would have put the exact same dose into the crystal with the same collection wall clock time - but would result in a low-dose and fine-sliced dataset of adequate multiplicity (200 degrees of rotation covered instead of 70 degrees). It is most likely, that halving the transmission would have been a better strategy on top of this (achieving high multiplicity via 400 degrees of data on top of low-dose and fine-slicing), so

  sweep 1: 0.1  deg/image
           0.05 sec/image
           50%  transmission
           120  degrees of data
           start at Omega=-90
  sweep 2: 0.1  deg/image
           0.05 sec/image
           50%  transmission
           280  degrees of data
           start at Omega=+30

The total collection time would have been longer though: 200 seconds instead of 100 seconds. but given the fact that it took 224 seconds to recenter (?) the crystal between sweep 1 and 2, those additional 100 seconds of time do not seem crucial. If we assume that the initial centering took a similar amount of time, we have for the current data collection

  sweep 1 centering  : 225 sec
          collection :  30 sec
  sweep 2 centering  : 225 sec
          collection :  70 sec
 ------------------------------
  total                550 sec =  9 min 10 sec

and a slightly different strategy described above:

  sweep 1 centering  : 225 sec
          collection :  60 sec
  sweep 2 centering  : 225 sec
          collection : 140 sec
 ------------------------------
  total                650 sec = 10 min 50 sec

Remember: we can always glue fine-sliced images together (creating 0.5 degree images out of five 0.1 degree images) - but we can never split images collected in wide-slicing mode.

(Re)processing

After downloading the images (and placing the mini-cbf files into a directory Images), we can run autoPROC on that dataset via

    process ReverseRotationAxis=yes -I Images -d 00 | tee 00.lis

The extra keyword is recorded on our beamline settings wiki page.

Radiation damage

Both sweeps of data show strong indications of radiation damage:

Sweep Spots found R-values
1 00_y3_b07_SPOT.XDS.SpotsPerImage.png 00_y3_b07_aimless.mrfana.Rmerge_Rmeas_Rpim_batch.png
2 00_y3_b07b_SPOT.XDS.SpotsPerImage.png 00_y3_b07b_aimless.mrfana.Rmerge_Rmeas_Rpim_batch.png

If these had been low-dose, high-multiplicity datasets we could have (1) restricted the image ranges to the earlier half for each sweep, or (2) analysed the data for early and late parts (and use those results for F(early)–F(late) radiation-damage detection maps in BUSTER).

The problem with missing data is visible in the STARANISO plots:

00_staraniso_alldata.ismean-0kl.png 00_staraniso_alldata.ismean-h0l.png 00_staraniso_alldata.ismean-hk0.png

and the merging statistics:

anisotropic (STARANISO) analysis:

  Spacegroup name          C2
  Unit cell parameters     190.8751 110.2789 64.2203 90.0000 96.2532 90.0000
  Wavelength               0.97918 A
 
  Diffraction limits & principal axes of ellipsoid fitted to diffraction cut-off surface:
    2.503         0.7227   0.0000   0.6912       0.962 _a_* + 0.273 _c_*
    2.168         0.0000   1.0000   0.0000       _b_*
    3.488        -0.6912   0.0000   0.7227      -0.933 _a_* + 0.360 _c_*
 
  Number of active ice-rings within this resolution range =  0

  Number of RUNs (sweeps) contributing to this dataset =   2

  Criteria used in determination of diffraction limits:
  -----------------------------------------------------
     local(I/sigI)  >=    1.20

                                              Overall  InnerShell  OuterShell
  ---------------------------------------------------------------------------
     Low resolution limit                      63.838      63.838       2.571
     High resolution limit                      2.207       6.865       2.207


     Rmerge  (all I+ & I-)                      0.161       0.060       0.637
     Rmerge  (within I+/I-)                     0.141       0.058       0.582
     Rmeas   (all I+ & I-)                      0.197       0.073       0.845
     Rmeas   (within I+/I-)                     0.190       0.078       0.802
     Rpim    (all I+ & I-)                      0.110       0.040       0.547
     Rpim    (within I+/I-)                     0.126       0.053       0.548
     Total number of observations               56301        3992        1823
     Total number unique                        20852        1408         849
     Mean(I)/sd(I)                                8.8        26.3         1.5
     Completeness (spherical)                    31.4        61.6         3.5
     Completeness (ellipsoidal)                  47.2        61.6         8.4
     Multiplicity                                 2.7         2.8         2.1
     CC(1/2)                                    0.983       0.994       0.428

traditional (isotropic) analysis:

                                              Overall  InnerShell  OuterShell
  ---------------------------------------------------------------------------
     Low resolution limit                      51.754      51.754       2.697
     High resolution limit                      2.652       7.192       2.652


     Rmerge  (all I+ & I-)                      0.205       0.057       1.264
     Rmeas   (all I+ & I-)                      0.251       0.070       1.785
     Rpim    (all I+ & I-)                      0.142       0.040       1.261
     Total number of observations               64946        3441        1130
     Total number unique                        25685        1226         961
     Mean(I)/sd(I)                                6.7        29.5         0.9
     Completeness                                66.8        62.0        50.0
     Multiplicity                                 2.5         2.8         1.2
     CC(1/2)                                    0.975       0.994       0.370

For comparison the values from the deposited structure (as available):

                                              Overall  InnerShell  OuterShell
  ---------------------------------------------------------------------------
     Low resolution limit                      43.680      43.680       2.750
     High resolution limit                      2.700       7.320       2.700


     Rmerge  (all I+ & I-)                      0.140       0.060       0.637
     Rmeas   (all I+ & I-)                      0.173       0.071       0.769
     Rpim    (all I+ & I-)                      0.099       0.040       0.492
     Total number unique                        20799        1163         679
     Mean(I)/sd(I)                                4.9          NA          NA
     Completeness                                57.3        62.6        38.4
     Multiplicity                                 2.5         2.9         1.9
     CC(1/2)                                       NA       0.991       0.566

These show better R-values (but poorer <I/sigma(I)> values) and lower completeness: so maybe some images (towards the end of each sweep where radiation damage becomes very pronounced) were exclued? This would also explain the better R-values (but not the lower <I/sigma(I)> value).