Experimental Phasing

From Phaserwiki
Revision as of 10:48, 23 May 2011 by Airlie (talk | contribs) (How to Define Data)

Phaser performs SAD phasing in two modes. In the Automated Experimental Phasing mode, Phaser corrects for anisotropy, puts the data on absolute scale, does a cell content analysis, refines heavy atom sites to optimize phasing, and completes the model from log-likelihood gradient maps. Alternatively, the SAD Phasing mode can be used, which only refines heavy atom sites to optimize phasing, and completes the model from log-likelihood gradient maps. For this mode, the data should be pre-corrected for anisotropy and put on an absolute scale. This mode should only be used as part of automation pipelines, where the correct preparation of the data can be guaranteed and it saves cpu time

Automated Experimental Phasing

MODE EP_AUTO combines the anisotropy correction, cell content analysis, and SAD Phasing modes to automatically solve a structure by experimental phasing. The final solution is output to the files FILEROOT.sol, FILEROOT.mtz and FILEROOT.pdb. Many structures can be solved by running an automated experimental phasing job with defaults.

How to Define Data

You need to tell Phaser the name of the mtz file containing your data and the columns in the mtz file to be used. For SAD phasing, a single CRYSTAL and DATASET with anomalous data (F(+), SIGF(+), F(-) and SIGF(-)) must be given. The columns must have the correct CCP4 column type: 'G' for F(+) and F(-) and 'L' for SIGF(+) and SIGF(-). If the columns on your mtz file have somehow acquired the incorrect column type, you should change the column type with an mtz editing programme (e.g. sftools).

   CRYSTAL insulin DATASET sad LABIN F+ = F(+) SIG+ = SIGF(+) F- = F(-) SIG- = SIGF(-)

How to Define Atoms

Atom sites are defined with the ATOM keyword. Atoms sites may be entered one at a time specifying fractional or orthogonal coordinates, occupancy and B-factor, or from a PDB file, or from a mlphare-style HA file. The crystal to which the atoms correspond must be specified in the input.

A partial structure, for example a partial and poor MR solution (which may or may not contain anomalous scatterers) can be used to start the phasing, either alone or in addition to some anomalous scattering sites entered with the ATOM keyword. The partial structure is entered with the PART keyword. If you do not have a partial MR structure, anomalous scatterers must be found using Patterson methods, Direct methods or Dual Space methods (e.g. phenix.hyss, SHELXD, SnB, Rantan) prior to running Phaser. Note that SHELXD outputs a pdb file with the scatterers labelled sulphur (S) regardless of the correct scattering type: you need to change the pdb so that it contains the correct scattering type prior to using it in Phaser.

Cluster Compounds

Cluster compounds can be used as the scattering type. The coordinates for Ta₆Br₁₂ are stored internally and have the cluster name TX. Coordinates of other cluster compounds must be supplied (coordinates are translated to the origin internally before spherical averaging the structure factors) and are given the cluster name XX.

   CLUSTER PDB <PDBFILE>

How to Control Output

The output of Phaser can be controlled with the following keywords:

Basic Modes for Experimental Phasing

SAD Phasing

MODE EP_SAD phases SAD data and completes the structure from log-likelihood gradient maps. The final solution is output to the files FILEROOT.sol, FILEROOT.mtz and FILEROOT.pdb .

Do SAD phasing of insulin. This is the minimum input, using all defaults (except the ROOT filename)

  #insulin_sad.com
  phaser << eof
  MODE EP_SAD
  TITLe sad phasing insulin with intrinsic sulphurs
  HKLIn S-insulin.mtz
  CRYStal insulin DATAset sad LABIn F+=F(+) SIG+=SIGF(+) F-=F(-) SIG-=SIGF(-)
  WAVElength 1.5418
  LLGComplete COMPLETE ON SCATterer S
  ATOM CRYStal insulin PDB S-insulin_hyss.pdb
  ROOT insulin_sad
  eof

Combined MR and SAD Phasing

MODE EP_SAD when used with the PART keyword phases SAD data and completes the structure from log-likelihood gradient maps using a partial structure as (at least part of) the initial atomic substructure. Only the real (i.e. non-anomalous) signal from the partial structure is used. The final solution is output to the files FILEROOT.sol, FILEROOT.mtz and FILEROOT.pdb. Note that, because the substructure after log-likelihood-gradient completion includes the partial model, the phases automatically combine information from the partial model and the anomalous scatterers.

Do SAD phasing of insulin starting from partial model of one helix only. This is the minimum input, using all defaults (except the ROOT filename)

  #insulin_mr_sad.com
  phaser << eof
  MODE EP_SAD
  TITLe sad phasing insulin with intrinsic sulphurs, starting from a helix
  HKLIn S-insulin.mtz
  CRYStal insulin DATAset sad LABIn F+=F(+) SIG+=SIGF(+) F-=F(-) SIG-=SIGF(-)
  WAVElength 1.5418
  LLGComplete COMPlete ON SCATterer S
  PART PDB helix.pdb ID 100
  ROOT insulin_mr_sad
  eof

Density Modification after Phaser

Phaser produces map coefficients that reduce the model bias coming from the real contribution of the anomalous scatterers, and there is a small but significant improvement by starting density modification from these FWT/PHWT coefficients rather than FP, PHIB and FOM.

The FWT/PHWT coefficients should be used when running Resolve. If you run Phaser then Resolve from the AutoSol wizard of phenix, FWT/PHWT are picked up automatically.

If you're using DM from ccp4i, you have to provide PHIB, FOM and the HL coefficients in the interface, then use the "Run&View Com File" option to add "FDM=FWT PHIDM=PHWT" to the LABIN line of the DM input.