Difference between revisions of "Keyword Example Scripts"

From Phaserwiki
m (Protected "Phaser-2.2: Keyword Example Scripts" ([edit=sysop] (indefinite) [move=sysop] (indefinite)) [cascading])
(No difference)

Revision as of 13:05, 7 September 2009

Phaser can be run from the command line

$prompt> phaser

After printing the Phaser Banner, the Preprocessor will ask:

ENTER KEYWORD INPUT FROM FILE OR FROM STANDARD INPUT

Enter the keyword input

At the end of the input, start Phaser with one of the commands

  • END
  • QUIT
  • STOP
  • KILL
  • EXIT
  • GO
  • RUN
  • START

Alternatively, phaser can be run from command scripts as below

Automated Molecular Replacement

Example command script for finding BETA and BLIP. This is the minimum input, using all defaults (except the ROOT filename).

  #''beta_blip_auto.com''
  phaser << eof
  TITLe beta blip automatic
  MODE MR_AUTO
  HKLIn beta_blip.mtz
  LABIn F=Fobs SIGF=Sigma
  ENSEmble beta PDB beta.pdb IDENtity 100
  ENSEmble blip PDB blip.pdb IDENtity 100
  COMPosition PROTein SEQuence beta.seq NUM 1 #beta
  COMPosition PROTein SEQuence blip.seq NUM 1 #blip
  SEARch ENSEmble beta NUM 1
  SEARch ENSEmble blip NUM 1
  ROOT beta_blip_auto # not the default
  eof

Example command script for finding BETA and BLIP. The spacegroup recorded on the mtz file is P3221 but the other hand is also a possibility. Both search orders (BETA first, BLIP second and BLIP first, BETA second) are tried, using the PERMutations ON keyword. We would not normally recommend using the PERMutations ON keyword for this case, as it is obvious that the larger molecule should be easier to find first. To speed up the calculation only the top peak after the translation function is taken into refinement. As one of a number of alternatives to providing sequence files to specify the composition, the molecular weights of the two components are given in this example.

  #''beta_blip_auto_sg.com''
  phaser << eof
  TITLe beta blip automatic
  MODE MR_AUTO
  HKLIn beta_blip.mtz
  LABIn F=Fobs SIGF=Sigma
  ENSEmble beta PDB beta.pdb IDENtity 100
  ENSEmble blip PDB blip.pdb IDENtity 100
  COMPosition PROTein MW 28853 NUM 1 #beta
  COMPosition PROTein MW 17522 NUM 1 #blip
  SEARch ENSEmble beta NUM 1
  SEARch ENSEmble blip NUM 1
  PERMutations ON # not the default
  SGALternative HAND # not the default
  ROOT beta_blip_auto_sg # not the default
  FINA TRA SELEct NUM 1 # not the default
  eof

How to Define Models

Building an Ensemble from Coordinates

You have one structure as a model with 44% sequence identity to the protein in the crystal.

  ENSEmble mol1 PDB structure1.pdb IDENtity .44

You have three structures as models with 44%, 39% and 35% identity to the protein in the crystal.

  ENSEmble mol2   PDB structure1.pdb IDENtity .44 PDB structure2.pdb IDENtity .39 PDB structure3.pdb IDENtity .35

You have an NMR Ensemble as a model. There is no need to split the coordinates in the pdb file provided that the models are separated by MODEL and ENDMDL cards. In this case the sequence identity is not a good indication of the rms deviation of the structural coordinates to the target structure. You should use the RMS option; several test cases have succeeded where the ID was close to 100% with an RMS value of about 1.5Å (see table below).

  ENSEmble mol3 PDB nmr.pdb RMS 1.5

Building an Ensemble from Electron Density

You have low resolution electron density of your model. This density has been cut out and converted to structure factors in a large cell.

  ENSEmble mol1  HKLIn mol1.mtz F = Fmol1 P = Pmol1 EXTEnt 23 25 29 RMS 2.0 CENTre 4 3 30 PROTein MW 10241 NUCLeic MW 0

How to Define Composition

Composition by Molecular Weight

You have one protein (with MW 21022) in the asymmetric unit

COMPosition PROTein MW 21022

You have three copies of a protein (with MW 21022) in the asymmetric unit

  COMPosition PROTein MW 21022
  COMPosition PROTein MW 21022
  COMPosition PROTein MW 21022

Another way of entering the same thing is

  COMPosition PROTein MW 21022 NUMber 3

Yet another way of entering the same thing is

  COMPosition PROTein MW 63066

You have two copies of a protein (with MW 21022), two copies of a protein (with MW 9843) and RNA with (MW 32004) in the asymmetric unit

  COMPosition PROTein MW 21022 NUMber 2
  COMPosition PROTein MW 9843 NUMber 2
  COMPosition NUCLeic MW 32004

Composition by Sequence

You have one protein (with sequence in fasta format in the file prot1.seq) in the asymmetric unit

  COMPosition PROTein SEQuence prot1.seq

You have three copies of a protein (with sequence in fasta format in the file prot1.seq) in the asymmetric unit

  COMPosition PROTein SEQuence prot1.seq
  COMPosition PROTein SEQuence prot1.seq
  COMPosition PROTein SEQuence prot1.seq

Another way of entering the same thing is

  COMPosition PROTein SEQuence prot1.seq NUMber 3

Yet another way of entering the same thing is to make a sequence file with all the amino acids concatenated together (prot1.seq3)

  COMPosition PROTein SEQuence prot1.seq3

You have two copies of a protein (with sequence in fasta format in the file prot1.seq), two copies of a protein (with sequence in fasta format in the file prot2.seq) and RNA with (with sequence in fasta format in the file nucl1.seq) in the asymmetric unit

  COMPosition PROTein SEQuence prot1.seq NUMber 2
  COMPosition PROTein SEQuence prot2.seq NUMber 2
  COMPosition NUCLeic SEQuence nucl1.seq

Composition by Percentage Scattering

Each copy of Ensemble mol1 gives 22% of the scattering

  COMPosition ENSEmble mol1 FRACtional 0.22

Each copy of Ensemble mol2 gives 78% of the scattering

  COMPosition ENSEmble mol2 FRACtional 0.78

How to Define Solutions

To include the files you should use the preprocessor command @

  @ filename.sol
  @ filename.rlist

"sol" Files

One copy of mol1 with known orientation and position (fractional coordinates)

   SOLUtion 6DIM ENSEmble mol1 EULEr 17 20 32 FRACtional 0.12 0.05 0.74

One copy of mol1 with known orientation only

   SOLUtion 3DIM ENSEmble mol1 EULEr 17 20 32

One copy of mol1 with known orientation and only the coordinates in 2 dimensions is known. The degenerate direction is defined as the direction perpendicular to the plane in which the position is given.

   SOLUtion 5DIM ENSEmble mol1 EULEr 17 20 32 DEGEnerate X FRACtional 0.05 0.74 

If the rotation function and translation function for mol1 were very clear, then there will only be one type of 6DIM solution for mol1. If the rotation and translation functions for mol2 were then not clear, there will be a series of possible 6DIM solutions for mol2.

  SOLUtion SET 
  SOLUtion 6DIM ENSEmble mol1 EULEr 17 20 32 FRACtional 0.12 0.05 0.74
  SOLUtion 6DIM ENSEmble mol2 EULEr 5 183 230 FRACtional 0.71 0.54 0.81 
  SOLUtion SET
  SOLUtion 6DIM ENSEmble mol1 EULEr 17 20 32 FRACtional 0.12 0.05 0.74
  SOLUtion 6DIM ENSEmble mol2 EULEr 51 93 75 FRACtional 0.08 0.57 0.25 

"rlist" Files

THere are three trial orientations to search

  SOLUtion TRIAl ENSEmble mol1 EULEr 17 20 32 SCORE 4.5
  SOLUtion TRIAl ENSEmble mol1 EULEr 67 65 51 SCORE 4.4
  SOLUtion TRIAl ENSEmble mol1 EULEr 67 112 81 SCORE 4.3

There are two possibilities for the position of the first molecule, and two orientations to search for the first and three for the second.

  SOLUtion SET
  SOLUtion 6DIM ENSEmble mol1 EULEr 17 20 32 FRACtional 0.12 0.05 0.74
  SOLUtion TRIAl ENSEmble mol1 EULEr 44 20 32 SCORE 5.8
  SOLUtion TRIAl ENSEmble mol1 EULEr 67 65 51 SCORE 5.2
  SOLUtion SET
  SOLUtion 6DIM ENSEmble mol1 EULEr 17 20 32 FRACtional 0.13 0.55 0.76
  SOLUtion TRIAl ENSEmble mol1 EULEr 83 9 180 SCORE 6.3
  SOLUtion TRIAl ENSEmble mol1 EULEr 8 36 92 SCORE 4.2
  SOLUtion TRIAl ENSEmble mol1 EULEr 48 87 10 SCORE 4.0

If a degenerate translation function is performed, then a SOLUtion TRIAl line is produced with the degenerate translation information present, ready for performing the translation function on the third dimension.

  SOLUtion TRIAl ENSEmble mol1 EULEr 17 20 32 DEGEnerate X FRACtional 0.05 0.74 

Fixed Partial Structure

If you have the coordinates of a partial solution with the pdb coordinates of the known structure in the correct orientation and position, then you can force Phaser to use these coordinates. Use this pdb file to define an ensemble (named "mol1" in this example). Then manually create a .sol file of the following form and include it in the Phaser command script with the @filename preprocessor command (or include it directly in the script)

  SOLUtion SET 
  SOLUtion 6DIM ENSEmble mol1 EULEr 0 0 0 FRACtional 0 0 0

Fast Rotation Function

Example command script for fast rotation function to find the orientation of BETA.

  
  #beta_frf.com
  phaser << eof
  TITLe beta FRF
  MODE MR_FRF
  HKLIn beta_blip.mtz
  LABIn F=Fobs SIGF=Sigma
  ENSEmble beta PDB beta.pdb IDENtity 100
  COMPosition PROTein SEQuence beta.seq NUM 1 #beta
  COMPosition PROTein SEQuence blip.seq NUM 1 #blip
  SEARCH ENSEmble beta
  ROOT beta_frf
  eof

Example command script for fast rotation function to find the orientation of BLIP knowing the position and orientation of BETA, with the position and orientation of BETA input from the command line.

  #blip_frf_with_beta.com
  phaser << eof
  TITLe blip FRF with beta rotation and translation
  MODE MR_FRF
  HKLIn beta_blip.mtz
  LABIn F=Fobs SIGF=Sigma
  ENSEmble beta PDB beta.pdb IDENtity 100
  ENSEmble blip PDB blip.pdb IDENtity 100
  COMPosition PROTein SEQuence beta.seq #beta
  COMPosition PROTein SEQuence blip.seq #blip
  SEARch ENSEmble blip
  SOLUtion 6DIM ENSEmble beta EULEr 201 41 184 FRACtional -0.49408 -0.15571 -0.28148
  ROOT blip_frf_with_beta
  eof

Example command script for fast rotation function to find the orientation of BLIP knowing only the orientation of BETA, with the orientation of BETA input using the output solution file from the beta_frf.com job above.

  #blip_frf_with_beta_rot.com
  phaser << eof
  TITLe blip FRF with beta R
  MODE MR_FRF
  HKLIn beta_blip.mtz
  LABIn F=Fobs SIGF=Sigma
  ENSEmble beta PDB beta.pdb IDENtity 100
  ENSEmble blip PDB blip.pdb IDENtity 100
  COMPosition PROTein SEQuence beta.seq NUM 1 #beta
  COMPosition PROTein SEQuence blip.seq NUM 1 #blip
  SEARch ENSEmble blip
  @beta_frf.sol # solution file output by phaser
  ROOT blip_frf_with_beta_rot
  eof

Brute Rotation Function

Example command script for brute rotation function to find the orientation of BETA

  #beta_brf.com
  phaser << eof
  TITLe beta BRF
  MODE MR_BRF
  HKLIn beta_blip.mtz
  LABIn F=Fobs SIGF=Sigma
  ENSEmble beta PDB beta.pdb IDENtity 100
  COMPosition PROTein SEQuence beta.seq NUM 1 #beta
  COMPosition PROTein SEQuence blip.seq NUM 1 #blip
  SEARch ENSEmble beta
  ROOT beta_brf
  eof

Example command script for brute rotation function to find the optimal orientation of BETA in a restricted search range and on a fine grid around the position from the fast rotation search.

  #beta_brf_around.com
  phaser << eof
  TITLe beta BRF fine sampling
  MODE MR_BRF
  HKLIn beta_blip.mtz
  LABIn F=Fobs SIGF=Sigma
  ENSEmble beta PDB beta.pdb IDENtity 100
  ENSEmble blip PDB blip.pdb IDENtity 100
  COMPosition PROTein SEQuence beta.seq NUM 1 #beta
  COMPosition PROTein SEQuence blip.seq NUM 1 #blip
  SEARch ENSEmble beta
  ROTAte AROUnd EULEr 201 41 184 RANGE 10
  SAMPling ROTation 0.5
  XYZOut ON # not the default
  ROOT beta_brf_around
  eof

Fast Translation Function

Example command script for finding the position of BETA after the rotation function has been run and the results output to the file beta_frf.rlist

  #beta_ftf.com
  phaser << eof
  TITLe beta FTF
  MODE MR_FTF
  HKLIn beta_blip.mtz
  LABIn F=Fobs SIGF=Sigma
  ENSEmble beta PDB beta.pdb IDENtity 100
  ENSEmble blip PDB blip.pdb IDENtity 100
  COMPosition PROTein SEQuence beta.seq NUM 1 #beta
  COMPosition PROTein SEQuence blip.seq NUM 1 #blip
  @beta_frf.rlist
  ROOT beta_ftf
  eof 

Example command script for finding the position of BLIP after the rotation function has been run and the results output to the file blip_frf_with_beta.rlist, which has the SOLUtion 6DIM keyword input for BETA and the SOLUtion TRIAL keyword input for the orientations to try for BLIP with the translation function.

  #blip_ftf_with_beta.com
  phaser << eof
  TITLe beta FTF
  MODE MR_FTF
  HKLIn beta_blip.mtz
  LABIn F=Fobs SIGF=Sigma
  ENSEmble beta PDB beta.pdb IDENtity 100
  ENSEmble blip PDB blip.pdb IDENtity 100
  COMPosition PROTein SEQuence beta.seq NUM 1 #beta
  COMPosition PROTein SEQuence blip.seq NUM 1 #blip
  @blip_frf_with_beta.rlist
  ROOT blip_ftf_with_beta
  eof

Brute Translation Function

Example command script for brute Translation function to find the position of BETA after the rotation function has been run

  
  #beta_btf.com
  phaser << eof
  TITLe beta BTF
  MODE MR_BTF
  HKLIn beta_blip.mtz
  LABIn F=Fobs SIGF=Sigma
  ENSEmble beta PDB beta.pdb IDENtity 100
  ENSEmble blip PDB blip.pdb IDENtity 100
  COMPosition PROTein SEQuence beta.seq NUM 1 #beta
  COMPosition PROTein SEQuence blip.seq NUM 1 #blip
  @beta_frf.rlist
  TRANslate AROUnd FRACtional POINt -0.49408 -0.15571 -0.28148 RANGe 5
  ROOT beta_btf
  eof 

Example command script for brute Translation function to find the position of BETA degenerate in X after the rotation function has been run

  #beta_btf_degen_x.com
  phaser << eof
  TITLe beta degenerate X
  MODE MR_BTF
  HKLIn beta_blip.mtz
  LABIn F=Fobs SIGF=Sigma
  ENSEmble beta PDB beta.pdb IDENtity 100
  ENSEmble blip PDB blip.pdb IDENtity 100
  COMPosition PROTein SEQuence beta.seq NUM 1 #beta
  COMPosition PROTein SEQuence blip.seq NUM 1 #blip
  @beta_frf.rlist
  TRANslate DEGEnerate X
  ROOT beta_btf_degen_x
  eof 

Refinement and Phasing

Example command script to refine a set of solutions

  #beta_blip_rnp.com
  phaser << eof
  TITLe beta blip rigid body refinement
  MODE MR_RNP
  HKLIn beta_blip.mtz
  LABIn F=Fobs SIGF=Sigma
  ENSEmble beta PDB beta.pdb IDENtity 100
  ENSEmble blip PDB blip.pdb IDENtity 100
  COMPosition PROTein SEQuence beta.seq NUM 1 #beta
  COMPosition PROTein SEQuence blip.seq NUM 1 #blip
  ROOT beta_blip_rnp # not the default
  HKLOut OFF # not the default
  XYZOut OFF # not the default
  @beta_blip_auto.sol
  eof 

Log-Likelihood Gain

Example command script to rescore the solutions using a different resolution range of data and a different spacegroup

  #beta_blip_llg.com
  phaser << eof
  TITLe beta blip solution 6A P3121
  MODE MR_LLG
  HKLIn beta_blip.mtz
  LABIn F=F SIGF = SIGF
  ENSEmble beta PDB beta.pdb IDENtity 100
  ENSEmble blip PDB blip.pdb IDENtity 100
  COMPosition PROTein SEQuence beta.seq NUM 1 #beta
  COMPosition PROTein SEQuence blip.seq NUM 1 #blip
  ROOT beta_blip_llg # not the default
  RESOlution 6.0
  SPACegroup P 31 2 1
  @beta_blip_auto.sol
  eof 

Packing

Example command script for determining whether a set of molecular replacement solutions pack in the unit cell.

  #beta_blip_pak.com
  phaser << eof
  TITLe beta blip packing check
  MODE MR_PAK
  HKLIn beta_blip.mtz
  LABIn F=F SIGF=SIGF
  ENSEmble beta PDB beta.pdb IDENtity 100
  ENSEmble blip PDB blip.pdb IDENtity 100
  COMPosition PROTein SEQuence beta.seq NUM 1 #beta
  COMPosition PROTein SEQuence blip.seq NUM 1 #blip
  ROOT beta_blip_pak # not the default
  @beta_blip_auto.sol
  eof 

Automated Experimental Phasing

Do SAD phasing of insulin. This is the minimum input, using all defaults (except the ROOT filename and specifying the wavelength explicitly).

  #insulin_auto.com
  phaser << eof
  MODE EP_AUTO
  TITLe sad phasing of insulin with intrinsic sulphurs
  HKLIn S-insulin.mtz
  COMPosition PROTein SEQ S-insulin.seq
  CRYStal insulin DATAset sad LABIn F+=F(+) SIG+=SIGF(+) F-=F(-) SIG-=SIGF(-)
  CRYStal insulin DATAset sad SCATtering CUKA # default: change if necessary
  LLGComplete CRYStal insulin COMPLETE ON SCATtering ELEMent S
  ATOM CRYStal insulin PDB S-insulin_hyss.pdb
  ROOT insulin_auto
  eof

Anisotropy Correction

Example command script to correct BETA-BLIP data for anisotropy

 
  #beta_blip_ano.com
  phaser << eof
  MODE ANO
  TITLe beta blip data correction
  HKLIn beta_blip.mtz
  LABIn F=Fobs SIGF=Sigma
  ROOT beta_blip_ano # not the default
  eof 

Cell Content Analysis

Example script for cell content analysis for BETA-BLIP

  #beta_cca.com
  phaser << eof
  TITLe BETA-BLIP cell content analysis
  MODE CCA
  HKLIn beta_blip.mtz
  LABIn F=Fobs SIGF=Sigma
  COMPosition PROTein SEQuence beta.seq NUM 1 #beta
  COMPosition PROTein SEQuence blip.seq NUM 1 #blip
  RESO 3.0
  ROOT beta_blip_cca # not the default
  eof 

Normal Mode Analysis

Do normal mode analysis only, write out eigenfile but not coordinates

  
  #beta_nma.com
  phaser << eof
  TITLe beta normal mode analysis
  MODE NMA
  ENSEmble beta PDB beta.pdb IDENtity 100
  XYZOut OFF
  ROOT beta_nma # not the default
  eof 

Write out pdb files perturbed in 0.5 Ångstrom rms intervals in "forward" (positive dq values) along modes 7 and 10 (and combinations of 7 and 10)

  #beta_nma_pdb.com
  phaser << eof
  TITLe beta normal mode analysis pdb file generation
  MODE NMA
  ENSEmble beta PDB beta.pdb IDENtity 100
  ROOT beta_nma_pdb # not the default
  EIGEn beta_nma.mat
  NMAPdb MODE 7 MODE 10 RMS 0.5 FORWARD
  eof