Shell Scripts and Awk Programs

  1. cartopdbq
 
Usage
: cartopdbq lig.car > lig.pdbq

Converts from Biosym InsightII ".car" format to PDBQ format

 
Usage
: check-qs lig

Needs:
 lig.pdbq

Creates:
 lig.err

Checks partial atomic charges in PDBQ file; any non-integral charges are reported.

 
Usage
: checkqs lig

Needs:
 lig.pdbq

Creates:
 lig.err

Sorts the input PDBQ file by residue number before running the result through check-qs .

 
Usage
: clamp grid.map > grid.map.NEW

Clamps any AutoGrid map values that exceed ECLAMP (normally set to 1000.0)

 
Usage
: cnvmol2topdbq lig.mol2 > lig.pdbq
Needs:
 lig.mol2
Creates:
 lig.pdbq

This converts from (fixed format) Tripos SYBYL mol2 fiormat into PDBQ format, but stores all the residues' chain-IDs specified by the SUBSTRUCTURE records in the mol2 file. These chain-IDs are then output when the PDBQ lines are written.

 
Usage
: deftors lig.mol2

Creates
: lig.pdbq
, lig.err

Sets up rotatable bonds for AutoDock . This script launches AutoTors , with the -A +15.0, -a, -h and -m flags; it also checks the charges in the output PDBQ file, with check-qs .

 
Usage
: dpf3gen lig.pdbq > lig.dpf

This is normally used by mkdpf3 , so you should not use this script by hand.

It generates a pre-cursor to a default AutoDock docking parameter file. You must edit the file before using it. This reads in the small molecule PDBQ file, detects all atom types present in the lig.pdbq; and creates a docking parameter file for AutoDock . Note the user must replace the tags <lig> and <macromol> by appropriate filename stems.

This uses equilibrium separations and well depths to define pairwise energy potentials, rather than coefficients.

 
Usage: 
get-coords lig.vol > lig.txt

This is used as part of prepare , prepare-gpf+dpf , prepare II and prepare III . It takes the .vol file created by pdb-volume and creates a line that can be used in the grid parameter file to specify the center of the maps.

 
Usage
: get-docked lig.macro.dlg
Creates
: lig.macro.dlg.pdb

This extracts the docked records from a docking log file. This is very useful when wanting to view the results of a docking in a molecular modelling program or molecular viewer. It is essentially the same as the ` dockedtopdb ' awk program.

 
Usage
: gethis lig.macro.dlg
 
Usage
: getready lig.pdb

Needs:
 	pdbinfo, pdbsplitchains, pdbwaters, pdbdewater

Creates:
 lig.info, lig_.atm.pdb, lig_.het.pdb, lig_.wat.pdb
 
Usage
: gpf3gen lig.pdbq[s] > lig.gpf

This is used by mkgpf3 , and should not be used by hand; otherwise the user must edit certain tags by hand before this can be used by AutoGrid .

This generates a precursor to a grid parameter file. It takes lig.pdbq as its input file, detects all atom types present, and creates the properly formatted parameter file for AutoGrid . It uses equilibrium separations and well depths to define pairwise energy potential. It also assigns atomic solvation parameters, based on Stouten, P.F.W., FrÖmmel, C., Nakamura, H., and Sander, C. (1993), "An effective solvation term based on atomic occupancies for use in protein simulations", Molecular Simulation , 10 , 97-120.

 
Usage
: histable lig.macro.dlg
Creates
: lig.macro.dlg.tbl

This extracts the histogram from the docking log file, and counts all the `#' symbols, writing the result in a table file. This is suitable for input to a variety of graph drawing programs and spreadsheets.

 
Usage
: job3 lig.macro > lig.macro.joblog &

Launches a single AutoDock 3.0 job. It assumes that " lig.macro.dpf " exists, and executes AutoDock using the arguments:

 
autodock3 -p lig.macro.dpf -l lig.macro.dlg

You must edit this script the first time you use it, so that the environment variables $root, $bin and $sh are correctly set equal to, respectively: the path to the root of AutoDock tree, the architecture-dependent binary subdirectory and the Unix scripts subdirectory. The file lig.macro.joblog contains the output from the job script.

 
Usage
: makebox macro.gpf >! macro.gpf.box.pdb

Creates
: macro.gpf.box.pdb

If you colour the `box molecule' by atom type, i.e. red for oxygen, green for carbon, and blue for nitrogen, then the edges of this box will be coloured-coded to indicate the Cartesian axes. R,G,B will correspond to x,y,z, respectively. Your molecule viewer must obey the CONECT records in the ` macro.gpf.box.pdb ' file, even though the corresponding bonds may appear too long, otherwise the edges of the grid box will not be displayed.

 
Usage
: mkbox macro.gpf >! macro.gpf.box.pdb

Creates
: macro.gpf.box.pdb

This is very similar to ` makebox ', except that this puts a phosphorus atom at the minimum x , minimum y and minimum z coordinates of the box. This helps to convey which directions are +x, +y and +z . Once again, if oxygen is red, carbon is green and nitrogen is blue, then R,G,B will correspond to x,y,z, respectively.

 
Usage
: mkdlgfld lig.macro.dlg

Needs
: lig.macro.dlg

Creates
: lig.macro.dlg.fld

Only needed for AVS users.

This extracts the "AVSFLD" records from an AutoDock log file, and puts them in lig.macro.dlg.fld. These "AVSFLD" descriptors must be removed before the file can be used in AVS.

 
Usage
: mkdpf3 lig.pdbq macromol.pdbqs

Needs
: dpf3gen, dpf3gen.awk (AWK program)
Creates
: lig.macro.dpf

This creates a default docking parameter file for AutoDock 3.0; it needs the ligand in PDBQ format and the macromolecule in PDBQS format. It uses the script dpf3gen , which in turn calls the awk program ` dpf3gen.awk '. The lig.macro.dpf docking parameter file is based on the atom types detected in the input lig.pdbq file. See dpf3gen above.

 
Usage
: mkgpf3 lig.pdbq macromol.pdbqs
Needs
: gpf3gen, gpf3gen.awk (AWK program), 
pdbcen (AWK program)
Creates
: macro.gpf

This creates a default grid parameter file for AutoGrid 3.0; it needs gpf3gen.awk and pdbcen , both awk programs. See gpf3gen above.

 
Usage
: mol2fftopdbq lig.mol2 > lig.pdbq

Needs:
 lig.mol2

Creates:
 lig.pdbq

Converts from free formatted SYBYL mol2 into AutoDock PDBQ format. Chain-IDs specified in the mol2 file by the SUBSTRUCTURE records are incorporated into the PDBQ file.

 
Usage
: mol2topdbq lig.mol2

Needs:
 lig.mol2

Creates:
 lig.pdbq

Converts from fixed-format SYBYL mol2 into AutoDock PDBQ format, and automatically names the output based on the stem of the input mol2 file. Do not use " mol2topdbq lig.mol2 > lig.pdbq ", because " lig.pdbq " is automatically created.

 
Usage
: mol2topdbqs lig.mol2

Needs:
 lig.mol2

Creates:
 lig.pdbqs
 
Usage
: pdbcen lig.pdb
Creates
: a "gridcenter" line in AutoGrid GPF format, holding the x,y,z coordinates of the molecule.

This calculates the center of a molecule supplied in PDB format, and outputs a line holding the x,y,z coordinates of the molecule for inclusion in an AutoGrid 3.0 grid parameter file (GPF).

 
Usage
: pdb-center [ lig.pdb | lig.pdbq ] > lig2.pdb

Calculates the center of mass of each residue; writes these coordinates out using REMARK records.

 
Usage
: pdb-center-all [ lig.pdb | lig.pdbq ] > lig2.pdb

Calculates the center of mass of each residue; writes these coordinates out using REMARK records.Also calculates the center of all the residues.

 
Usage
: pdb-distance macro.pdb

The first line of the macro.pdb file defines the center of the distance profile. It is just a copy of the line containing the atom of interest, which will be the origin for the distance calculations. However, it must have the ATOM or HETATM record replaced with a non-PDB tag, `FROM'. The x,y,z coordinates in this FROM line will then be used to calculate the distance to the center of each residue in the protein. Finally, this awk program outputs a bar chart using `#' symbols, showing the distance from this point to each residue. This can be useful to identify all the residues nearest a particular ligand atom, or near an active site.

 
Usage
: pdbdewater macro.pdb >! macro.dry.pdb

This removes any water records from a PDB file.

 
Usage
: pdbinfo macro.pdb

Builds a summary of the contents of a PDB file.

 
Usage
: pdb-volume [ lig.pdb | lig.pdbq ] > lig2.pdb

Calculates the center of mass of each residue. Writes out REMARKs showing these coordinates. Draws ASCII diagram showing volume extents of each residue.

Usage : pdbqtobnd lig
Needs: lig.pdbq
Creates: lig.bnd

Creates " lig.bnd " from the existing " lig.pdbq " ligand PDBQ file. Note this script needs just the stem of the file name. This script executes " pdbqtoatm " and " atmtobnd ": the latter is an executable, not a script, so it must be compiled for each architecture and operating system used.

 
Usage
: pdbqtopdb lig.pdbq > lig.pdb

Converts from AutoDock PDBQ to PDB format.

 
Usage
: pdbsplitchains macro.pdb

Creates separate PDB files that contain each of the chains in macro.pdb . The chain IDs are used to name the new PDB files. If there is no chain ID, the underscore character, `_' is used.

Usage : pdbtoatm lig.pdbq > lig.atm

This creates a Connolly ATM formatted file " lig.atm " from the ligand PDBQ file, " lig.pdbq ". This is used to create input for the utility program atmtobnd to generate a bond connectivity file.

 
Usage
: pdbwaters macro.pdb > macro.wet.pdb
 
Usage:
 prepare m s

where : m.pdb and s.pdbq contain the receptor and ligand respectively. Prepare performs the following eight steps. The macromolecule ` .pdb ' filename stem is represented by " m ", and the ligand ` .pdbq ' filename stem by "s":

1. Extracts all ATOM and TER records from m.pdb into m.enz;
2. Renumbers residues to avoid problems in protonate -step;
3. Adds polar hydrogens to m.enz , creating m.polH ;
4. Somewhat crudely assigns partial atomic charges to m.polH , creating m.pdbq ;
5. Checks charges in m.pdbq , all errors held in m.err ;
6. Creates s.gpf , a parameter file for AutoGrid , based on ligand file s.pdbq ;
7. Creates s.vol , a volume dimensions file; and finally,
8. Creates s.dpf , a parameter file for AutoDock, based on ligand file s.pdbq ;

Its arguments are the stem of the filename of the macromolecule ` .pdb ' file and that of the ligand PDBQ file. See the flowchart below for more details. It shows what files are created by ` prepare ', and which scripts or programs are used. Steps 1.-4. are better carried out with a reliable molecular modeling system: these steps can produce some odd results unless carefully checked.

The user must check the m.err error file to ensure there are no non-integral charges, either on any residue in the macromolecule, or on the macromolecule as a whole. If there are, then the user must repair the m.pdbq file. This problem can arise if there are atoms for which no coordinates were assigned by the crystallographer, e.g . due to ambiguous electron density. Assuming there were no problems, s.gpf and s.dpf should be successfully produced.

 
Usage
: prepare-gpf+dpf macro lig

Executes only steps 6. through 8.

 
Usage
: rem-lp lig.pdbq
or
: rem-lp lig.mol2
Creates:
 lig.pdbq
or
: lig.mol2

This removes the lone-pairs (atom name = LP) added by some molecular modelling programs, such as SYBYL, and adds their partial charges on to that of the atom to which they were attached (SG in cysteines and SD in methionines). Otherwise, AutoDock treats lone-pairs as carbon atoms. (Note: if you need lone-pairs, you can force AutoGrid to calculate a grid map for "LP" atoms, using the atom code "L" in the "types" commands of AutoGrid and AutoDock).

 
Usage
: lig.pdb > lig2.pdb
 
Usage
: lig.pdb > lig.rnm

Used by prepare to renumber residues in the macromolecule contiguously. This step is needed prior to using protonate , which may fail if there are gaps in the residue numbers.

 
Usage
: resrange lig.pdb

This is handy to summarise the range(s) of residues in a given protein PDB file.

 
Usage
: runtrj lig

Needs:
 lig.dpf
, lig.trj

Creates:
 lig.tcom
, lig.tlg
 and
 lig.tout

This creates an AutoDock command file, lig.tcom , which is then used to convert the trajectory written in state variables ( lig.trj ), into a trajectory written in cartesian coordinates. lig.trj is created by an earlier run of AutoDock , in which trjfrq was set to a non-zero value.

 
Usage
: stats columns.dat

This is a very useful, general awk program. Use it to calculate the minimum, maximum, mean and standard deviation for each column of numbers in an input file, here ` columns.dat '. Any alphanumeric columns will be ignored.