The Standardizer command-line tool reaches all the functionalities of Standardizer.
On this page you will find information on
You can standardize molecules with Standardizer using the following command:
Prepare the usage of the standardize
script or batch file as described here.
Available options will be displayed by this command:
The appearing general options:
The command line parameter --config
is mandatory. This specifies the path and filename of a configuration file or else it is the simple actionstring. We highly recommend creating the configuration file via Standardizer GUI or Standardizer Editor.
By default, the program exits in case of molecule import/export errors. If the command-line parameter -g
or --ignore-error
is specified, then errors will not stop the process. The error is written to the console, the molecule is discarded from the structure file (the resulting file will contain less molecules than the input file). With option --empty-mol-on-error
the structure is changed for an empty molecule. The molecule is presented in the original form when using the option --unstandardized-mol-on-error
. Both of these settings result in a file containing the same number of structures as the input file.
The following command lists the available Standardizer actions:
Valid actionstrings are:
Visit Standardizer Actions to find out more about the functionality of each action!
Most molecular file formats are accepted.
The input is either specified in input file(s), or else in input string(s), usually in SMILES format.
If neither the input file name(s) nor the input string(s) are specified in the command line then the standard input is read.
Standardizer writes output molecules in the format specified by the --format
option (the default format is "SMILES"). If the --output
is omitted, results are written to the standard output.
If the command line parameter --export-fields-to-smiles
is specified, then the property fields (SDF fields) of the molecules will be exported even if the output format is SMILES, SMARTS, Chemaxon Extended SMILES or Chemaxon Extended SMARTS. In case of other formats the property fields are always exported, this option has no effects.
A UNIX command that reads molecular structures from the mols.sdf
file and writes the standardized molecules to the standard output in smiles format:
A UNIX command that reads molecules given as SMILES strings from file nci10000.smiles
located in the./test/pharmacophore
directory and writes results in the file named nci10000.sdf
to be created in the same directory:
The same with transformation check and verbose output, then displaying the result in MarvinView:
Processing an SD file and displaying the standardized molecules using MarvinView:
Note that such piping does not work in Windows.
Standardization with actionstring:
Standardization with actionstring, taking input molecules as SMILES strings:
Processing tasks belonging to no groups or to task group "target":