--smiles string |
Read molecule from specified SMILES string | |
---|---|---|
--smarts string |
Read molecule from specified SMARTS string | |
f {f FIELD1 ,f FIELD2 , ... } | Import data fields from a multi-column file. The fields should be separated by tab character. The first column contains the SMILES/SMARTS strings, the second may contain the molecule name or the data field called FIELD1 , the following columns contain the other fields. | molconvert sdf "foo.smi{fname,fID}" reads the smiles string, the name and the ID from the foo.smi file and converts it to SDF format. |
d | Import with Daylight compatibility for query H. In daylight smarts, H is only considered as H atom when the atom expression has the syntax [<mass>H<charge> | |
c | Ignore fixing of double bond stereo information in small rings, also ignore fixing of aromatic bonds to aliphatic if necessary. Double bonds in small rings (ring size < 8) is imported automatically with CIS stereo information. If c options is set, the double bond stereo information is not changed to CIS during the import. By default the bond is aromatic between two aromatic atom. But this is not true e.g. in case of biphenyl where the bond connecting the two aromatic ring is single. If biphenyl is represented with the SMILES string: "c1ccc(cc1)c1ccccc1" then it is necessary to set the bond between the two rings to single. If the molecule is exported by Chemaxon tools, the single bond between two aromatic atom is always explicitly written to avoid any confusion, so fixing aromatic bonds to aliphatic can be avoided. | |
x | Do not import any stereochemical information. | molconvert smiles "C/C=C/[C@@H]1CCC@HCC1{smiles:x}" CC=CC1CCC(C)CC1 |
Z | Import compressed smiles. The compressed format must be specified explicitly, as it is not recognized by the importer automatically. |
After importing SMILES, invoking of MoleculeGraph.clearCashedInfo
method is recommended in order to remove cashed information which results increased molecule size.
Export options can be specified in the format string. The format descriptor and the options are separated by a colon.
Codename | Explanation | |
---|---|---|
a, +a, +a_gen | General aromatization. | |
a_bas | Basic aromatization. | |
a_loose | Loose aromatization. | |
a_ambig | Ambiguous aromatization. | |
-a, -a_gen | General Dearomatization. | |
-a_huckel | Huckel dearomatization. | |
-a_huckel_ex | Huckel dearomatization, throwing exception in case of failure. | |
H, +H | Add explicit Hydrogen atoms. | |
-H | Remove explicit Hydrogen atoms. | |
0 | Do not include chirality (parity) and double bond stereo (cis/trans) information | smiles:0 (not stereo)smiles:a0(aromatic, not stereo) |
q | Obsolete option. Atom equivalences are checked by default using graph invariants at double bonds. | smiles -s "C/C=C(/C)C"results CC=C(C)C |
ri | Smiles export rigorousness ( i with the following values): 1. Export the most information from the molecule to SMILES or SMARTS format. Don't check anything. 5. Atoms, bonds and the molecule is checked for SMILES, SMARTS compatibility ( default ). * 7. In addition to the checks in case of value 5, double bonds in alternating single and double bond chain are checked for correct export. | Let molecule.mrv file contain the molecule CC=CC=CC=CC where the two side double bonds are in TRANS configuration but the middle one has no CIS, TRANS information (crossed double bond, or double bond with wiggly bond).molconvert smiles:r7 m.mrv drops the exception: "Nonstereo double bond between active CIS TRANS stereo bonds. Not possible to export it correctly to SMILES" molconvert smiles m.mrv results C\C=C\C=C\C=C\C (which is incorrect in the sense that the middle bond became TRANS configuration). |
s | Write query smarts.(don't write explicit H in bracket)(See query SMARTS for details.) | |
u | Write unique smiles. This option performs aromatization and removes explicit hydrogens (also considers chirality info[2]). Note: Use this option if you want unique smiles export. | This option can help in cases in which the compounds should be unique chemically, even though the representations are different. For example aromatic compounds in Kekulé form can generate different smiles for different Kekulé forms of the same aromatic compound without this option. |
h | Convert explicit H atoms to query hydrogen count. | |
Tf1:f2:... | Export f1 , f2 ... SDF fields. The fields are separated by tab character. If '-' is given before the T option like '-Tf1:f2:...' then no header line is written. '*' character is used to export all fields (and name also) in the molecules. 'name' field is used to export molecule name (if no 'name' field in the molecule exists). | |
t | Export terminal atom with single_or_aromatic bond. | Examples: instead of [#6]-c1ccccc1 export the molecule to [#6]c1ccccc1instead of [#6]-[#6] export the molecule to [#6][#6] |
n | Export molecule name (the first line of an MDL molfile). | |
Z | Use compressed format, and compress the SMILES string. Note that the compressed format is not recognized by the import, so it should be specified explicitly. | |
x | ||
BOM | Write the UTF-8 byte order mark (BOM), if the given or the system's encoding is UTF-8. |