ChemAxon's hERG Predictor

    Version 1.0-beta-3

    This documentation gives a short introduction to ChemAxon's hERG (hERG-beta) Predictor.

    Table of Contents

    What is hERG?

    hERG potassium channels play an essential role in normal electrical activity of the heart, mediating the cardiac action potential of the heartbeat. Affecting their activity by xenobiotics can have life-threatening consequences. Accordingly, hERG is one of the most important off-targets of drug discovery.

    Optimisation to reduce the risk of inhibiting the hERG channels during discovery projects requires computational prediction in the early design throughout the pre-synthesis phase. The hERG channel inhibition capacity of a drug is measured by its hERG activity (Act).

    Experimentally, hERG activity is determined by different electrophysiological methods and measured as IC50 or Ki values. However, as with other physico-chemical properties (e.g. pKa), the negative logarithm of the measured activity (pActivity) is used in the literature:

    pActivity = -log10(Act)

    Besides quantitative hERG values it is also common to provide a two-class classification of compounds for hERG activity.

    ChemAxon's hERG models

    We built two hERG models, an activity model and a two-class (TOXIC or SAFE) classification model.

    The hERG activity model

    The activity model was built using multiple publicly available data sources, including the hERG Central DataBase, ChEMBL and various selected publications and patents. The current version of the activity hERG model encapsulates structure activity relationships (SARs) from around 2500 pActivity data points.

    We ran a test to validate the predictive capabilities of the activity model on a test set of around 270 molecules. The test resulted in a 0.80 Pearson correlation coefficient and a 0.55 RMSE. 72% of the test set molecules had delta distribution less than 0.5.

    The accuracy report is available here.

    The hERG classification model

    The classification model was built using the Honma et al. dataset. The current version contains around 204k training data points.

    The classification classes of the model were determined based on the IC50 threshold of 10 μM. TOXIC corresponds to predicted IC50 ≤ 10 μM or ≥50% inhibition at 10 μM, while SAFE corresponds to predicted IC50 > 10 μM or <50% inhibition at 10 μM.

    We ran a test to validate the predictive capabilities of the classification model on the remaining test subset of around 87k data points.

    Our results were compared to the performance of other commercial hERG models in the article. They are summarised below.

    herg_class_honma_results

    NOTE: The hERG classification model is currently ONLY available in MarvinSketch, Playground, cxcalc and Chemical Terms.

    Model creation and applicability domain

    For both models we used Random Forest and Conformal Prediction algorithms to create them and provide the applicability domain of the prediction. The applicability domain provides information about a model's performance and accuracy.

    For creating the models we used selected fingerprints (e.g. ECFP) and physico-chemical descriptors.

    The applicability domain is based on the 5 most similar compounds from the training set according to ECFP-4 fingerprints and Tanimoto metrics. The error bound of the applicability domain comes from the Conformal Prediction algorithm.

    NOTE: The hERG applicability domain is currently ONLY available in Playground and Design Hub.

    How to use the hERG Predictor?

    ChemAxon's hERG Predictor uses the activity and classification models to predict the pActivity value and the classification class.

    It is currently available in MarvinSketch, in the Playground and Design Hub web applications, in the cxcalc command line tool and the Chemical Terms language.

    MarvinSketch

    In MarvinSketch hERG prediction can be done with the hERG (Beta) Plugin, which can be found under the Calculations » ADMET menu item. The current version of the plugin returns both the hERG activity and the classification class by default.

    herg_beta_marvin_21.13

    Playground

    You can predict both the hERG activity and the classification class in Playground by selecting the relevant model under the Explore Calculations menu.

    herg_explore_calcs

    If hERG (Activity model) is selected, the predicted pActivity value with its applicability domain appears next to the drawing canvas.

    herg_playground_activity

    If hERG (Classification model) is selected, the predicted classification class with its applicability domain appears next to the drawing canvas.

    herg_playground_classification

    Design Hub

    In Design Hub predicting hERG activity can be done with the Add Property » hERG menu item. The predicted activity value with its applicability domain appears next to the drawing canvas.

    herg_design_hub

    cxcalc

    To predict hERG activity with cxcalc, use the herg-beta function. You can choose which model to use for prediction by using the -a (for the activity model) and the -c (for the classification model) options. For both options the default setting is true.

    The following examples show how to use cxcalc for prediction:

    cxcalc -N i herg-beta aspirine
    hERG_activity   hERG_class
    3.55    SAFE
    cxcalc -N i herg-beta -c false sildenafil
    hERG_activity
    5.41
    cxcalc -N i herg-beta -a false sildenafil
    hERG_class
    SAFE

    Chemical Terms

    To predict hERG with Chemical Terms use the hergBeta() or hergActivityBeta() functions for activity prediction and the hergClassBeta() function for classification class prediction.

    The following examples show how to use Chemical Terms for prediction:

    evaluate -e "hergBeta()" aspirine
    3.55
    evaluate -e "hergActivityBeta()" sildenafil
    5.41
    evaluate -e "hergClassBeta()" aspirine
    SAFE

    NOTE: Memory issues (slowdown, prediction failure) can be experienced when predicting hERG in Instant JChem (IJC). To overcome such issues, increase the heap memory available to IJC. For more information see the relevant memory management documentation page.

    Summary table

    The following table summarises the availability of the hERG prediction and applicability domain features in the above mentioned products.

    Product name hERG prediction Applicability domain
    MarvinSketch
    Playground
    Design Hub
    Cxcalc
    Chemical Terms

    Licensing

    To access and use the hERG Predictor you need a valid ChemAxon ADMET license. Please consult us for more information on licensing.

    References

    1. Du, Fang et al. (2011). hERGCentral: A large database to store, retrieve and analyse compound-human ether-à-go-go related gene channel interactions to facilitate cardiotoxicity assessment in drug development, Assay and Drug Development Technologies, 9(6), 580-588. DOI link
    2. ChemBL 27, ChemBL database
    3. Honma, Teruki et al. (2019). Support Vector Machine model for hERG inhibitory activities based on the integrated hERG database using descriptor selection by NSGA-II, Scientific Reports (NatureResearch), 9, 12220. DOI link

    Release notes

    hERG v. 1.0-beta:

    • The default ChemAxon hERG pActivity model (hERG-beta) is built and available for prediction
    • The hERG Predictor is released in MarvinSketch, cxcalc and Chemical Terms v. 20.20.

    hERG v. 1.0-beta-2:

    • The hERG prediction is available in Playground
    • The applicability domain is implemented and available in Playground and Design Hub

    hERG v. 1.0-beta-3:

    • The ChemAxon hERG classification model is built and available for prediction in MarvinSketch, cxcalc, Chemical Terms and Playground.