Products  Physico-Chemical Laboratory 


 
Request more information
   

Overview

Product Details:
   

Key Capabilities

Input/Output

List of Features

Technical Info

Training Tools

Ref. Database

Integration

What's New

Resources:
   

Application Notes

Technical Notes

Demo Movies

Catalog

Evaluations:
   

Prediction Accuracy

Related Products:
   

Compare ACD/Labs pKa Products

ACD/LogD Sol Suite

ACD/PhysChem Batch (pKa Module)


 

ACD/pKa Prediction Accuracy vs. Experiment

A true and fair measure of the accuracy of predictive software is its ability to accurately predict diverse compound datasets. We recently took the opportunity to search the current literature for experimental pKa data. This data was collated and used to test the prediction accuracy of the ACD/pKa algorithm.

Experimental data was gathered for 277 compounds, providing 285 pKa measurements for acidic and basic ionizing groups. Of these, 284 corresponding pKa values were predicted by the ACD/pKa algorithm. For compounds with differing published experimental values (within 0.5 pKa) an average was used. For compounds where the difference in experimental pKa showed a discrepancy of more than this, further study of the data was undertaken to decide on the ionization taking place and select the appropriate pKa value. Approximated apparent pKa predictions were used in this study since these represent best the true behavior of a compound in aqueous solution, and would therefore be most likely to reflect the findings of experimentally determined pKa.

  
Figure 1: A scatter plot of experimental pKa data vs. predictions made using ACD/pKa.    Figure 2: A histogram illustrating the error range in predictions.
N =284 R2 = 0.95    Av. Error = 0.39 Std. Error = 0.76

Results of this study show that 80% of predictions (228 individual pKa values) for this diverse set of compounds were within an error of 0.5 pKa units; while 91% (260 of 285 pKa values) were predicted with an error of less than 1 pKa. No specific ionization centre or functional group was found to be predicted consistently poorly, although four of the compounds with the poorest prediction accuracy (errors of greater than 2.5 pKa units compared to the experimental value) had ionization events at extremely acidic pH (below zero). Click here to see the table of data.

Within this dataset, we can look at a particular series of compounds to examine how well the algorithm performed for that chemical space. If we choose the benzimidazoles, of which there were a total of 13 in the full dataset, we see at a quick glance that ~77% (10 out of 13) were predicted within 0.75 pKa; 3 are predicted poorly with an error greater than 2 pKa. Closer examination of this class in the desktop pKa module showed that prediction of single pKa values for this class were far more accurate, with an average error of 0.18 pKa (compared to 0.68 pKa in approximated apparent values) (see Table below).

Table 1. Experimental and predicted pKa for Benzimidazole class

ACD/pKa Predicted Values
Compound Experimental pKa Apparent pKa Single pKa
Benzimidazole 5.61 5.67 5.67
1-methylbenzimidazole 5.57 5.72 5.72
1-ethylbenzimidazole 5.62 5.80 5.80
2-methylbenzimidazole 6.10 6.29 6.29
2-ethylbenzimidazole 6.2 6.17 6.17
2-ispropylbenzimidazole 6.23 6.13 6.13
2-aminobenzimidazole 7.18 7.55 7.55
4-methylbenzimidazole 5.67 8.14 5.70
5-methylbenzimidazole 5.81 7.81 5.79
5,6-dimethylbenzimidazole 5.89 6.09 6.09
5-nitrobenzimidazole 4.17 4.40 4.40
5-aminobenzimidazole 6.11 8.20 6.11
2-chlorobenzimidazole 4.68 3.94 3.94

It is therefore worth noting that the different types of pKa predicted by the ACD/pKa algorithm should be investigated especially when looking at a particular chemical space. Single pKa values are useful in tracking protonation/deprotonation for a single ionization centre in a series of compounds to determine relative changes.

While this comparison takes into account only limited series of compounds, and is therefore not a comprehensive study of the prediction accuracy of the ACD/pKa algorithm, it reflects the high standards our customers have come to expect from ACD/Labs PhysChem software. To study the accuracy of prediction for your chemical domain, we invite you to evaluate the software.

References

  • L. Xing and R.C. Glen, J. Chem. Inf. Comput. Sci., 42: 796–805, 2002
  • U.A. Chaudry and P.L.A. Popelier, J. Org. Chem., 69: 233–41, 2004
  • F. Eckert and A. Klamt, J. Comput. Chem., 27: 11–19, 2006
  • T. N. Brown and N. Mora-Diez, J. Phys. Chem. B, 110: 9270–79, 2006
  • H. Lu, X. Chen, and C.G. Zhan, J. Phys. Chem. B, 111: 10599–605, 2007
  • M. Meloun and S. Bordovská, Anal. Bioanal. Chem., 389: 1267–81, 2007
TOP

This page was last updated 14 May 2008
 

  Products | Solutions | Support
Online Services | Resources
About Us | Downloads | Events
Site Map | Contact Us
 

 
Copyright © 1996 - 2008 Advanced Chemistry Development     All rights reserved