QSARs for Plasma Protein Binding: Source Data and Predictions

Description

The dataset has all of the information used to create and evaluate 3 independent QSAR models for the fraction of a chemical unbound by plasma protein (Fub) for environmentally relevant chemicals. In vitro plasma protein values for 1245 pharmaceuticals and 406 ToxCast chemicals were collected from the literature (Obach 2008, Zhu 2013, Wetmore 2012, Wetmore 2015). The 21 descriptors calculated by MOE that were used in the models are included, as is an acid/base/neutral/zwitterions classification based on ionization percentages calculated in ADMET Predictor. Finally, the dataset includes the in silico Fub predictions for each chemical from the constructed k-nearest neighbor, support vector machine, and random forest QSAR models, as well as a consensus (average) prediction. This dataset is associated with the following publication: Ingle, B., R. Tornero-Velez, J. Nichols, and B. Veber. Informing the Human Plasma Protein Binding of Environmental Chemicals by Machine Learning in the Pharmaceutical Space: Applicability Domain and Limits of Predictability. Journal of Chemical Information and Modeling. American Chemical Society, Washington, DC, USA, 56(11): 2243-2252, (2016).

Resources

Name Format Description Link
53 PPB_JChemInfMod_Supp_AllData.xlsx https://pasteur.epa.gov/uploads/569/PPB_JChemInfMod_Supp_AllData.xlsx

Tags

  • plasma-protein-binding
  • quantitative-structure-activity-relationship-qsar
  • domain-of-applicability
  • environmental-toxicology
  • machine-learning

Topics

Categories