article thumbnail

We Need Better Benchmarks for Machine Learning in Drug Discovery

Practical Cheminformatics

It is quite a bit more potent than the values one finds with screening hits, which typically have IC50s in the single to double-digit µM range. Clintox – This dataset consists of 1483 SMILES strings and two binary labels indicating whether a molecule is an FDA-approved drug and whether a toxicity outcome has been reported.

Drugs 93
article thumbnail

A PAINful itch

Molecular Design

The distinction between Type 1 and and Type 2 behaviors is an important and useful one to make from the perspective of drug discovery scientists who are making decisions as to which screening hits to take forward.