We Need Better Benchmarks for Machine Learning in Drug Discovery
Practical Cheminformatics
AUGUST 3, 2023
Of the 404 molecules labeled as CA, 68 are azo dyes widely known to be cytotoxic and generate assay interference. I wrote more about the problems with this dataset in a Practical Cheminformatics post in 2018. However, the many complications associated with cell assays would not make these my first choice for benchmarking.
Let's personalize your content