Practical Cheminformatics Index

7 minute read

Published:

Unified index of posts from Blogger and GitHub Pages.

Total posts: 87

Generated on 2026-04-11.

DateTitleSourceKeywords
2026-02-07AI in Drug Discovery - Please Stop Fishing in the Bathtub!GitHubAI, Drug Discovery, Benchmarking
2026-01-17Practical Cheminformatics with MarimoGitHubMarimo, Jupyter, Python, Visualization
2026-01-09Can Gemini Search the ChEMBL Database?GitHubGemini, LLM, ChEMBL, SQL
2025-11-08Performing Exploratory Data Analysis on the OpenADMET ExpansionRx Blind Challenge DatasetGitHubEDA, OpenADMET, ADMET, Data Analysis
2025-11-03We Still Haven’t Found What We’re Looking For - The Continuing Evolution of Protein-Ligand Co-Folding MethodsGitHubCofolding, Protein-Ligand, AlphaFold
2025-09-20Just Because You Published It Doesn’t Mean It’s RightGitHubReproducibility, Publication, Benchmarking
2025-09-15Time For a New AdventureGitHubCareer
2025-07-27Redoing the Boltz-1 Analysis of Orthosteric and Allosteric Ligand Cofolding with Boltz-2GitHubBoltz-1, Boltz-2, Cofolding, Allosteric
2025-07-21Three Papers Demonstrating That Cofolding Still Has a Ways to GoGitHubCofolding, Protein-Ligand
2025-05-18GNN’s can extrapolate for some properties, but there’s a trickGitHubGNN, Machine Learning, Extrapolation
2025-05-12Useful RDKit Utils - A Mötley Collection of Helpful RoutinesGitHubRDKit, Python, Utilities
2025-05-06The Trouble With TautomersGitHubTautomers, RDKit, Cheminformatics
2025-04-26Why Don’t Machine Learning Models Extrapolate?GitHubMachine Learning, Extrapolation
2025-04-26We’ve MovedBloggerGeneral
2025-03-07Even More Thoughts on ML Method ComparisonsBloggerMachine Learning, Benchmarking
2024-11-16Some Thoughts on Splitting Chemical DatasetsBloggerData Splitting, Machine Learning, Validation
2024-10-08Silly Things Large Language Models Do With MoleculesBloggerLLM, SMILES, Cheminformatics
2024-09-30Digging Deeper into Thompson Sampling - A Guest Blog Post by Patrick RileyBloggerThompson Sampling, Active Learning, Optimization
2024-05-22Generative Molecular Design Isn’t As Easy As People Make It LookBloggerGenerative Models, Molecular Design
2024-01-16AI in Drug Discovery - A Highly Opinionated Literature Review (Part III) BloggerAI, Drug Discovery, Literature Review
2024-01-08AI in Drug Discovery - A Highly Opinionated Literature Review (Part II) BloggerAI, Drug Discovery, Literature Review
2024-01-02 AI in Drug Discovery 2023 - A Highly Opinionated Literature Review (Part I)BloggerAI, Drug Discovery, Literature Review
2023-12-07Some Thoughts on Biotech vs Pharma for Computational ChemistsBloggerCareer, Biotech, Pharma
2023-11-27Comparing Classification Models - You’re Probably Doing It WrongBloggerMachine Learning, Classification, Metrics
2023-08-03We Need Better Benchmarks for Machine Learning in Drug DiscoveryBloggerBenchmarking, Machine Learning, Drug Discovery
2023-07-05A Simple Tool for Exploring Structural AlertsBloggerStructural Alerts, Toxicity, Screening
2023-06-12Getting Real with Molecular Property PredictionBloggerProperty Prediction, Machine Learning
2023-05-02Using Counterfactuals to Understand Machine Learning ModelsBloggerXAI, Counterfactuals, Machine Learning
2023-04-10Build a QSAR Model in 8 Lines of PythonBloggerQSAR, Python, Machine Learning
2023-04-03Getting Inside the Mind of the Medicinal Chemist with Machine LearningBloggerMedicinal Chemistry, Machine Learning, SAR
2023-03-12 Working With Drug Data from the ChEMBL DatabaseBloggerChEMBL, SQL, Data Mining
2023-02-01 Generative Molecular Design - We Need to Raise the BarBloggerGenerative Models, Molecular Design
2023-01-03AI in Drug Discovery 2022 - A Highly Opinionated Literature ReviewBloggerAI, Drug Discovery, Literature Review
2022-12-04Mining Ring Systems in Molecules for Fun and ProfitBloggerRing Systems, Data Mining, RDKit
2022-03-28Clustering Fragment Screening Hits With a Self-Organizing MapBloggerClustering, SOM, Fragment-based Design
2022-01-17The Solubility Forecast IndexBloggerSolubility, Physicochemical Properties
2022-01-03Useful RDKit UtilitiesBloggerRDKit, Python, Utilities
2021-11-30Picking the Highest Scoring Molecule(s) From Each ClusterBloggerClustering, Selection, Virtual Screening
2021-10-24Exploratory Data Analysis With mols2grid and Bemis-Murcko FrameworksBloggerEDA, mols2grid, Murcko Frameworks
2021-09-12Similarity Search and Some Cool Pandas Tricks BloggerSimilarity Search, Pandas, Python
2021-08-31Building a multiclass classification modelBloggerMachine Learning, Classification, Multiclass
2021-08-21Practical Cheminformatics - The DirectoryBloggerDirectory, Index
2021-07-27Viewing Clustered Chemical Structures in a Jupyter NotebookBloggerVisualization, Clustering, Jupyter
2021-07-07Automatic Analog Generation With Common R-group ReplacementsBloggerAnalog Generation, R-groups, SAR
2021-06-03Assessing Interpretable ModelsBloggerXAI, Interpretable ML
2021-03-30Fast Parallel Cheminformatics Workflows With DaskBloggerDask, Parallel Processing, Python
2021-01-18AI in Drug Discovery 2020 - A Highly Opinionated Literature ReviewBloggerAI, Drug Discovery, Literature Review
2020-11-17A Highly Opinionated List of Open Source Cheminformatics ResourcesBloggerResources, Open Source
2020-10-31What Do Molecules That Look LIke This Tend To Do? BloggerSAR, Similarity, Data Mining
2020-10-12A Collection of Things I Frequently Forget How To Do With Seaborn ScatterplotsBloggerVisualization, Seaborn, Python
2020-08-16Examining the Data From the ChEMBL SARS-CoV-2 Drug Repurposing ScreensBloggerChEMBL, SARS-CoV-2, Drug Repurposing
2020-06-22Wicked Fast Cheminformatics with NVIDIA RAPIDSBloggerRAPIDS, GPU, Parallel Processing
2020-05-24Using the Structure-Activity Landscape Index (SALI) to Analyze Data From the SARS-CoV-2 MPro ScreenBloggerSALI, SAR, SARS-CoV-2
2020-05-13Some Thoughts on Comparing Classification ModelsBloggerMachine Learning, Classification, Metrics
2020-05-04Exploring the SARS-CoV-2 Main Protease (MPro) StructuresBloggerSARS-CoV-2, MPro, Protein Structure
2020-04-27Positional Analogue ScanningBloggerSAR, Analogue Scanning, RDKit
2020-04-11Adding Chemical Structures to a Recent COVID-19 Drug Repurposing DatasetBloggerCOVID-19, Data Cleaning, SMILES
2020-03-30Building on the Fragments From the Diamond/XChem SARS-CoV-2 Main Protease (MPro) Fragment Screen (Part II) Structure-Base Evaluation of Expanded FragmentsBloggerFragment-based Design, MPro, Structure-based Design
2020-03-25Building on the Fragments From the Diamond/XChem SARS-CoV-2 Main Protease (MPro) Fragment Screen (Part I)BloggerFragment-based Design, MPro, SARS-CoV-2
2020-03-21Benchmarking “One Molecular Fingerprint to Rule Them All”BloggerFingerprints, Benchmarking
2020-02-09How (Not) to Get a Job in Science - Part 2 - The InterviewBloggerCareer, Interviewing
2020-01-21How to (Not) Get a Job in ScienceBloggerCareer, Job Search
2020-01-07Visualizing Decision TreesBloggerVisualization, Decision Trees, Machine Learning
2019-11-13Interactive Plots with Chemical StructuresBloggerVisualization, Interactive Plots, Bokeh
2019-11-01Visualizing Chemical SpaceBloggerChemical Space, Visualization, Dimensionality Reduction
2019-09-19Dissecting the Hype With CheminformaticsBloggerHype, AI, Cheminformatics
2019-07-28How Good Could (Should) My Models Be? BloggerMachine Learning, Performance, Error Bar
2019-06-02Using Reaction Transforms to Understand SARBloggerSAR, Reaction Transforms, RDKit
2019-05-03Where’s the code? BloggerOpen Source, GitHub
2019-04-22Clustering 2.1 Million Compounds for $5 With a Little Help From Amazon & FacebookBloggerClustering, AWS, Large Datasets
2019-03-31Multiple Comparisons, Non-Parametric Statistics, and Post-Hoc TestsBloggerStatistics, Multiple Comparisons
2019-03-03Plotting DistributionsBloggerVisualization, Statistics, Seaborn
2019-02-19Some Thoughts on Evaluating Predictive ModelsBloggerMachine Learning, Evaluation, Metrics
2019-01-17My Response to Peter Kenny’s Comments on “AI in Drug Discovery - A Practical View From the Trenches”BloggerAI, Drug Discovery, Debate
2019-01-11K-means ClusteringBloggerClustering, Machine Learning
2018-11-16AI in Drug Discovery - A Practical View From the TrenchesBloggerAI, Drug Discovery, Implementation
2018-10-30Self-Organizing Maps - 90s Fad or Useful Tool? (Part 1)BloggerSOM, Clustering, Machine Learning
2018-10-30Self-Organizing Maps - The Code (Part 2)BloggerSOM, Python, Implementation
2018-10-06My Science/Programming JourneyBloggerCareer, Personal
2018-09-29Assigning Bond Orders to PDB Ligands - The Easy WayBloggerPDB, Bond Orders, RDKit
2018-09-24Some Notes From the 2018 RDKit UGM BloggerRDKit, Conference
2018-09-17A Few Updates to Free-WilsonBloggerFree-Wilson, SAR
2018-09-05Predicting Aqueous Solubility - It’s Harder Than It LooksBloggerSolubility, Property Prediction
2018-08-20Scaffold Hopping? It’s ComplicatedBloggerScaffold Hopping, SAR
2018-08-08Filtering Chemical LibrariesBloggerLibrary Filtering, Drug-likeness
2018-06-08Cheating at Word Cookies with PythonBloggerPython, Fun
2018-05-30Free Wilson AnalysisBloggerFree-Wilson, SAR