| 2026-02-07 | AI in Drug Discovery - Please Stop Fishing in the Bathtub! | GitHub | AI, Drug Discovery, Benchmarking |
| 2026-01-17 | Practical Cheminformatics with Marimo | GitHub | Marimo, Jupyter, Python, Visualization |
| 2026-01-09 | Can Gemini Search the ChEMBL Database? | GitHub | Gemini, LLM, ChEMBL, SQL |
| 2025-11-08 | Performing Exploratory Data Analysis on the OpenADMET ExpansionRx Blind Challenge Dataset | GitHub | EDA, OpenADMET, ADMET, Data Analysis |
| 2025-11-03 | We Still Haven’t Found What We’re Looking For - The Continuing Evolution of Protein-Ligand Co-Folding Methods | GitHub | Cofolding, Protein-Ligand, AlphaFold |
| 2025-09-20 | Just Because You Published It Doesn’t Mean It’s Right | GitHub | Reproducibility, Publication, Benchmarking |
| 2025-09-15 | Time For a New Adventure | GitHub | Career |
| 2025-07-27 | Redoing the Boltz-1 Analysis of Orthosteric and Allosteric Ligand Cofolding with Boltz-2 | GitHub | Boltz-1, Boltz-2, Cofolding, Allosteric |
| 2025-07-21 | Three Papers Demonstrating That Cofolding Still Has a Ways to Go | GitHub | Cofolding, Protein-Ligand |
| 2025-05-18 | GNN’s can extrapolate for some properties, but there’s a trick | GitHub | GNN, Machine Learning, Extrapolation |
| 2025-05-12 | Useful RDKit Utils - A Mötley Collection of Helpful Routines | GitHub | RDKit, Python, Utilities |
| 2025-05-06 | The Trouble With Tautomers | GitHub | Tautomers, RDKit, Cheminformatics |
| 2025-04-26 | Why Don’t Machine Learning Models Extrapolate? | GitHub | Machine Learning, Extrapolation |
| 2025-04-26 | We’ve Moved | Blogger | General |
| 2025-03-07 | Even More Thoughts on ML Method Comparisons | Blogger | Machine Learning, Benchmarking |
| 2024-11-16 | Some Thoughts on Splitting Chemical Datasets | Blogger | Data Splitting, Machine Learning, Validation |
| 2024-10-08 | Silly Things Large Language Models Do With Molecules | Blogger | LLM, SMILES, Cheminformatics |
| 2024-09-30 | Digging Deeper into Thompson Sampling - A Guest Blog Post by Patrick Riley | Blogger | Thompson Sampling, Active Learning, Optimization |
| 2024-05-22 | Generative Molecular Design Isn’t As Easy As People Make It Look | Blogger | Generative Models, Molecular Design |
| 2024-01-16 | AI in Drug Discovery - A Highly Opinionated Literature Review (Part III) | Blogger | AI, Drug Discovery, Literature Review |
| 2024-01-08 | AI in Drug Discovery - A Highly Opinionated Literature Review (Part II) | Blogger | AI, Drug Discovery, Literature Review |
| 2024-01-02 | AI in Drug Discovery 2023 - A Highly Opinionated Literature Review (Part I) | Blogger | AI, Drug Discovery, Literature Review |
| 2023-12-07 | Some Thoughts on Biotech vs Pharma for Computational Chemists | Blogger | Career, Biotech, Pharma |
| 2023-11-27 | Comparing Classification Models - You’re Probably Doing It Wrong | Blogger | Machine Learning, Classification, Metrics |
| 2023-08-03 | We Need Better Benchmarks for Machine Learning in Drug Discovery | Blogger | Benchmarking, Machine Learning, Drug Discovery |
| 2023-07-05 | A Simple Tool for Exploring Structural Alerts | Blogger | Structural Alerts, Toxicity, Screening |
| 2023-06-12 | Getting Real with Molecular Property Prediction | Blogger | Property Prediction, Machine Learning |
| 2023-05-02 | Using Counterfactuals to Understand Machine Learning Models | Blogger | XAI, Counterfactuals, Machine Learning |
| 2023-04-10 | Build a QSAR Model in 8 Lines of Python | Blogger | QSAR, Python, Machine Learning |
| 2023-04-03 | Getting Inside the Mind of the Medicinal Chemist with Machine Learning | Blogger | Medicinal Chemistry, Machine Learning, SAR |
| 2023-03-12 | Working With Drug Data from the ChEMBL Database | Blogger | ChEMBL, SQL, Data Mining |
| 2023-02-01 | Generative Molecular Design - We Need to Raise the Bar | Blogger | Generative Models, Molecular Design |
| 2023-01-03 | AI in Drug Discovery 2022 - A Highly Opinionated Literature Review | Blogger | AI, Drug Discovery, Literature Review |
| 2022-12-04 | Mining Ring Systems in Molecules for Fun and Profit | Blogger | Ring Systems, Data Mining, RDKit |
| 2022-03-28 | Clustering Fragment Screening Hits With a Self-Organizing Map | Blogger | Clustering, SOM, Fragment-based Design |
| 2022-01-17 | The Solubility Forecast Index | Blogger | Solubility, Physicochemical Properties |
| 2022-01-03 | Useful RDKit Utilities | Blogger | RDKit, Python, Utilities |
| 2021-11-30 | Picking the Highest Scoring Molecule(s) From Each Cluster | Blogger | Clustering, Selection, Virtual Screening |
| 2021-10-24 | Exploratory Data Analysis With mols2grid and Bemis-Murcko Frameworks | Blogger | EDA, mols2grid, Murcko Frameworks |
| 2021-09-12 | Similarity Search and Some Cool Pandas Tricks | Blogger | Similarity Search, Pandas, Python |
| 2021-08-31 | Building a multiclass classification model | Blogger | Machine Learning, Classification, Multiclass |
| 2021-08-21 | Practical Cheminformatics - The Directory | Blogger | Directory, Index |
| 2021-07-27 | Viewing Clustered Chemical Structures in a Jupyter Notebook | Blogger | Visualization, Clustering, Jupyter |
| 2021-07-07 | Automatic Analog Generation With Common R-group Replacements | Blogger | Analog Generation, R-groups, SAR |
| 2021-06-03 | Assessing Interpretable Models | Blogger | XAI, Interpretable ML |
| 2021-03-30 | Fast Parallel Cheminformatics Workflows With Dask | Blogger | Dask, Parallel Processing, Python |
| 2021-01-18 | AI in Drug Discovery 2020 - A Highly Opinionated Literature Review | Blogger | AI, Drug Discovery, Literature Review |
| 2020-11-17 | A Highly Opinionated List of Open Source Cheminformatics Resources | Blogger | Resources, Open Source |
| 2020-10-31 | What Do Molecules That Look LIke This Tend To Do? | Blogger | SAR, Similarity, Data Mining |
| 2020-10-12 | A Collection of Things I Frequently Forget How To Do With Seaborn Scatterplots | Blogger | Visualization, Seaborn, Python |
| 2020-08-16 | Examining the Data From the ChEMBL SARS-CoV-2 Drug Repurposing Screens | Blogger | ChEMBL, SARS-CoV-2, Drug Repurposing |
| 2020-06-22 | Wicked Fast Cheminformatics with NVIDIA RAPIDS | Blogger | RAPIDS, GPU, Parallel Processing |
| 2020-05-24 | Using the Structure-Activity Landscape Index (SALI) to Analyze Data From the SARS-CoV-2 MPro Screen | Blogger | SALI, SAR, SARS-CoV-2 |
| 2020-05-13 | Some Thoughts on Comparing Classification Models | Blogger | Machine Learning, Classification, Metrics |
| 2020-05-04 | Exploring the SARS-CoV-2 Main Protease (MPro) Structures | Blogger | SARS-CoV-2, MPro, Protein Structure |
| 2020-04-27 | Positional Analogue Scanning | Blogger | SAR, Analogue Scanning, RDKit |
| 2020-04-11 | Adding Chemical Structures to a Recent COVID-19 Drug Repurposing Dataset | Blogger | COVID-19, Data Cleaning, SMILES |
| 2020-03-30 | Building on the Fragments From the Diamond/XChem SARS-CoV-2 Main Protease (MPro) Fragment Screen (Part II) Structure-Base Evaluation of Expanded Fragments | Blogger | Fragment-based Design, MPro, Structure-based Design |
| 2020-03-25 | Building on the Fragments From the Diamond/XChem SARS-CoV-2 Main Protease (MPro) Fragment Screen (Part I) | Blogger | Fragment-based Design, MPro, SARS-CoV-2 |
| 2020-03-21 | Benchmarking “One Molecular Fingerprint to Rule Them All” | Blogger | Fingerprints, Benchmarking |
| 2020-02-09 | How (Not) to Get a Job in Science - Part 2 - The Interview | Blogger | Career, Interviewing |
| 2020-01-21 | How to (Not) Get a Job in Science | Blogger | Career, Job Search |
| 2020-01-07 | Visualizing Decision Trees | Blogger | Visualization, Decision Trees, Machine Learning |
| 2019-11-13 | Interactive Plots with Chemical Structures | Blogger | Visualization, Interactive Plots, Bokeh |
| 2019-11-01 | Visualizing Chemical Space | Blogger | Chemical Space, Visualization, Dimensionality Reduction |
| 2019-09-19 | Dissecting the Hype With Cheminformatics | Blogger | Hype, AI, Cheminformatics |
| 2019-07-28 | How Good Could (Should) My Models Be? | Blogger | Machine Learning, Performance, Error Bar |
| 2019-06-02 | Using Reaction Transforms to Understand SAR | Blogger | SAR, Reaction Transforms, RDKit |
| 2019-05-03 | Where’s the code? | Blogger | Open Source, GitHub |
| 2019-04-22 | Clustering 2.1 Million Compounds for $5 With a Little Help From Amazon & Facebook | Blogger | Clustering, AWS, Large Datasets |
| 2019-03-31 | Multiple Comparisons, Non-Parametric Statistics, and Post-Hoc Tests | Blogger | Statistics, Multiple Comparisons |
| 2019-03-03 | Plotting Distributions | Blogger | Visualization, Statistics, Seaborn |
| 2019-02-19 | Some Thoughts on Evaluating Predictive Models | Blogger | Machine Learning, Evaluation, Metrics |
| 2019-01-17 | My Response to Peter Kenny’s Comments on “AI in Drug Discovery - A Practical View From the Trenches” | Blogger | AI, Drug Discovery, Debate |
| 2019-01-11 | K-means Clustering | Blogger | Clustering, Machine Learning |
| 2018-11-16 | AI in Drug Discovery - A Practical View From the Trenches | Blogger | AI, Drug Discovery, Implementation |
| 2018-10-30 | Self-Organizing Maps - 90s Fad or Useful Tool? (Part 1) | Blogger | SOM, Clustering, Machine Learning |
| 2018-10-30 | Self-Organizing Maps - The Code (Part 2) | Blogger | SOM, Python, Implementation |
| 2018-10-06 | My Science/Programming Journey | Blogger | Career, Personal |
| 2018-09-29 | Assigning Bond Orders to PDB Ligands - The Easy Way | Blogger | PDB, Bond Orders, RDKit |
| 2018-09-24 | Some Notes From the 2018 RDKit UGM | Blogger | RDKit, Conference |
| 2018-09-17 | A Few Updates to Free-Wilson | Blogger | Free-Wilson, SAR |
| 2018-09-05 | Predicting Aqueous Solubility - It’s Harder Than It Looks | Blogger | Solubility, Property Prediction |
| 2018-08-20 | Scaffold Hopping? It’s Complicated | Blogger | Scaffold Hopping, SAR |
| 2018-08-08 | Filtering Chemical Libraries | Blogger | Library Filtering, Drug-likeness |
| 2018-06-08 | Cheating at Word Cookies with Python | Blogger | Python, Fun |
| 2018-05-30 | Free Wilson Analysis | Blogger | Free-Wilson, SAR |