Posts by Category

Research

An Overview of Machine Learning Models for NCAA and PTM modeling

7 minute read

Published:

AlphaFold 2 [1] has largely solved the problem of predicting the 3D structure of proteins from their amino acid sequences, though there is still debate over the extent to which this problem is truly solved — a view I share. AlphaFold 3 [2] and a number of subsequent models [3,4,5] have extended this success to protein complexes and proteins with small-molecule binders. However, all of these are primarily designed for canonical amino acids (CAAs) and do not account for non-canonical amino acids (NCAAs). Given that NCAAs play an increasingly important role in drug discovery and protein engineering, it is crucial to review the current state of machine learning models for predicting the structure of proteins containing NCAAs and post-translational modifications (PTMs). In this post, I review the recent progress in this field, with a critical view on the extent to which these models offer fruitful solutions.