TERMs (short for TERtiary Motifs) are compact modules that recur in protein structures and capture the effective degeneracy of secondary, tertiary, and quaternary structural environments in proteins. In Zheng et al. [1] we suggest that PDB-based TERM statistics represent fundamental relationships between sequence and structure. In Mackensie et al. [2], we show that the protein structural universe is highly degenerate ("quantized" so to speak) at the level of TERMs, and a surprisingly small number of such motifs describes the majority of known structure space.
[1] F. Zheng, J. Zhang, G. Grigoryan, "Tertiary Structural Propensities Reveal Fundamental Sequence/Structure Relationships", Structure, 23(5): 961-971, 2015.
[2] C. O. Mackenzie, J. Zhou, G. Grigoryan, "Tertiary Alphabet for the Observable Protein Structural Universe", Proceedings of the National Academy of Sciences, 113(47): E7438-E7447, 2016.
The following contains more information about specific TERM-related projects and results:
- TERMANAL — a structure analysis method based on TERMs
- Universal TERMs — click here to download the set of universal TERMs identified in [Mackenzie et al., PNAS, 113:2016], along with their structural matches and other information.
- ConFind — a tool for identifying contacting position pairs in proteins that we use in much of our TERM-based work.
- TERM-ddG — a method for computing stability changes in proteins upon mutations, published in [Zheng and Grigoryan, PLoS One, 12:2017].
- dTERMen — design with TERM-based energies; a general-purpose method for computational protein design based on the concept that sequence-structure mappings can be deduced (in a quantitative fashion) from structural matches to the TERMs comprising the target structure.