Journal of Physical Chemistry B, Vol.120, No.6, 1060-1068, 2016
Characterization of the Free State Ensemble of the CoRNR Box Motif by Molecular Dynamics Simulations
Intrinsically disordered proteins (IDPs) and regions are highly prevalent in eukaryotic proteomes, and like folded proteins, they perform essential biological functions. Interaction sites in folded proteins are generally formed by tertiary structures, whereas IDPs use short segments called linear motifs (LMs). Despite their short length and lack of stable structure, LMs may have considerable structural propensities, which often resemble bound-state conformations with targets. Structural data is crucial for understanding the molecular basis of protein interactions and development of targeted pharmaceuticals, but IDPs present considerable challenges to experimental techniques. As a result, IDPs are largely under-represented in the Protein Data Bank. In the face of experimental challenges, molecular dynamics (MD) simulations have proven to be a useful tool for structural characterization of IDPs. Here, the free state ensemble of the nuclear receptor corepressor 1 (NCOR1) CoRNR box 3 motif, which is important for binding to nuclear receptors to control gene expression, is studied using MD simulations of a total of 8 mu s. Transitions between disordered and alpha-helical conformations resembling a bound-state structure were observed throughout the trajectory, indicating that the motif may have a natural conformational bias toward bound-state structures. The data shows that the disordered and folded populations are separated by a low energy (4-6 kJ/mol) barrier, and the presence of off-pathway intermediates, leading to a C-terminally folded species that cannot efficiently transition into a completely folded conformation. Structural transitions and folding pathways within the free state ensemble were well-described by principal component analysis (PCA) of the peptide backbone dihedral angles, with the analysis providing insight for increasing structural homogeneity of the ensemble.