TY - JOUR
T1 - AlphaFold-predicted protein structures and small-angle X-ray scattering
T2 - insights from an extended examination of selected data in the Small-Angle Scattering Biological Data Bank
AU - Brookes, Emre
AU - Rocco, Mattia
AU - Vachette, Patrice
AU - Trewhella, Jill
AU - Meneau, F.
N1 - © Emre Brookes et al. 2023.
PY - 2023/8/1
Y1 - 2023/8/1
N2 - By providing predicted protein structures from nearly all known protein sequences, the artificial intelligence program AlphaFold (AF) is having a major impact on structural biology. While a stunning accuracy has been achieved for many folding units, predicted unstructured regions and the arrangement of potentially flexible linkers connecting structured domains present challenges. Focusing on single-chain structures without prosthetic groups, an earlier comparison of features derived from small-angle X-ray scattering (SAXS) data taken from the Small-Angle Scattering Biological Data Bank (SASBDB) is extended to those calculated using the corresponding AF-predicted structures. Selected SASBDB entries were carefully examined to ensure that they represented data from monodisperse protein solutions and had sufficient statistical precision and q resolution for reliable structural evaluation. Three examples were identified where there is clear evidence that the single AF-predicted structure cannot account for the experimental SAXS data. Instead, excellent agreement is found with ensemble models generated by allowing for flexible linkers between high-confidence predicted structured domains. A pool of representative structures was generated using a Monte Carlo method that adjusts backbone dihedral allowed angles along potentially flexible regions. A fast ensemble modelling method was employed that optimizes the fit of pair distance distribution functions [P(r) versus r] and intensity profiles [I(q) versus q] computed from the pool to their experimental counterparts. These results highlight the complementarity between AF prediction, solution SAXS and molecular dynamics/conformational sampling for structural modelling of proteins having both structured and flexible regions.
AB - By providing predicted protein structures from nearly all known protein sequences, the artificial intelligence program AlphaFold (AF) is having a major impact on structural biology. While a stunning accuracy has been achieved for many folding units, predicted unstructured regions and the arrangement of potentially flexible linkers connecting structured domains present challenges. Focusing on single-chain structures without prosthetic groups, an earlier comparison of features derived from small-angle X-ray scattering (SAXS) data taken from the Small-Angle Scattering Biological Data Bank (SASBDB) is extended to those calculated using the corresponding AF-predicted structures. Selected SASBDB entries were carefully examined to ensure that they represented data from monodisperse protein solutions and had sufficient statistical precision and q resolution for reliable structural evaluation. Three examples were identified where there is clear evidence that the single AF-predicted structure cannot account for the experimental SAXS data. Instead, excellent agreement is found with ensemble models generated by allowing for flexible linkers between high-confidence predicted structured domains. A pool of representative structures was generated using a Monte Carlo method that adjusts backbone dihedral allowed angles along potentially flexible regions. A fast ensemble modelling method was employed that optimizes the fit of pair distance distribution functions [P(r) versus r] and intensity profiles [I(q) versus q] computed from the pool to their experimental counterparts. These results highlight the complementarity between AF prediction, solution SAXS and molecular dynamics/conformational sampling for structural modelling of proteins having both structured and flexible regions.
KW - AlphaFold
KW - SAXS
KW - ensemble modelling
KW - small-angle X-ray scattering
KW - structural flexibility
UR - http://www.scopus.com/inward/record.url?scp=85168081854&partnerID=8YFLogxK
UR - https://www.mendeley.com/catalogue/7200957f-6d31-305b-9f27-1828bb711482/
U2 - 10.1107/S1600576723005344
DO - 10.1107/S1600576723005344
M3 - Article
C2 - 37555230
AN - SCOPUS:85168081854
SN - 0021-8898
VL - 56
SP - 910
EP - 926
JO - Journal of Applied Crystallography
JF - Journal of Applied Crystallography
IS - Pt 4
ER -