Abstract
Electron cryo-microscopy is a fast advancing biophysical technique to derive three-dimensional structures of large protein complexes. Using this technique, many density maps have been generated at intermediate resolution such as 610 Å resolution. Although it is challenging to derive the backbone of the protein directly from such density maps, secondary structure elements such as helices and β-sheets can be computationally detected. Our work in this paper provides an approach to enumerate the top-ranked possible topologies instead of enumerating the entire population of the topologies. This approach is particularly practical for large proteins. We developed a directed weighted graph, the topology graph, to represent the secondary structure assignment problem. We prove that the problem of finding the valid topology with the minimum cost is NP hard. We developed an O(N2 2N) dynamic programming algorithm to identify the topology with the minimum cost. The test of 15 proteins suggests that our dynamic programming approach is feasible to work with proteins of much larger size than we could before. The largest protein in the test contains 18 helical sticks detected from the density map out of 33 helices in the protein.
Original language | English (US) |
---|---|
Pages (from-to) | 415-430 |
Number of pages | 16 |
Journal | Journal of Bioinformatics and Computational Biology |
Volume | 9 |
Issue number | 3 |
DOIs | |
State | Published - Jun 2011 |
Externally published | Yes |
Keywords
- Constraint graph
- electron cryo-microscopy
- secondary structure
- shortest path
- topology
ASJC Scopus subject areas
- Biochemistry
- Molecular Biology
- Computer Science Applications