Documentos de Académico
Documentos de Profesional
Documentos de Cultura
H O R H H O R H
H-bond
N N
N N
R H H O R H H O
Protein Structures
Proteins consist of a long chain of
Gly Leu Ser Pro amino acids, the primary structure
Generate
Global
Starting
Optimization
Configurations
Phase 1 Phase 2
Secondary Structure Predictions
in Phase 1
Sequence: SKIGIDGFGRIGRLVLRAALSCGAQ
Sequence:
SKIGIDGFGRIGRLVLRAALSCGAQ
Type:
CBBBB BCCCAAAAAAACCCBBBBBC
Weight:
1135522356789992888566733
Matching the predicted strands is a
combinatorial problem
? ? ?
Which orientation?
parallel anti-parallel
odd even
There are n!2 n-2 possible
n-stranded motifs
It takes weeks to
create some of these
configurations using
constrained local
minimizations!
Direct
Final Configuration
Manipulation
Initial Configurations
CASP4 Competition (before ProteinShop)
•Our group predicted 8 proteins
•Largest protein had 240 aa
•Most complex fold had 2 β-strands
Initial Configurations
Amino Acid Sequence
Subspace
Phase 1 Selection
Takes months to
Subspace converge using
Initial Configurations
Optimization hundreds of
processors on
Phase2: Global Seaborg!
Optimization Candidate
Selection
Final Configuration
Final Configuration
Phase 2 with ProteinShop