C1orf41 Results

From MDWiki
Revision as of 23:18, 11 June 2009 by Hana Hamzah (talk | contribs)
Jump to navigationJump to search

Structure

Protein Structure

PDB ID: 1TVG Information from the PDB stated that x-ray diffraction was ueed to solve the structure of this protein with an R value of 0.215 and at 1.6Å. Two ligands were present in the crystal structure, a calcium (II) ion and a samarium (III) ion. This protein has 153 residues and its secondary structure consists of two helices and nine beta sheet strands.


Scop result

Scop.png


Figure 1: Representation of c1orf41 secondary structure as presented in PDBsum.


Tertiary structure

Figure 2: This protein is monomeric, consisting of a single domain. The fold of the protein is a jelly roll barrel where four pairs of anti-parallel beta strands are organised to form a barrel-like structure.


Figure 3: Surface representation of c1orf41. The Ca (II) ion is located within a pocket of the protein. Sm (III) interacts with Asp92 on the protein but its presence was probably due to the method used in solving the phase problem during structure solution using x-ray crystallogrophy.


Figure 4: Electrostatic charge distribution on the surface of c1orf41.




Structural analysis

1)Structural similarities

DALI was used to find proteins that are similar in structure to c1orf41. The results showed that sialidases, alpha-N-acetylglucosaminidases and galactose oxidases have similar structures to this protein.

Figure 5: DALI result showing the first 35 proteins that have similar structures to c1orf41. Even though the proteins are similar in structure they have very little sequence identity to c1orf41. 1xpw is c1orf41 but the structure was solved by NMR.


2)Domain Classification

Figure 6: Pfam result indicated that our protein has a F5/8 type C domain, also known as the discoidin domain which is apart of galactose binding domain super family.


3)Possible ligand binding sites

Observations and comparisons of several proteins from the DALI result with our protein showed that there is a similar position on each proteins where a metal ion is located.


Figure 7: This figure shows the position where calcium (II) ion (yellow) is located in within a loop of c1orf41 structure. This loop where metal ion is coordinated to was also observed in several proteins from the DALI result.


1gog Na.png
Figure 8:Shown at the top is the discoidin domain of galactose oxidase with a sodium ion (purple) located within a loop. The bottom figure is bacterial sialidase discoidin domain with a sodium ion (purple) and a beta-D-galactose molecule. The sodium ion is also located i within a loop.


Figure 9:CastP also identified a surface cleft that is located at the loop position.



1tvg ca res2.PNG

Figure 10: Pymol figure displaying possible metal binding position on c1orf41. Orange is the residues that may be involved in coordinating the metal and yellow is the calcium ion. The residues are based on CastP result.


Nest analysis by Profunc suggested two other ligand binding sites. Nests are structural motifs that commonly found in functionally important regions of protein structures.

Profunc nest.PNG

Figure 11: Nest 1 and has scores higher than 2. So, they are more likely to be functional. His54 and Lys55 NH atoms are accessible from a large surface cleft of the protein. The cleft is also deep indicating that the nest is functionally important. The residues of Nest 2 are part of the loop that coordinates the Ca (II)ion.


1tvg nests2.png

Figure 12: Pymol representation of Nest 1 and 2.


Surface nest3.PNG

Figure 13: Arrows indicating the pockets of the nests that may be accessible to ligands.



Evolution

Results from Blast showed that c1orf41 have high homology to small heat shock proteins which gave us a hint that the protein may be a heat shock protein (Figure 14). However, initial multiple sequence alignment showed many gaps and few conserved regions (Figure 15).

Figure 14: Homologues of c1orf41 from BLAST.


Figure 15: Initial multiple sequence alignment of the 27 proteins obtained from Blast


The sequences from platypus, urchin_156, urchin_159 and g.alga_fap were excluded from further investigation. This was due to poor alignment of platypus, urchin_156 and urchin_159 with the other organisms. In addition, the sequence for g.alga_fap was removed as it was a truncated sequence of g.alga_itp25.


One of the characteristic of a small heat shock protein is that they have a conserved alpha B crystallin domain which can also be observed in c1orf41 as shown in figure 16 below:


Figure 16: Conserved alpha B crystallin domain in c1orf41

The conserved residues found in c1orf41 and its homologues fall in the alpha B crystallin domain. But more conserved regions are observed in higher terrestrial eukaryotes. With this, we are more confident that c1orf41 is a heat shock protein.


The residues of the possible binding site were highlighted in black in figure 17 below. However, these residues were only conserved in terrestrial eukaryotes. During evolution, the protein has evolve and adopt different function in marine eukaryotes, amphibian and terrestrial eukaryotes. The evolutionary relationship of this protein is shown in figure 18.


Figure 17: Residues involved in the possible binding site of c1orf41
Figure 18: Unrooted phylogenetic tree of c1orf41 and its homologues. The tree showed that the protein evolved from marine eukaryotes (red) to amphibians (blue) and to terrestrial eukaryotes (green)

To ensure that the reliability of the evolutionary relationship of c1orf41 and its homologues, the tree was bootstrap with 100 replicates (Figure 19). Bootstrap value of more than 75 indicated that we are confident in the branch whereas bootstrap value of less than 50 indicated no confidence in the branch over an alternative


Figure 19: Phylogenetic tree with its respective bootstrap value


Back to main page