Talk:Protein Function
Information From 8th May 2007
MIF4G is the middle domain of eukaryotic initiation factor 4G (eIF4G). It also occurs in NMD2p (non-sense mediated mRNA decay protein - it is involved in the non-sense mediated decay of mRNAs containing premature stop codons) and in CBP80 (Cap binding protein).
The protein binds eIF4A, eIF3, RNA and DNA Therefore part of function is to bind to RNA
Possibly located in the cytoplasm - See link to LOCATE. Mouse protein of similar seuqence in this location.
MIF4G starts residue 28 Ends 240 (mouse)
It is soluble and non-secreted.
PA74324.2 Riken cDNA 2310075612 Rik Protein - AAH26740, AAH55812(mouse), AAH33759(human)
AAH55812 - Rik Protein Mouse. Present in the cerebellum, Striatum, Eye, Wholebrain, Liver, Hippocampus, Hematopoietic Stem Cells and Kidney Accession No: BC055812.1
Performed a MultiLoc prediction that determines location of the protein based on Amino Acid sequence and the presence etc of a N-termial targeting sequence. There is a 0.93 Probability that the protein is cytoplasmic. Now I have to find specific location, what the protein binds to and the structure of what it binds to. If i can identify the structure of the binding domain then I can predict to some extent the structure or a very small peice of the structure ie active site and can use this to perform function based analysis?
Obtained Sequences
Human - Protein Sequ
mgepsreeyk iqsfdaetqq llktalkvac fetedgeysv cqrsysncsr lmpsrcntqy
rdpgavdlek vanvivdhsl qdcvfskeag rmcyaiiqae skqagqsvfr rgllnrlqqe
yqareqlrar slqgwvcyvt ficnifdylr vnnmpmmalv npvydclfrl aqpdslskee
evdclvlqlh rvgeqlekmn gqrmdelfvl irdgfllptg lsslaqllll eiiefraagw
kttpaahkyy ysevsd
>AAH26740 ARM repeat, position: 13-208 (Mouse)
SFDAQTQQLLKTALKDPGAVDLERVANVIVDHSLQDCVFSKEAGRMCYAIIQAESKQAGQSVFRRGLLNRLQKEYDAREQ
LRACSLQGWVCYVTFICNIFDYLRVNNMPMMALVNPVYDCLFQLAQPESLSREEEVDCLVLQLHRVGEQLEKMNGQRMDE
LFILIRDGFLLPTDLSSLARLLLLEMIEFRAAGWK
Mouse - Protein
mseasrddyk iqsfdaetqq llktalkdps avdlervanv ivdhslqdcv fskeagrmcy
aiiqaeskqa gqsvfrrgll nrlqkeydar eqlracslqg wvcyvtficn ifdylrvnnm
pmmalvnpvy dclfqlaqpe slsreeevdc lvlqlhrvge qlekmngqrm delfilirdg
fllptdlssl arllllemie fraagwkttp aahkyyysev sd
FASTA - Human
>gi|21707112|gb|AAH33759.1| MIF4G domain containing [Homo sapiens]
MGEPSREEYKIQSFDAETQQLLKTALKVACFETEDGEYSVCQRSYSNCSRLMPSRCNTQYRDPGAVDLEK
VANVIVDHSLQDCVFSKEAGRMCYAIIQAESKQAGQSVFRRGLLNRLQQEYQAREQLRARSLQGWVCYVT
FICNIFDYLRVNNMPMMALVNPVYDCLFRLAQPDSLSKEEEVDCLVLQLHRVGEQLEKMNGQRMDELFVL
IRDGFLLPTGLSSLAQLLLLEIIEFRAAGWKTTPAAHKYYYSEVSD