Information report for A4A49_14084
Gene Details
|
|
Functional Annotation
- Encodes a ubiquitin-specific protease which catalyzes deubiquitination of histone H2B and is required for heterochromatin silencing.Loss of function mutations display autonomous endosperm development and embryo arrest. Loss of function also results in an increase in expression of the PcG complex target gene PHE1.
Homologous
- Arabidopsis thaliana — AT3G49600
Gene Resources
- Pfam: PF00443
- UniProt: A0A1J6IGV1
- EMBL: MJEQ01037187
- AlphaFoldDB: A0A1J6IGV1
- KEGG: nau:109225654
- OMA: GSCEEMP
- InterPro: IPR001394 , IPR006615 , IPR018200 , IPR028889 , IPR029071 , IPR033841 , IPR035927 , IPR038765 , IPR044743 , IPR050164
- PANTHER: PTHR24006 , PTHR24006:SF722
- SUPFAM: SSF143791 , SSF54001 , SSF54236
- PROSITE: PS00972 , PS00973 , PS50235 , PS51283
- Gene3D: 3.30.2230.10 , 3.90.70.10
- OrthoDB: A0A1J6IGV1
- CDD: cd01795 , cd02668
- STRING: 49451.A0A1J6IGV1
Sequences
CDS Sequence
- >A4A49_14084
ATGGGGCATCATCGTCCGAATACTCGCAGTAAAAATAAAAGGAATAGACCGGATGATTCTGCTGAGGCTACTGCGGAAATTTTCAGAAACATTCTATCTACCAGCCAAGTGACGGAAGATGATGTTAATCAGCTATACATGATATCGAAACCTGCTTGCCAAGGATGTCGTGTCAACACCAAGGATAATCCTAATTGCTTTTGTGGATTGATTCCACCACCAAATGGTAGTCGCAAATCTGGGTTATGGCAGAAGACATCTGAAGTAGTCAATGCTCTTGGTCCTGATCCTTCAGATGATCGTCGTGCATCCCCAGAGACACCTGCTGGCTTGACAAATCTGGGTGCGACATGCTATGCTAACAGTATTCTGCAATGCCTGTACATGAATAAGTCATTCAGGAAGGGTGTATTCTCTATTGAACCAGATGTTTTGAAACAACAACCTGTGTTAGACCAACTAGCACGACTATTTGCAAAGTTACATTTGAGCAAAATGGCTTATGTTGATTCTGCTCCATTTATCCAAACTCTAGAGCTAGATAATGGTGTTCAGCAGGATAGTCATGAGTTTCTGACCTTGCTCTTCTCTCTGCTTGAGCGATGTTTGAGCCGGTCTAGCGTGTTGAAGGCCAGAACAATTGTCCAAGATCTTTTCCGCGGAGGTGTGTCACATGTGACTAAGTGCTCAAAATGTGGAAATGAATCTGAAGCTTCATCAAAAATTGAAGACTTCTATGAACTGGAGTTGAATGTCAAGGGTATGAAGAGTTTAGACGAGAGTCTGGACGACTATCTTAGTGTGGAGGAGCTTCAAGGAGATAATCAATATTATTGTGATTCATGTGCCACCCGAGTTGATGCTACCCGTAGCATTAAACTGCGCTCTCTGCCTGCAGTCCTAAATTTCCAGCTCAAGCGTTGCATTTTCCTTCCAAATACTACAACGAGGAAGAAAATCACATCTGCATTTTGTTTTCCTGAAGAATTGAATATGACACGGAGGATATCTGAGCATTTCCAATCAGAACTAATATATGACTTGTCAGCCATATTGATCCACAAAGGCTCTGCTGCAAATAGTGGCCACTATGTCGCGCACATTAAAAATGAGAATACACAGCAGTGGTGGGAATTTGATGATGAACATGTTTCGAATTTAGGCTGTCAGCCATTTGGTAAAGGTTCTTCACATTCTGCTGTCAAGCCTTCTCAAACTGAGCCACTTGACCACTCTTCTTCTGATGCAATTAATATCCTCGAGAATGGAAATGGGCCTGCTGCTGGTGGGCAGCAGCAAGCCTCAAATACTGATGTCACTGAGGTGAAGACCTTCTCGTCTTGTGATGCTTATATGCTGATGTACGTCCTTAGACATACAAAGAGTGGTGATAAAATGTCAATTGACTCTAGTGATGATAAGGCAAAAAAAGAGGCTTGTACATCTTCAGAAGCTGATAGTCATCTTCCATCCCACCTTTATGAGGAGGTAGAAACATTGAATGACTCATATATAGACTCATGTAACCGATACAAATCTAGGAAGGAGTCTGAACTGAACTGCATCACTGAGCGGAGGCAGGAGGTACGTTCAATCCTTTCTAAAGCTGCAGTCCAATCACCTGAAAAATCTTACTTTTGGATATCTATGGACTGGCTGCGTCAATGGGCGGACAACATCATGCCATCAATCATTGACAATACTTCCTTACAATGCACACATGGGAAAGTACCAGTTTCAAAGATTGGCTCTATGAAGCGGTTGTCTGATGAAGCTTGGACCATGTTATTCTCTAAGTACGGTGGAGGACCAATGCTGGCCAAAGATGATTACTGCATTGACTGCCTCTTTGAAGTGGCTCGGTCGATGGCCCGTGCAGATAACTACCGGGATCGAAGAACATTAATGAAAGAACTTGCAGAAGCAGCACTTGCAGGGGTTTGTCTAGATGGGAAGTTGTACTACATATCAAAGACATGGTTACAGCAGTGGCTCCGACGGAAGAATGTAGACTCTCCTTGTGATGCTGATGCTGGACCAACAGCTTCAATAAGGTGTCCACATGGACAGCTGATGCCTGAACAGGCTGCTGGGGCTAGGCGTGTGCTAATACCCGAGAGTCTTTGGAATTTTATTCGTGAGATTGCTATGGCAGTAAAACCTGATGACGCTGTGGGTTGTTCAACTTTCATTTTAGACTCTGAGCCCTGTGCTCAATGCAACAGTCAACTCACCGAAGTTGCATGCTTAGAAGATACTCTAAGGGGGTTCAAGCTCAAGCAACGGCAAAGTCATGAGAGATTAGCAATGGGTAAAAGCATACCAATTCTTCCTGGTTCCAAATACTTCTTGTTACCATCTTCTTGGTTGTCTAAATGGAAAAGCTACTCTAATGCAAGTGGCAAAAGTGCTCCTTGTGCTGAACCTGAAACTTTGGATGCCGTCATTGATTTGCTGATGTGTGAAAAGCATTCAAGACTTCTTGAAAGGCCTCCTGATCTGGTCTGCAAACGTGGAAGTATTCTCCAAAAGTCACCTGCTACAGATGCATTGGCAATTATCACCGACAACGATTGGAAATTGTTTTGTGAAGATTGGGGTGGTACAGAGGCAAAAGGCATTACAGCTGAAATTGATTGTTTGGGGAATGATTTCCTTGGATCTAGTGAAGATATGGCAATTTCTGAGGAGCATATGAATTTGAGCGACGAATCGAATGCTGGGTCTGAGTCTAGAAAACCCATCATTAAGATTTCACCAGAGGTGTGTGAGGAGTGCATTGATGAAAGGAAAAGCTGTGAATTAATGAGGAAGCTCAATTATTCCGATGAGGATATATGTGTCTGTTTCATTCGTGGCAAGGAACCTCCAAAATCAGTTCTAGAAGCATCAGTGAACAGCTTGGAACCAAATCGACGGACTTCAAAGCGTTCCAGGAAAACAGCATTTGGAAACTCAGTAAACTTGAATGTGTCTGGGTCCACATCTGTCTACCAGCTGAAGATGATGATATGGGAAGCTTTTGGGATTGTCAAGGAAAATCAGGTACTTCACAAAGGTTCTCTGGTTATTGATAGTGAATCTGCTTGTCTTGCTGACATGAATATATTCCCCGGAGATGTATTATGGGTTACAGATTCGGAGATTCATGAGCATCGTGATATTGCAGATGAGCTTTCTGGCCAGAAAACGGAGGCACAAAACACTGAAGAAGGTTTTCGTGGAACACTTCTGTCCTCGAGTATCTCATCCCACTTTGTCTCTGAAGCATCTGCATGCCTAAATTAA
Protein Sequence
- >A4A49_14084
MGHHRPNTRSKNKRNRPDDSAEATAEIFRNILSTSQVTEDDVNQLYMISKPACQGCRVNTKDNPNCFCGLIPPPNGSRKSGLWQKTSEVVNALGPDPSDDRRASPETPAGLTNLGATCYANSILQCLYMNKSFRKGVFSIEPDVLKQQPVLDQLARLFAKLHLSKMAYVDSAPFIQTLELDNGVQQDSHEFLTLLFSLLERCLSRSSVLKARTIVQDLFRGGVSHVTKCSKCGNESEASSKIEDFYELELNVKGMKSLDESLDDYLSVEELQGDNQYYCDSCATRVDATRSIKLRSLPAVLNFQLKRCIFLPNTTTRKKITSAFCFPEELNMTRRISEHFQSELIYDLSAILIHKGSAANSGHYVAHIKNENTQQWWEFDDEHVSNLGCQPFGKGSSHSAVKPSQTEPLDHSSSDAINILENGNGPAAGGQQQASNTDVTEVKTFSSCDAYMLMYVLRHTKSGDKMSIDSSDDKAKKEACTSSEADSHLPSHLYEEVETLNDSYIDSCNRYKSRKESELNCITERRQEVRSILSKAAVQSPEKSYFWISMDWLRQWADNIMPSIIDNTSLQCTHGKVPVSKIGSMKRLSDEAWTMLFSKYGGGPMLAKDDYCIDCLFEVARSMARADNYRDRRTLMKELAEAALAGVCLDGKLYYISKTWLQQWLRRKNVDSPCDADAGPTASIRCPHGQLMPEQAAGARRVLIPESLWNFIREIAMAVKPDDAVGCSTFILDSEPCAQCNSQLTEVACLEDTLRGFKLKQRQSHERLAMGKSIPILPGSKYFLLPSSWLSKWKSYSNASGKSAPCAEPETLDAVIDLLMCEKHSRLLERPPDLVCKRGSILQKSPATDALAIITDNDWKLFCEDWGGTEAKGITAEIDCLGNDFLGSSEDMAISEEHMNLSDESNAGSESRKPIIKISPEVCEECIDERKSCELMRKLNYSDEDICVCFIRGKEPPKSVLEASVNSLEPNRRTSKRSRKTAFGNSVNLNVSGSTSVYQLKMMIWEAFGIVKENQVLHKGSLVIDSESACLADMNIFPGDVLWVTDSEIHEHRDIADELSGQKTEAQNTEEGFRGTLLSSSISSHFVSEASACLN