Information report for 64602
Gene Details
Functional Annotation
- Refseq: XP_002506434.1 — predicted protein
- TrEMBL: C1EIJ8 — C1EIJ8_MICCC; Uncharacterized protein
- STRING: XP_002506434.1 — (Micromonas sp. RCC299)
- GO:0006355 — Biological Process — regulation of transcription, DNA-templated
- GO:0003700 — Molecular Function — transcription factor activity, sequence-specific DNA binding
- GO:0043565 — Molecular Function — sequence-specific DNA binding
Family Introduction
- The bZIP domain consists of two structural features located on a contiguous alpha-helix: first, a basic region of ~ 16 amino acid residues containing a nuclear localization signal followed by an invariant N-x7-R/K motif that contacts the DNA; and, second, a heptad repeat of leucines or other bulky hydrophobic amino acids positioned exactly nine amino acids towards the C-terminus, creating an amphipathic helix. To bind DNA, two subunits adhere via interactions between the hydrophobic sides of their helices, which creates a superimposing coiled-coil structure. The ability to form homo- and heterodimers is influenced by the electrostatic attraction and repulsion of polar residues flanking the hydrophobic interaction surface of the helices.
- Plant bZIP proteins preferentially bind to DNA sequences with an ACGT core. Binding specificity is regulated by flanking nucleotides. Plant bZIPs preferentially bind to the A-box (TACGTA), C-box (GACGTC) and G-box (CACGTG), but there are also examples of nonpalindromic binding sites.
Literature and News
Gene Resources
Homologs
- Hordeum vulgare: MLOC_81003.6, MLOC_81003.5, MLOC_81003.2, MLOC_81003.4, MLOC_81003.3, MLOC_81003.1
- Musa acuminata: GSMUA_Achr1P15640_001
- Panicum virgatum: Pavir.3KG530600.2.p, Pavir.3KG530600.1.p
- Triticum aestivum: Traes_6AS_F1CEB89EE.2, Traes_6DS_273430303.2
Sequences
CDS Sequence:
- >64602|Micromonas_sp._RCC299|bZIP|64602
ATGAAGTTTGGGTTCGATGCGTCGTCTCGGGATGTGCACCGGGTCGTTCAGAGTTCGGATCGCGCCCCGAACGAGCGCGGAGAGCACTCGGGTTTCCGCACCGACGTACCCATGACGAAAGCGTCGGTCGCTGTCGCGGCGCCCGCGAAGCCCACCGGCGCGGCGAAAGCGCCAAAGGCGAAAGCCGACGGCACGAAGCAGGACCCTCAAGCTGGAACGTTATCGAGGGAAGATATCGCCAACTTCAAGTTGGAGGATGACGGCGGCAGCAGCGACGCTCACAGCGACGACGGCAACGTCGAGGAGAACGGAGACCTGGACGAGACCAAACGAGTCCGGCGCATGCTGTCCAACCGCGAGTCCGCGCGTCGTTCTCGGCGTCGCAAGCAGGCTCACCTGGGCGAGCTGCAGCTGAAGGTTAACCAGCTCCAGAGCGAGAATCAGGACCTGCTCAACAAGCTGCACCAACTCCACGTCTCGTTCAACTCGATGATGGCGCGCAACAGGATGATCAAGGACAACATGGCGTACCTTCGCATGCAGATCATGAGCGGCGAGCCGTTGTCGCGGGAGGCGCTCGTCGCCGCGACGCAGGCCGTCGCCGCCGGGGCGCAGACGAACTTGCAGGATCTCAACCATCTCGTCGTCGGCGGACCCGGAGCTGCTGGGATGAACACGATGCTGATGAGCTCCGGCGGTCAGATGGGCGTTCCTTACGTCGGCGGGATGGGCGGGATGGTGCAGGGCGGCGCGGGGGGGATGAATGGGCTCGTCGCCCCGCAGGGTGTGGGCTCGATGGCCGGAATGGGCTCGATGGGCTCGATGGGCTCGATGGGTTCGATGAGGGCCTACAACCTCCAGATGTCGCAGGCGATGAACCGCGGCGTGAACGGCGTCAATACCGGAATGAGCGGGATGTTCGGCGGCGCCATGGGCGGCACCGCGCAAAACATGGCTGCGCAAAACGGCCTCGGCAACTTCAAGCTCGAAGCCGGCGGCGGCGTGAGCGAGTTTAGCAACGCGTCGTCGCTCACCCAGCAGGGATCCATGGGCGCCGGCGACGGCCCGTCCATCCGCGGCGTCAAAGGCCCGAGCTGCATCTCCGACGCCGACCCCGACGCGTTCATCGACGGCGGCGGCGGCCCAATCGCCGAGCGCGAAGCCGCGCTCGCCGCCGCCGTCGCCGCGCAACTCCGGATGGGTACCCCGATTGGATCTGGCGAAGTTCTCCACCAGCTCGCCGACCAGGAGGCTGCCCTCGCCGCCCAGCTGAAGACGTTCGCCGCGGGCGCCGCCGGCGACGGGGCGATGCGGGCGGGCGCGATGCAGGCGGGCGCGATGCAGGGCGCCGGCGGCGTCGTCGGCGGTCGGAAGATTTCCGGCGACGAGGATTTCCGACTTCCCGACGGCGCGGCGGCGGCGACGGCGGCGGCGGCGCGCGCGGCTGCTCAGGGCCGAAACGGGCGCCAGGCGTCTGACGGATCGTGCGCGGACGCGCGTCGCAAGCAGCGCGCGGGGAGCGGGGATTCGAGCGACGCGCGCGCCGGCGGCGGGAGCGGCAGCGGGAGCGGCAACAACAGCGGCGACGGTGGAGATTCGGAGTTGGCCGGAAAGGGTCAACAGCAGCAGTCGCGCGGCGCGCTGAGCCCCCTGGACCACGACGCGTCCCCCCTCGCAGCGCCCATCGCCGAGAAGCCGCCGCACATCGTGGACCAGGTTGGCGTGGACGCGGCGGACTCGACGGTGCTAGGCTTCCTATCCTCGCCCATCCTGGAGGGTGAGGTTGTTGACGGGTGGCGCCACGGCGGCGTCGATTTCGACGACTGGGCCTCCACCGTGATTGACGGCGGCGGCGGCATGGGCACCGAAACCAAGGAGCCGGGGACGCTCGGGAAGAACGCGCGCACCGCGAGCATGAACAGGGTGGCGAGTTTGGAGCACATGGCGAAGAGGGCGCAAGCCGCCGGCGGTCCGGTGTCGTGA ATGCAAGCCATGAACCAGTACGATCCGACGTCGTCTCACGACGATTTCCTCGACCAGATTCTCTCTTCCGTTCCTTCTTCTTCACCTTGTTGGCCAGACCTTTCCAAATCCTGGGACCCACACCACCACCACCTCTCTCCGCCGCTGCCGCCTAACCCTAGCTCCGGCGATGACCACCAGCCTCCTCCTTCTAATCCTCTTCAGCATTTCCAATATGACGACCAGTCTTCTTCTTTCCTTGCTGCCAAGCTCCGCCAGCACCAGATCACTGGTGGCGGTGGCGGTGGTGGCGGCACTGTAGCAGCTGCTAAAGCACTTTTGCTCCAGCAACAACTTTTACTTTCCAGAACACTCGCCGGAAACGGCCTTAGGTCTCCGACTGGAGCCTCCGGTGACAACGGCTTCCTTAACATGCTCGGTAATGGTGACCAAAACGACGGCGTCGGAAATCCGGCTAATGACAGTTCCGTTCAAGCTCTTTTCAACGGATTCACTGGATCTCTTGGTCAAAACTCAAGTCAACCTCAACATTTTCATCATCCTCAGGGAGGAACGATGCAGGCGCAGAGTTTCGGAGCTACGGCAACGGTGCCGGCGATGAATCAAACTCCGGCAGCAAGTGGTTCAGCTGTTGGAGGTACAACGCCGGCGGCACAGCCAAAGCAACAGCGAGTGAGAGCTCGTAGAGGACAAGCAACTGATCCTCACAGTATTGCTGAACGATTACGTAGAGAGAGAATTGCAGAGAGAATGAAGGCTTTGCAGGAGCTGGTACCCAATGCCAATAAGACGGACAAGGCTTCAATGCTGGATGAGATCATCGACTATGTCAAATTCCTACAGCTCCAAGTCAAAGTTCTGAGTATGAGTAGATTGGGTGGTGCTGCAGCTGTTGCTCCCCTAGTTGCTGATATGTCCTCTGAGGGAAGAGGAGAAGGAAATGGAGGAAGGGGAGGAAACGGAACGGCGTCGTCCTCAAACAATGACAGTATGACGGTAACGGAGCACCAGGTGGCTAAACTAATGGAGGAGGATATGGGTTCAGCAATGCAATATCTGCAAGGGAAAGGCTTATGCCTAATGCCAATTTCCTTAGCTACAGCTATTTCAACTGCCACGTGTCACTCCAGGAACCCCATGATCCCTAACGGTAACGGTGGCAACAACCCACTACTCGGCGGAGGAGGAGGCCCCGCCACCAACGGCGGTGGTGGCGAGGCTGGTGGACCATCCTCTCCTAACTTGTCGGTTTTGACTGTCCAGTCAGCCACGATTGGTAACGGCGGAATTGATCCCTCCGTTAAAGACGCTACTTCTGTTTCTGAAGCTTAA ATGGAGAAAAGGAAGCTACAGAAAAGGGTCTTTATTGGTGAAGATGATGTATCCTCTCTCTTACAAAGGTATACAACAATGACTGTGTTGACGCTGTTGCAAGAAGTGTCACAAGTTCAAGATGTGAAAATTGACTGGAATGAATTGGTGAAGAAGACCACAACTGGGATTACAAATGCCAGAGAGTACCAGATGCTGTGGCGCCATTTGGCTTATCACCATGGTTTGCTTGACAAATTTGATGATGATTCTGCTCAACCTCTGGATGATGATAGTGATCTAGAATATGAATTGGAAGCTTTTCCTCCCGTAAGCAGTGAGGCTTCAGCAGAGGCTACAGCATATGTGAAGGTATTAATTGCTTCTGGGGTTCCATGTGATTCACATATGTCAAATGGAAATACTATTGAAGCTCCATTGACTATAAGCATACCCAACGGGCAAACATCTGGAACCGCTACAGAAAATTCACTCCATGGTATTTCAGCTTATGGGATGAAAATTACGGTCCCGGTTTCTGTGGAAAAACAACCACTTCCATCTGTCACGGCTGCTGAGGTTGTGGACACCAATGGACCAGCTAATTCTAACTTTCCTCCTCGGCGAAGACGAAAGCCTTGGTCTGCAGCAGAGGATATGGAACTCATTGCTGCAGTACAAAAGTGTGGTGAAGGAAATTGGGCAAATATATTGAAAGGGGACTTTAAGGGAGACAGAACAGCGTCACAACTCTCTCAGAGATGGGCAATTATTAGAAAGCGACATGGGAACATGGCGGGGAATGGCTCACAGCTGTCGGAGGCTCAGCTTGCAGCTCGTCATGCAGTGTCCCTGGCTCTCGGGGATAACTTGAGAGCAGCTTGTCCAATCAGCACTAATGTGGGGCCAAATTCAGGCAGTGCACCAGGTAATTCTTCGCATTTTGCTGCTGCCAATAATGCATCTGGTGGTCCAAAATCCGAGCATCAGCAAGATTTAGTACCTTCAAAACCTCGAGTACTCCCAAAAAATCCATTGCCAAAACCTGCTATTAACCCAGATTCAATGGTCAAAACTGCTGCTATGGCTGCAGGCTCCCGAATTCCTACCCCTTCAGGTGCTGCTGCCTCACTGCAGAAGGCTGCAGAGTCTAAAAAAGTCGTCCAAATCATGCCCGGAGGAACTCCAGCAGTCAAATCATCTGTGCCAGGTAGCACTAATGGTTTGCCCAGTAATGTGCACTTCATTCGAAATGGCCTGGTTTCCCGTCCCCCAGGTTGTAGTCCGTCAAATGCCTCACAATCTGCAGTTCGACCTAACCCAGGGGCTGTCCCATCTAGAACAAATGTGTCATCTGGGGTTCCCGCTGCCCCTACATCTTCTTCAGCAAGGGTACTTGAAGCGAAAACTAAAGCAGTCGCTATCCAAGAAAATCCAACTGATATTCTGTCTAATGCACGGATAGAGAAAGTTGAAGTTGATCGAGCTGCTTCAATGGATGCACAACAGCAGGTCCGAAAAGGTCAAACTTTTGGTTCAAGCAACTTGTTGACAGAGAAAATTGAAGGTCAAACTTCAATTTTTGGTCGCATAGTGAAAGAGCATGGAGGAGAGAGTAAAGCTTCAAGAGTTCGGGCTCAAGAAAAGCTAATCCCAAGTCTAGAGACTGCCAGCGATAATAGCATCAGGAACCAGGGTGAGAATAGCAATAAGAATGAGAACACTAAAATAGTTTTGGCACTTGAAAGTAGTGGTGTTCAAGGTCCTGCAACCGAGACATGTAATAAAGTCTTCAAAGGGAGTTGA ATGCATGAAATGAATTCGGATTGGGTGCTCTCCTTTCAGAGCCACAACCAAGATTTTGCCATTCGACAATTTTGTGCTGCTGCCCGAATGAACGATCAACAACAACAACAACAACATTATTCACAAAATAGTATAAGTGAGCTGTACACTTATACTAGCACCCCTGATCACTCAGCCTCTTCCCAGGAGGAGCAAGAATTTGTTGATAGCTTCTTCAACATTGATTCTTACGACAAAGATCCTCTTGATCACCAAGTAAATCTTGATAATGAGGAGTTTGGAGGCTTCACATCAATGATACTCAATGAATACAGTTACGAGAATCTATCATTGCTGACCGATGAAGAACCAAGGTATACCGTGCTGGAGGCTTGTAGCTCAAGCGATGATCTTACTGGAGCGGAAACAATTAGCCATGACCATGTGGAGGCTGAGGTGGATCAGGGGCTTCAGCTTGTACATTTGCTCTTGGCTTGTGCTGAGGCTGTAGGTTGTAGGGACACTCGACTTTCTGATTCATTGCTTACTCAAATCTGGCCTGCAGTCACCCCTTTTGGCGACTCTCTCCAAAGGGTCTCCTATTGCTTTGCTTTGGGACTTCAATCCAGACTAATGCTTCTTCAACATAGAAATAATGCAGGTGCAACATTAATACAGGACCAGTTAGCAGCTTCGTCTAGAGCGGAGAAAAGAGAGGCCTATCATTGGTTGCACCAAATCACTCCCTACATGGCTTTTGGTTTCATGGCTGCAAATGATGCTATCTGCAAAGCTTCCAGAGGGAAGGAATGCTTGCATATCATTGATCTTGGCATGCTCCATATTTTTCAATGGCAATCCATAATTAGAACGTTAACTGCACAACCAGACTCACCTCCTCCAAGACTAGTTCGCATCACAGGCATTCTTGAAGATACACACGATTTTATGGAGCTTGAACTAGGATTGAAGAGCCTTATGGATGAGGTTGCTGCTACTTCTTCGGGCAGTATTCGTCTAGAGTTCCAAGTGATCAAGGAGCCAGTATCAGCATCGCTCTTCACCAGGGAAAAGCTTGATGTAAGAGAAGGGGAGGCTTTAATTGTGAACAGCGTCATGCAGTTGCACAAGTATGTGAAAGAGAGCAGAGGATCTCTCAAGACCATTCTTCAAGCCATCAAGAAATTAGGCCCTGCTCTGGTTACTGTGGTGGAACAGGACGCCAATCACAATGGTCCTTTCTTTCTAGGAAGATTCTTGGAATCACTTCACTATTACTCTGCCATTTTTGATTCGCTTGAAGCTAGCCTCCCACGGGACTCCACCACAAGGATTAAGATAGAGAAGTGTCATTATGCAGAGGAGATTCGAAACATTGTGGCATATGAAGGATCGAATAGGATCGAGAGGCATGAAAGAGTAGACCAGTGGCGCAAGCAACTTGGACGTGCTGGATTTCAGGTAGTAGGATTGAAGTGTATGAGCCAGGCTCGGATGATGCTTTCAGTCTATGGTTCCGATGGTTATACTCTAGCCAATGAGAAGGGCTGCCTTCTGCTGGGATGGAAAGGAAGGCCAATAATGTTTGCATCTGCTTGGGAAGTCCACAATGTTGCATCCTCCTGA ATGATGAAGGAGCCGTTAGAATTCTCTGAAAGCTCTTCTAAAAATTCTACTTCTTTATCCATTGAAAATAATACTCCTATGAAAGATCATAGCAATAAATCTGGAGGGGTAAGACCATATGTTAGATCCAAAATGCCAAGACTCCGATGGACGCAAGATCTTCATCGCAGATTTGTGCATGCTGTTGAGACACTTGGTGGAGAAGATCGAGCAACACCAAAGATGGTGCTACAGTTGATGGATGTGAAGGGCCTGACAATATCTCACGTAAAAAGTCACCTTCAGATGTACAGAAGCATGAAGCATGAACAGATGATACAAGCTGAAGCAGAAGCTGCAAATGGGAGTAAAAGGAATAGAATGGATATTGGTGCAGCCAATGGGAACTATCATCACTACTACAATATTAATGACAAAGCCTTCTTTGGTGCTCATCCTTTGAGTAATGTAACCAGTAATTATGCGGAGCTTGCCTCTATTTTGCCTCCTGCATGGAAACATATGCAAGAGAGCAAGGAAAATAAGATCATGGGATTGGAAGGAAAGTCTAATTATTCCATCATGTTCAGGGATTTCTTCAATGGCTGCAGTGTTCAAGATATTGGCAACAGGAATAAAGTGGTAGGAGAAGCTAGCAGTTTGTCAAATAAGAGTCCATCTGCAGAAGAGGAAGATATCTCCAGCAGCACCATGTCCTTAGAGCCATCTGCCAGTTTTGATGTCAATAATCTCTCTCTTGACCTCACCCTTGCTTAA
Protein Sequence:
- >64602|Micromonas_sp._RCC299|bZIP|64602
MKFGFDASSRDVHRVVQSSDRAPNERGEHSGFRTDVPMTKASVAVAAPAKPTGAAKAPKAKADGTKQDPQAGTLSREDIANFKLEDDGGSSDAHSDDGNVEENGDLDETKRVRRMLSNRESARRSRRRKQAHLGELQLKVNQLQSENQDLLNKLHQLHVSFNSMMARNRMIKDNMAYLRMQIMSGEPLSREALVAATQAVAAGAQTNLQDLNHLVVGGPGAAGMNTMLMSSGGQMGVPYVGGMGGMVQGGAGGMNGLVAPQGVGSMAGMGSMGSMGSMGSMRAYNLQMSQAMNRGVNGVNTGMSGMFGGAMGGTAQNMAAQNGLGNFKLEAGGGVSEFSNASSLTQQGSMGAGDGPSIRGVKGPSCISDADPDAFIDGGGGPIAEREAALAAAVAAQLRMGTPIGSGEVLHQLADQEAALAAQLKTFAAGAAGDGAMRAGAMQAGAMQGAGGVVGGRKISGDEDFRLPDGAAAATAAAARAAAQGRNGRQASDGSCADARRKQRAGSGDSSDARAGGGSGSGSGNNSGDGGDSELAGKGQQQQSRGALSPLDHDASPLAAPIAEKPPHIVDQVGVDAADSTVLGFLSSPILEGEVVDGWRHGGVDFDDWASTVIDGGGGMGTETKEPGTLGKNARTASMNRVASLEHMAKRAQAAGGPVS*