Information report for Lus10042057
Gene Details
|
|
Functional Annotation
- Refseq: XP_012084427.1 — transcription factor RF2a
- TrEMBL: A0A067K743 — A0A067K743_JATCU; Uncharacterized protein
- STRING: Lus10042057 — (Linum usitatissimum)
- GO:0006355 — Biological Process — regulation of transcription, DNA-templated
- GO:0003700 — Molecular Function — transcription factor activity, sequence-specific DNA binding
- GO:0043565 — Molecular Function — sequence-specific DNA binding
Family Introduction
- The bZIP domain consists of two structural features located on a contiguous alpha-helix: first, a basic region of ~ 16 amino acid residues containing a nuclear localization signal followed by an invariant N-x7-R/K motif that contacts the DNA; and, second, a heptad repeat of leucines or other bulky hydrophobic amino acids positioned exactly nine amino acids towards the C-terminus, creating an amphipathic helix. To bind DNA, two subunits adhere via interactions between the hydrophobic sides of their helices, which creates a superimposing coiled-coil structure. The ability to form homo- and heterodimers is influenced by the electrostatic attraction and repulsion of polar residues flanking the hydrophobic interaction surface of the helices.
- Plant bZIP proteins preferentially bind to DNA sequences with an ACGT core. Binding specificity is regulated by flanking nucleotides. Plant bZIPs preferentially bind to the A-box (TACGTA), C-box (GACGTC) and G-box (CACGTG), but there are also examples of nonpalindromic binding sites.
Literature and News
Gene Resources
Homologs
- Citrus sinensis: orange1.1g007579m
- Gossypium arboreum: Cotton_A_32367_BGI-A2_v1.0, Cotton_A_00688_BGI-A2_v1.0
- Gossypium hirsutum: Gh_D02G2195, Gh_A13G2204, Gh_D13G2509, Gh_A03G1762
- Juglans regia: WALNUT_00012916-RA, WALNUT_00003890-RA
- Manihot esculenta: Manes.04G082600.1.p, Manes.11G081300.1.p
- Nicotiana benthamiana: Niben101Scf01852g04031.1
- Populus trichocarpa: Potri.004G163800.1, Potri.009G125400.1
- Ricinus communis: 30131.m007145
Sequences
CDS Sequence:
- >Lus10042057|Linum_usitatissimum|bZIP|Lus10042057
ATGGGTGATACGGAAGAAGGGAGCACTGATGTGATGCAGCGATTGCAGTCGTCGTTCGGGACAAATCAGTCATCGACGTCGTCTTCAATGCTGAAGCAGCTGCCGTTTTCATCATCACCAAGGCAGATTGATATCCCACCTTTGAGTCAAAATCAAATGCGAGCTAGGCATTTTGCTCACTTTGCTCAACAGCAAGGTTTTAGCGGCGGCGGTGGTGATAGCAACAGCAATAATAACAAAAGAGCTGGGATTCCGCCTTCACATCCGAACCAGATCACACCGATTTCTCCCTTCTCTCAGATCCCGGTGTCTACCAGGCCAGTCAACCACCACCAAATGGGGTCTTCTCAGAGTTTCAATACTACCAACAAGCCTGGCCATTCTCGATCTTTATCACAACCATCCTCGTTTTTCTCGCTTGATTCTCTTCCACCTTTAAGTCCTGCATCATTCAGAGACCATTCACCAACTTCTGAAACTGATGCTTCCATGGAGGTGGATAGGGATGGGAACTATCATTCCGTGTTGCCACCGTCTCCTTATAGTAGGTCCTCCTCAATTGCTCCTCGTGTTGGTGAGAGTTTACCACCGAGGAAGGCTCATAGACGGTCCAATAGTGATATACCGTTTGGGTTTAGTACTGTGATGCAGTCTTCTCCACCACTGAAATTTGGTAACAAGCTAGCTCAGGTGGTTAAAAATGAAGAAGGGATGGGTGAGAGGAAACCCGAAGGTGAAGCCATGGATGATTTGTTCTCGGCGTATATGAATCTGGATAACTTCGACTCGTTGAATTCGTCAGGGACTGATGACAAGAATGGGAACGAGAATCGCGAGGATTTGGATAGCAGGGCGAGTGGAACCAAGACGAATGGTGGTGATAGCAGTGATAATGAGGCGGAGAGCAGCATTAACGAGAGTGGGAGTAGCATTCCAAGGAGGGAAGGGACTAAAAGGAGTGCTGAGGGGGATATTGCTCCAACTTCAAGACATCACAGAAGTGTTTCCTTGGATAGTTTCATGGGGAAGTTGAACTTTGGAGGTGATGAGTCCCCAAAGCTTCCCCCTTCGCCTGGTCCTCGTCCTGGACAGTTGTCTCCGACTAGCTCGATCGATGGGAGTGCCTTTAGCTTGGATTTTGGGAATGGGGAGTTCAACAGTGTTGAGCTGAAGAAAATTATGGCAAATGAGAAACTCACTGAGATTGCTTTAACTGATCCAAAGCGTGCAAAGAGAATTTTGGCGAACCGTCAGTCTGCTGCTCGGTCAAAAGAAAGGAAGATGAGATACATATCGGAACTTGAACACAAGGTTCAGACTCTTCAGACTGAAGCTACTACATTGTCTGCCCAGCTTACTCTTTTACAAAGAGATTCACTTGGACTCACGAATCAGAATAACGAACTAAAATTTCGTCTACAAGCCATGGAGCAACAGGCACAACTTCGGGATGCTCTGAATGAAGCTTTGACTGCTGAGGTTAGACGACTGAAGATAGCAACAGCGGAAATTAGCGAGAACAGTGCAGATCCATCAAAAGGTAGCCAGCACCATACTTCTGCAAACTCTCAAATGTTCCAGCAGCAGCAGCAAATGTCATCTCAATGCAACATGAACCAATTGCAGCAGCAGCAGCATGAACAACGAAACGGAACAAACTCGAAAGCAGAAGCAAACCAGTGA
Protein Sequence:
- >Lus10042057|Linum_usitatissimum|bZIP|Lus10042057
MGDTEEGSTDVMQRLQSSFGTNQSSTSSSMLKQLPFSSSPRQIDIPPLSQNQMRARHFAHFAQQQGFSGGGGDSNSNNNKRAGIPPSHPNQITPISPFSQIPVSTRPVNHHQMGSSQSFNTTNKPGHSRSLSQPSSFFSLDSLPPLSPASFRDHSPTSETDASMEVDRDGNYHSVLPPSPYSRSSSIAPRVGESLPPRKAHRRSNSDIPFGFSTVMQSSPPLKFGNKLAQVVKNEEGMGERKPEGEAMDDLFSAYMNLDNFDSLNSSGTDDKNGNENREDLDSRASGTKTNGGDSSDNEAESSINESGSSIPRREGTKRSAEGDIAPTSRHHRSVSLDSFMGKLNFGGDESPKLPPSPGPRPGQLSPTSSIDGSAFSLDFGNGEFNSVELKKIMANEKLTEIALTDPKRAKRILANRQSAARSKERKMRYISELEHKVQTLQTEATTLSAQLTLLQRDSLGLTNQNNELKFRLQAMEQQAQLRDALNEALTAEVRRLKIATAEISENSADPSKGSQHHTSANSQMFQQQQQMSSQCNMNQLQQQQHEQRNGTNSKAEANQ*