Information report for Lus10041475
Gene Details
|
|
Functional Annotation
- Refseq: XP_002311449.2 — transcription factor TGA2.2 isoform X1
- Swissprot: Q41558 — HBP1C_WHEAT; Transcription factor HBP-1b(c1) (Fragment)
- TrEMBL: B9HJ06 — B9HJ06_POPTR; Uncharacterized protein
- STRING: Lus10041475 — (Linum usitatissimum)
- GO:0006355 — Biological Process — regulation of transcription, DNA-templated
- GO:0003700 — Molecular Function — transcription factor activity, sequence-specific DNA binding
- GO:0043565 — Molecular Function — sequence-specific DNA binding
Family Introduction
- The bZIP domain consists of two structural features located on a contiguous alpha-helix: first, a basic region of ~ 16 amino acid residues containing a nuclear localization signal followed by an invariant N-x7-R/K motif that contacts the DNA; and, second, a heptad repeat of leucines or other bulky hydrophobic amino acids positioned exactly nine amino acids towards the C-terminus, creating an amphipathic helix. To bind DNA, two subunits adhere via interactions between the hydrophobic sides of their helices, which creates a superimposing coiled-coil structure. The ability to form homo- and heterodimers is influenced by the electrostatic attraction and repulsion of polar residues flanking the hydrophobic interaction surface of the helices.
- Plant bZIP proteins preferentially bind to DNA sequences with an ACGT core. Binding specificity is regulated by flanking nucleotides. Plant bZIPs preferentially bind to the A-box (TACGTA), C-box (GACGTC) and G-box (CACGTG), but there are also examples of nonpalindromic binding sites.
Literature and News
Gene Resources
Homologs
- Gossypium arboreum: Cotton_A_28674_BGI-A2_v1.0
- Gossypium hirsutum: Gh_D12G1679, Gh_A12G1545, Gohir.A12G169200, Gohir.D12G172500
- Manihot esculenta: Manes.14G099100.1.p, MANES_14G099100
- Populus trichocarpa: Potri.008G118300.1, POPTR_008G118300v3, Potri.008G118300.2
- Prunus mume: XP_008243193.1
- Prunus persica: Prupe.1G307300.2.p
- Ricinus communis: 28883.m000741
- Vitis vinifera: GSVIVT01011929001, VIT_01s0011g03230
Sequences
CDS Sequence:
- >Lus10041475|Linum_usitatissimum|bZIP|Lus10041475
ATGCAAACATCATCATCTCATCATCATCATCATCCCATCCCTTCTGCTATGTTCCGGTCCTCGGAAATGTACAGCAGTAGTACTCCTCTACTTCCACCATCTTTCTTTTTCAGAGGAGCAGCAGCACAAGAAGAAGAAGAGGGTAGTAGAGTCCAAACGAGGTTTGGTGATATTGGAGAGCTCGATCATCATCATCATCATCCTCCTCAACCAACTTTCCATCACAACCATGCTTTTGATTTAAGCCCAAGCTCCTCCTCCATGTTCAGCCTGAAATCTGGGAATGTTGTCAATGCTAATATTGTTCCAAGCAGCAGCAGCAGCTTGCTTCTCCCTTCTCCATTCAACACGAGTATTGTTGGGTGCTTGGACACAGGGCAGCAGCAGTACATGTATGGGAAAGGGACAAGTAGCTTTGGCAATATTGACAACAACAACAACTGGGGAGACTCTAATTCGGGCGGGATGGCTGCAGATACTACTGACACTTCCACAGATGTTGACACTACTGATGATCGACACCACCATCACCAGCAGCAACTTGGTGGCCAGCATGGCTCAGTTGTGGTGGTGGATTCCATTGATCAATCCAACTGCAAAGTTGATGACCAAAAGACAAGTCGAAGGTTGGCTCAAAATCGCGAAGCAGCACGAAAGAGCCGGCTTAGGAAGAAAGCTTATGTCCAGCAACTTGAGAACAGTCGTCTGAAGCTTACCCAACTTGAGCAAGACCTACAACGTGCTCGCCAACAGGGAAATTTCACGGGTGATCATGGCCATTCAGTAGCTGGAAATGATGCATTGGCATTTGACATGGACTATCAACGCTGGCTCGGAGAACATCAGAGGCTGATTAACGACCTAAGATCAGCAGCGAATTATCTGACAAGCGACGACAAGCTGCGCATTCTCGTAGATGGAGTGATGAGCCACTATGCCGAGATATTCAGGCTAAAGAGCATTGCTGCAAAGGCTGACGTGTTTCACATTCTGTCAGGCTTATGGAAGACTCGTGCTGAGCGATGTTTCATGTGGATTGGTGGATTTCGCTCTTCCGAACTTCTCAAGATAGTAGTGAACCAACTCAAACCGTTGACCGATCAACAGTTGGTAGGGATATGCAATCTGCAGCAGTCATCCCAGCAGGCAGAAGATGCATTATCACAAGGTATGGAAGCATTGCAACAGTCACTTGTGGACTCACTTTCTTCGACCTCTCTCCGTCCCGGTGCGGCTCCTGGTTGTCGTGTCAATGTGGCTGATTACATGGCCCAGATGGCTATTGCAATGGGCAAACTCACCACCCTTGAGAACTTCCTTCATCAGGCTGACCTTCTGCGGCAGCAGACATTGCAACAGCTGCATCGAATACTGACGACACGCCAGGCTGCAAATGCTCTTCTTGTTATCAGTGACTACACTTCTCGTCTCAGGGCCCTTAGCTCGCTATGGTTGGCTCGACCTAGGAACCTGAACGATCGATCCACAAATGTTGTGTAA
Protein Sequence:
- >Lus10041475|Linum_usitatissimum|bZIP|Lus10041475
MQTSSSHHHHHPIPSAMFRSSEMYSSSTPLLPPSFFFRGAAAQEEEEGSRVQTRFGDIGELDHHHHHPPQPTFHHNHAFDLSPSSSSMFSLKSGNVVNANIVPSSSSSLLLPSPFNTSIVGCLDTGQQQYMYGKGTSSFGNIDNNNNWGDSNSGGMAADTTDTSTDVDTTDDRHHHHQQQLGGQHGSVVVVDSIDQSNCKVDDQKTSRRLAQNREAARKSRLRKKAYVQQLENSRLKLTQLEQDLQRARQQGNFTGDHGHSVAGNDALAFDMDYQRWLGEHQRLINDLRSAANYLTSDDKLRILVDGVMSHYAEIFRLKSIAAKADVFHILSGLWKTRAERCFMWIGGFRSSELLKIVVNQLKPLTDQQLVGICNLQQSSQQAEDALSQGMEALQQSLVDSLSSTSLRPGAAPGCRVNVADYMAQMAIAMGKLTTLENFLHQADLLRQQTLQQLHRILTTRQAANALLVISDYTSRLRALSSLWLARPRNLNDRSTNVV*