Information report for Ahy011307
Gene Details
Functional Annotation
- Refseq: XP_025642427.1 — probable transcription factor PosF21
- TrEMBL: A0A445E305 — A0A445E305_ARAHY; Uncharacterized protein
- STRING: XP_007131742.1 — (Phaseolus vulgaris)
- GO:0006355 — Biological Process — regulation of transcription, DNA-templated
- GO:0003700 — Molecular Function — transcription factor activity, sequence-specific DNA binding
- GO:0043565 — Molecular Function — sequence-specific DNA binding
Family Introduction
- The bZIP domain consists of two structural features located on a contiguous alpha-helix: first, a basic region of ~ 16 amino acid residues containing a nuclear localization signal followed by an invariant N-x7-R/K motif that contacts the DNA; and, second, a heptad repeat of leucines or other bulky hydrophobic amino acids positioned exactly nine amino acids towards the C-terminus, creating an amphipathic helix. To bind DNA, two subunits adhere via interactions between the hydrophobic sides of their helices, which creates a superimposing coiled-coil structure. The ability to form homo- and heterodimers is influenced by the electrostatic attraction and repulsion of polar residues flanking the hydrophobic interaction surface of the helices.
- Plant bZIP proteins preferentially bind to DNA sequences with an ACGT core. Binding specificity is regulated by flanking nucleotides. Plant bZIPs preferentially bind to the A-box (TACGTA), C-box (GACGTC) and G-box (CACGTG), but there are also examples of nonpalindromic binding sites.
Literature and News
Gene Resources
Homologs
- Cajanus cajan: C.cajan_21275
- Cicer arietinum: XP_004494977.1
- Citrus sinensis: orange1.1g007579m
- Glycine max: Glyma.11G110400.1.p, Glyma.12G036400.1.p
- Gossypium arboreum: Cotton_A_32367_BGI-A2_v1.0
- Juglans regia: WALNUT_00012916-RA, WALNUT_00003890-RA
- Lotus japonicus: Lj3g3v3406920.1
- Manihot esculenta: Manes.11G081300.1.p, Manes.04G082600.1.p
- Medicago truncatula: Medtr4g072090.1
- Populus trichocarpa: Potri.009G125400.1
- Ziziphus jujuba: XP_015892467.1
Sequences
CDS Sequence:
- >Ahy011307|Arachis_hypogaea|bZIP|gnl|UG|Ahy#S59545770
ATGGGTGACAATGAAGACTCCAACAACAATATTGACATGATGCAAAGGCTTCAATCCTCATTCGGAACGACGTCGTCTTCGTCGTCTTCGTTTCTCAAACAGACTCTTACCATGGAGCAGCTCACAATACCCCAATTCCAACAAATGCGTGCTAATAACAATAATCAGCAACAACATTTCTACGGTGGAGATGGAATGAAGCGTGCGGGTATACCACCGTCTCACCCGCACCAGATCCCGCCCATTTCCCCATACTCTCAGATCCCGCGCTCCACACTCACTCACCACCAAATGGGTTCACCTACACCCACCCACACCCGATCCCTATCCCAACCATCGTTCTTCTCACTCGACTCTCTCCCACCGCTAAGCCCCTCTCCCTTCCGCGGCGCTGACTCCTCAACTTCAATCTCCGACCAAGCCTCCGCTGACGTCTCCATGGAAGATCGTGACGTCAGCAGCCACCAACAGCAACAGCAACAACACTCTCTCCTCCCTCCGTCTTCGCCTTTCTCATCTGCCCGGGGCACCGGCAACCCGCTGCCCCCTCCTCGGAAAACGCACCGTCGCTCCAACAGCGATATACCGTTTGGATTCTCCACCATTTTGCAGTCGTCGCCGCCGCTGATTCCGTTGAAGAAGCCTGCGCAGCTTGTTAAGAAGGAATCTGGTTCCTGGAGTAGCGCTGATCCCAATGCGGAGGGATCAGGGGAGAAGAAGCTCTCCCCTGAAGGTGAAGTAGTTGATGATCTTTTCTCTGCTTACATGAACTTGGATAGCATCAATGACGCGTTTAACAATAATTCATCTGATGAGAAGAACGGTAACGAGATTATTATAAATAATAACAATAGTAATCGTGATGATTTGGATAGTAGAGCTAGTGGAACCAAGACTAGTAACAACAATGGTGGTGGGGATAGTAGTGATAATGAAGCTGAGAGTAGCGGCAACTCAATGATGAGTCAGAGTTGTGGGGAGAAGAGGGATGGCGTTAAGAAGAGGAGTGCTGCTGGTGAGGTTGTTGGCGGCGTCGCTCCCACGAGCCGCCACTATAGGAGTGTGTCTATGGATAGTTTCATTGGCAAGTTGAATTTCAATGATGAGTCGCCAAGGTTGCCGCCATCGCCTGGGTCTGCTCGGCCGCCTTCCTCGAATGGAATTGATGGGAACACTGGTGCTTTCAGCTTGGAGTTTGGGAATGGTGAGTTTAGTGGCCCTGAGTTGAAGAAGATCATGGCCAATGAGAAGCTTGCTGAGATTGCTATGATGGATCCTAAGCGTGCAAAGAGGATTCTGGCCAATCGGCAGTCAGCTGCACGATCCAAAGAAAGGAAGATGCGATACATTTCTGAGCTGGAACATAAGGTCCAGACCCTACAAACTGAGGCCACCACACTCTCTGCACAGCTTACTCTGTTGCAGAGGGATTCTGTTGGACTCACTAACCAAAATAGTGAGCTCAAATTTCGCCTTCAATCCATGGAACAACAGGCAAAACTCCGAGATGCTCTAAACGAGGCTCTCACTGCCGAGGTTCAACGACTAAAGCTCGCTACGGCTGAGCTGAACGGGGAGTCACACTCATCCGGTTGCTTGATTCCGCAACATTCTGTCAACCCTCTGATGTTTCAGCAGCAGCAGCAAGCTTCTACTGCATCTCAACAAAACATTCATCTTCAACAACAACAGCGGCAGAATGGTAATACCAACTCACAAACCGATCTAAAACAATAA
Protein Sequence:
- >Ahy011307|Arachis_hypogaea|bZIP|gnl|UG|Ahy#S59545770
MGDNEDSNNNIDMMQRLQSSFGTTSSSSSSFLKQTLTMEQLTIPQFQQMRANNNNQQQHFYGGDGMKRAGIPPSHPHQIPPISPYSQIPRSTLTHHQMGSPTPTHTRSLSQPSFFSLDSLPPLSPSPFRGADSSTSISDQASADVSMEDRDVSSHQQQQQQHSLLPPSSPFSSARGTGNPLPPPRKTHRRSNSDIPFGFSTILQSSPPLIPLKKPAQLVKKESGSWSSADPNAEGSGEKKLSPEGEVVDDLFSAYMNLDSINDAFNNNSSDEKNGNEIIINNNNSNRDDLDSRASGTKTSNNNGGGDSSDNEAESSGNSMMSQSCGEKRDGVKKRSAAGEVVGGVAPTSRHYRSVSMDSFIGKLNFNDESPRLPPSPGSARPPSSNGIDGNTGAFSLEFGNGEFSGPELKKIMANEKLAEIAMMDPKRAKRILANRQSAARSKERKMRYISELEHKVQTLQTEATTLSAQLTLLQRDSVGLTNQNSELKFRLQSMEQQAKLRDALNEALTAEVQRLKLATAELNGESHSSGCLIPQHSVNPLMFQQQQQASTASQQNIHLQQQQRQNGNTNSQTDLKQ