Information report for C.cajan_43444
Gene Details
|
|
Functional Annotation
- Refseq: XP_020206203.1 — uncharacterized protein LOC109791325
- TrEMBL: A0A151QZ37 — A0A151QZ37_CAJCA; Uncharacterized protein
- STRING: XP_007152366.1 — (Phaseolus vulgaris)
Family Introduction
- GT factors constitute a plant-specific transcription factor family with a DNA-binding domain that binds GT elements. The DNA-binding domain of GT factor, rich in basic and acidic amino acids and proline and glutamine residues, features a typical trihelix (helix-loop-helix-loop-helix) structure that determines the specific binding of GT elements; thus GT factors are also called trihelix transcription factors. GT elements are highly degenerate cis-elements with A/T-rich core sequences (Villain et al. 1996; Wang et al. 2004). Interaction between GT factors and GT elements has been implicated in the complex transcriptional regulation of many plant genes.
Literature and News
Homologs
- Actinidia chinensis: Achn354421
- Capsicum annuum: CA03g16650
- Cicer arietinum: XP_012572837.1
- Glycine max: Glyma.09G149700.1.p, Glyma.16G201100.1.p
- Malus domestica: MDP0000129923
- Medicago truncatula: Medtr0007s0220.1
- Petunia inflata: Peinf101Scf01452g04008.1
- Prunus mume: XP_016649806.1
- Prunus persica: Prupe.2G286400.1.p
- Ricinus communis: 30147.m014143
- Solanum lycopersicum: Solyc03g006900.1.1
- Solanum melongena: Sme2.5_03635.1_g00005.1
- Vitis vinifera: GSVIVT01018456001
Sequences
CDS Sequence:
- >C.cajan_43444|Cajanus_cajan|Trihelix|C.cajan_43444
ATGGGCGACAAGGGTGACACATCCAAGAAACAACAACAACCACCACAGCAGCAACAACCTCAATCTCTCCATCACCACCAACAGCAACAACAAACTCAACAACAACAACCACAAACACAGCAAATTTCATCTTCTCCTCAAGCCCCACGTGAGGAATTACCACCCATCACAGAAGTTAGGTCAATCCACCACCAACAACAACAGACACCAGTTGTGGTTGTAACTGGAGCACCCCCTTTCATTTCAAGTCCTCTATATGTTTCCAGTGGTGCAAGTTCCTCACCCTTTGACCCTCAGTTTGAGTCACTCAACCCCAAGAGGCCTAGATACAGTGGACAATGGAAGCAACTTCTACCATCTCCATCTGCAAAACAAAAACAAAAGGGGAAGTACGTGAGCCCAGTTTGGAAGCCCAACGAGATGCTGTGGCTAGCAAGGGCTTGGAAGGAGCAATACCAAGGTGGTGGTGGTGGATCTGAATCTTCTTCAAGAGCAGAACAACAAGGAGAATTGGGAATCTCGAGAGGGAAAACCAGGGCTGATAAAGATAGAGAAGTTGCCGAATTTCTCAGGAGGCATGGGGTGAATAGAGATGCCAAAACTGCAGGGACGAAATGGGATAACATGTTGGGAGAGTTTAGGAAAGTGTATGAATGGGAAAGAGGGGGTGAGAGGGAACAAATTGGGAAGAGCTATTTCCGGCTTTCACAGTATGAGAGAAAGTTGCATAGGCTCCCAGCTTCTTTTGATGAAGATGTTTTCGAAGAGCTTTCTCAGTTCATGGGGTCTAGAATGAGATCCTCTCATGGCAGAGCTGGTTCCTCTTTTGTGTCAGGAGATGAAGCTAGATCGGCTCTTGCTGCTACAAGAGCTCTTCTACCTCCTCCTCGATCTCTCAAAGACGATGAAGCAACCCCTCATTCAGGAAGGACAAAGCAATTGGCTTTGACAAGTGGGGGTGAACCTTTCTTTCAGGGCTCAAGAGGGGGTTTATTAGGGTTAGAGTCTCTCTTGGACGTTTCAGGCTCTTCTTCATCATCGAGGGAACCACTCCGAAGAATGGGGAAGATAAGAATGACGTGGGAGGAATCAGTGAGTTTGTGGGCAGAAGAAGGTGAAGTTCACAGAGGGAGAGTGAAGCTTCAATCTTCGAGTTTTCTGAATGCAGATGAACTCACTTGCTTTGACGATGCCATGGTGCCTTGTCCCATGGAATATTTCGAAGATGGTCCTTTGAAGGGTTTCTCTGTTGACAGATTCGTTTCGGGACAGCAAGTTAAAGTTTTTGGCAGAAGAAAGGCTTCCCCGGCCTCGGCTTCTTCTGGTTTAGCTGAAAGAGTCCAACTTCCCTCCAAAGCACTTCCCATAAGATCCATTGGCACATTGGATTTCCGAGACCCAACAGAATACTACATGGAATGTCTCCTACGTGCATCATCCCCGCAAACGCTCCCAACCCTCTTCGACCTAAGGCGCCACCTCCAAGACCCGCCACCGGAGGAGCTCCGCTTTCCCCTCCGCCGCGAGGTCTACGACGACCTCCCTCAGGGAAAAGAACTCTTCTTCACTTCCGCCACCGAGCCCTTGGATTGCAGAACCCTAATGCACGACATCGTGGGCCCCATCATCCGCACCCACCCTAGCCCCACAATCCCCACTTCCCGCGACTCCTTCATCCCTCTCTGGGACGATTGCGTCAACCGAGTCATCGCGAGATTCTGCCCCGAAGAAATGAAAATAATCCGAAAACCCTCGTTAAAGGACACCGCCAAAACCCTAATCCAAGACCAGTGGCCCAACGTAACGGGATTCGTGAACAACTTCTGCTTATGGCGCGGCGAGGAAACGGATCGGTTGAAAGAGTCGCAACCGGATCCATCGAGCACGCTGGTCGGAAAATTACTCTGGAGTTACATGGATCTCCCTTATGTTCTAGGTTATTACGCAGTGGCCAATACGGTGACGTTTTGCGCGTTGAGCAGGTCGCACGAGGGGGTGAATCTAATCAACCGTACGGATTTGTTGGAGTTGAATTTGAGTAATCCGATGGAGAGGTTGAAGTGTTTGGTTCCGTGTTTCAGGATTGGGGTGTTGTTGGCGATGTTGAGTAAGCATTGTGCGAAGGGGGGTATATATAGCGATTTCGAGAGGGTTAGTTACGGTAACGGAGTGGTGACGGAGTTGACGCCGAACACGTGCACTCGGGTGTTTTGGGAGAAGCGGAAGTGGATGGCCGTTAAGGAGGTGTACGAGATTCTCGACCACAGGATTCCGCACGCGGAGATTCTGGTGGGGAGTTCGGAGGGGGAGCTGACGTTGTCGTTTCGGCCGCGGGGGTGTCGTTTGAGGCCGGGGAGTTTTGAGGAGCTGGTGGAGGCGTTGAAGTGCGTGACGAAGGCGCTGGTGGCGCTGCACGACTTGTCGTTTATGCATAGGGACTTGAAGTGGGAGAAGGTGATGCGGCGCGTGGACGGGGAGGAGTGGTTTGTGTGCGGGTTCGAGGAGGCGGGGGGGGCGCCGGAGCTGAAGGGGCACGTGGCGGGGGCGCGTGAGGGCCACGCGCCGGAGATGGAGAGGGGGTTGCATGGGGTGAAGGTGGACGTGTGGGGGGTGGGGTACTTGATTAGGACGTGTGGGTTGGGAGGGGTGCCGAAGATGCTTCGGGAGCTGCAGGGGAGGTGCATGGAGCAGAGCCCAGAGCAGAGGCCCACAGCGGCCGACTGCTACCACCACCTGCTGCAGATGCAGTCGTCGCTCGCTGCCGCTGCCGCCGCCACCGGGGGTGTCATCATGATGTGA
Protein Sequence:
- >C.cajan_43444|Cajanus_cajan|Trihelix|C.cajan_43444
MGDKGDTSKKQQQPPQQQQPQSLHHHQQQQQTQQQQPQTQQISSSPQAPREELPPITEVRSIHHQQQQTPVVVVTGAPPFISSPLYVSSGASSSPFDPQFESLNPKRPRYSGQWKQLLPSPSAKQKQKGKYVSPVWKPNEMLWLARAWKEQYQGGGGGSESSSRAEQQGELGISRGKTRADKDREVAEFLRRHGVNRDAKTAGTKWDNMLGEFRKVYEWERGGEREQIGKSYFRLSQYERKLHRLPASFDEDVFEELSQFMGSRMRSSHGRAGSSFVSGDEARSALAATRALLPPPRSLKDDEATPHSGRTKQLALTSGGEPFFQGSRGGLLGLESLLDVSGSSSSSREPLRRMGKIRMTWEESVSLWAEEGEVHRGRVKLQSSSFLNADELTCFDDAMVPCPMEYFEDGPLKGFSVDRFVSGQQVKVFGRRKASPASASSGLAERVQLPSKALPIRSIGTLDFRDPTEYYMECLLRASSPQTLPTLFDLRRHLQDPPPEELRFPLRREVYDDLPQGKELFFTSATEPLDCRTLMHDIVGPIIRTHPSPTIPTSRDSFIPLWDDCVNRVIARFCPEEMKIIRKPSLKDTAKTLIQDQWPNVTGFVNNFCLWRGEETDRLKESQPDPSSTLVGKLLWSYMDLPYVLGYYAVANTVTFCALSRSHEGVNLINRTDLLELNLSNPMERLKCLVPCFRIGVLLAMLSKHCAKGGIYSDFERVSYGNGVVTELTPNTCTRVFWEKRKWMAVKEVYEILDHRIPHAEILVGSSEGELTLSFRPRGCRLRPGSFEELVEALKCVTKALVALHDLSFMHRDLKWEKVMRRVDGEEWFVCGFEEAGGAPELKGHVAGAREGHAPEMERGLHGVKVDVWGVGYLIRTCGLGGVPKMLRELQGRCMEQSPEQRPTAADCYHHLLQMQSSLAAAAAATGGVIMM