Gene Details:
- Gene ID: Gh_D07G0523
- Gene Family: Trihelix Family
- Description: Trihelix Family protein
- Species: Gossypium hirsutum
- Source: Trihelix family gene from PlantTFDB
Protein Features:
Annotation Proteins:
- Refseq: XP_012468135.1 — PREDICTED: uncharacterized protein LOC105786301
- Refseq: XP_012468148.1 — PREDICTED: uncharacterized protein LOC105786301
- Swissprot: Q84W56 — RNJ_ARATH; Ribonuclease J
- TrEMBL: A0A0D2N648 — A0A0D2N648_GOSRA; Uncharacterized protein
- STRING: Gorai.001G059600.1 — (Gossypium raimondii)
Gene Ontology:
- GO:0003677 — Molecular Function — DNA binding
Family Introduction:
- GT factors constitute a plant-specific transcription factor family with a DNA-binding domain that binds GT elements. The DNA-binding domain of GT factor, rich in basic and acidic amino acids and proline and glutamine residues, features a typical trihelix (helix-loop-helix-loop-helix) structure that determines the specific binding of GT elements; thus GT factors are also called trihelix transcription factors. GT elements are highly degenerate cis-elements with A/T-rich core sequences (Villain et al. 1996; Wang et al. 2004). Interaction between GT factors and GT elements has been implicated in the complex transcriptional regulation of many plant genes.
Literature:
- Systematic analysis of GT factor family of rice reveals a novel subfamily involved in stress responses. DOI: 10.1007/s00438-009-0507-x ; PMID: 20039179
Sequences:
CDS Sequence:
- >Gh_D07G0523|Gossypium_hirsutum|Trihelix|Gh_D07G0523
ATGGCTGCTTCCACTGCTCTCTCACTTTGTCCGTACATACTCTCTCGCCGGCCAACCCCAAGAAAGCGTCTCTTTTCTTGCTCCGTCGGCTCCACTACTCCTATAGGTACACGAAGAACTAACGTACCGCGTAGAAGTTCAGGAAGATTGGATGGGGCTAGAAAAAGTATGGAAGACTCTGTCCAACGCAAGATGGAGCAGTTCTATGAAGGGACTGCTGGACCACCTCTTCGGGTTCTTCCAATTGGTGGTTTGGGTGAAATTGGAATGAATTGCATGCTTGTTGGTAATTATGATCGCTATATTCTAATTGATGCTGGTGTGATGTTTCCAGACTATGATGAGCTTGGAGTCCAAAAGATTATACCTGATACAACATTTATTAAGAAATGGAGCCACAAAATTGAAGCAGTAGTGATAACACATGGCCATGAAGATCACATTGGTGCGTTGCCTTGGGTTATCCCAGCATTGGATCCTCACACTCCAATATATGCTTCATCCTTTACCATGGAGCTGATCAAAAAGCGTCTGAAGGAGAATGGGATTTTTGTCCCATCTAGGCTTAAGGTATTTAAAATGAGAAAGAGATTTACGGCCGGGCCATTTGAAATAGAGCCTCTCAGGGTGACACATTCTATTCCCGACTGTTGTGGATTAGTTCTTCGCTGTGCTGATGGTACAATTCTTCACACTGGGGACTGGAAGATTGATGAATCACCATTGGATGGGAATATTTTCGACCGGCAGTTTCTAGAGGATCTGTCAAAGGAAGGAGTAACACTGATGATGAGTGACTCTACTAATGTATTGTCACCCGGAAGGACAACTAGTGAGCGTGTAGTAGCAGATGCATTGTTGAGGCATATATCAAATGCTAAAGGAAGGATTATTACTACCCAGTTTGCATCAAACATACACCGACTTGGAAGTGTAAAAGTTGCTGCAGATTTAACTGGTAGAAAGCTGGTATTTGTTGGCATGTCATTAAGGACTTATCTAGATGCAGCTTGGAAGGATGGAAAAGCACCAATTGATCCATCAACTCTGGTGAAAGCAGAAGATATTGATGCCTATGCTCCGAAGGATTTAATAATTGTCACAACTGGATCCCAAGCAGAGCCACGGGCAGCCTTGAATCTTGCATCCTATGGAAGTAGTCATTCTTTCAAACTGAACAAGGAAGATGTGATTCTCTATTCAGCTAAGGTAATCCCTGGTAATGAATCCCGGGTAATGAAGATGCTAAACCGTATATCAGAGATTGGATCAACTATAGTGATGGGTAGGAATGAGGGGCTACACACTTCTGGTCATGGCTATCGTGGAGAGCTGGAGGAAGTACTTAAAATTGTGAAGCCGCAGCATTTTTTACCCATACATGGAGAGCTCGTGTTCTTGAAAGAGCATGAGCTACTTGGGAAATCAACCGGCGTTCGACACACCACTGTTATAAAGAATGGAGAGATGCTTGGGGTTTCTCATTTGAGGAATAGAAAAGTTCTGTCTAATGGCTTTAGTTCCCTTGGGAAGGAGAATTTACAGTTAATGTACAGTGATGGTGATAAGGCATTTGGCACATCAACTGAACTTTGTATTGATGAGAGACTAAGAATTGCATCTGATGGCATTATAGTGGTCAGCATGGAAATTTTACGCCCCCAAAAGATTGATGGCATAATTGAGAATAGCTTAAAAGGGAAGATAAGAATCACTACACGCTGCTTATGGCTAGACAAGGGCAAGCTTTTAGATGCACTCCATAAAGCTGCTCATGCTGCACTGTCGAGCTGTCCTGTGAATTGCCCCCTAGCTCACATGGAAAGAACTGTTTCTGAGGTATTGAGGAAGATGGTAAGGAAGTACAGTGGTAAGAGGCCTGAAGTCATTGCGATTGCATTGGAGAACCCTGCAGGAGTTCTCTCTGATGAGCTAAATGAAAAGCTATCTGGCAACTCCAATGTTGGTTTTGGGATACCGGCAGTGAGAAAAGTAATGGATGGACATCCAAAAAGGAGAGAACCAAACAAGATAAAAGCAGAAAATGACAGTAATCTGCATATAGAGAATACTTCAGAACAAAATTTGATAGTTGGCAATGATGTTGAAACGTTCTTACCCGAGGAAGTGACCACTAGTTCAAGTCCTGACCATGCAGAAAGGCATACACGCAGTACTGAGGATTCTGATGAATTCTGGAAACCATTCATCAAATCATCTTCACCTATTGACAATTTGGAAAATGATAACAATGGGTTTATCCCAATAGAGGAACATAAGTCAGAACTTAAGAGTGATGATGCCGCAAGCAGTGGAGATGTCTCAGAACTGCTCAGCTCTCAACTGAAGTCGTCAAAGCCTGCCAAACGGAACAAATGGACATCTGAAGAGGTCAAGAAGCTAATTAAAATGCGGGGGGAATTACATAGCCGATTTCAGGTTGTGAAAGGGAGAATGGCCCTTTGGGAAGAAATATCTGCTAGCCTGTTGGCTGATGGAATTAGTCGAAGCCCTGTGCAGTGTAAATCCAGATGGGCATCTCTGGTTCAGAAATATGAGGAAATCAGGAGTGAGAAGAAAAGCCATAAAGATTGGCCCTATTTTGAGGAAATGAATAAAATTTTATCTGATGATTTTGAGGCAGCAGCTACATAA
Protein Sequence:
- >Gh_D07G0523|Gossypium_hirsutum|Trihelix|Gh_D07G0523
MAASTALSLCPYILSRRPTPRKRLFSCSVGSTTPIGTRRTNVPRRSSGRLDGARKSMEDSVQRKMEQFYEGTAGPPLRVLPIGGLGEIGMNCMLVGNYDRYILIDAGVMFPDYDELGVQKIIPDTTFIKKWSHKIEAVVITHGHEDHIGALPWVIPALDPHTPIYASSFTMELIKKRLKENGIFVPSRLKVFKMRKRFTAGPFEIEPLRVTHSIPDCCGLVLRCADGTILHTGDWKIDESPLDGNIFDRQFLEDLSKEGVTLMMSDSTNVLSPGRTTSERVVADALLRHISNAKGRIITTQFASNIHRLGSVKVAADLTGRKLVFVGMSLRTYLDAAWKDGKAPIDPSTLVKAEDIDAYAPKDLIIVTTGSQAEPRAALNLASYGSSHSFKLNKEDVILYSAKVIPGNESRVMKMLNRISEIGSTIVMGRNEGLHTSGHGYRGELEEVLKIVKPQHFLPIHGELVFLKEHELLGKSTGVRHTTVIKNGEMLGVSHLRNRKVLSNGFSSLGKENLQLMYSDGDKAFGTSTELCIDERLRIASDGIIVVSMEILRPQKIDGIIENSLKGKIRITTRCLWLDKGKLLDALHKAAHAALSSCPVNCPLAHMERTVSEVLRKMVRKYSGKRPEVIAIALENPAGVLSDELNEKLSGNSNVGFGIPAVRKVMDGHPKRREPNKIKAENDSNLHIENTSEQNLIVGNDVETFLPEEVTTSSSPDHAERHTRSTEDSDEFWKPFIKSSSPIDNLENDNNGFIPIEEHKSELKSDDAASSGDVSELLSSQLKSSKPAKRNKWTSEEVKKLIKMRGELHSRFQVVKGRMALWEEISASLLADGISRSPVQCKSRWASLVQKYEEIRSEKKSHKDWPYFEEMNKILSDDFEAAAT