Information report for XP_010109060.1
Gene Details
|
|
Functional Annotation
- Refseq: XP_024029684.1 — uncharacterized protein LOC21396125
- TrEMBL: W9SQI9 — W9SQI9_9ROSA; Uncharacterized protein
- STRING: XP_010109060.1 — (Morus notabilis)
Family Introduction
- GT factors constitute a plant-specific transcription factor family with a DNA-binding domain that binds GT elements. The DNA-binding domain of GT factor, rich in basic and acidic amino acids and proline and glutamine residues, features a typical trihelix (helix-loop-helix-loop-helix) structure that determines the specific binding of GT elements; thus GT factors are also called trihelix transcription factors. GT elements are highly degenerate cis-elements with A/T-rich core sequences (Villain et al. 1996; Wang et al. 2004). Interaction between GT factors and GT elements has been implicated in the complex transcriptional regulation of many plant genes.
Literature and News
Sequences
CDS Sequence:
- >XP_010109060.1|Morus_notabilis|Trihelix|XP_010109060.1
ATGGGTGGTGACAACAAGGGAGAAACAACAAAGAAGCCGCAACAATCAAAACCATCACAACCATCACAACAACAACAACAACAACAACAACAGCAGCAAAATCCTCAGCCGCAACAACAACAACAACAACAGCAACAGCAACAACAACAAATCTCAATATCTTCTCCCAAAGATCCAATATTGGAAGAGTCAACAGATCAAACCAGACTAATATCAGCTCCTTTGTTCATCCCCGGCGGTGCAACTTCCTCTTCACCGTTTGATCATCAGCAATTCGATCAAGCGGTGACCAACCCTAAGAGACCAAGATACACCTCCGCAACCGGCCAATGGAAGCTTCTACCGTCTCCTTCTTCTCAGCAGAAAGTGGTTCAGACCCAAATGACGGTTTTGACCTCCGAGTCAAGCCCATCGCCAACCGCCTACCCTCCAGCCTCAAGCATAGTCCCTGCGAATATATCCCAACCCCGAGCGCTCCACAGCGCCGCCGCTTCCTCCTCCGATACGGCGTCGTCTCCTTCCCACTCCTTGCCGTCCGGCCAAGACACGAGCAAACCGGAGGGGGAAAACCAGGTTCACCAGCAATTCAGGAAGGGAAAGTACGTGAGCCCGGTGTGGAAACCTAACGAGATGCTGTGGCTAGCCAGGGCTTGGCGGGTTCAGTACCAAAGCGGCGGTGGGTCTAATGGGTCCGGAACGTCTTCGAGAGTGGCGGAGCCCACAGAACATGGTGGCGCCGCCCTAACCAAGGGCAAAACTAGAGCCGATAAGGATAAAGAAGTGGCCGAGTTTCTCCAACGGCATGGGGTTAACAGAGACGCGAAAACCGCAGGGACGAAGTGGGACAACATGTTGGGGGAGTTTCGGAAGGTTTATGAGTGGGAGAGAGGAGGGGAGAGAGAGCAAGTTGGGAAAAGCTATTTTCGGCTCTCGCCGTATGAGAGGAAGCTTCATAGATTGCCTGCTTCGTTTGATGAGGAAGTTTTTGAGGAGCTTTCGCAGTTCATGGGGTCCCGAATGAGGACCCCACAGAGCAGAGCAGCCTCTGGAATTCTCGTCGCCGCCGGTGATGATTCTAGGATAAGAGCAAGCCTCCCAGCTCCCGCTCCTTTCAAGGAAGACGAACAATTCCCTCTCCCAGCTAGGACAAAGCAGTTGATAATTGGAAGTGGGGGTGAAACATTTTACTATGGAGGAAGTGGGAGCGGGAGAGGAGGGGCTTTAATGGGGTTTGAGTCCTCAGGAGACATGGCGGGCCCATCACCACCATCGTCATCTTCTTCAAAGGAGCTTCGTCGGATCGGTAAGGTTCGAATGACATGGGAAGAATCTGTGAGTTTGTGGGCTGAAGAAGGCGAGCACCATAGAGGAAGAGTGAAGCTCCAAGGAGGGTCGTTGAGCTTCTTGAACGCTGATGAACTCACTTACTTTGACGACGCCATGGTCGCTTGCACCCTGGAAGGGTTTGAAGAAGGCCCTCTTAGAGGATTCTCCGTCGACAGATTCGTTTCCGGACAGCAAGTCAAAGTCTTTGGCAGGAAAAAGCCTTTCTCAGTCTCTGCTGCTCCTGGCTTCCAAGAGAAAGTTCAACAGCAGCTTCCATTCACGGAAGCCTCCATAAGATCAATTCCTCCATGGGAATTTCAGGATGCAAGCGATTACTACGTAGGATGTCTTAGAGTTCCACCACCATCAATCCCAAGCCTATTCGAGCTCTCATGGTACTTACAAGAACCGCCGCCGGAGGAACTTCGCATCCCGCTTCGCAAAGACGTTTACCGAGACTTGCCTCAAGGGAAAGATCTATTCTTCACAATGTCCACCGAATTACTAGACTGTAGAGCCATAACATACGATATACTAAGCCCCATCATCAGGCCAACTCCTAGTCTCACTTCTTCGACCGCCACAAGTAGAGAATCATTCGTTGGCCTTTGGGACGATTGCATTAACAGAATCGTCTCAAAGTTTTGCTCAGTCGAGATGGTCACCATTCGTAAACCCACTTCAACATCATCAACCGAAGTTTTGCAAGATCAATGGCCGAACTTGACAGGCTTCGTAAGGAACTTTTGTTTGTGGAGAGGAGAGGAAACCGACCAACTAAGGGAAGGGCAAATTGATCCGTCAAATTCCATTGTGGAAAAGCTTCTATGGACCTATGGAGATGTTCCATACATATTAGGTTACTACGTGATTGGCTTTTTGGTAACTTTCTGCGCGCTGAGCCGCGCGCAGGACCGCATAATCCGAACCGATCTACAGACGCTAGACTTGTCCTCACCATCGGATAGGTTAAAAGCCCTAGTCCCATGCTACAGAATCGCGGGTCTATTGCCATTACTAGCGGACCGATGCTTCATCAACGTCAACTCCAAAACTCTCATCTACAGTGACTTCGAGAGAGTCGACAATGGTAATGGAAACGTGACGGAGATGACACCGAACACCGCAACGAGATTCTTTCCCAACAGGAGAAAGTGGGCGGCAGTGAAAGAAATCTACGATTTTCTAGACCACCGAATCCCGCACGCCGAGTTCGTCCACCGATCTTCGGAGAAAGATCTCTCTCTTGTGTTCAAGCCGAGAGGGATCAAAACCAAGCCGGCGAACTGCGATCAGCTTGTGGAGGCGCTCAAGTACGTGACGAAGGCGCTCGTGGCTCTACACGACTTGTCGTTCATGCACAGGGATTTGAGCTGGGATAAAGTGATGATGAGGAGGGCCGAGAGAGGGGATCAGATCATGGGGGCGGAGTGGTTCGTGTGCGGCTTCGAAGAGGCTGTCGGAGCGCCGCAAATATACCCGCACAGTAGTAGCTCGGCGGCGCGTGGGGCACACGCACCGGAGATGGAGAGGGGGTTGCATGGGGTGAAGGTGGACGTTTGGGGGATGGGGAATCTGGTGCGGACTTGTGGGGTGGTTGGAGGAGTGCCCAAGATGCTGCGGGAGCTGCAGAATCGGTGCCTTGATCAGAATCCGGAGTTGCGCCCCACCGCCGCCGACTGCTACCACCATTTGCTTCAGCTCCAGTCCTCTCTTCAGACCGCCTCCGCCGCTGCTTCTGGCGGTGTATTGATGTGA
Protein Sequence:
- >XP_010109060.1|Morus_notabilis|Trihelix|XP_010109060.1
MGGDNKGETTKKPQQSKPSQPSQQQQQQQQQQQNPQPQQQQQQQQQQQQQISISSPKDPILEESTDQTRLISAPLFIPGGATSSSPFDHQQFDQAVTNPKRPRYTSATGQWKLLPSPSSQQKVVQTQMTVLTSESSPSPTAYPPASSIVPANISQPRALHSAAASSSDTASSPSHSLPSGQDTSKPEGENQVHQQFRKGKYVSPVWKPNEMLWLARAWRVQYQSGGGSNGSGTSSRVAEPTEHGGAALTKGKTRADKDKEVAEFLQRHGVNRDAKTAGTKWDNMLGEFRKVYEWERGGEREQVGKSYFRLSPYERKLHRLPASFDEEVFEELSQFMGSRMRTPQSRAASGILVAAGDDSRIRASLPAPAPFKEDEQFPLPARTKQLIIGSGGETFYYGGSGSGRGGALMGFESSGDMAGPSPPSSSSSKELRRIGKVRMTWEESVSLWAEEGEHHRGRVKLQGGSLSFLNADELTYFDDAMVACTLEGFEEGPLRGFSVDRFVSGQQVKVFGRKKPFSVSAAPGFQEKVQQQLPFTEASIRSIPPWEFQDASDYYVGCLRVPPPSIPSLFELSWYLQEPPPEELRIPLRKDVYRDLPQGKDLFFTMSTELLDCRAITYDILSPIIRPTPSLTSSTATSRESFVGLWDDCINRIVSKFCSVEMVTIRKPTSTSSTEVLQDQWPNLTGFVRNFCLWRGEETDQLREGQIDPSNSIVEKLLWTYGDVPYILGYYVIGFLVTFCALSRAQDRIIRTDLQTLDLSSPSDRLKALVPCYRIAGLLPLLADRCFINVNSKTLIYSDFERVDNGNGNVTEMTPNTATRFFPNRRKWAAVKEIYDFLDHRIPHAEFVHRSSEKDLSLVFKPRGIKTKPANCDQLVEALKYVTKALVALHDLSFMHRDLSWDKVMMRRAERGDQIMGAEWFVCGFEEAVGAPQIYPHSSSSAARGAHAPEMERGLHGVKVDVWGMGNLVRTCGVVGGVPKMLRELQNRCLDQNPELRPTAADCYHHLLQLQSSLQTASAAASGGVLM