Information report for Lus10016758
Gene Details
|
|
Functional Annotation
- Refseq: XP_002511207.2 — uncharacterized protein LOC8266322 isoform X1
- Swissprot: Q84W56 — RNJ_ARATH; Ribonuclease J
- TrEMBL: B9RAI2 — B9RAI2_RICCO; Uncharacterized protein
- STRING: Lus10016758 — (Linum usitatissimum)
- GO:0003677 — Molecular Function — DNA binding
Family Introduction
- GT factors constitute a plant-specific transcription factor family with a DNA-binding domain that binds GT elements. The DNA-binding domain of GT factor, rich in basic and acidic amino acids and proline and glutamine residues, features a typical trihelix (helix-loop-helix-loop-helix) structure that determines the specific binding of GT elements; thus GT factors are also called trihelix transcription factors. GT elements are highly degenerate cis-elements with A/T-rich core sequences (Villain et al. 1996; Wang et al. 2004). Interaction between GT factors and GT elements has been implicated in the complex transcriptional regulation of many plant genes.
Literature and News
Homologs
- Gossypium arboreum: Cotton_A_16240_BGI-A2_v1.0
- Gossypium hirsutum: Gh_D03G1080, Gh_A03G0459, Gh_D07G0523, Gh_A07G0459
- Manihot esculenta: Manes.14G052700.2.p, Manes.14G052700.1.p
- Populus trichocarpa: Potri.012G095800.1, Potri.015G093600.1, Potri.015G093600.2
- Prunus persica: Prupe.5G152800.1.p
- Pyrus bretschneideri: Pbr036910.1, Pbr017747.1, Pbr017741.1
- Ricinus communis: 30147.m014528
- Sesamum indicum: XP_011073024.1, XP_011073026.1
- Vitis vinifera: GSVIVT01008515001
Sequences
CDS Sequence:
- >Lus10016758|Linum_usitatissimum|Trihelix|Lus10016758
ATGTCCACTTTTAGTCTCACTGCTCCCTCGCTCTGCCCTTATTACCGCCCTGGTCTTGCCAAGTTCTCTGTTTCTTGCTGTACTGGTTCCCCGACCAAGATAGGAAGCAGGCGGGTATCAAAAGCACCACCACGTAAAAGACCAAGTAGAAGAATGGAAGGGGCAGGGAAAAGTATGGAGGATTCTGTTAAACGCAAAATGGAACAGTTCTATGAAGGAACTGATGGCCCACCTCTACGTATTGTTCCAATTGGTGGACTTGGTGAGATTGGAATGAATTGCATGCTGGTCGGGAATTACGATCGCTATATTCTAATTGACGCTGGTGTTATGTTTCCTGACGACGAAGATCTCGGAGTTCAAAAGATTTTGCCAGACACCACGTTTATCAAAAGATGGAGTCACAAAATTGAGGCAGTTGTTATAACTCATGGCCATGAAGATCACATCGGTGCGTTGCCATGGGTGATCCCGGCTTTGGATGCAAATACCCCGATATTCGCTTCATCGTTTACAATGGAGCTGATAAAAAAACGTCTGAAGGAGCATGGTTTCTTTCTTCCTTCCAGACTAAAAGTTTTCAGAACAAGGAAGAGATTCACTGCTGGGCCTTTTGAAATAGAGCCAATCACAGTGACCCACTCGATTCCTGATTGTAGTGGTCTCATTCTTCGTTGTTCTGATGGTATAATCCTTCATACTGGTGATTGGAAGATAGATGAATCTCCATTGGATGGCAAACCATTTGATCGTGAAGCTCTAGAAGAACTCTCAAAGGAAGGAGTGACATTGATGATGAGCGATTCAACAAATGTGTTATCGCCAGGAAGGACAACTAGTGAAACTGTTGTAGCGGATTCACTGTTGAGACATATTTCTGCTGCTAAAGGAAGGGTTATTACTACTCAATTTGCATCAAATATATGGCGGCTAGGAAGTGTGAAAGCTGCTGCTGATTTGACAGGAAGAAAACTTGTTTTTGTTGGCATGTCCTTAAGGACATACTTGGATGCTGCTTGGAAGGATGGAAAAGCACCTATTGACCCTGCCACTCTGGTGAAAGCAGAAGATATTGATCAATACGCTCCTAAAGATTTATTGATTGTGACAACTGGGTCACAAGCAGAACCTCGAGCTGCACTAAATCTTGCATCGTATGGAACTAGCTATGCTTTCAAACTGAAAAAGGAAGATATAATTCTTTATTCAGCTAAGGTTATCCCTGGTAATGAATCAAGAGTAATGAAAATGATGAATCGCATAACAGAAATTGGGTCAACCATAGTAATGGGTAGAAACGAGCAACTGCACACTTCTGGCCATGGGTATCGTGGAGAGCTGGAGGAGGTACTTAAAATCGTGAAACCGCAGCACTTTCTCCCTATACATGGAGAACTTTTGTTCTTGAAAGAACATGAATTGCTTGGAAAGTCAACTGGAGTGCATCATACTACTGTCATTAAGAACGGGGAGATGCTCGGCGTATCACATTTAAGGAATAGACGAGTGTTATCTAATGGTTTCATTTCTCTTGGAAAGGAGAATTTACAGCTAATGTATAGTGATGGTGATAAAGCATTTGGCACAGCAACTGAGCTTTGCATCGAGGAGAGGCTAAGAATTGCAACCGATGGAATCATTGTGGTCAGCATGGAAATCTTGCGACCTCAGGGCGTAGATAGTGTGAGCGAAAATAACATAAAAGGAAGAATAAGAATCACAACACGGTGCTTGTGGTTGGACAAGGGGAAGCTTTTAGATGCACTCCACAAAGCTGCCCACGCCGCCCTTTCAAGTTGCCCTTTAAACTGTCCTTTAGCGCACATGGAAAGAACAGTAGCCGAGGTATTAAGGAAGATGGTGAGGAAGTATAGTGGCAAAAGGCCTGAGATGATTGTTATTGCCATGGAAAACCCAGCAGGGGTTCTTTCTGAAGAACTGTCTGCAAAGCTTGCTGGCAAATCAGAAATGGGGTTTGGGATATCAGCATTGAGAAAAGTAATCGACAAACATCCTGAAAGAAAGAACAAGCCACAAATTGACGAAAATGGATATGGTTACATTGAGGATGCACCACTGGAGGATTCTGAAGGAGAGAATGCAGTTGAGGAAGATAACAGTAGTTCAAGTGAGAGGTTGGATGGAAGGAGTGTGGAGGATGATAATTTTTGGCGTTCAATGATGTCATCACTGCCTGGTGACCCTTCAGAAGAAGAAGCTAATGTGAGAGGTGGTGGTAATGACAGTAGTGAAGATAATGACGCAGAGAAAACTAGGCAGAAATCTGGTAAGCGTAATAAGTGGAAACCGGAGGAGATCAAGAAGCTAATTAAAATGAGAGGGATTTTGCATAGCAGGTTTATCAGTGTAAAGGGAGGAAGGATGGCCCTCTGGGAAGATATATCTAGTAGCTTGATGGAGGAAGGGATCGAACGCACTCCAGGACAATGCAAATCACTGTGGGCATCTCTGGTACAAAAATACGAGGAAAGCAAAAACGGGCCTGAGAGTGGAAAAGAATGGCAATATTTTGAACAAGTGAAGAGCATTCTATCTGATCATGAGCCTGAGCCAACTGCAGCGGCTAAATGA
Protein Sequence:
- >Lus10016758|Linum_usitatissimum|Trihelix|Lus10016758
MSTFSLTAPSLCPYYRPGLAKFSVSCCTGSPTKIGSRRVSKAPPRKRPSRRMEGAGKSMEDSVKRKMEQFYEGTDGPPLRIVPIGGLGEIGMNCMLVGNYDRYILIDAGVMFPDDEDLGVQKILPDTTFIKRWSHKIEAVVITHGHEDHIGALPWVIPALDANTPIFASSFTMELIKKRLKEHGFFLPSRLKVFRTRKRFTAGPFEIEPITVTHSIPDCSGLILRCSDGIILHTGDWKIDESPLDGKPFDREALEELSKEGVTLMMSDSTNVLSPGRTTSETVVADSLLRHISAAKGRVITTQFASNIWRLGSVKAAADLTGRKLVFVGMSLRTYLDAAWKDGKAPIDPATLVKAEDIDQYAPKDLLIVTTGSQAEPRAALNLASYGTSYAFKLKKEDIILYSAKVIPGNESRVMKMMNRITEIGSTIVMGRNEQLHTSGHGYRGELEEVLKIVKPQHFLPIHGELLFLKEHELLGKSTGVHHTTVIKNGEMLGVSHLRNRRVLSNGFISLGKENLQLMYSDGDKAFGTATELCIEERLRIATDGIIVVSMEILRPQGVDSVSENNIKGRIRITTRCLWLDKGKLLDALHKAAHAALSSCPLNCPLAHMERTVAEVLRKMVRKYSGKRPEMIVIAMENPAGVLSEELSAKLAGKSEMGFGISALRKVIDKHPERKNKPQIDENGYGYIEDAPLEDSEGENAVEEDNSSSSERLDGRSVEDDNFWRSMMSSLPGDPSEEEANVRGGGNDSSEDNDAEKTRQKSGKRNKWKPEEIKKLIKMRGILHSRFISVKGGRMALWEDISSSLMEEGIERTPGQCKSLWASLVQKYEESKNGPESGKEWQYFEQVKSILSDHEPEPTAAAK*