Information report for Thecc1EG000519t1
Gene Details
|
|
Functional Annotation
- TrEMBL: A0A061DMT4 — A0A061DMT4_THECC; Uncharacterized protein
- STRING: EOX91273 — (Theobroma cacao)
- GO:0006355 — Biological Process — regulation of transcription, DNA-templated
- GO:0005634 — Cellular Component — nucleus
- GO:0003677 — Molecular Function — DNA binding
Family Introduction
- The plant-specific B3 superfamily encompasses well-characterized families, such as the auxin response factor (ARF) family and the LAV family, as well as less well understood families, such as RAV and REM.
- All members of the B3 superfamily contain an ~ 110 amino acid region called the B3 domain. This domain was initially named because it was the third basic domain in the maize gene VIVIPAROUS1 (VP1).
- The first and second basic domains (B1 and B2) are specific to the VP1-like proteins, but genes that contain the B3 domain are widespread in plant genomes. The B3 domain of VP1 encodes a sequence-specific DNA binding activity.
Literature and News
Gene Resources
Homologs
- Gossypium arboreum: Cotton_A_34095_BGI-A2_v1.0, Cotton_A_34097_BGI-A2_v1.0
- Gossypium hirsutum: Gh_D13G2207, Gh_A12G0630, Gh_D12G0647
- Manihot esculenta: Manes.10G031300.1.p
- Nicotiana tabacum: XP_016490744.1, XP_016490738.1, XP_016490759.1, XP_016488301.1, XP_016490750.1, XP_016488300.1, XP_016454455.1
- Petunia inflata: Peinf101Scf00925g17001.1
- Pyrus bretschneideri: Pbr025044.1
- Ziziphus jujuba: XP_015890081.1
Sequences
CDS Sequence:
- >Thecc1EG000519t1|Theobroma_cacao|B3|Thecc1EG000519t1
ATGACTTCTCATCAACGTAGATCAGACAATGAGCATTCCATGTTTACGTCCAAAACACCCCATTTTTTTAAAGTTATTCTTGACGAAACTTTCCGAGACGGGAAACTTGGGATTCCTACAAATTTTGTAAGAAAATATGGAAGGCAATTGTCAAGTCCTATTCGTCTTGAGGTCCCAAGTGGTGCAGTATGGCAGGTTGAACTGACGAAGTGTGATGAAAGGGTGTGGTTGCAAAAGGGTTGGCAAGAATTTGCAGAGCATAACTCACTGGAATATGGATATTTTCTGGTTTTTAGATATGAAGGCAATGCTCATTTTCATGTACTTATTTTTGATACGAGTGCTTCAGAGATAGAATATCCACACACTAACACTACAGAAGAAGATGATGGCTTTGATAATGCTTTAGTTTGCAAGAAAAGTAAAGGGAAGTCAGATATCCCATATCCCCAACCTCATAAAGAAATGAAAGTTGATTCACCCAATGAAATTGGGACACATTTGAAGTCAAAGATTTCAGCTCCAGCTGCAATGGGAGGTGGAGTTTCGGGTCAGAGAAGCCCACAAATCGAGGTTCTCGAGACGGTTGGACATTTGACAGCCGATGAGAAAACTAAAGCTCTTCAGAAAGCCAGTGGTTTCAAAACTAAAAATCCATTTTTCATGGTGGTTATGCAACCATCATATGTTAGCTTCTCATATAGAATGAGTGTACCGGACGGCTTTGCTAGGAAATATTTCAAGATGACACAGGGCAATGTGATCCTACGTATTTCTAGTGGGCAAAGTTGGCCTGCAAAGTACTATTGCAGGCCAAACATTGATAATCCAAGAGCACAACTCCGCGATGGTTGGCAGGAGTTTGCAAAGCATAATGCTCTGGAAGTCGGTGATGTCTGCGTTTTTGAGCTAACTAGAACGAGCCCTGAAATCTTGTTGAAAGTAGTCATTTGTAAACGGTTTTTTGAAGATGCCATTGCAGCCAGGCCACTGGCTGGTGGGAGCATAGCCTATCGAGTAAAAAAGCGGCGCTTGTTCAGTGATACTGAAACTAACTGTCTGCAAAATCAACCTGCCATCAGAGAATATAGAGTGCCAAAGACTGAGCAAAACGAGAATATCCATACATCTATTGAAATATTGGATGACTTTCCACTAAACCAAATAACGAAGAAGAAATTGCCATTCCCAGGTTTTCAGCCCTGTAGGATGATGAAAACCAATCCAAGTCAAGTTAAAGGGATCGAGCTTGGGAAACAAAAGACGAGCTTGGATTTCCAATATTCAACGAACGAACTTGGGGGTGAGTTTAAATTTTCTGGGAAAGATGAAAGTGTAGGGATGTCAGGTGCTCAGAGATGTTCGAAACCTGACTTTCTGGGCAGAATGCAACCATTGACTACAACGGAGAAAAAAATAGCTCTTAAACGGGCTATGGCTTTCAAATCTGCAAACCCCTCTTTCACTGTTGTGATGCAGCCATCATATGTTCTTCCTGGTGGTTCTCTGAGCATACCATCCCAATTCGTCAAAAGGTATTTCAAGAAAAATGGTGAAGTCACCCTGAGAGTTTCAGATGGGAGGACTTGGATTGTTGACTACAATGGAGAAGGAGATGGTCAATGTCCAAAAGGCAAATTTCGTAGTAGGAGTTGGAGAGCATTTGTGCTGGACAATAACTTGAAGGTGGGAGACGTCTGTGTCTTTGAGCTGATAAAGGCAAATGGAAATTCCTTCGACGTTGTCATTTTTCCAGATGCTAATATTGCAAGTTGTTCCTCATCAAAATTAGATTCAAGATATCAATGTAAAGAAGCTGAAGATGAAGGATCCATTGAAATCCTGGAGTGCACTGCCCCATGCCAGAAAACCAGAGAGAAATCATCAATCCAGTGCCCTCGGCCTCAGAAGATGATGAAGATCAATATGATTAACAAAACTGAAAAGATACTGGAGTCCGAATATATAGATCCACGTTTCAGACCTTTTTGCAACAAAGCTTGCGGAATTAAGCTTGAAGAACCCAAGGGAAGTACAAGCTCAAGCTGTTGCAAGCAGGAAGTTGGACTTAAGCCTGCTACAAGGACAGGAACAAGCACTGAAAAGGGGTGGGAATGTCCAGAACAAGCTGAGATTCTGAGGTCGCAAAAATTAACAGCCAAAGTAAAGGCTAAAACTCTGCGGATAGCTAAAGCATTCAACTCAAAAAATCCTTTCTTTTTGGTTGTAATTCAACCATCACACATAAGCCGTAATTATAAAATGTGCATACCAAGTAACTTTGCTAGGAAATATTTTACAAAGACGCATGGAGGTGAGACAGTCCTTTGCCTTTCAGATGGAAAATCCTGGTCTGTCAAGTACTACCGCAGAGGTGACGATGGAAACCCACGAGGACAATTTTCAGGTGGTTGGAAGAAATTTGCTCTGGATAATAATCTGGTTGTTGGTGATGTTTGTGTATTTGAGTTGCTTAAAGGTGCTGATATCTCATTTAAAGTCTTAAGAGACATTCATGCTCGTCCTCGCCAGCTCCTCCTGCAGACCTTCGACTCTTCAGAAGAATCGTGCAGCCGTCTCATCTGTATCTTGGCCGTGTGGACATACCATTTAAGTTTATCGAGAAATATTTTAAGCCAGATACAAAAATGTAATCCTTCGAGTTGCGAGTAG
Protein Sequence:
- >Thecc1EG000519t1|Theobroma_cacao|B3|Thecc1EG000519t1
MTSHQRRSDNEHSMFTSKTPHFFKVILDETFRDGKLGIPTNFVRKYGRQLSSPIRLEVPSGAVWQVELTKCDERVWLQKGWQEFAEHNSLEYGYFLVFRYEGNAHFHVLIFDTSASEIEYPHTNTTEEDDGFDNALVCKKSKGKSDIPYPQPHKEMKVDSPNEIGTHLKSKISAPAAMGGGVSGQRSPQIEVLETVGHLTADEKTKALQKASGFKTKNPFFMVVMQPSYVSFSYRMSVPDGFARKYFKMTQGNVILRISSGQSWPAKYYCRPNIDNPRAQLRDGWQEFAKHNALEVGDVCVFELTRTSPEILLKVVICKRFFEDAIAARPLAGGSIAYRVKKRRLFSDTETNCLQNQPAIREYRVPKTEQNENIHTSIEILDDFPLNQITKKKLPFPGFQPCRMMKTNPSQVKGIELGKQKTSLDFQYSTNELGGEFKFSGKDESVGMSGAQRCSKPDFLGRMQPLTTTEKKIALKRAMAFKSANPSFTVVMQPSYVLPGGSLSIPSQFVKRYFKKNGEVTLRVSDGRTWIVDYNGEGDGQCPKGKFRSRSWRAFVLDNNLKVGDVCVFELIKANGNSFDVVIFPDANIASCSSSKLDSRYQCKEAEDEGSIEILECTAPCQKTREKSSIQCPRPQKMMKINMINKTEKILESEYIDPRFRPFCNKACGIKLEEPKGSTSSSCCKQEVGLKPATRTGTSTEKGWECPEQAEILRSQKLTAKVKAKTLRIAKAFNSKNPFFLVVIQPSHISRNYKMCIPSNFARKYFTKTHGGETVLCLSDGKSWSVKYYRRGDDGNPRGQFSGGWKKFALDNNLVVGDVCVFELLKGADISFKVLRDIHARPRQLLLQTFDSSEESCSRLICILAVWTYHLSLSRNILSQIQKCNPSSCE*