Information report for Thecc1EG026624t1
Gene Details
|
|
Functional Annotation
- Refseq: XP_017976648.1 — PREDICTED: protein ALWAYS EARLY 2 isoform X2
- TrEMBL: A0A061F320 — A0A061F320_THECC; DIRP,Myb-like DNA-binding domain, putative isoform 1
- STRING: EOY11452 — (Theobroma cacao)
- GO:0005654 — Cellular Component — nucleoplasm
- GO:0003677 — Molecular Function — DNA binding
Family Introduction
- A novel myb-like gene (AtmybL2) was isolated from an Arabidopsis thaliana cDNA library. The single copy gene was localised on chromosome I. A gene specific transcript is preferentially found in leaves. The predicted gene product consists of a conservative N-terminal myb-domain known to be involved in DNA-binding and a unique proline-rich C-terminal part. Remarkably, the myb-domain includes only one of the typical two or three tryptophan repeats found in other myb-like proteins.
Literature and News
Gene Resources
Homologs
- Citrus sinensis: orange1.1g044351m
- Gossypium arboreum: Cotton_A_10975_BGI-A2_v1.0, Cotton_A_27916_BGI-A2_v1.0
- Gossypium hirsutum: Gh_A11G2639, Gh_A13G0889, Gh_Sca005054G02, Gh_D13G1130
- Ziziphus jujuba: XP_015875085.1, XP_015875086.1
Sequences
CDS Sequence:
- >Thecc1EG026624t1|Theobroma_cacao|MYB_related|Thecc1EG026624t1
ATGGACTGCTGTATCTACGCTATGAAGAAGAAATTAGCTGACAAGTTAGGATCTCAATGGAGCAAGGAAGAAATTGAGCGTTTTTATAAAGCTTATCGAGAGTATGGGAAAGATTGGAAGAAGGTGGCTGCTGCTGTGCATAATAGATCCACTGAAATGGTGGAGGCCCTTTACCTTATGAACCGGGCATATTTATCCCTGCCAGATGGAACAGCTTCTGTGATTGGCCTTATAGCAATGATGACCGATCATTATAGTGTCCTGAGAGGGAGCGATTGTGAAAGAGAGAGTAATGAGCCTTCTGAAATACCCCAGAAAGCTCAAAAGCGCAAGCGGGCAAAGGTTCATCTTGGTACCTCAAAAGAAGGTGTTGTACAGCCTCAATCAATTGCATCTAGTCAGGGATGCCTATCTTTGTTGAAGAGGGCAGGCCTTAATGGTATTCATCCTCATGCAGTAAGGAAAAGAACTCCTCGGGTTCCTGTTTCATATTCATATAGGAGAAATGACACTGAAAGTTACATTCCACCAAACAAAAGAGTTAAGAAGTCAGACGCTGATGATAATGATGCTGAACATGTTGCCGCATTGACGTTGACTGGGGCATTGCAAAGGGGAGGCTCCCCTCAGGTTTCTCAGACACCTTACAAAAGAGCTGAATGCAGAAGATCCTCACCCGTTCAGAGCTATGATAGGACGTCACCACAACCGGAGACCACTAAGGCCAAGCTTGATGATTCTTCCTACGAATGCTGGATGGAAGGCAGGCCTAGGGGCACGGAACCTGTAATTGGAACTCATGCCAGAGATGCAGACCCCTTGATGGATATGGAAGTTGTTGGTACCATTGAAGGTCATCGAAAGGGAAAGAAATTTTACAGGAAGAAAATGAAAGTTGAAGAAACCAAGAACAATCTCTCTGATGATGGTGGAGAAGCGTGTAGTGGCACTGAAGAAAGAATTAGAGGTAGTACTCTCAAAGGGAAAGTTGATATGGAGATTACCAGTGCAAAAAGTGAACAACTTTCACCATGGAGTCAGAGGAAGAGAAGTAACAAGAAGCTTGTCTTCGGAGGTCTGAACTTAAGATCTAGCATTGAATTTGATAGCGCATACAAGCTCAACATCGTTTTTTTATTCTTTTTTCGATATGTAGATGAAAGCTCTTCCATTGATGCTCTGCTGACATTAGCCAATTTGTCAACGTCAATGTTGCCAACATCAATAATGGAATCTGAATCATCTGTCAAATTGAAAGAGAATAGAATTACACTTGAATCTGTTGACAAGTCTAGTGCCCCTGAAGCTGCATCTACAAGTCATCACAGAGATAATATTAAGCACCTACGGCCAAACGAAAAGGTGCTCGACTCAATCACTGGTGCAGAGGAAGCTACCACTAGGAAACTAAAAGTTGGAAGGAATTCAGCTATGGATGATAATGTTGTTTCTGAGGCAAAACAAAAGCCAGAACCTACCAATAACTCTTGGAAAAGAAAACGCAAATCCTTCAGTTCAAAGATTTCTAATGCAGAAGCTTCAATGGATTCTCATCTCCAACAATCTTTTGATAATGAGGACATGGGTGAAGAAGACAATAAATATCTCACTAAAGGTAAATGTGGTGCTCAATCTTCTGTTCAATCAAGACAATGGAAGTCATTCAGAGTGTCAGAGGATTCCTCTACTAATGATGATCCAAAAATGGCTGGAATTGATTCAGTGGTGTTGACTTCACAAGTTCCTGCACCAAACCCTGTTAGCGTACCACCTAAGCATCAAAGTAGACGTAAAATGAACCTGAGGAGAGCCTTCCTTTCAACAGATAGAAGTTCTTCCAAGTGCACATTGAAAAATCAACCAATCAAGCAGTCTGTCACACAAGACAGACTAAAGGAACAGCTCTCTTCCTGCCTATCATCTAATCTGGCACGAAGATGGTGCAGTTTTGAATGGTTTTACAGTGCTATTGATTATGCTTGGTTTGCTAAAAGGGAGTTTGTTGAGTACCTAAATCATGTCGGACTGGGTCATGTTCCAAGGCTTACTCGTGTTGAGTGGGGTGTCATAAGAAGTTCCCTTGGCAAACCTCGGAGGTTTTCTGAACGCTTTTTACATGAAGAAAGGGAAAAACTTAAACATTATCGGGAGTCTGTGAGACAACATTATTCTCAGCTTCGCGTTGGTGCTAGGGAAGGACTTCCAACGGATCTGGCATATCCTTTATCAGTTGGACAACAAGTAATTGCCATTCATCCCAAAACGAGGGAAGCTCATGATGGAAAAGTACTTACTGTGGACCATGATAGGTGCAGGGTTCAGTTTGATAGTCCTGAACTAGGGGTTGAATTTGTCATGGATATTGATTGCATGCCATTAAATCCGTTGGAAAATATGCCGGAAGCACTTAGGAGACAGAACCTTGCTTTTGATAAATTCTCTGTGACACCTAAACCGTCTCAAGTGAATAGCCATTCAGATTTTGGTGGGTCCACGGTATTCACTTCAAGTGGGCATCTGGAGAATGGAACCAGCCCTGTGAACATGTCGGCAAATCAGATAAAGGTGGATGCCAACCGTAACATTTTGCATGCTGAGGCAGCTGTTCCTTATGTTGTTAGTGCACATCAAGCAGCCTATGGTCAACCACTTACCATGGCACATATCAAAGGGAGGGAAACTGATACACGAGCTATGTCTGAATTGAACGGTGCTCTTGACAAAAAGGAAGCTTTATTGATGGAGCTCAGAAACACGAACAATGACATATCAGAAAATCAAAATGGAGAAAGTTGTTTAAAAGATTCTGAACCTTTCAAGAAGCATATTGCCACGGCTTCTTCTGCTTTAGTTAACTTGAGACAACGAAATGCTTACCCAGCAAACCCCCTGTCACCTTGGCAGAAACCCCCAACCAATTCCAACTTCTTTGGTGGCTTGAAAAGTTATGTTGACAGTTCTCTTGTCTCACCAGAATCAGGATCTGGTGTGGGTGAAATTGTTCAAGGCTCAAGACTAAAGGCGCATGCTATGGTGGATGCTGCTATGAAGGCCATGTCATCAATGAAGGAAGGCGAAGATGCATTTATGAGGATTGGAGAAGCTTTGGACTCTTTAGATAAACGGCAATTCACATATGACATTAGGATGCCGGTGATCAAGTCACGAGAGCAGGAGAATGGCAGTATGGATTATCGCAATCACTTGGTTTCCTGTACATCAAAACCGGTGGCTGCCGGTTGGGCAACTAATCCCAAGTCGCAGGAGGCTTCTGACAAAAACGAGGAACAAGGTCCTTCAGAGCTGATCGCATCATGTGTTGCTACTTTGCTCATGATACAGACATGTACAGAGCGACAATATCCGCCAGCAGACGTGGCTCAAATAATCGATTCAGCTGTTACAAGCTTGCATCCATGTTTTCCCCAGAACCTGCCAATTTACCGAGAAATACAAATGTGCATGGGGAGGATTAAGACTCAAATATTAGCTTTGATACCCACTTGA
Protein Sequence:
- >Thecc1EG026624t1|Theobroma_cacao|MYB_related|Thecc1EG026624t1
MDCCIYAMKKKLADKLGSQWSKEEIERFYKAYREYGKDWKKVAAAVHNRSTEMVEALYLMNRAYLSLPDGTASVIGLIAMMTDHYSVLRGSDCERESNEPSEIPQKAQKRKRAKVHLGTSKEGVVQPQSIASSQGCLSLLKRAGLNGIHPHAVRKRTPRVPVSYSYRRNDTESYIPPNKRVKKSDADDNDAEHVAALTLTGALQRGGSPQVSQTPYKRAECRRSSPVQSYDRTSPQPETTKAKLDDSSYECWMEGRPRGTEPVIGTHARDADPLMDMEVVGTIEGHRKGKKFYRKKMKVEETKNNLSDDGGEACSGTEERIRGSTLKGKVDMEITSAKSEQLSPWSQRKRSNKKLVFGGLNLRSSIEFDSAYKLNIVFLFFFRYVDESSSIDALLTLANLSTSMLPTSIMESESSVKLKENRITLESVDKSSAPEAASTSHHRDNIKHLRPNEKVLDSITGAEEATTRKLKVGRNSAMDDNVVSEAKQKPEPTNNSWKRKRKSFSSKISNAEASMDSHLQQSFDNEDMGEEDNKYLTKGKCGAQSSVQSRQWKSFRVSEDSSTNDDPKMAGIDSVVLTSQVPAPNPVSVPPKHQSRRKMNLRRAFLSTDRSSSKCTLKNQPIKQSVTQDRLKEQLSSCLSSNLARRWCSFEWFYSAIDYAWFAKREFVEYLNHVGLGHVPRLTRVEWGVIRSSLGKPRRFSERFLHEEREKLKHYRESVRQHYSQLRVGAREGLPTDLAYPLSVGQQVIAIHPKTREAHDGKVLTVDHDRCRVQFDSPELGVEFVMDIDCMPLNPLENMPEALRRQNLAFDKFSVTPKPSQVNSHSDFGGSTVFTSSGHLENGTSPVNMSANQIKVDANRNILHAEAAVPYVVSAHQAAYGQPLTMAHIKGRETDTRAMSELNGALDKKEALLMELRNTNNDISENQNGESCLKDSEPFKKHIATASSALVNLRQRNAYPANPLSPWQKPPTNSNFFGGLKSYVDSSLVSPESGSGVGEIVQGSRLKAHAMVDAAMKAMSSMKEGEDAFMRIGEALDSLDKRQFTYDIRMPVIKSREQENGSMDYRNHLVSCTSKPVAAGWATNPKSQEASDKNEEQGPSELIASCVATLLMIQTCTERQYPPADVAQIIDSAVTSLHPCFPQNLPIYREIQMCMGRIKTQILALIPT*