Information report for Thhalv10003596m
Gene Details
|
|
Functional Annotation
- Refseq: XP_006394955.1 — protein ALWAYS EARLY 1 isoform X1
- Swissprot: Q6A331 — ALY1_ARATH; Protein ALWAYS EARLY 1
- TrEMBL: V4MLC3 — V4MLC3_EUTSA; Uncharacterized protein
- STRING: XP_006394955.1 — (Eutrema salsugineum)
- GO:0003677 — Molecular Function — DNA binding
Family Introduction
- A novel myb-like gene (AtmybL2) was isolated from an Arabidopsis thaliana cDNA library. The single copy gene was localised on chromosome I. A gene specific transcript is preferentially found in leaves. The predicted gene product consists of a conservative N-terminal myb-domain known to be involved in DNA-binding and a unique proline-rich C-terminal part. Remarkably, the myb-domain includes only one of the typical two or three tryptophan repeats found in other myb-like proteins.
Literature and News
Gene Resources
Homologs
- Arabidopsis thaliana: AT3G05380
- Brassica napus: GSBRNA2T00144207001, GSBRNA2T00133776001, GSBRNA2T00028858001, GSBRNA2T00120517001, GSBRNA2T00038579001, GSBRNA2T00119961001
- Brassica oleracea: XP_013598393.1, XP_013616256.1
- Brassica rapa: XP_009151196.1, XP_009151195.1, XP_009129869.1, XP_009147247.1
- Raphanus raphanistrum: RrC2295_p2
Sequences
CDS Sequence:
- >Thhalv10003596m|Eutrema_salsugineum|MYB_related|Thhalv10003596m
ATGGCGCCCAGTAGGAAGTCGAAGAGTGTGAACAAGCGCTTCACCAATGAAGCTTCTCCAGAAATAGATTATAGGAACTCGAGCAAAACCAAGCAACGAAAGAAGAAATTAGCTGACAAGCTGGGACCTCAGTGGACGAGAGGAGAGCTGGAGCGTTTCTATGACGCCTATCGGAAGCATGGGGGAAAATGGAAAAAGGTAGCTGCTGCAGTGAGGAATAACCGGTCTGTTGAGATGGTGGAAGCCCTTTTTTATATGAATCGGGCATATTTATCCCTTCCAGAGGGAACTGCGTCTGTAGCTGGCCTCATTGCAATGATGACTGATCATTACAGCGTCATAGAGGGGAGTGAAAGTGAAGGAGAAGGCCATGTTGCTTCGGGAGTACCGAGGAAATATCTGAAGCGCAAACGTGCTAAAGTCTCGCCTAGTAATTTTCGAGAAGAAGCTAGTACACCACATTCAATTGCATCAGCAGAAGGATGCCTCCCATTTTTAAAGCTGACACAAGCTTATGGAATTGAGAGGCGTGCCACTGCGAAACGTACACCTCGGTTTCCTGTACCAATTGCAGACAAGAGGGATGATAGAGAAGATTCTACTCCGCGAAATAAAAGAGCCAAGAAACAACTTGATGATGGTGATGATGATGATGATGATGAGACGCTAGCTTTAACATTGGCAAATGCATCGAGAAGGGGAGGAGGGTCTCCATATAGAAGAACAAAACTCCATGACAGCACACCAAGTGGAAAAATGTCACAATCAAAGGAAGCTCAAGCCAAGCTCCGTGCTAGCTCCATGTTTGACAATGGGGTGACAATAAGCCGAGATAGGAGGCATATAAAGGGATCTCGTGATAGAGATGGTGCCTTGTTGATGGATATGGGAGGGCTTGGTACCGTGGAGATTCCTCAGAAGGAGAAAAATGTGAGATTTGAAGAAGCAGAAAGAGATGCTTCTGATGACAGCGGAGAAGCATGCAATGCCAATGAGAGACGTAAACCTAAACCGCAGAAAAGAGTGGTAGAGACTGAAGGCTCAAGAGATGAACTAGAAGCTCTGTATGCATTGGCTGAATTGTCAGCTTCACTTACTCCGGCTGGTTTGATGAAATTAGAATCATCTGCACAGTTGCAAAAAGAAAGAGTAGCTAACAACGTGGATGAGAAATCTAACACTCCGGAAACCGTATCTACAAGCCATACCAGAGAAAAAGCAAAACAAGCAGGACCAGAAGATAGGCTTCTACATGCGGTTTCCGCTACTGATAATAGAAAACTTACGTCTGCGCAGGAACTCGTCGATGGTAATGACGTCTCCATAGGGGAACTTGGCACCTCAAGAAGAAAACGTAAACCTCTACATAATAAGGAATTGGCTGATGTTTATCTGAAGACTCTGGTCAAAGAGAGACGTGCTGGTCAAGGACCAGCAAAACAGCTGAAAACCGCAAAGAACTCGGAAGAATTTTCTTCAACTAGCGATAAGAAAATAACTGGACCGGATGCAGTAGTGTCAGCTACACAAGTCTCAGGTTCGGGTCCAGCGAGTTTGCCGCAGAAACCACCAAACAGGCGTAAGATGAGTTTGAAGAAAAGTTTGCAAGAAAGAGCTAAAACTTACGAAACCACTCCTGAAAAGCCACATAGTTACGAAACATTTTCAGAACATGAATTATTAAAGAAGGTATCGACTTGTCTGTCATATCCATTGGTACGCCGAAGGTGCATATTTGAATGGTTCTACAGTGCTATTGACTATCCCTGGTTTGCAAAAATGGAGTTCACTGATTATCTAAATCATGTGGGACTTGGTCACGTTCCAAGACTTACTCGTCTTGAGTGGAGCGTCATCAAAAGTTCTCTTGGTAGACCTCGGAGATTCTCTGAGAGATTCATACAGGAAGAGCGGGATAAACTCAAACAATATCGTGAATCTGTGAGAAAGCATTATACAGAGCTCCGTGCAGGTGCTAGGGAAGGGCTTCCTACAGATTTGGCCCAGCCTTTATCAGTCGGGAATAGAGTCATTGCCATCCATCCTAAAACACGGGAAATTCGTGATGGCAAGATTCTTACTGTGGATCATGACAAATGCAACGTTCTATTTGATGAAACGGGCTTTGAATTAGTTATGGACATTGATTGCATGCCTTTAAATCCATTGGAATACATGCCAGAGGGTCTGAGGAGGCAAATTGATAACTGCTTGTCAATAAGCAAAGAATCACAGCTTAATAGAAACCCAAATTCTGATGCATCTGTTCTGTTCCCACCTTCTGTGCTTGAAAATATCGACTTTTCCATGACTCCTCCCGTGAAACAGGATGATAGGGACAGGCCAGTCTCTACTGATCAATCATATAACACAAGTAATAGCAAAGCAAGAAGAGCTGAAATTCAACGAGCTCTGATGCTGAAGCATAGTTCAGATGCAGAGGAAATGGAGCCAGAAATGCTTGGAATAGTCAGTGGTTCAAGGTCAATAGCACAAGCAATGGTGGATGCAGCTATGAAGGCTGCATCTTCGGTGAAGGGTGAAGAAGACGCAGGGAACATGGTGAAACAAGCTGTAGGCTCCATTGGCGAACACCAGCCATTAGATAACTTTGTAGAGACTACCAATGGCAGCTTGGATCATCATCACCAAAGCCGGTCTCCCTCAATCACAGCAGAGCCGATGACTAAAGGATTGATCGGATCAGGAAAAAACGAAACGCAAATGGCTTCAGAGCTTATCAGCTCTTGTGTTTCAACTTGGCTCATGATCCAGATGTGCACAGAGAAGCAGTACCCACCAGCTGACGTGGCTCAGGTGATGGAGACAGCAGTGAGTAGCTTGCAGCCACGGTGTCCCCAGAACATGCCGATCTACAGAGAAATTCAGACTTGTATGGGATGGATCAAGAATCAAATCATGGCTCTTGTAAGAACATGA
Protein Sequence:
- >Thhalv10003596m|Eutrema_salsugineum|MYB_related|Thhalv10003596m
MAPSRKSKSVNKRFTNEASPEIDYRNSSKTKQRKKKLADKLGPQWTRGELERFYDAYRKHGGKWKKVAAAVRNNRSVEMVEALFYMNRAYLSLPEGTASVAGLIAMMTDHYSVIEGSESEGEGHVASGVPRKYLKRKRAKVSPSNFREEASTPHSIASAEGCLPFLKLTQAYGIERRATAKRTPRFPVPIADKRDDREDSTPRNKRAKKQLDDGDDDDDDETLALTLANASRRGGGSPYRRTKLHDSTPSGKMSQSKEAQAKLRASSMFDNGVTISRDRRHIKGSRDRDGALLMDMGGLGTVEIPQKEKNVRFEEAERDASDDSGEACNANERRKPKPQKRVVETEGSRDELEALYALAELSASLTPAGLMKLESSAQLQKERVANNVDEKSNTPETVSTSHTREKAKQAGPEDRLLHAVSATDNRKLTSAQELVDGNDVSIGELGTSRRKRKPLHNKELADVYLKTLVKERRAGQGPAKQLKTAKNSEEFSSTSDKKITGPDAVVSATQVSGSGPASLPQKPPNRRKMSLKKSLQERAKTYETTPEKPHSYETFSEHELLKKVSTCLSYPLVRRRCIFEWFYSAIDYPWFAKMEFTDYLNHVGLGHVPRLTRLEWSVIKSSLGRPRRFSERFIQEERDKLKQYRESVRKHYTELRAGAREGLPTDLAQPLSVGNRVIAIHPKTREIRDGKILTVDHDKCNVLFDETGFELVMDIDCMPLNPLEYMPEGLRRQIDNCLSISKESQLNRNPNSDASVLFPPSVLENIDFSMTPPVKQDDRDRPVSTDQSYNTSNSKARRAEIQRALMLKHSSDAEEMEPEMLGIVSGSRSIAQAMVDAAMKAASSVKGEEDAGNMVKQAVGSIGEHQPLDNFVETTNGSLDHHHQSRSPSITAEPMTKGLIGSGKNETQMASELISSCVSTWLMIQMCTEKQYPPADVAQVMETAVSSLQPRCPQNMPIYREIQTCMGWIKNQIMALVRT*