Tomato leaf curl Arusha virus
Basic Information
Genus |
Begomovirus
|
NCBI Assembly |
GCF_000868885.1 |
Isolate |
Tanzania:Kilimanjaro |
Release date |
2015/2/13 |
Submitter |
Shih,S.L., Tsai,W.S., Green,S.K., Lee,L.M. |
Host |
|
Download |
Genome
|GFF3
|PEP
|CDS |
Genomic Organization
JBrowse
Genome
ACCGGATGGCCGCGCCCCCGAAAAAGCATGGACCCCCTTGAAATATACGGAGCCAATCAGATTGCAGCCTCAATGCTTAGTTAATTTTTTTTTTGTCTTTATATACTTGGCTGTTAAGTATTAAACGCCGTCATTATGTGGGATCCGTTGGTAAATGAGTTTCCGGAGTCTGTTCACGGGTTTCATTGTATGCTTGCCATAAAATATTTGCAGGCCGTTGAAGAGTCTTACGAGCCCAATACATTGGGCCACGATTTAATTAGAGATTTAATCTCTGTAGTTAGAGCCCGGGATTATGTCGAAGCGACCCGCCGATATAATCATTTCCACGCCCGCCTCGAAGGTTCGTCGAAGGCTGAACTTCGACAGCCCTTATTCCAGCCGTGCTGCTGTCCCCATTGTCCCAGGCACAAGCAAACGCAGGTCATGGACGTACAGGCCCATGTATCGCAAGCCCAGAATGTATCGCATGTTCCGTAGTCCAGATGTTCCTCGGGGATGTGAGGGTCCCTGTAAGGTTCAGTCTTATGAGCAGAGGGATGATGTGAAGCACACCGGTATTGTTCGTTGTGTTAGTGATGTAACTAGAGGTAATGGAATTACTCATGGAGTAGGAAAACGGTTCTGCATTAAGTCCATATACATTTTAGGAAAAATATGGATGGATGAGAATATCAAGAAGCAGAATCATACTAATCAGGTCATGTTTTTTTTGGTCCGTGATAGAAGGCCCTATGGCCCAAGCCCAATGGATTTTGGGCAGGTGTTTAACATGTTTGATAATGAGCCCAGTACAGCCACTGTGAAGAATGATCTCCGAGACAGATATCAAGTTTTGCGGAAATTTCATGCAACTGTTGTCGGTGTCCCCTCTGGGATGAAAGAGCAGGCGTTACTTAAAAGATTTTTTAGAATTAATAATCATGTAGTTTATAATCATCAGGAGACTGCTAAGTATGAGAATCATACTGAGAATGCTCTGTTGTTGTATATGGCATGTACTCATGCCTCTAACCCTGTGTATGCTACGTTGAAAATACGTATCTATTTCTATGATTCAGTTGGGAATTAATAAAGTTTGAATTTTATATCATAATTTTGTTCCACCCATAAAGTGCCATTGATTACATCAAACAATACATATTCTATTGCTCTAATTACATTATTAATTGAAATTACACCAAGATTGTCTAAATATTTCCTAACTTGAGTCTTAAAGACTCTTAAGAAAAGACCAGTCTGAGGCTGTAAGGTTGTCCAGATCTTGAAGGCCATGAAACACTTGTGAATCCCCAGTTCCTTCCTTAGGTTGTGGTTGAATCGGATTTGTACTGTGATGATGTCGTGGTTGTAGTTGAACGGTCTCTTTGAGTGTTCCGTGATGCTGAAATATAGGGGATTGGCGATTTCCCAGGTATAGACGCCACTCTGTGCCTGATGCACAGTGATGAGTTCCCCGGTGCGTAAATCCATGGTTGCGACAGTTGAGCGACAAGTAGTACGAGCACCCGCAATTAAGGTCTATCCTCTTCCTCCGCTGAAGCCTCTGTTTGGCTGCTCTGTGTTGGACCTTGATGGGAACTTGAGTACAATGGCTGTTGGATGGTGAAGAAGACCGCATTTTTAATTGCCCAGGCCTTTAATGGTGCGTTCTTTTCCTCATCCAAGTACTCTTTATATGATGAAGTGGGTCCTGGATTGCAGAGGAAGATTGCCGGGATACCTCCTTTAATTTGAATTGGTTTCCCGTACTTTGTGTTGCTTTGCCAGTCCCTTTGTGCGCCCATGAATTCTTTAAAGTGCTTTAGATAATGAGGATCGACGTCATCAATGACGTTGTACCAAGCATTATTACTGTACACCTTTGGACTGAGGTCAAGATGACCACACAAATAATTGTGTGGTCCTAATGACCTAGCCCACATTGTCTTGCCGGTACGACTATCCCCTTCAATCACAATACTCATGGGTCTCAATGGCCGCGCAGCGGCAGTGACAACATTCTGTGCCGCCCACTCTTCAAGTTCCTCTGGAACTTGATCGAACGAAGAACATAAGAAAGGTGAAACATATTCCTCCAACGGAGGTGTAAAAATCCTATCTAAATTACTTTTCAAATTATGATACTGAAAAATAAAATCTTTAGGGAGTTTCTCCCTAATAATAGCCAGAGCGGCTTCAGCGGACCCTGCGTTTAATGCCTCGGCGCATGCGTCGTTAGCATTATGGCAGCCTCCTCTAGCACTTCTGCCGTCGATCTGGAATTCCCCCCATTCGAGTGCGTCTCCATCCTTGTCGATGTAGGACTTGACGTCGGAGCTTGATTTAGCTCCCTGAATGTTCGGATGGAAATGTGTTGACCTGGTTGGGGATACCAGGTCGAAGAATCTGTTATTTTTGCAGTTGAATTTTCCTTCGAACTGAATAAGCACGTGGAGATGAGGCTCCCCATCTTCGTGTAGTTCTCTGCAAATTTTGATGAATTTTTTATTTGTTGGAGTATCAGTATTTAATAATTGAGATAGTGCTTCTTCTTTATTTAGAGAGCATTTGGGATAAGTGAGGAAATAATTTTTGGAATTTATTTGGAAACGCTTAGGAGGAGGCATGTTGGTCAATGGGTACCGATTGACTCACTTGGAATGCTTCTCCTGGTATATCGGTACCCAATATATAGTGGGTACCGAATGCCAGTATTGTAATAACAAAAAGTTACTCTACCCTTATTGTCAAATTGTTAAAGCGGTCATCCGTCTAATATT
Gene Information
NCBI Accession
|
YP_001040008.1
|
Location
|
136-480 |
Gene Name
|
V2 |
Protein Name
|
pre-coat protein |
Coding Region
|
ATGTGGGATCCGTTGGTAAATGAGTTTCCGGAGTCTGTTCACGGGTTTCATTGTATGCTTGCCATAAAATATTTGCAGGCCGTTGAAGAGTCTTACGAGCCCAATACATTGGGCCACGATTTAATTAGAGATTTAATCTCTGTAGTTAGAGCCCGGGATTATGTCGAAGCGACCCGCCGATATAATCATTTCCACGCCCGCCTCGAAGGTTCGTCGAAGGCTGAACTTCGACAGCCCTTATTCCAGCCGTGCTGCTGTCCCCATTGTCCCAGGCACAAGCAAACGCAGGTCATGGACGTACAGGCCCATGTATCGCAAGCCCAGAATGTATCGCATGTTCCGTAG |
Protein Sequence
|
MWDPLVNEFPESVHGFHCMLAIKYLQAVEESYEPNTLGHDLIRDLISVVRARDYVEATRRYNHFHARLEGSSKAELRQPLFQPCCCPHCPRHKQTQVMDVQAHVSQAQNVSHVP |
NCBI Accession
|
YP_001040009.1
|
Location
|
296-1072 |
Gene Name
|
V1 |
Protein Name
|
coat protein |
Coding Region
|
ATGTCGAAGCGACCCGCCGATATAATCATTTCCACGCCCGCCTCGAAGGTTCGTCGAAGGCTGAACTTCGACAGCCCTTATTCCAGCCGTGCTGCTGTCCCCATTGTCCCAGGCACAAGCAAACGCAGGTCATGGACGTACAGGCCCATGTATCGCAAGCCCAGAATGTATCGCATGTTCCGTAGTCCAGATGTTCCTCGGGGATGTGAGGGTCCCTGTAAGGTTCAGTCTTATGAGCAGAGGGATGATGTGAAGCACACCGGTATTGTTCGTTGTGTTAGTGATGTAACTAGAGGTAATGGAATTACTCATGGAGTAGGAAAACGGTTCTGCATTAAGTCCATATACATTTTAGGAAAAATATGGATGGATGAGAATATCAAGAAGCAGAATCATACTAATCAGGTCATGTTTTTTTTGGTCCGTGATAGAAGGCCCTATGGCCCAAGCCCAATGGATTTTGGGCAGGTGTTTAACATGTTTGATAATGAGCCCAGTACAGCCACTGTGAAGAATGATCTCCGAGACAGATATCAAGTTTTGCGGAAATTTCATGCAACTGTTGTCGGTGTCCCCTCTGGGATGAAAGAGCAGGCGTTACTTAAAAGATTTTTTAGAATTAATAATCATGTAGTTTATAATCATCAGGAGACTGCTAAGTATGAGAATCATACTGAGAATGCTCTGTTGTTGTATATGGCATGTACTCATGCCTCTAACCCTGTGTATGCTACGTTGAAAATACGTATCTATTTCTATGATTCAGTTGGGAATTAA |
Protein Sequence
|
MSKRPADIIISTPASKVRRRLNFDSPYSSRAAVPIVPGTSKRRSWTYRPMYRKPRMYRMFRSPDVPRGCEGPCKVQSYEQRDDVKHTGIVRCVSDVTRGNGITHGVGKRFCIKSIYILGKIWMDENIKKQNHTNQVMFFLVRDRRPYGPSPMDFGQVFNMFDNEPSTATVKNDLRDRYQVLRKFHATVVGVPSGMKEQALLKRFFRINNHVVYNHQETAKYENHTENALLLYMACTHASNPVYATLKIRIYFYDSVGN |
NCBI Accession
|
YP_001040010.1
|
Location
|
1069-1473 |
Gene Name
|
C3 |
Protein Name
|
replication enhancement protein |
Coding Region
|
ATGGATTTACGCACCGGGGAACTCATCACTGTGCATCAGGCACAGAGTGGCGTCTATACCTGGGAAATCGCCAATCCCCTATATTTCAGCATCACGGAACACTCAAAGAGACCGTTCAACTACAACCACGACATCATCACAGTACAAATCCGATTCAACCACAACCTAAGGAAGGAACTGGGGATTCACAAGTGTTTCATGGCCTTCAAGATCTGGACAACCTTACAGCCTCAGACTGGTCTTTTCTTAAGAGTCTTTAAGACTCAAGTTAGGAAATATTTAGACAATCTTGGTGTAATTTCAATTAATAATGTAATTAGAGCAATAGAATATGTATTGTTTGATGTAATCAATGGCACTTTATGGGTGGAACAAAATTATGATATAAAATTCAAACTTTATTAA |
Protein Sequence
|
MDLRTGELITVHQAQSGVYTWEIANPLYFSITEHSKRPFNYNHDIITVQIRFNHNLRKELGIHKCFMAFKIWTTLQPQTGLFLRVFKTQVRKYLDNLGVISINNVIRAIEYVLFDVINGTLWVEQNYDIKFKLY |
NCBI Accession
|
YP_001040011.1
|
Location
|
1214-1621 |
Gene Name
|
C2 |
Protein Name
|
transcriptional activation protein |
Coding Region
|
ATGCGGTCTTCTTCACCATCCAACAGCCATTGTACTCAAGTTCCCATCAAGGTCCAACACAGAGCAGCCAAACAGAGGCTTCAGCGGAGGAAGAGGATAGACCTTAATTGCGGGTGCTCGTACTACTTGTCGCTCAACTGTCGCAACCATGGATTTACGCACCGGGGAACTCATCACTGTGCATCAGGCACAGAGTGGCGTCTATACCTGGGAAATCGCCAATCCCCTATATTTCAGCATCACGGAACACTCAAAGAGACCGTTCAACTACAACCACGACATCATCACAGTACAAATCCGATTCAACCACAACCTAAGGAAGGAACTGGGGATTCACAAGTGTTTCATGGCCTTCAAGATCTGGACAACCTTACAGCCTCAGACTGGTCTTTTCTTAAGAGTCTTTAA |
Protein Sequence
|
MRSSSPSNSHCTQVPIKVQHRAAKQRLQRRKRIDLNCGCSYYLSLNCRNHGFTHRGTHHCASGTEWRLYLGNRQSPIFQHHGTLKETVQLQPRHHHSTNPIQPQPKEGTGDSQVFHGLQDLDNLTASDWSFLKSL |
NCBI Accession
|
YP_001040012.1
|
Location
|
1515-2609 |
Gene Name
|
C1 |
Protein Name
|
replication-associated protein |
Coding Region
|
ATGCCTCCTCCTAAGCGTTTCCAAATAAATTCCAAAAATTATTTCCTCACTTATCCCAAATGCTCTCTAAATAAAGAAGAAGCACTATCTCAATTATTAAATACTGATACTCCAACAAATAAAAAATTCATCAAAATTTGCAGAGAACTACACGAAGATGGGGAGCCTCATCTCCACGTGCTTATTCAGTTCGAAGGAAAATTCAACTGCAAAAATAACAGATTCTTCGACCTGGTATCCCCAACCAGGTCAACACATTTCCATCCGAACATTCAGGGAGCTAAATCAAGCTCCGACGTCAAGTCCTACATCGACAAGGATGGAGACGCACTCGAATGGGGGGAATTCCAGATCGACGGCAGAAGTGCTAGAGGAGGCTGCCATAATGCTAACGACGCATGCGCCGAGGCATTAAACGCAGGGTCCGCTGAAGCCGCTCTGGCTATTATTAGGGAGAAACTCCCTAAAGATTTTATTTTTCAGTATCATAATTTGAAAAGTAATTTAGATAGGATTTTTACACCTCCGTTGGAGGAATATGTTTCACCTTTCTTATGTTCTTCGTTCGATCAAGTTCCAGAGGAACTTGAAGAGTGGGCGGCACAGAATGTTGTCACTGCCGCTGCGCGGCCATTGAGACCCATGAGTATTGTGATTGAAGGGGATAGTCGTACCGGCAAGACAATGTGGGCTAGGTCATTAGGACCACACAATTATTTGTGTGGTCATCTTGACCTCAGTCCAAAGGTGTACAGTAATAATGCTTGGTACAACGTCATTGATGACGTCGATCCTCATTATCTAAAGCACTTTAAAGAATTCATGGGCGCACAAAGGGACTGGCAAAGCAACACAAAGTACGGGAAACCAATTCAAATTAAAGGAGGTATCCCGGCAATCTTCCTCTGCAATCCAGGACCCACTTCATCATATAAAGAGTACTTGGATGAGGAAAAGAACGCACCATTAAAGGCCTGGGCAATTAAAAATGCGGTCTTCTTCACCATCCAACAGCCATTGTACTCAAGTTCCCATCAAGGTCCAACACAGAGCAGCCAAACAGAGGCTTCAGCGGAGGAAGAGGATAGACCTTAA |
Protein Sequence
|
MPPPKRFQINSKNYFLTYPKCSLNKEEALSQLLNTDTPTNKKFIKICRELHEDGEPHLHVLIQFEGKFNCKNNRFFDLVSPTRSTHFHPNIQGAKSSSDVKSYIDKDGDALEWGEFQIDGRSARGGCHNANDACAEALNAGSAEAALAIIREKLPKDFIFQYHNLKSNLDRIFTPPLEEYVSPFLCSSFDQVPEELEEWAAQNVVTAAARPLRPMSIVIEGDSRTGKTMWARSLGPHNYLCGHLDLSPKVYSNNAWYNVIDDVDPHYLKHFKEFMGAQRDWQSNTKYGKPIQIKGGIPAIFLCNPGPTSSYKEYLDEEKNAPLKAWAIKNAVFFTIQQPLYSSSHQGPTQSSQTEASAEEEDRP |
NCBI Accession
|
YP_001040013.1
|
Location
|
2195-2452 |
Gene Name
|
C4 |
Protein Name
|
C4 protein |
Coding Region
|
ATGGGGAGCCTCATCTCCACGTGCTTATTCAGTTCGAAGGAAAATTCAACTGCAAAAATAACAGATTCTTCGACCTGGTATCCCCAACCAGGTCAACACATTTCCATCCGAACATTCAGGGAGCTAAATCAAGCTCCGACGTCAAGTCCTACATCGACAAGGATGGAGACGCACTCGAATGGGGGGAATTCCAGATCGACGGCAGAAGTGCTAGAGGAGGCTGCCATAATGCTAACGACGCATGCGCCGAGGCATTAA |
Protein Sequence
|
MGSLISTCLFSSKENSTAKITDSSTWYPQPGQHISIRTFRELNQAPTSSPTSTRMETHSNGGNSRSTAEVLEEAAIMLTTHAPRH |