Annotated TAIR ID | ATMG00750.1 |
Annotated Gene Name | GAG/POL/ENV polyprotein |
Annotated Gene Symbol(s) | ORF119 |
Annotation Type | Ortholog |
Chromosome Position | chrM:220830-221189 | Forward length=119 |
E-Value | 4.3E-21 |
Replicate 1 | 1.5491368099556 |
Replicate 2 | 1.8900577545153 |
Replicate 3 | 2.3699067784528 |
Replicate 1 | 2.36558420871 |
Replicate 2 | 3.5750730116202 |
Replicate 3 | 2.5869315325101 |
Replicate 1 | 2.8670659825656 |
Replicate 2 | 2.9994390938494 |
Replicate 3 | 3.1995015143228 |
>S_Transcript_178539.p1 GENE.S_Transcript_178539~~S_Transcript_178539.p1 ORF type:complete len:1821 (-),score=207.02,RVT_1|PF00078.23|3.8e+03,RVT_1|PF00078.23|2e-24,rve|PF00665.22|4.7e-18,Retrotrans_gag|PF03732.13|1e-16,gag-asp_proteas|PF13975.2|1e+04,gag-asp_proteas|PF13975.2|1.3e-08,Asp_protease_2|PF13650.2|1.7e-05 S_Transcript_178539:122-5584(-)
ATGATCACAAGAAGCCAAGGGCAACAAGATCTTATTCCCTTGAATCTTGAGATTGAAGCTGAAGCCCGGCGACTTAGCTCTAGCCAAAGGAGACGAAGAAGGAGACAGTTAGCGCAACAACAAGCTATGGCCGACCAACCTATGAGGAATTACGGCGTCCCTAGTCTTACTTCAACAGGGAACGCTATATCTCGGCAAACTACTGAGAATCACACCTTTGAATTGAAACCAGCTCTTATTAATCTTATCAAAGATGATCGATTCTCCGGCCTGCCTCATGAAATGCCTTTGGATCATCTAAACACTTTCGTTGAAAAATGTGATACCATCAAAATAAATGGAATCACTGATGATATGATCCGGTTAAAGTTATTCCCTTTCTCTCTCATGGGGAATGCGAGGACTTGGTTGCAAAACAAGCCCCCGAACACCTATGCTGATTGGGAATCGCTGGCCAAGGCATTCCTTGAGAAGTTTTACCCTCCCACGAAGACCGAGAGGTATCGTGAGGAAATTCTGCAATTCCGCCAACAACCGACCGAGACCTTGAGTGAGGCTTGGGAAAGATTTAAGGAGCTCCTACGAAAATGCCCCCATCATGGACTACCAGATTTCCTTATCCTTTCCAGCTTCACCAAACATCTTAGCGCGGATGACAAGAGGGCACTGGACTCGGCATCAAACGGATCATTTATGGGGAAAACTTATGCTCAAGGCAAAGCACTCATAGAAGAGCTAGCAACAAGCTCATTTTCTTGGAGCGAGAGGAACACTCCCCCTCCTAAGGGAGGAGGTAACATTGATTTAGATTCTATTCAAATACTTGCTGAAAAGTTAGATTCTTTAACTTTGAAAGTCGAAAGACTACAATCTATTTCAACCCCCTCACAAGAAAGTGTGAAAATGTTGCGTAATGTACAATCATATGGACATGCCAATCCCTCTTCCCCATTTGATCCCAATATTTCTCAAGTTAATGCTGTACAAAATGCCCCCATGATGACCCCCTTGTGTGAAATATGTGGTATTTATGGTCATTTTCCTGCTGAATGTAATCTTGTTGTTAGTGAGAATGTGAATGCTGTTACTCAATACCAAGGTAGACCCCAAGGGAACCCCTACCCCAACTCTTTCAACACACAATACAGAGGGAATCCCCAAGGGTACAACTACCAATCACAACAACAACAATACCATCGACCCCCACCATACCCTCAACAACCCACATATTATCAACAACCTCCACCACAGCAAATACCCAGCCAACAAACTCCTCCCGGTTTCCAACAAACGAGACAAAATCTCCCATCTTCTTCAAACTCTCGAAGAGACAATTCTTTAGACCAAAAGCTCGAGAGTATATTGGAAACCCTTAAGCAAATGCAAACACATAGCAAAATGGTGGATAACCAAATTGCTCAACTTGCTCAACCCTCTAGACCTCAAGCACTCCCAGCTCAACCTACTCAACCCAATAACGAACATGTGAATGCGGTAACGCTTAGGAGTGGAAAGGCCCTTGAAGATCCTAAGTCTAAACCTAGACCCTCTAGTGCTTTACCCAATTCACCTAATGAGGAGCCTAGCGTGATTGAGGAAGATTCTAATGTTGAGGGGAACCTTAAGGAGAGTCATAACCTTTCTAGTCCCCCTAGAGAACTAGCACCCTCTTTGCCTTTCCCTCATAGACAGGTTGACACCAAGGTTGACAAGCAATTTAGTCGATTTGTGGAAATTCTAAAAAAGTTGCAGTTAAATGTGCCTTTCACTGATATGTTCACCCAAATGCCCACCTATGCTAAATTTTTGAAAGACATTCTCACTAGAAAACGTAGTTTAGAGGCAGTTGAGACAGTAGCCTTCGCTGAACAATGTAGCAACATCCTCTTAAACAAGGCCCCTCCTAAGTTGAAAGACCCAGGCACTTTTGCTATCCCATGTGTTTTGGGTGATTTTAGAGTTGAGAATGCTTTGTGTGATTTGGGTGCTAGTGTTAGTGTTATGCCACTTTCAATTTGTAAGAAGCTTGACCTTGGAGAGATTAAGGTCACCTCCATGACCCTCCAAATGGCGGATCGCTCCGTCAAACACCCTATAGGAGTCTTGGAGGACGTACCCGTCCGTGTAGGGAAATTCTACTTTCCCGTAGACTTTGTGGTCTTAGAGATGGATGGGGATTCTCAAATCCCCATAATTCTTGGACGACCTTTCTTGGCTACAGCTGGGGCCTTGATTGATGTGAGAAATGGTAAAATCTCCTTGCAAGTGGGTGATGAAAAGCTTGAATTCTTACTACCTAATGCTATGCAACACCCCTCTTCCCTTGATTGTTGTTATAGGATTGATGTGCTTGATGAAGTATTACAGGAATTTGAACCCTTGCATATATGTGGCGAGCCGTTGAACCCCTTCTTCAACAATGAAACAATCAAGGAGAGTGATGAATATATTTGTCATGTAGCACAGGTAATGGATTCTGCACCCATTGAGAAGGATGACTCACCAATGGAACAACCTGTGGAAGACCCTATTCCACAAAGACAAGGTAACACTTCCTCTCTTGAACTTAAGCCCCTTCCCCCTTCTCTCAAGTATGCGTATTTGGATTCCGAGCATAAATTCCCAGTTATCGTTAACGCTAAACTCAATAGCCCCGAGCTTTCTAAGTTGTTGTCGGTTCTCCGAAAACACAACAAAGTTATTGGGTATAGCATCGACGATCTCACGGGAATAAGCCCCTCTTTATGTATGCATCGTATCTCTTTGGAAGACTGTTCTAAGTCTTCCATTGAGCATCAAAGGCGGCTCAATCCTAATATGCAAGAAGTGGTCCGTAAAGAAATTTTGAAACTACTTGAAGCTGGAATCATCTATCCTATCTCGGATAGTAAATGGGTGAGTCCCGTTCAAGTAGTTCCTAAAAAGGGAGGAATGACGGTGATTCAAGGGGATAACAACACATTGATCCCAACTCGTCTTGTTACGGGGTGGAGGATGTGTATTGATTATCGCAAGTTGAATTCCGTCACCCGCAAGGACCACTATCCTCTTCCATTCATTGATCAAATGCTTGAACGACTGGCTAAGCACTCTCATTTTTGCTATCTTGATGGTTACTCGGGATTTTTCCAAATCCCAATACATCCCGAGGACCAAGAGAAGACGACATTTACTTGCCCTTATGGCACCTTCGCTTATCGAAGGATGCCATTCGGGTTATGCAATGCACCCGCGACATTTCAACGGTGTATGATGTCTATCTTTTCTGATTTTGTTGAGAAAATCATGGAAGTCTTCATGGACGACTTTAGCGTCTATGGGACTAATTTTGATGAATGTTTAGCCAATCTTTCACGAGTCCTTGAACGATGTGAGGAGGTAAACTTGGTTTTGAATTGGGAGAAGTGTCATTTTATGGTTCAAGAGGGAGTCGTGTTGGGTCATATTGTGTCTAGTCGGGGTATCGAAGTCGATAGAGCTAAGGTTGAGGTGATCGAAAAACTTCCACCTCCCACTAATGTCAAAGGGGTGAGAAGTTTCCTTGGTCACGCCGGTTTTTACCGACGTTTTATCAAGGACTTTTCAAAGATCGCAAAGCCTCTAACCCAGCTCCTCGTCAAAGATACTCCTTTCATCTTTTCTAATGATTGTCTTGAAGCTTTTTCCAGTTTAAAGAAGGCCCTTGTGTCTGCGCCGATTGTACAACCGCCCGATTGGAATCTACCTTTTGAGATTATGTGTGATGCCAGCGATTTTGCAATTGGGTCGGTGTTAGGGCAACGTAAAGATAAAAAGCTTCACGCCATTTATTATGCCAGCAAAACTTTGGATGGGACACAGGTTAACTACACCACAACCGAGAAGGAACTTCTAGCAGTTGTTTATTCATTGGATAAATTTCGGCCTTATCTTATTGGGTCGAAGGTTATTGTTCATACAGACCATGCTGCTCTCAAGTATCTACTGTCCAAAAAGGAATCCAAACCTCGGTTAATCCGTTGGATACTTGCTCTTCAAGAATTCAACCTTGAAATCAAAGACAAGTGTGGAGCCGAAAATGTGGTTGCAGATCATCTATCAAGACTTCCTTTCCCCGGTTCTTCTAACGACAGTCCCATCAATGATTCAATACCGGGAGAACATCTTCTTTCCATTCACTCTAATCAAATTCCATGGTATGCGGATATCGCCAATTATCTTGCTTGTGGCGCCCAACCTATTGGCTATTCCTACCAACAACGCAAGAAATTTTTCCATGATGTTCGTCACTACTTATGGGATGACCCTCTACTCTTCAAGCGGTGTAGTGATGGGATTATTAGAAGATGTGTGCCCGAATTTGAGGTCCCGAGTATCCTATCACATTGCCACGAGCAACCTTGCGGAGGTCATATGGGCACCTCAAAAACATGTGCTAAGATCCTCCAATGTGGATTTTATTGGCCAACTCTATTTCGGGACGCTAATACCTTCGTCAAGAGTTGCGATCGTTGTCAACGAGTTGGCAATATCACTAAACGGCATGAGATGCCCCTTACTAACATACTTGAGGTTGAAATCTTTGATGTTTGGGGCATCGACTTCATGGGACCCTTTCCCTCATCTTTTGGAAACAAATTTATTTTGGTAGCCATTGATTACGTGTCTAAGTGGGCGGAGGCCATGGCATCTCCGACCAACGACTCACGGGTAGTGGCTAAACTTTTCAAGAAGATTATATTTCCAAGATTCGGGGTTCCACGAGTCGTTATTAGTGATGGTGGGTCTCACTTTCGAGAAAAGAGTCTTGAAAAACTGTTAGCAAAATATGAGGTGAATCACCGAATTGGCCTTGCTTATCATCCCCAAACGAGTGGCCAAGTTGAAGTTACAAATAGAGAATTGAAACACATTCTTGAGAAAACGGTGCAAACAAGAAAAGATTGGTCTCTATGTTTGGATGATGCCCTATGGGCTTATCGGACGGCATTTAAAACACCTATAGGAACCACACCATACCGGCTCCTATATGGTAAATCATGTCATTTGCCGGTCGAGTTGGAGCACAAGGCTTATTGGGCGATAAAGGCCCTGAACCTTAACCTAAAAGATGCGGGAGCCAAAAGGCTTTTGGACCTTAACGAGCTTGATGAAATCAGATTTGACGCTTATGAAAATGCCAAACTCTACAAGGAGAAGACTAAGAAATGGCATGATCAAAGGATCACCTCCCGGGAGTTCAAGGAGGGTGACCAAGTCCTTCTCTATAACTCTAGACTCAAGCTCTTTCCCGGGAAGCTCAAATCCCGATGGTCCGGCCCCTTCCTTGTACACAAAGTTTACCCTCATGGAGCCATTGCTATTGGAAAGGAAGGTAGCGACGTCTTTAAGGTTAATGGCCATCGCCTCAAGCATTATCATGTTGGCACCCCATTGGGCCGAGTGACTACGGTGTCACTCCTTGACCCCCCTCCAATTGTCTAA
Click here to download the sequence:
Link to NCBI Nucleotide BLAST
>S_Transcript_178539.p1 GENE.S_Transcript_178539~~S_Transcript_178539.p1 ORF type:complete len:1821 (-),score=207.02,RVT_1|PF00078.23|3.8e+03,RVT_1|PF00078.23|2e-24,rve|PF00665.22|4.7e-18,Retrotrans_gag|PF03732.13|1e-16,gag-asp_proteas|PF13975.2|1e+04,gag-asp_proteas|PF13975.2|1.3e-08,Asp_protease_2|PF13650.2|1.7e-05 S_Transcript_178539:122-5584(-)
MITRSQGQQDLIPLNLEIEAEARRLSSSQRRRRRRQLAQQQAMADQPMRNYGVPSLTSTGNAISRQTTENHTFELKPALINLIKDDRFSGLPHEMPLDHLNTFVEKCDTIKINGITDDMIRLKLFPFSLMGNARTWLQNKPPNTYADWESLAKAFLEKFYPPTKTERYREEILQFRQQPTETLSEAWERFKELLRKCPHHGLPDFLILSSFTKHLSADDKRALDSASNGSFMGKTYAQGKALIEELATSSFSWSERNTPPPKGGGNIDLDSIQILAEKLDSLTLKVERLQSISTPSQESVKMLRNVQSYGHANPSSPFDPNISQVNAVQNAPMMTPLCEICGIYGHFPAECNLVVSENVNAVTQYQGRPQGNPYPNSFNTQYRGNPQGYNYQSQQQQYHRPPPYPQQPTYYQQPPPQQIPSQQTPPGFQQTRQNLPSSSNSRRDNSLDQKLESILETLKQMQTHSKMVDNQIAQLAQPSRPQALPAQPTQPNNEHVNAVTLRSGKALEDPKSKPRPSSALPNSPNEEPSVIEEDSNVEGNLKESHNLSSPPRELAPSLPFPHRQVDTKVDKQFSRFVEILKKLQLNVPFTDMFTQMPTYAKFLKDILTRKRSLEAVETVAFAEQCSNILLNKAPPKLKDPGTFAIPCVLGDFRVENALCDLGASVSVMPLSICKKLDLGEIKVTSMTLQMADRSVKHPIGVLEDVPVRVGKFYFPVDFVVLEMDGDSQIPIILGRPFLATAGALIDVRNGKISLQVGDEKLEFLLPNAMQHPSSLDCCYRIDVLDEVLQEFEPLHICGEPLNPFFNNETIKESDEYICHVAQVMDSAPIEKDDSPMEQPVEDPIPQRQGNTSSLELKPLPPSLKYAYLDSEHKFPVIVNAKLNSPELSKLLSVLRKHNKVIGYSIDDLTGISPSLCMHRISLEDCSKSSIEHQRRLNPNMQEVVRKEILKLLEAGIIYPISDSKWVSPVQVVPKKGGMTVIQGDNNTLIPTRLVTGWRMCIDYRKLNSVTRKDHYPLPFIDQMLERLAKHSHFCYLDGYSGFFQIPIHPEDQEKTTFTCPYGTFAYRRMPFGLCNAPATFQRCMMSIFSDFVEKIMEVFMDDFSVYGTNFDECLANLSRVLERCEEVNLVLNWEKCHFMVQEGVVLGHIVSSRGIEVDRAKVEVIEKLPPPTNVKGVRSFLGHAGFYRRFIKDFSKIAKPLTQLLVKDTPFIFSNDCLEAFSSLKKALVSAPIVQPPDWNLPFEIMCDASDFAIGSVLGQRKDKKLHAIYYASKTLDGTQVNYTTTEKELLAVVYSLDKFRPYLIGSKVIVHTDHAALKYLLSKKESKPRLIRWILALQEFNLEIKDKCGAENVVADHLSRLPFPGSSNDSPINDSIPGEHLLSIHSNQIPWYADIANYLACGAQPIGYSYQQRKKFFHDVRHYLWDDPLLFKRCSDGIIRRCVPEFEVPSILSHCHEQPCGGHMGTSKTCAKILQCGFYWPTLFRDANTFVKSCDRCQRVGNITKRHEMPLTNILEVEIFDVWGIDFMGPFPSSFGNKFILVAIDYVSKWAEAMASPTNDSRVVAKLFKKIIFPRFGVPRVVISDGGSHFREKSLEKLLAKYEVNHRIGLAYHPQTSGQVEVTNRELKHILEKTVQTRKDWSLCLDDALWAYRTAFKTPIGTTPYRLLYGKSCHLPVELEHKAYWAIKALNLNLKDAGAKRLLDLNELDEIRFDAYENAKLYKEKTKKWHDQRITSREFKEGDQVLLYNSRLKLFPGKLKSRWSGPFLVHKVYPHGAIAIGKEGSDVFKVNGHRLKHYHVGTPLGRVTTVSLLDPPPIV*
Click here to download the sequence:
Link to NCBI Protein BLAST