Submission Results
Sequence Name: Not Available
GenBank Accession Number: UVIR01000003.1 open_in_new
GenInfo (GI) Number: 0000000000 open_in_new
Download Results: UVIR01000003.1.PHASTER.zip
gi|00000000|ref|UVIR01000003.1| Burkholderia pseudomallei strain A-281-02-2-07 genome assembly, 374054, gc%: 68.32%
Download summary as .txt file: summary.txt file_download
Total: 1 prophage regions have been identified, of which 0 regions are intact, 1 regions are incomplete, and 0 regions are questionable.
Region | Region Length | Completeness | Score | # Total Proteins | Region Position | Most Common Phage | GC % | Details |
---|---|---|---|---|---|---|---|---|
1 | 27.3Kb | incomplete | 30 | 7 | 95942-123324 info_outline | PHAGE_Bacill_vB_BanS_Tsamsa_NC_023007(1) | 65.09% | Show info_outline |
>1 95942-123324 CAACGCACGCGCCGCCGTCCCGACAGGGCCCCAAAAGTCTCAGCCCGCGCGTTCCAATGAATGGAACGCA CGGGCTGAATGGACTGCGAGGCGAATGGGCGCGCGCCGCGCTCGCGTTCCGCGACGCGCGACGGTCGGCC GCGCCGCTCAACGCCCCTTGTAGACCGGAGCGCCGTCGGCGATGCGCTTGACCTTGTTCGGCGAGCCCGA TTCGGCCGCGGCCGCCGGCTGCGCGCCGTAGCCGCTCGTATCGGCTTGCGCGGCGACGTTTTGCTGGCCG TGGATCTTGGCTTCGGCGGCCTGGATTTCCTCGGGGTAGTTCACGCGGTCGGAAGCCGGGTTGTAGCCTG CTTGCTCGAGCTGGATGAGCTCGGCCCTGACTTGCGCGCGGGTCACCGGGCCATTCGATTGCTGGGCGAA CGAAACGGCCGGAACGGCGAGCGCGGCGGCGACTGCGACGGTAGCGATAAGCGATTTCATGATCAAACCT CCGTGATTTTTTGCTTCGCCGCAAGCACCATGCGAGCAGCGATTGATCACAGTCTAGTCACGGCGGCACT TAGGGAAAACCATGTATCAACGAAAACACTGTTTCGCGACCGATAACAATCGCCGTTACATCCCTAGAAA AACCCTACTAATAACGTAGGTGAGCGCAACAATTGCCGCGCTCGGCGGTCACTCGGCCCAACTCGCGTGG AAACTGCCCGGCTTGTCCGTGCGCTCGAACGTATGCGCGCCGAAGAAATCGCGCTGCGCCTGCACGAGGT TCGCGGGCAGACGCGCGGACCGGTAGCTGTCGAAGTACGCGACCGCCGACGCGAACGCCGGCACGGGCAC GCCCGCCTTCACCGCGGCGATCACGACGTCGCGCAGCGCGGACTGATAGTTCGCCGCGATGTCCTGGAAA TACGGGTCGAGCAGCAGGTTCGCGAGCGCGGCGTTCTTCGCATACGCATCGGTGATCTTCTGCAGGAAAC GCGCGCGGATGATGCAGCCCGCGCGGAAGATCTTCGCGATCGTGCCGAGATCGAGGTTCCAGCCGTATTC CTTCGACGCCGTATCGAGCTGCGCGAAGCCCTGCGCGTACGAGATCACCTTCGACAGATACAGCGCGCGG CGCACCGACTCGACGAACGCGGCGCGATCGCCTTCGAGCGGCGCGGGCGACGGCCCGCTCAACACCTCGC TCGCCGCGACGCGCTGCGCCTTCAGCGACGACAGCACGCGCGCGAACACCGATTCGGTGATGAGCGGCAA CGGCACGCCGAGGTCGAGCGCGTTCTGGCTCGTCCACTTGCCGGTGCCCTTCTGCGCGGCGCGGTCGAGG ATCACGTCGACGAGATGCTCGCCCGTTTCGTCGTCCTTCTTGCCGAAGATCTTCGCGGTGATCTCGATCA GGTAGCTGTCGAGTTCGCCCTGGTTCCATTCGGCGTAGACCGCGCCCAGCTCGTCGTTCGTCAGGCCCGC GACCTGCTTGAGCACCGCGTAGCTCTCGGCGATCAGTTGCATGTCGCCGTACTCGATCCCGTTGTGGACC ATCTTCACGTAATGGCCCGCGCCGTCCGGCCCCATGTACGCGACGCACGGCTCCCCGTCCGCCGGCGCCT TCGCGGCGATCTGCTCGAGGATCGGCTCGACGAGATCGTAGGCGTCGCGCTGGCCGCCCGGCATGATCGA CGGCCCGCGCAGTGCGCCCTCCTCGCCCCCCGACACGCCCGTGCCGATGAAATGCAGCCCCGCCTGCGCG AGCTCCTGGTTGCGGCGAATCGTGTCGGTGAAATGCGTGTTGCCGCCGTCGATCAGCACGTCGCCCTTGT CGAGCAGCGGCTTGAGCGCCGCGATCGTCGCGTCGGTCGCTTCGCCCGCCTTCACCATCAGGAGGATGCG CCGCGGCGTCTCGAGCGACGCGACGAACGCCTCGAGCGTATGGGCCGGCACGAGCTTGCGATCGGGAAAT TCGGCGATCAGTTCGTCGGTCTTCTCGCGGCTGCGGTTGTAGACCGACACCGCATAGCCACGGCTTTCGA TATTCAAGGCCAGATTGCGCCCCATCACCGCGAGACCGACCACACCGATTGCTTGTTTGCCCATTGTTCA ATTCTCCGGAAAAAACAGGGCGCGACCCACCGGGGTCGCGCGCATAGGCGCAAGGATAAGGGAAACCACG CGAGCGCGCCGCGAATGAAAGACGCTCGAAGGCGATTGGACGGCGGTTCGGTTTGACGTCCGACGGCCGG TCAATGTGGATTCGGCGACGGATCGACGGCATTCGCCGTCGAGGCGCACGAGTTTGGCCCGGGTGTAAGC GGCAAGCGGAAAAAACGTCGATCCGGCGGCGTTCCTGCGTGGATTCGGGATGCCGCGCAAGCGGTGCCCG TCGATCGGATTGCGCCGCCGGTGCGGCGAGCGCGGTTTGTGCGTGTCGCAGGGGTTGCGACCACGGGTTC GGCTCGTACGTTTCGTGCGTTTCGTGCGCTTCGTGCGTCGAATGCCAATCAACGCCCGCGCGAGCGCCGC GCATGCCCCGTCCGTCGGGCGCGCCCGCCTACGCGCGCTTTCGGAACACGACGCTCAGATTGTTCGCCGG CATCTCGACGACCTCGTCGAGCGCGAGGCCCGACGCGCCGCCGAGTTCGACCACCGCTTCCAGATCGCGC ACGCCCCACGCCGGATTGCGGCGGCGCAATTGCGCATCGAACTGCGCGTTCGATTCAGTGGTGTGCGCGC CGCCTCGGCGATACGGCCCGTACAGGTACAACACGCCGCCGTCGGGCAACGCGCGCGCCGCGCCGACGAA GAGCGCGCACACAGCTGCCCACGGCGCGATGTGGATCATGTTGACGCAGACAATCGCGTCGGCGGCGGCG AGCGGCCACGGCTCGACACACACGTCGAGCGCGAGCGGCGCGCGCAGGTTCGGCAGGGCGGCGTCGGCCA CCCACGCGGCGATCGACTCGCGCGCGGCGGCGTCGACGTCGGTCGGCTGCCAGACGAGATCGGGCAGCGC GGCCGCGAAGTGAACCGCATGCTGGCCGGTGCCGCTCGCGATCTCGAGCACCGCGCCGCGCGCGGGCAGT ACGCGCTTCAGCACGGCGAGAATCGGCTCGCGATTGCGTTCTGCGGCGGGCGCCCATTGGCGCGAAGCGG AATCGATCGGCATGACGAAATGAATCGGGAAGATCGAATCAGACGTGAGGCGAGCCGTGCGCGTCGCGCT GCGCAAGCCCGAGACGTGCGATGAGCGCGGCGAGCGCCTGCCCGCACGACGCCTCGACCTTCAGCGTCAG CAGCGGATCGGCGCGGGTGTGCCCGAGGTTGATTGCGACGATCGGCTTGCGCTGGTCGTTCGCCCACACG CAGAACCGATAGCCGGAATACACCATCAGCGACGAGCCGACGACGAGCATCGCGTCGGCGTCGTCGAGCG CGCGCGCGGCGGCCGCGACGCGCTCGCGCGGCACGTTCTCGCCGAAAAATACGACGGCCGGCTTCAGCAG CCCGCCGCACGCGGGGCATGCGGGCACGCGAAAATGGTCGAGCGCGCTCCATTCGAGATGCGCGTCGCCA TCGGCGGCGGGCTGCGCTCGAGCGCCGAGCAGCTCGGGGTTCTCCGCTTCGAGAACGTGCTGGATCGACG CGCGCGCGTGATGCGCACCGCAATCGAGACATGTGACACCGCCGATGCCGCCGTGCAGCTCGATGACGTC GACGCTGCCCGCGCGCTGATGCAGCCCGTCGACGTTTTGCGTGACGAGCCGCCCGATGCGTGCCGCCGCG CCGAGCCGCGCGAGCGCATGATGCGACGCGTTCGGCTGCGCGCGCCCGACCACGGGCCAGCCGAGCATGC TGCGCGCCCAGTAGCGGCGGCGCGCATGATCGGAATCGAGAAACTCGCGATACTGAATCGGCTGCGAGCG CATCCATGCGCCGTTGCGGTCGCGATAGCCGGGAATGCCGGAATCGGTACTGATGCCCGCGCCCGTGAGC ACGAACAGGCGCGGATGCCGCTCGACGAACGCATGCAGCGCATCGAGCGCGCGCGGATCGGCAAACGGCG CCGCGTCCGGCGACGACGATGACGACGGTGGCTGCGGCGAAACAACGGCGGAAACGGTCATGGCAATGAG GGCTCGTCCGGCAGGATCAGCCGCTCGGGGACATGCGCGGTCTGCCTGCGTGCCGTCACATAATACAGAA CGCCGGGCACGACGAGGCCGATGATCCACGAGATGTCGGTGCCGCCGAGCTTCGCGACGAGCGGCCCGGT ATAGAACTCGGTCGCGATGAACGGCATCTGCACGAGCACGCCGATCGCGTAGATCGATACGCCCGCGACG TTCCAGCGGCCGTAGCGGCCGTCGGGATCGTAGAGCGCGGGCACGTCGTAGCGCTCCCTCGTCACCCAGT AGAAATCGACCAGGTTGATCGCGCTCCACGGCGTGAAAAATGCGAGCAGGAACAGGATGAACGCGGAGAA ATCCTTCAGGAATGCGTGACGACCGACGAGCGCGAGCCACGCGACCGCGGCGACCATCGCGAGCACATAG GCCATTCGCGCGCGCTGCGTGATCTGCGCGCGGCCCGAGAAGCCGGTGACGATCGTCGCGACCGACATCA CGCTGCCGTATGCGTTGAGCGTCGTGATCGTCAGCTTGCCGAACGCGATCGCGAAATAGAGCAGCGCGGC CGTCGCGCCGCTCGCGCCGAGGCTCACGATGAACTGCACCTCGTGCCCGGCGAACTGCTTGCCCGCGAGC GCCGCCGCGAACACGCCGAACGCCATCGACGCCTGCGCGCCGAGCACCGAGCCAAGACCGATCGCCCAGA ACGTGCGCCGCGCCGACGTCGAGCGCGGCAGATAACGAGAATAGTCGGCGACGTACGGCCCGTACGCGAT CTGCCACGAGGCGGACAGCGAGATCGCGAGCAGGAAGCTCGACAGCGAGAAATGCCGGATCGCGAGCAGC GCGCCCACGTCGTGGCCGGCGAGGAGCCGCACGAACATGTAGACGAACGCGACGACGCCGACGATGCTCG ATACGCGGCCGAGCGCGTGAATCGTCCGATAGCCGAACACCGCGAGCACGACGACGACCGCGATGAACGC GAAGATGCCGACCGGATCGGCGACGCCGAACAGTTGCGCGAGCGCCTGCCCGGCGAGCACGGTGCCGCTC GCGGAGAAGCCGACGTACATCAGGCAGACCAGCACGAGCGGAATCACCGCGCCGTAGATGCCGAACTGCA CGCGGCTCGAGATCATCTGCGGCAAGCCGAGTTGCGGCCCCTGCGCGCCGTGCAGTGCCATCACGGCTGC GCCCAGCAGTTGACCGAGCGCGAGGCCGACGAGCGACCAGGACACGTCGCCGCCCAGCACGACCGCGAGC GCGCCGGTGACGATCGCCGTCACCTGCAGGTTCGCGCCGAGCCATAGCGTGAACTGGCTCGAGAGCTTGC CGTGGCGTTCGTGGTCGGGGATGTAGTCGATCGTGCGGCGTTCGCGCAGCGCTCGGCGTTGGGCGTCGGT GGCGTTCGGCATGGCGGCGGATCCGGAAATGTCTGAGGTTCGGAGTGCTCACCCTAGCCGCTCAAAATTG TATAGACAACTGTTTGCGGGATGCGGGTTTGTACCGACATCCAATTAGCGGCCAATAGCCGGCGAATCCG CCCGTCAATGAAGCAATCGGGTCGGGAAGGATAAATCTTCAATATCTTCGATGGCTATCAATCAAGATTA CAAATTAATCAAAAACGGGAAATATGGCCGATAGCGCCATTTGACATACATTTCGACTCACATTCATCAC ACACTGCAAGCGATCATGATCAACAAACCGCCCCATCACAATCTCTACCAATCAGATTCATTACAAACAA ATCGATCAACAGATTACGGAAACCCCCGATAAACAATACCAAAACGGTAAATAGAAACACCCATAAAGCA AAAAAATCGCACTATCCTCATAGAAAATTTCGGCACCACCTCCCTCTCTGCCCCAACCTCTCGTCCGGCT AATTTAGAATTTCCGCACCCGTCTCGACAGGAAAAAACATGCTCGAACAAACAGCGGAACATTGGGGACA GAGTTATTTCTCGGAAACATTCAATCGCCGGGAATGGCAAGCGCACCCCCTTTCCATCCAGCGCCAGTAT GAACTGCAGGGCAATCTCATGCGTGAGGAGTGGCTTTTTTCACGTTATCTCAACAGAAAGCGCGTTCAGC GAGCCGCCAGCCTAGGAGCGGGGAGAGCCGAAACAGAAATCGCCCTTCTCGAATTGGGCGCAGTCGAGCA TTTTGATCTCTTTGATGTCAGCTCAGTTGGAATAGAATATGCAAAATCGATAGCTGAAGAAAAAGGATTT GGTCATAAGGTTACATGTCACGTGGGACCAATCGGTTGCGCCGAGCTGAACGAGAACACCTACGACCTGA TCACGTTCGTCGCGTCGCTCCATCATATGGAACCGCTCGCCGAAACCCTTGAGCGTGCCAATCGAGCACT AACCCAACGCGGAATCATCTGGTGCGCCAACGAATATATCGGACCGGACCGATTCGACTATCCAGTTGCA CATGCCGCGATTGCAAAATCCTTCTTTCAGCAAATTCCACCCGGCTTACGTAACCACTGGCATCGAGAGC TTCAGTTCCCGACCCCACAGGAAGTCGCGGAGGTAGATCCAACAGAAGCGCCCTGCTCATCGAAGATCGA ACCCACGATGCGTGACATGTTTCCAGCCCTGGAGATCATGCCACTATATGGGGCATTCGCTTTCATGGTG TTTTGGGGATTGAACCACGATGCGCTGTACGAGACTCCCGAAGGAACGGAACTAACTCGGTTCATTCTAG GAATGGATAAAGCGTTAACCGATGCGGGAATCTTACCGACATACTTTGCGCATATCATAGCCAGAAAAAA TAACGCCCCTCGACAACAGATCCACCGCCCAGGCCGCCACTCAAGCACCCCTCTTTATAGATATGCACAA AGAGCAGCATCCATACTCCGCCATCTAAAAATTAGATAAATAGAATCCAAAATGAAACATAATAAAAATC TGGGGCTGGAGGGGTTACGCGGACTCGCCTGCATGGCGGTCATGATAGGCCATTTTTCTTATGTTTTCCA CCCCTATCTCGCATCGCTATTCCGACCCATACCATTCTCCATTAAACCGTCCGCTTTTGAACGCTTGGTT TCAATTCCACCATTAACACTAGCTTATAGTGCCGATGCGGCCGTCAGCATATTTTTTGTCATGAGTGGTT GCGTACTCACGACTAAGTTTTTCATGGCAAAAGAGATCCCCGCCCTTCAATCAGCCGCCGCAAAGCGGTA CATACGTCTCGGCTTGCCAAGCTTCGCGTCGATATTCCTCGCGTGGATACTATGGCGCTCCGGCGCAATC TACACGGCACATGCAAATGAAATCGGCGTCGCCGGATGGGTGTCGGGGTGGTACGCAGGGCCATTCATGT TCATTGACGTATTCATCGACGGCCTCGCCGGTGCTCCACTATTTGCTCGTACCGCACTCAATCCAGCACT CTGGACGATCCAAGTCGAATTGATCGGCTCAATCGTCCTGTTCGCCATCATTTCCTTATTCGGAAAAAGA CCGATTCTCTTGATTCTATGGTTTATGTTCTTCGCGAACATATTCGGCCTCCAATCACCAAACGTTCTCT TTTATATCTCATTCCTAGCTGGCGCGATTCTTAATATTGCTCAACCATGGTTGAATCGCAACCAAGTAGC GTCACTCGTCGCATTCTCGCTCGGAATATTTTGCGTTGCCTATAACCAGCAAGCCATGTTCGATATACTG CGAATGCTCCCATTGCCCAACTTGAAGCCTGTCGGCCCGGATTTCAACGACAATCCGGATTTGCTGTGGC ACACAGTAGGATCAATTCTGTTAGTCGCCGGCGTAATCGGCTCGCGGCGCCTCTCAGCAATCATAGGCTC GCGCATTCCGGTATTCCTTGGAAAGATTTCATTTTCGATCTACGTGATTCACGTGCCAATCCTGGCGTCC GTTGGGCTGCGTGTAGCCGCGGCCGCGCAACTCGCCGGGCTGAAGTATGAACAAAGCGTAGCGTTGGCAT TTGTGGCATACGCAATGGCAGTCGTCGGAGGTGCCATCCTTTTCGAACGATGGGTCGACGCACCATCAAT TCGTTTAGCCAACCGTATGGCGAATAGGACGGCGAGAGTGGAAAAATTGCAAGGTCAGTCGATCGCACCA TGGCAATGATCCCCCGCGCAATATCCGCCCCTCTCCTCCGCGACAAACCCGCAAAGTACTACCTTGCACG TACCGCAACAGAATTGTGACCTGTCCATCGAAGCGGATCCCGGACAGGTTCATAACATGACCGAGTTCTG GTTCTGAACCGGCCTACGCGCTTCCGAAATCTACGAATTGGAGTGGCCGCAAGCCGATATCGCAAGTGGG ACGCTCCTGATAGCAAAGGCATTCGTCCGCCACGAACAAATGGATCGGCCAAAAACCAAGGTGGCTCGGC TGGTGATTGTGAATAGTCGCGCGCTTGCAGCGCTACAGCGCCAACGCGCATTTACCCAAATGGCGAATGG CCGCGTTTTTCATGGCCCTCGATATAACAAAGGATGGGCCACGAGGATGCCTTCAGACGGACGGACTGGG AGCCCATGCTCAAGCGCCTGTGATCCGCTATCGACGCCCATACAACATGACGCACTGTTACGCGACGGCG ATGCTCATGGCGGGAATGACGCCGGCATTCTGCGCGAAGCAGCTCGGCCATAGCGTGGAGATGTTCCTGA CGACCTACTCAAAGTGGATCGATGGGGATCAGAACGATTTGGAGATGGCGCTGGAAAGGGCTATTTCCTC CCCAGGACCTCCTCAGATGAAGGCAAAATCAACATAAATCGTTGATTTTCAATAGAAAAATATGGTGGGC CCGGCGGGTTTCGAACCCGCGACCAATCGATTATGAGTCGACTGCTCTAACCCCTGAGCTACAGGCCCTA AAAACGAACGCGGCGCAAAAAAAGACCCGGTAATGCTGGCCGGGTCGGCATTGTAACAACTAAGTAAGGG AATCGCCGCCGGATTTTTGCGCCCCGGCGGCGATCGACCGGAGTCGGACGATCAGTTCCCCTCGAGGAAC GACTTCAGCTTGTCCGAACGGCTCGGATGACGCAGCTTGCGCAGCGCCTTCGCCTCGATCTGACGAATCC GCTCGCGCGTGACGTCGAACTGCTTGCCGACTTCCTCGAGCGTGTGATCGGTGCTCATCTCGATGCCGAA GCGCATCCGCAGCACCTTCGCCTCGCGCGGCGTCAGCGAATCGAGCACGTCCTTCACGACGTCGCGCATG CTCGCATGCAGCGCGGCATCCGCCGGCGCGACCGTATTGGTGTCCTCGATGAAGTCGCCGAGATGGGAAT CGTCGTCGTCGCCGATCGGCGTTTCCATCGAGATCGGCTCCTTCGCGATCTTCATGATCTTGCGGATCTT GTCCTCCGGCATCTCCATCTTCTCGGCGAGCGTCGCCGGATCCGGCTCGAGGCCGGTTTCCTGCAGGATC TGCCGCGAGATGCGGTTCATCTTGTTGATCGTCTCGATCATGTGAACCGGAATCCGAATCGTGCGCGCCT GGTCCGCGATTGAGCGCGTGATGGCCTGGCGGATCCACCACGTCGCATACGTCGAGAACTTGTAGCCGCG ACGGTATTCGAACTTGTCCACCGCCTTCATCAGGCCGATGTTGCCCTCCTGAATCAGGTCCAGGAACTGC AGGCCGCGGTTCGTGTACTTCTTCGCGATCGAGATCACGAGGCGCAGGTTCGCCTCGGTCATCTCGCGCT TCGCCTGGCGCGCCTTCAGTTCGCCCGCCGCCATCTGGCGGTTGGTTTCCTTCAGGTCCTTGAGCGGCAG CACGACACGCGCCTGCAGGTCGAGCAGGCGCTGCTGCTGTTCGCGGATCGCCGGAATGTTGCGCGACAGG ATCGCCCCATACGAATGGCCTTCGGCCGCGACCTTCTCGGCCCAGTCGAGATCGGTCTCGCTGCCCGGGA AGCGCGCGATGAACTCCGAGCGCGGCATCCCGCACTTGTCGACGACGATGTGCAGGATCTGACGCTCGAC CTGGCGCACTTCGTCGACCTGCGCGCGCAGCGTATCGCACAGGCGCTCGACGGTGCGCGCGGTGAAGCGG ATCGACATCAGCTCGTTCTGGATCGTTTCCTGTGCCTTCAGGTACGACTTCGACTTGTAGCCTTCCTTCT CGAACGCGCGGCGCATCTTGTCGAACCATTCGCTGATGAGCGCGAACTTCTCGAGCGACGCGCGCTTGAG CGCTTCGAGCTGGGCCGCGTTGGCGGTCGCCTGCGCGGCGCCGTCGTCGTCCTCTTCTTCCTCTTCGTCG TCCTCTTCGGCTTCCGCGTCCTCGTTCTCGATTTCCTCGGCGTCCTTCTCCGAGAAGCCGTCGGTATCGG CGGCGTTCGGATCGATCAGGCCGTCGACGAGCTCGTCGACGCGGCTCTCCTCGTTCGCCACGCGCTCGGC CATCGCGAGGATGTCGGCGATCGTCGTCGGGCACGCGGAAATCGCCATCACCATGTGGCGCAGGCCGTCC TCGATCCGCTTCGCGATCTCGATTTCGCCTTCGCGCGTGAGCAGCTCGACCGTGCCCATCTCGCGCATGT ACATCCGCACCGGATCGGTCGTGCGGCCGAATTCGGAATCGACGGTCGACAGCGCGACTTCCGCTTCCTC CTCGACTTCATCGTCGGACGACGCGGCGGGCGCATTGTCGTTCAACAGCAGCGTCTCGGCGTCGGGCGCC TGCTCGTACACCGCGACGCCCATGTCGTTGAACGTGCCGATGATGCCTTCGAGCGCTTCCGTTTCCGTGA AGTTGTCCGGCAGATGGTCGTTGATTTCGGCGTACGTGAGGAAGCCGCGCTCCTTGCCGAGCTTGATCAG CGCGCGCAGCTTCGTACGGCGCTCTTCGAGCTCCTCGGCCGTGCCGGGCGTGCTCGTCGCGAACGCTTCC TTCAGCAGCGCCTTTTCCTTGGCGCGACGGTCGCGCGCCTTGGCCTTTTCGCCTTTGCCCGACGCAGCGG TAGGCTGGTCGTCGCTCTGGTTTGCGTCGTCATCGACGGATACTTCGTTCAGCTTTTTCGTCATGGAGTT CGCCGTACCGGCTAAATCGACTCGCGGCTGCTGGACTACAGCCGGTTGAACCGTGGATGCTCGAGTAGCT GCCGTTTCGTCCGGCGCGGCAGCCGCTCCCCTTGCGCTTTTCGCACTCGCCCGCGGCGAAGCCGGCTTCG CCGCCTTCGCCTCGGACCTCACGACGACCCGCGATGCGCTGGCCGTCGGCGCCGGCGCAGCCTTCGCGGT GCCGGCCGCGGCTCTCTTCCTCTCAGGGTGCGCCGCGCGGGCCGAAGACGTCTTTGCGGAAGCCGCGGAG CGGGCCGGCGCGACCTTCGTGGTCGATTCAGTCGCCTTGCCCGTCGCTTTTTTTCCGCCGGTAGTCTTTG CCATCGCGATTCTCGCCTCTGCTAGGAAAACAAACCGCTGAAAACCTTATATTATAGCACGTTGAGGCAG ACCCCTCCCCGGCCGGTTCACAGCCCGAGCCTCCGTTTCATGTCTGCCAGGTTCTGATGCAAGTCCTTGA TTTCCGCCAATTCCTCGGAAGAGGGCGCCGAGCGCTTCGACAGGCTGTTGAGCCGGTCGCAGCACGCGTC ATAGCGCATCTTGAGCACCGCTGCCTGAAGCTCCTCGCCTGCGATCCGTTCGCGCTCGCGCAGCCCTTCC TGCGCCGCGTCGCTTTCCGGATTCTGCTGCATCAAGTCGCGGACATTTTCATCATAGACCAGAATTTCCC GGAAAATTTCGTCGTAGGTCGGCGCGTTGGCCGAATTGCGTAGCACGTCGGACAAAAGCCGGAATTCCGC CCCGTCCCCAAGCGCGCGCGCGTGGCCGATCACCTCGTCGAACAGTTCCCCGATCCTGGGCAGCGCCCGC AGCGTCGCGACGCTTTCGTCGTCGATCGCCGCCGCGATGCGCGGGTGCATCACCAGATTGCGCAATGCGC GCTTTTCGCTCTCCGTCACGCGCCGCCGCTCGTTGCGGGCGGGCGCCTGGCGGGCCGGCGCGGCGATCCG CGCGTCCACTTCGGCAAGCGCGGTGACTTCGTCGAACGGAATCTGCAGCCGATCGGCGAACATGTGCATG ATCTGAGCGCGCAACGCGTTCGCGGGCAGCGCCTGCAGCAGCGGCTTCGCATCGAACAGCGCCTTCGCGC GCCCTTCCGGCTGGTCAAGCTCCTTACCTGCCGTGGCCTCGTTCAGCAGGAACTGGGACAGCGGCATCGC CCGCTTCACCTGTTCCGAAAACGCATCCGTCCCGAATTCCCGCACATAGCTGTCCGGATCGTGCTCCGGG GGCAGGAACAGGAAACGGATCGTCCGGTTGTCCGCCGCGTGCGGCAGGCACGCCTCCAGCGCGCGCCGCG CGGCGCGCCGGCCGGCCGCGTCGCCGTCGAAACTGAAGACGACCGTGTCCGTCTGCCGCAACAGCTTCTG CACGTGGATCGGCGTGCACGCGGTGCCGAGCGTCGCCACCGCGTTCGGAAAGCCGAGTTGCGCGAGCGCG ACGACGTCCATGTACCCCTCGACGACGAGCGCGTACTTGTGCTCGCGAATCGCGAGCCGCGCCTCGAACA GGCCGTACAGCTCGCTGCCCTTGTTGAACAGCGGCGTCTCCGGCGAATTCAGGTATTTGGGCTCGCCGCC GTCCAGCACCCGGCCGCCGAAGCCGATGACCTGCCCCTTCACGTTGCGGATCGGGAACATGATCCGCTCG CGGAACCGGTCGTAGCGGCGCGCGACGCCGTTCGCGTCGGTCTTCTCGCTGACGATCACGAGGCCCGCGT CGACGAGCGAATCGTGGCGGTAGTCGTCGAACGCCGCCTCGAGGTTCTGCCAGCCGTCCGGCGCGTAGCC GAGCCCGAAGCGCAGCGCGATCTCGCCCGTCAGGCCGCGCCGCTTCAGGTACTGGATCGCGTTCGGCGCG CCGCGCAACTGCTTGCGGTAATAGTCACAGGCGGTCTGCATCACGTCGGACAGCGCGGTCGTGACGGATT TCGACGCCGCGGGCGGCGCATGGCCGTCGCTCGAGCCGCCGCCTCCGCGCATCGGCGACGGCTCGTGCGG CACCGTGAGCCCGACCGATTGCGCAAGCTCCTGCACCGCCTCGGGAAACGTGAGCCCCGCGTGCTCCATC AGGAAGCCGATCGCGGTGCCATGCGCGCCGCAACCGAAGCAGTGATAGAACTGCTTCGTCGGGCTGACCG TAAACGACGGGCTCTTCTCGTTGTGAAACGGGCACAGCCCCATGAAGTTGGCGCCGCCCTTCTTGAGCTG CACGTACCGGCCGACCACGTCGACGATGTCGACGCGGTTCAGCAAATCCTGCAGGAACGAATGCGGAATC ACCGTTGGATCGATCGAAAGACGTGCCGGCCGACGCGCGCCGCAGCGGCCGGCGCAGGCCGCCTGAACGA CGCTCGCGCGCCGCGCCGGCGGGAAGTTACTTCGAGAGCGCCGCCTTGACGAGCGCGGACACGGCCGTCA TGTCGGCGCGGCCGGCGAGCTTGCCCTTGAGCACGCCCATCACCTTGCCCATGTCCTGCGGGCCCGCCGC GCCCGTTTGCGCGACGGCGGCCTGCACTTCGGCCGCGACCTCGGCATCCGAGAGCTGCGCCGGCAAGTAC GCGCTCAGCACCGCGACTTCCGCCTGCTCCTTCTCGACGAGATCGGTGCGCCCGGCGGCCTCGAACTGGC TGATCGAATCCTTGCGCTGCTTGATCATCTTGTCGACCACCGCGGTGACGCCCGCGTCGTCGAGCGTCAC GCGCTCGTCGACCTCGCGCTGCTTGATCGCGGCGAGCAGCAGGCGGATCGTCGCGAGCCGCTCGCTTTCC TTCGCGCGCATCGCGGCCTTCATGTCTTCACTGATCTGGTCTCTCAGACTCATCGTATTTCCGGTATCAA AATTTCCATGAGGCGCCGCCTAGCGGCGTCAACACAAAAACCCGCTTGAGACGGTTTCTCAAGCGGGTTT GCTCGAATTCGGCATCGACGACCTCGCTCGTGGAGCCCGCCGGTGCGCCGCCTAGCGAACCACGCGGCGC GGACAACGCTCAGTAGAGCTTCTTCGGCAGCATCTGGCTGCGGATGCGCTTGTAGTGGCGCTTCACCGCA GCGGCCTTCTTGCGCTTACGCTCCGCAGTCGGCTTCTCGTAGAACTCGCGCGCGCGCAACTCCGTCAGCA GACCGTTCTTTTCGATCGTGCGCTTGAAGCGGCGCATCGCGACTTCGAAAGGCTCGTTTTCCTTAACGCG GATGATCGTCATGACCCGTCACGTAAAAAAAGAAAGTAAAGGTATTGCGGAGAAAGGTTTTCGAGTATAG CAGACGCTTTGCACAAACGCACTAACCCGTCAAGCGCCGCGTGCCGTCGGCGGCCGCGCGCCCGGCCGCG ACGCCCGACGCCCATGCCCACTGGAAATTGTAGCCGCCGAGCCAACCCGTCACGTCGAGCGCCTCGCCGA CGAAGAACAGGCCCGGCACGCGCTCGCTCATCATCGTCGCCGACGACAGCTCGCGCGTGTCGACCCCGCC TTTCGTCACCTCGGCCTTCCGGTAGCCCTCGGTGCCGCTCGGCGTGAGCGTCCAGCGCGACAGCGTCTCG CCGATGCGGCGCAGCGTCTTGTCCGGCAGGTCGGCAAGCCGGGCGTCCGCGTGCACCCGCTCCGCCTCGA GCCACGCGTGCGCGAGCCGCGCGGGCACCCATTCGGCGAGCAGCGTGCCGATCTGGCGGCGCGAGCCGGC CTTCGCGCGAATCAGCTCGTTCGCCGCGTCGATTTGCGGCGCGAGATCGACGTGAATCGGCTCGCCCGAG CGCCAGTAGCTCGAAATCTGCAGGATGCCCGGCCCGGACAGTCCGCGATGGGTGAGCAGGATGTCCTCGA CGAACTCACCGCCGCCCCGCTTCGCGCCGGTCGACACCCGCGCCTCGAGCGACACGCCCGACAGCGCGGC GAACGGCGCCCACGCATCGGCCGAAAAGGTCAGCGGCACGAGCGCCGGCCGCGTATCGACGAGCTTGTGG CCGAACTGTTTCGCGATCCGGTAAGCGAAATCGGTCGCGCCGATCTTCGGGATCGACAATCCGCCCGTCG CGACGACGAGCGCCTGCGCGCCGATCGCGCCCGCGCTCGTATCGAGCGCGAAGCCGTCCGCCTGCGCGTG CCGCACCGCGTCGACCGTCACCGGCCGCCGCCATGCGACGCCGCCCGCGTCGCACTCGCTTTTCAGCACG TCGATGATCGCGTCGCTCGAATGGTCGCAAAAGAGCTGGCCCTTGTGCTTCTCGTGCCACGTGACGCGAT GGTTCTTCAGGAGCCCGAGAAAATCGCGCGGCGTGTAGCGCGCGAGCGCCGAGCGGCAGAAATGCGGATT CGCGGACAGATAGTTGTCCGGCCCCGCGTACAGATTCGTGAAATTGCAGCGCCCGCCGCCCGAGATGCGG ATCTTCTCCGCGAGCCGTGGCGAGTGATCGAGCAGCACGACGCGCCGCCCGAGCTGCCCCGCCACCGCAG CCGTCATCATGCCGGCCGCGCCCGCGCCGATCACCGCGATATCGAATTTTTCCATGGCGCGGATTGTACC CGCGTCGCGTGCCCCTTCGCCCGCGCGCCGGCTGGCGAAGCGGCCGCGCTGTTATACTTTTCTGTTACGT TGATTTTCCGGCCGTGTGCGCACGCGCGCGTCGATGCGGCCGCCGCACACCATCATGCTCGTTCTCGGCA TCGAAAGCTCCTGCGACGAAACCGGCCTCGCGCTCTACGACACCGAGCGCGGCCTGCTCGCGCACGCGCT TCACTCGCAGATCGCGATGCACCGCGAATACGGCGGTGTCGTTCCCGAGCTCGCGTCGCGCGACCACATT CGCCGCGCGCTGCCGCTGCTCGAAGAGGTGCTCGCCGCAAGCGGCGCGCGCCGCGACGACATCGACGCGA TCGCGTTCACGCAGGGGCCCGGCCTCGCGGGCGCGCTGCTCGTCGGCGCGAGCATCGCGAACGCGCTCGC GTTCGCGTGGGACAAGCCGACCATCGGCATCCACCACCTCGAAGGGCATCTGCTGTCGCCGCTGCTCGTC GCCGAGCCGCCGCCGTTTCCGTTCGTCGCGCTGCTCGTGTCGGGCGGCCATACGCAACTGATGCGCGTGA GCGACGTCGGCGTCTACGAGACGCTCGGCGAGACGCTCGACGATGCCGCCGGCGAAGCGTTCGACAAGAC CGCGAAGCTGCTCGGCCTCGGCTATCCGGGCGGGCCGGAGGTATCGAGGCTCGCGGAAGCCGGCACCCCG GGCGCGGTCGTGCTGCCGCGGCCGATGCTTCATTCGGGGGATCTCGACTTCAGCTTCAGCGGGCTGAAGA CCGCCGTGCTCACGCAAATGAAGAAGCTCGAAGCGGCGCACGCGGGCGGCGCCGTGCTCGAACGAGCGAA GGCGGATTTCGCGCGCGGCTTCGTCGACGCGGCCGTCGACGTGCTCGTCGCGAAGTCGCTCGCCGCGTTG AAGGCGACGCGGCTCAAGCGGCTCGTCGTCGCCGGCGGCGTGGGCGCGAACCGGCAATTGCGCGCGGCGC TGTCGGCCGCCGCCCAAAAGCGCGGCTTCGACGTCCATTATCCCGATCTCGCGCTCTGCACCGACAACGG CGCGATGATCGCGCTCGCGGGCGCGCTGCGGCTCGCGCGCTGGCCGTCGCAGGCGAGCCGCGATTACGCG TTCACGGTGAAGCCGCGCTGGGATCTCGCGTCGCTCGCGCGATAGCGCGCGAAGCACGACGAGCCCGCCA TACGGAGCCGGCGCGGCGTGCGGCGGAGCCGAGGCCGCAACGCGCGCGTGCCTGCTCTCGGGCCCGCCCG ATGCCGGGCACGAAGGCGAGCGCGCGCAAACGGCGCGAGCGCGCCACGCGGCCGCGTGATCCGTTCGGCA CTTGCCGCCGGACATCATGCTCGCCCGACACGCGAGTCGGCGACGATCGTCGCCGAACGCGGTCGCCGGC CGCGCAGTCAATTGCCTGACACACCCAAACTGCAACCGGGCCGCGGCAGCCGCGCCGAACCCACACGCAC CGCCGCCACGCCCAAACGAAAAAGGCCGTTCGATTCGAACGGCCCCTTCCTCGAATACGGCGAACGCCGC CGCGCGACAGCGGCGCTCACGCGCCTCGCCGCTTGTCTCGCTCGATCAGCGCGTACGCGCTGTGATTGTG GATCGACTCGAAATTCTCCGCCTCGAGCACATAGGCGACGATCCGCTCGTCCGCATCGAGCCGCCGCGCG ACGTCGCGCACCAGATCCTCGACGAACTTCGGATTCTCGTACGCGCGCTCGGTGACGAACTTCTCGTCGG GACGCTTCAGGAGGCCCCATAGCTCGCACGAGGCCTCTTCCTCCGCGATCCGGATCAGATCTTCGACCGG CACGTCGGCCGCGAGCTCCGCGTCGATCGTCACGTGCGAGCGCTGGTTGTGCGCGCCGTACTGGGAAATC TTCTTCGAGCACGGGCAAAGGCTCGTCACCGGCACGAGCACCTTCGCGAACACGCGTGTGAGCCCGTCGC GCACGTCGCCCGTCAGCGTCACCTCGTAATCGAGCAGGCTGCGCACGCCCGACACGGGCGCCGTCTTGTT CACGAAATACGGAAACGACACCTCGATGCGCCCCGCCCGCGCCTCGAGCTTCTCGAGCATCGTCGCCAGC ATCGTGCGGAACGCATCGGCGGTCAGCGGCCCGCCGCGCTCCTCAAGCAGCGCGACGAAGCGCGACATGT GCGTGCCCTTCTGATCGGCGGGCAGATGCACGTCGAGATTCCAGGTGCCGACGGTGGCCTGCGTCTCGCC TTCCGCGGTGCGCACCGTCAGCGGATGCCGGACCGCGCGCACGCCGACGCGCTGGATCGGCATCTGCCGG GTATCGACGGTGCTCTGCACGTCGGGCATCGCGAATTCGGGATTCATCAGGTTCATCTTGTTCGATCCTT GTAGGGTCATGCGGCGCGACATCGCCCTATGCCGCATGGCCTGCCGGAAATGCAACATGGGCCCGCAGGC CCATGCTTTCGGGGGAAAGCCGTCGCGCGCCCGCCGCGCGGCGGCCCGATGCGACGTGAAGCGACGTTAT GCGACGCGCTTCGCCGGGCTCGCGACGTCCGCCGCGTGGCTCAGGAAGCGTTCGCGAATCGATTTGGCGA TGCCGTCGCCGTCGAGGCCGCACAGCGACAGCAGCTTCGCCGGATCGCCGTGATCGACGAAACGGTCCGG CAAGCCGAGCTGCAGCACCGGGCGGACGACGCCGCTTTCCATCATCGCCTCGACGCACGCGGAACCCGCG CCGCCCATCACGCAGCCTTCCTCGACGGTGACGAGGTAGTCGTGCGTCCGCGCGAGCGCTTGCACGAGCT CGGCGTCGATCGGCTTCACGAAGCGCATGTTCGCGACGGTCGCGTCAAGCGCGTCGGCCGCCGCGAGCGA CGGCGCCACCATCGTGCCGAACGCGAGAATCGCGATGCGCTTGCCGTCCGGCTGCGACGTGCGGCGGCGC ACTTCGCCCTTGCCGAGCGGGATCTCGGTAAACGCCTTGACGGTGGCAACGCCCGTGCCCGCGCCGCGCG GATAACGCACGGCGGTCGGGTTCGGCTGTTGCAGCGCGGTATGCAGCATCTGGCGGCACTCGTTCTCGTC CGACGCGGCCATCACCGTCATGTTCGGAATGCAGCGCAGGAACGCGAGATCGTACGCGCCCGCGTGCGTC GCGCCGTCCGCGCCGACGAGGCCCGCGCGGTCGATCGCGAACACGACGGGCAGGTTCTGCAGCGCGACGT CGTGAATCAGCTGATCATAAGCGCGCTGCAGGAACGTCGAATAGATCGCCACGACAGGCTTGAGCCCTTC CGTCGCGAGGCCGCCCGCGAACGTCACCGCGTGCTGCTCGGCGATGCCGACGTCGTAGTAGCGCTCCGGG AAGCGCTTCTCGAACTCGACCATGCCCGAGCCTTCGCGCATCGCGGGCGTGATGCCGACCACGCGCGCAT CGAGTTCCGCCGCGTCGCACAGCCATTCGCCGAACACTTGCGTGTAGGTCTTCTTCGCGGGCGTCGTCGA CGGTTTGATGCCTTCCGCCGGGTTGAACTTGCCGGGCCCGTGATAGAGCACCGGGTCGGCCTCGGCGAGC TTGTAGCCCTGGCCCTTCTTCGTCACGACGTGCAGGAACTGCGGCCCGCGCAGCTCCTTGATGTTCTGCA GCGTCGGGATCAGCGAATCGAGATCGTGGCCGTCGATCGGCCCGATGTAGTTGAAGCCGAATTCCTCGAA GAGGGTGGCCGGCACGACCATGCCCTTCGCATGCTCCTCGAGCTTGCGCGCGAGCTCCAGCACGGGCGGC GCGACGCTCAGCACGCGCTCCACGCCCGCACGCGCGGCCGCGTAGAAGCGGCCCGACATCAGGCGCGCGA GATGGCGATTCAGCGCGCCGACGGGCGGCGAGATCGACATGTCGTTGTCGTTCAGGATCACGAGCAGCTT CGCGTCCTCCGACACGCCCGCGTTGTTCATCGCCTCGAACGCCATGCCGGCCGTCATCGCGCCGTCGCCG ATCACCGCGATCGAGAAGCGGTCGTCGCCGTTCAGCTTGCTGCCGATCGCCATCCCGAGCGCGGCCGAGA TCGACGTGCTCGAGTGCGCGGTGCCGAACGTGTCGTATTCGGATTCCGAGCGGCGCGGAAAGCCCGAGAT GCCGTCGAACTGGCGCAGCGTCTTCATCCCGTCGCGGCGGCCCGTCAGGATCTTGTGCGGATAGGTCTGG TGACCGACGTCCCACACGATCCGGTCGTCCGGCGTATTGAACACGTAATGCAGCGCGATCGTCAGCTCGA CCGTGCCGAGATTGGACGACAAATGGCCGCCCGTCTTCGACACGCTGTCGAGGACGAACGCGCGCAGCTC GTCGGCGAGCGGCTGCAGTTGGCGACGATCGAGACGGCGCAGGTCCGCCGGATCGTCGATGGTTTTCAGC AAGTCGTACATCGTCGTTCCATTGTAGGAAAACAAACGCGCCCGCACTTCTTGCGCGCCCACCCGTGGCG TGGGCGCGGCGGCGGGCTTTCGCGTCAGCTGACCCGGTTCACCACCAGGTCCGCCAGTTCGGCCAGGCGC TGCGCGCGCGCGCCGAACGGCTCGAGCGCGGCGTGCGCATCGGCGCGCAATTGCGCGGCAAGCGCGCGCG ACGCGTCGAGACCGATGATCGACACGTAGGTCGGCTTGTCGTTCGCCGCGTCCTTGCCGGCCGTCTTGCC GAGCGTCGCGGAATCGGTGGTGACGTCGAGAATGTCGTCCACCACCTGGAACGCCAGGCCGACGGCCGCC GCATAGGCGTCGAGCGCGCGCATCGCGTCGCCGGACGGCGTCTCGCCCGCAAGCGCGCCCATGCGCACCG ACGCGCGCAACAGCGCGCCCGTCTTCATCCGATGCATCTTCTCCAGCGCGTCGCGCGTGAGCGCCACGCC GACGCTCGCGAGATCGATCGCCTGGCCGCCCGCCATGCCGATCGAGCCGCTCGCGAGCGCGAGCTCGCGC ACGAGCGCCGCCTGCCGGTCGGCCGGCAGCGCGGCGGCGTCGGTCAGCGCGACGAAAGCCTGCGACTGCA GCGCGTCGCCGACGAGCAGCGCGGTCGGCTCGTCGTACTTGACGTGGACGGTCGGCTTGCCGCGACGCAG CGCGTCGTCATCCATGCACGGCATATCATCATGCACGAGCGAATACACGTGAATCATCTCGAGCGCAGCC GACGCCGCGTCACGCGCCGCCTCCGACGCGCCCGTGAGCTCGCCCGCCGCATGGCACAGCAGCGGACGCA CGCGTTTGCCGCCCCCGAGCACCGCATAGCGCATCGCTTCGTGCAGTTGCGCGGGCGCCACAGTCCCGGC CGGTAAATAATGGCCGAGGGCCGTCTCGACACGCTCGAGCACCGAGCGCATCCATTGGTCGAATGTCATA GATCGTCGTCTTCGCCGTCCGCGGCGGCCGTTCCTGCGGAAAGCGGCTTGAGCGAGGCGCCGTCGAGCAC ACGCACCTGCTGCTCCGCTTTTTCCAATTGTTGTTGGCAAAATGCAACGAGCGCCGCGCCGCGGCGATAC GCCGCAAGCGAATCCTCGAGGCTCAGCGTCCCGCCCTCCATTCGGGCGACGAGCGCCTCCAGTTCCGCGA GCGCCATCTCGTAGTTTTCAGGCAGCGGCCCGGAACCGGGGTCCGACGCGCACGCGCCTGGGGTTGCGGT TTTCGCCATGGACGGGGTGTCAAATTTCAAAACAAGTCGGACATTCTACGGCAAAAGAGAATTTTCCGAC CTGCCGACTTGGCTTCGCGGGGCTTGGCCGCCGCGCGCCACGTGTGCAATTCGCACGACGCGCGTCCATA TTCCGGGAAAAATTTGACACAAATCAGTCACTTACCCCACCCCAAGAATGTCGATCCGGGTATAATCGCC GGTTCCCCTAAATCGAATTTTCGATGGTTGGGTTGTTCACTGCTTTCACGCTTTCACCGGGAGTGGGAAT GTCCAATCTGAGCGACGCACTGCAGCTGAAGTCGGCACATAGCCAGCTTCCCGTCACCGCTTATTTCGAT GAGGCGCTCCTCGCGCGCGAAATCGAAACACTTTTCAAGAAAGGACCTCGCTATGTCGGGCACGAATTGA TGGTGCCCGAAGCAGGAGATTATTTTGCGCTGCCTTCCGAAGACGAAGGCCGCGTGCTGGTGCGCAACCA GGCTTCGCAGATCGAGCTGCTGTCGAACGTGTGCCGCCACCGCCAGGCGATCATGCTGAACGGCCGCGGG CGTACGCAGAACATCGTGTGCCCGCTGCATCGCTGGACCTACGATCTCGAAGGCCAGTTGCTCGGCGCGC CGCACTTTCCGGACAAGCCCTGCCTGAACCTGCACGCGACGCCGCTGCAGCACTGGCAAGGGCTGCTGTT CGAGGCCGAGGGCCGCGATGTCGCGCACGATCTCGCGCAACTCGGCACGAAGCACCATTTCGACTTTTCG GACTACCTGTTCGATCACGTCGAGATCCACGAGTGCAATTACAACTGGAAGACCTTCATCGAGGTCTACC TCGAGGACTACCACGTCGTGCCGTTCCATCCGGGCCTCGGCAGCTTCGTGTCGTGCGACGACCTGAAGTG GGAATTCGGCGACTGGTACAGCGTGCAGACGGTGGGCGTGCACAACGCTCTCGCGAAGCCGGGCAGCCCG ACGTACCAGAAGTGGCACGATCAGGTGCTCCGTTATCGCAACGGCGTGCCGCCGGAGTTCGGCGCGATCT GGATGGTCTATTACCCGGGCCTCATGATCGAGTGGTATCCGCACGTGCTCGTGGTGTCGTGGCTGATTCC GCGCGGCCCGCAGAAGACGACGAACATCGTCGAGTTCTACTACCCCGAGGAAATCGCGCTGTTCGAGCGC GAGTTCGTCGAGGCGGAGCGCGCCGCCTATATGGAGACCGCGATCGAGGACGACGAGATCGCATGGCGCA TGGACGCCGGCCGCCGCGCGCTGATGGAGCGCGGCGAATCGCAGGTCGGCCCGTATCAGAGCCCGATGGA AGACGGCATGCAGCACTTCCACGAGTTCCTGCGCCGGCAACTCGGCGCGATCTGAGCGCAGCGGCGGCGT GCGCCGCGCGACGTTCGCCGAAATCTTGAGCGATCCGGACTTGCGCCGGGCCGACACGGAAAAGACGGGC TGCAGCCCGTCTTTTTTTGTTTAGACTTAGAGCATCTGGCCCGTATCTCCAAGGGAGGCGTCATGCCGCA CACTCACTACACCACGCTCATCTCGGCGGACAATCTCGCCGAACGTCTCGCGGCCGCCCCCGGCAGCGTG CTCGTGTTCGATTGCCGTTTCGATCTCGCCGATACCGACCTCGGCGAGCAGGCCTACGCGGCCGGCCACA TTCCGGGCGCGCATTACCTGCATCTGGACCGCGATCTGTCGGGCGCGAAAACCGGCACGAACGGTCGTCA CCCGCTGCCCGCGCGCGATGCGCTCGTCGATACGCTGAAGGCTCACGGGCTCAGGCAGAATCAGCAGATC GTCGCATACGATGCGCAGGGCGGCATGTACGCCGCGCGCCTGTGGTGGCTGCTGCGCTGGCTCGGCCACG ATTCCGTCGCGGTGCTCGACGGCGGCCTGCAGGCGTGGCAATCGGAAGGGCACGCGGTATCGCAGGATAC GCCGCCGAAGTCGATGGGCGATTTCCGCGCGGGCGCGCCGCTCGCGACGGTCGTCGACGCGCAGACGGTG CTCGCAAACCTCGCGACGAAAGACCGGCTCGTGATCGACGCGCGCGCGGCCGACCGCTACCGCGGCGAGA ACGAGACGATCGATCGCGTCGGCGGACACATTCCCGGCGCGCTCAATCGCTTCTTCAAGGACAACCTGAC GGCCGACGGGCGCTTCAAGAGCGGACACGAGCTGCGCGAGGCGTTCAGCCCGCTGCTCGGCGCGACGGAG CCGAAGAAAGTGATCCTGCAATGCGGCTCGGGCGTCACCGCATGCCACAACGCGCTCGCGATGGAAATCG CCGGCCTGCACGACCCGGCGCTGTACGCGGGCTCGTGGAGCGAATGGAGCGCGGACCCGTCCCGCCCCGT CGCGACGGGCGCGAACCCGTAAGCGCGCCAGCCGCGCCTTGCCGCGTGCGCCGCCCACCCGTGGCGGCGC ACGCGCGGCATCACATCACGCCGTGATGCCGAAACCATGCGATAGCGCGCTTCCAGCCGTCGTCGGCATC GGCCTTCACGTAACTCGGGCGGTAATCGGCGAAGAACGCGTGGCCCGCATCCGGATACACGACGATTTCC GATTCGCGCGCGGCCTGCGTGCCGGCCGCCGCGAGACGCTCGCGCATCCGCGCGAGCGAGCGCTGCGAAA TGCTGTCGTCCTTGCCGCCGTAGAGGCCGAGCGTCGGCGCCTTCAGTTGCGCCGCATGATCGTCAGGATT GAACGGCGTCATCTCGTTCGTCTCGCCGGCCACCTTGCCGTACCACGCGACAGCGGCGCGCACGTGCGGA TTGTGCTCGGCGAAAAGCCACGTCTGCCGGCCGCCCCAACAAAAACCCGTCACGCCGAGCCGCGACAGGT CCCCGCCGTTCTTGCCCGCCCATCGCACCGTCGCGTCGAGATCCTCGATCACCTGACGGTCGGGCACCTT GCTGACCACCTGATCGATCAGTTCCTGGATCGACGCATGCTTCGACGGATCGCCCTGCCGCGCATACAGA TCCGGCGCGATCGCCAGATAGCCGAGCTTCGCGAAGCGCCGGCAGATGTCGGCGATGTGCGCATGCACGC CGAACACCTCGTGGATCACGACGATCACCGGCAGATTGCTCTTGCCGTCCGGCTGCGCGCGATAGGCGGG CACGCTCGCGTCGCCCGAGCGGATCTGGACGGTGTCGACGTCTAGGCCGGCGGCATCGGTGGTGATCGTC TGCGCGGACACGGGCAGCACCGCCGCGGCGAACGTGCCGCCGAGCGCCGCCTGGACGAACTTGCGGCGGC TGAACGGAACGTGCGGAACTAGACTGTCGACTTCGGGTTTCAACATGGATGCGCTCCTCTCTTCTCGCTG TCGCGCACACGCGGCACACGCGTCGCCGACATGAATTCGAATCACGGCGGCTGGCGCGATGCAGTCGCGC CGCGCGCCGCCGCCCGCCCCGCCGAGGCGGAACCCGCACACGATGCCCAACAATCCAGGGACGCGTCAAC CTTTTGTCAGAGGGAAAGCGGCGGCGACGCCCGATCGTTCCGTCGATTGGCCCGGCTTCGAATGCCCCGC CGCGGCGTGCAAGCCGAAGCGCGTACGGTTCGGGCGCGCCGCAACGAGGCGGCGCGATGCGCGCGCATAC GCGCAAGCCGCCCGCGAAACGCCCCCCCGTGCGCGGCATGGGCGCGTTGCGCGCATCGGGCGACGCGGCT GTCGCCATGCAGGCGATCGCATGGGCCGATTTCGCGCCGCTTCAGAGGCCGGCCTGCATGCGCCCACTTC ATTCACAGTCCTACGAATTGCACGCGCCGCTTTTCATTTTCCTTTGCATACCGAGATAAGTCTTCTTCGA TACGTGCATTCCGGGAAAACATTGGATTGCCAATCATCCGGATTGACCGCATGATGCACTGGCGCGATAC GCGCGCAGCCGGCGACATCGCGCGATGTCGGCCGGCCGGATGCGCCGCGGACGACCACACAGGCCCGATG ACGCCGCTTGACCGCGCATCGCGTCGACAACGACACGCAACGCCGACGAAGCACTACGAGGTTGGAATGA ACGAGCCGACACGCCAGACCACCGGGGACCTTTCCGAGCGTCGCGGGCCGAATCGACGGCGCCGCGCGCC GCGCTCGCCGGCGCGCCAGCGCGGCTCGCTCGCCATCATCGCGGCAATCGCGATCGGCGTCGTGATCGCC GCGCTCGGCGCGGTCGACCTCGGCAACCTGTTCTATCAGCGCCGCGCGCTGCAAAGCATCGCCGACCTCG CCGCGCTCGCGGCCGCGCAGACGATGGACGACGGCTGCGCGAAGCCGGCCGCCACCGCGCAATCGGCCGC GCTCGGCAACGGCTTCGACAGCACCGCGTCGGGGCAATCGATGACGGTCGTCTGCGGCAGATGGGACGTG AAGGACAACGTCGGCCCGAGCTTCTTCGCGGGTTCGGCATCGGGCGCCGCGGCCGGCAGCGACGCGCAGC TCAACGCGGTTCAAGTGACGGTCACGCGCGCCGTGCCTTACTACTTCCTCGGCGCGCAGCGCACGATCGC GGCGACCAGCACCGCAGAGGCGACCAACGTCGGCGCCTACTCGATCGGCACGACGCTCGCGCAACTGCAA GGCGGCGTCGTGAACGCGCTGCTCAACGGGCTGCTCGGCGCGAATCTGAATCTGTCGGTGTTGTCGTATC AAGGCCTTGCCAATGCGCGAATCAGGATCAAGGACCTGATGGCCGCCGCGAACGTCGGCACCGTGAGCGC GCTGCTGAGCACGCAGACGACCGTCCCGCAGCTCGCGAACTGGATGCTGAGCGCGCTGTCGCAGACCTCG GTCGCGAATGCCGACTTGCAGACGAGCATCGGCGCGCTACAGACGATCGTCAGCGCGAACATTCCGGGCG GCCGGACTTTCACGATCGGCAACACCGCGAATTCGGCGGGCATCTTCTCGATCGGCCTGTCCAATCCGCA GGCCGCGCTCGACGCGACGTTCAGCCCGTTCGACGCACTTCTCGTCGCGGCCGAGATCGCGACCGGGCAA ACGGCGTTCTCGCTCGCGAACGGGCTGAACATCGGCGGGCTGAACGCGAATCTGCAAGTGCAGATCATCC AGCCGCCCGTGCTCGGCATCGGCGAAGCGGGCATCGACCCCGTCACGAAAACGTGGCGCACGATCGCACG CACCGCGCAGGTGCGGCTCTATCTGAACATCGGACTCGGCACGGCGAACCTGCCGCTCGGGCTGCTCGGC GCGCTCCTGCCGGTGCAGGTGAATCTGCCGCTATCGCTGCAGATCGCGCCGGGCCAGGCGTGGCTGCAAT CGGCGAGCTGCACGGCGTCGCCGTCGACTTGCACCTCGGCCATCGGCGTGCAGACGGGCCTCACGAATCT GTGCATCGGCGACACGCCGGCCAACATGTCCGCGTCGCTGCCGTTCACCTGCTCGACGCCCGCGACGCTC GTCAACGTCGCGAACCTCGTGACGATCAAGTCGCTCGTGTCGTTCCCGGCCGACGTGCCCGCGAGCCAGA CGCCGACGCTCACGTTCTACGGCACGACGGGCGGCTATCAGAGCACGAACTCGAACGGCGTCGGCAGCGT GCTCGGCAATGCGCTGTCCGGCCTCGGCGCATCGCTGCAGCAGACGCAGATCTCGCTGTTCGGCATCAGC CTGCCGCTCGGCCCGATCCAGACCGCGCTCAATGCGTTCCTGGGCGGCGTGCTGCCGCCGCTGCTGTCGG GGCTCGACGCCGCGATCGTGCCGCTGCTGCAACTGCTAGGCGTGCAGGTCGGCGAAAGCACGATTCACGA CATGTCGCTGACTTGCGGGGTGTCGCAGCTCGTCTATTGACGATGCGCCGATCGAACGGCGAACGGGCAA CGCGTTACGGACGATGAGCGACGACGACCGACGGGAAGCGGCGCGGCCGCCGGCTTCGCGCCGCTCGTCC AGGCAATGGTGAC
Region | 1 |
Region Length | 27.3Kb |
Completeness(score) | incomplete(30) |
Specific Keyword | integrase,protease |
Region Position | 95942-123324 |
# tRNA | 1 |
# Total Proteins | 7 |
# Phage Hit Proteins | 6 |
# Hypothetical Proteins | 0 |
Phage + Hypothetical Protein % | 85.7% |
# Bacterial Proteins | 1 |
Attachment Site | yes |
# Phage Species | 6 |
Most Common Phage Name(hit genes count) | PHAGE_Bacill_vB_BanS_Tsamsa_NC_023007(1) PHAGE_Salisa_1_NC_017983(1) PHAGE_Vibrio_KVP40_NC_005083(1) PHAGE_Cyanop_PSS2_NC_013021(1) PHAGE_Cyanop_KBS_S_2A_NC_020854(1) PHAGE_Helico_1961P_NC_019512(1) PHAGE_Synech_S_CBS1_NC_016164(1) PHAGE_Vibrio_ValKK3_NC_028829(1) PHAGE_Pseudo_MD8_NC_031091(1) PHAGE_Synech_S_CBS3_NC_015465(1) PHAGE_Brevib_Sundance_NC_028749(1) PHAGE_Bacill_JBP901_NC_027352(1) PHAGE_Helico_KHP30_NC_019928(1) PHAGE_Bacill_G_NC_023719(1) PHAGE_Bordet_BPP_1_NC_005357(1) PHAGE_Pelagi_HTVC008M_NC_020484(1) PHAGE_Psychr_Psymv2_NC_023734(1) PHAGE_Helico_phiHP33_NC_016568(1) PHAGE_Helico_KHP40_NC_019931(1) PHAGE_Pseudo_JBD44_NC_030929(1) |
First Most Common Phage # | 1 |
First Most Common Phage % | 14.28% |
GC % | 65.09% |
Region: | The number assigned to the region. |
Region Length: | The length of the sequence of that region (in bp). |
Completeness: | A prediction of whether the region contains a intact or incomplete prophage based on the above criteria. |
Specific Keyword: | The specific phage-related keyword(s) found in protein name(s) in the region. |
Region Position: | The start and end positions of the region on the bacterial chromosome. |
# tRNA: | The number of tRNA genes present in the region. |
# Total Proteins: | The number of ORFs present in the region. |
# Phage Hit Proteins: | The number of proteins in the region with matches in the phage protein database. |
# Hypothetical Proteins: | The number of hypothetical proteins in the region without a match in the database. |
Phage + Hypothetical Protein %: | The combined percentage of phage proteins and hypothetical proteins in the region. |
# Bacterial Proteins: | The number of proteins in the region with matches in the nrfilt database. |
Attachment Site: | The putative phage attachment site. |
# Phage Species: | The number of different phages that have similar proteins to those in the region. |
Most Common Phage: | The phage(s) with the highest number of proteins most similar to those in the region. |
First Most Common Phage #: | The highest number of proteins in a phage most similar to those in the region. |
First Most Common Phage %: | The percentage of proteins in # Phage Hit Proteins that are most similar to the Most Common Phage proteins. |
GC %: | The percentage of GC nucleotides of the region. |
Questionable (score 70-90)
Incomplete (score < 70)
Region: | The number assigned to the region. |
Region Length: | The length of the sequence of that region (in bp). |
Completeness: | A prediction of whether the region contains a intact or incomplete prophage based on the above criteria. |
Score: | The score of the region based on the above criteria. |
# Total Proteins: | The number of ORFs present in the region. |
Region Position: | The start and end positions of the region on the bacterial chromosome. |
Most Common Phage: | The phage(s) with the highest number of proteins most similar to those in the region. |
GC %: | The percentage of GC nucleotides of the region. |
Criteria for scoring prophage regions (as intact, questionable, or incomplete):
Method 1:
- If the number of certain phage organism in this table is more than or equal to 100% of the total number of CDS of the region, the region is marked with total score 150. If less than 100%, method 2 and 3 will be used.
- If the number of certain phage organism in this table is more than 50% of the total number of CDS of the region, that phage organism is considered as the major potential phage for that region; the percentage of the total number of that phage organism in this table in the total number of proteins of the region is calculated and then multipled by 100; the percentage of the length of that phage organism in this table in the length of the region is calculated and then multipled by 50 (phage head's encapsulation capability is considered).
- If any of the specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin') are present, the score will be increased by 10 for each keyword found.
- If the size of the region is greater than 30 Kb, the score will be increased by 10.
- If there are at least 40 proteins in the region, the score will be increased by 10.
- If all of the phage-related proteins and hypothetical proteins constitute more than 70% of the total number of proteins in the region, the score will be increased by 10.
If the region's total score is less than 70, it is marked as incomplete; if between 70 to 90, it is marked as questionable; if greater than 90, it is marked as intact.
gi|00000000|ref|UVIR01000003.1| Burkholderia pseudomallei strain A-281-02-2-07 genome assembly, 374054, gc%: 68.32%
Download details as .txt file: detail.txt file_download
Hits against Bacterial Database or GenBank File
Region 1, total 10 CDS
# | CDS Position | BLAST Hit | E-Value | Sequence |
---|---|---|---|---|
1 | 95942..95954 | attL | 0.0 | Showinfo_outline |
2 | 104668..104868 | PHAGE_Psychr_Psymv2_NC_023734: phage integrase; SAMEA2065998_00926; phage(gi593779738) | 1.38e-12 | Showinfo_outline |
3 | complement(104894..104969) | tRNA | 0.0 | Showinfo_outline |
4 | complement(105093..107135) | PHAGE_Synech_S_CBS3_NC_015465: group 2 RNA polymerase sigma factor; SAMEA2065998_00928; phage(gi331028036) | 4.68e-37 | Showinfo_outline |
5 | complement(107579..109453) | PHAGE_Helico_phiHP33_NC_016568: DNA primase; SAMEA2065998_00929; phage(gi371671350) | 7.44e-55 | Showinfo_outline |
6 | complement(109548..109994) | PHAGE_Bacill_G_NC_023719: gp509; SAMEA2065998_00930; phage(gi593777964) | 6.44e-17 | Showinfo_outline |
7 | complement(110161..110373) | PHAGE_Pelagi_HTVC008M_NC_020484: 30S ribosomal protein S21; SAMEA2065998_00931; phage(gi460042545) | 1.95e-05 | Showinfo_outline |
8 | complement(110453..111676) | FAD-dependent oxidoreductase; SAMEA2065998_00932 | 0.0 | Showinfo_outline |
9 | 111795..112856 | PROPHAGE_Escher_MG1655: glycation-binding protein, predicted protease/chaperone; essential for genome maintenance; SAMEA2065998_00933; phage(gi16130960) | 9.22e-133 | Showinfo_outline |
10 | 123324..123336 | attR | 0.0 | Showinfo_outline |
>95942..95954
CAACGCACGCGCC
>104668..104868
BurkholderiapseudomalleiMTHCYATAMLMAGMTPAFCAKQLGHSVEMFLTTYSKWIDGDQNDLEMALERAISSPGPPQMKAKST
>complement(104894..104969)
GGGCCTGTAGCTCAGGGGTTAGAGCAGTCGACTCATAATCGATTGGTCGCGGGTTCGAAACCCGCCGGGCCCACCA
>complement(105093..107135)
BurkholderiapseudomalleiMTKKLNEVSVDDDANQSDDQPTAASGKGEKAKARDRRAKEKALLKEAFATSTPGTAEELEERRTKLRALIKLGKERGFLTYAEINDHLPDNFTETEALEGIIGTFNDMGVAVYEQAPDAETLLLNDNAPAASSDDEVEEEAEVALSTVDSEFGRTTDPVRMYMREMGTVELLTREGEIEIAKRIEDGLRHMVMAISACPTTIADILAMAERVANEESRVDELVDGLIDPNAADTDGFSEKDAEEIENEDAEAEEDDEEEEEDDDGAAQATANAAQLEALKRASLEKFALISEWFDKMRRAFEKEGYKSKSYLKAQETIQNELMSIRFTARTVERLCDTLRAQVDEVRQVERQILHIVVDKCGMPRSEFIARFPGSETDLDWAEKVAAEGHSYGAILSRNIPAIREQQQRLLDLQARVVLPLKDLKETNRQMAAGELKARQAKREMTEANLRLVISIAKKYTNRGLQFLDLIQEGNIGLMKAVDKFEYRRGYKFSTYATWWIRQAITRSIADQARTIRIPVHMIETINKMNRISRQILQETGLEPDPATLAEKMEMPEDKIRKIMKIAKEPISMETPIGDDDDSHLGDFIEDTNTVAPADAALHASMRDVVKDVLDSLTPREAKVLRMRFGIEMSTDHTLEEVGKQFDVTRERIRQIEAKALRKLRHPSRSDKLKSFLEGN
>complement(107579..109453)
BurkholderiapseudomalleiMIPHSFLQDLLNRVDIVDVVGRYVQLKKGGANFMGLCPFHNEKSPSFTVSPTKQFYHCFGCGAHGTAIGFLMEHAGLTFPEAVQELAQSVGLTVPHEPSPMRGGGGSSDGHAPPAASKSVTTALSDVMQTACDYYRKQLRGAPNAIQYLKRRGLTGEIALRFGLGYAPDGWQNLEAAFDDYRHDSLVDAGLVIVSEKTDANGVARRYDRFRERIMFPIRNVKGQVIGFGGRVLDGGEPKYLNSPETPLFNKGSELYGLFEARLAIREHKYALVVEGYMDVVALAQLGFPNAVATLGTACTPIHVQKLLRQTDTVVFSFDGDAAGRRAARRALEACLPHAADNRTIRFLFLPPEHDPDSYVREFGTDAFSEQVKRAMPLSQFLLNEATAGKELDQPEGRAKALFDAKPLLQALPANALRAQIMHMFADRLQIPFDEVTALAEVDARIAAPARQAPARNERRRVTESEKRALRNLVMHPRIAAAIDDESVATLRALPRIGELFDEVIGHARALGDGAEFRLLSDVLRNSANAPTYDEIFREILVYDENVRDLMQQNPESDAAQEGLRERERIAGEELQAAVLKMRYDACCDRLNSLSKRSAPSSEELAEIKDLHQNLADMKRRLGL
>complement(109548..109994)
BurkholderiapseudomalleiMSLRDQISEDMKAAMRAKESERLATIRLLLAAIKQREVDERVTLDDAGVTAVVDKMIKQRKDSISQFEAAGRTDLVEKEQAEVAVLSAYLPAQLSDAEVAAEVQAAVAQTGAAGPQDMGKVMGVLKGKLAGRADMTAVSALVKAALSK
>complement(110161..110373)
BurkholderiapseudomalleiMTIIRVKENEPFEVAMRRFKRTIEKNGLLTELRAREFYEKPTAERKRKKAAAVKRHYKRIRSQMLPKKLY
>complement(110453..111676)
BurkholderiapseudomalleiMEKFDIAVIGAGAAGMMTAAVAGQLGRRVVLLDHSPRLAEKIRISGGGRCNFTNLYAGPDNYLSANPHFCRSALARYTPRDFLGLLKNHRVTWHEKHKGQLFCDHSSDAIIDVLKSECDAGGVAWRRPVTVDAVRHAQADGFALDTSAGAIGAQALVVATGGLSIPKIGATDFAYRIAKQFGHKLVDTRPALVPLTFSADAWAPFAALSGVSLEARVSTGAKRGGGEFVEDILLTHRGLSGPGILQISSYWRSGEPIHVDLAPQIDAANELIRAKAGSRRQIGTLLAEWVPARLAHAWLEAERVHADARLADLPDKTLRRIGETLSRWTLTPSGTEGYRKAEVTKGGVDTRELSSATMMSERVPGLFFVGEALDVTGWLGGYNFQWAWASGVAAGRAAADGTRRLTG
>111795..112856
BurkholderiapseudomalleiMRPPHTIMLVLGIESSCDETGLALYDTERGLLAHALHSQIAMHREYGGVVPELASRDHIRRALPLLEEVLAASGARRDDIDAIAFTQGPGLAGALLVGASIANALAFAWDKPTIGIHHLEGHLLSPLLVAEPPPFPFVALLVSGGHTQLMRVSDVGVYETLGETLDDAAGEAFDKTAKLLGLGYPGGPEVSRLAEAGTPGAVVLPRPMLHSGDLDFSFSGLKTAVLTQMKKLEAAHAGGAVLERAKADFARGFVDAAVDVLVAKSLAALKATRLKRLVVAGGVGANRQLRAALSAAAQKRGFDVHYPDLALCTDNGAMIALAGALRLARWPSQASRDYAFTVKPRWDLASLAR
>123324..123336
CAACGCACGCGCC
Questionable (score 70-90)
Incomplete (score < 70)
Viewer Options
Click on a region in the genome above to show details here.
ORF Start: 95942
ORF Stop: 95954
Strand: Forward
Protein Sequence: CAACGCACGCGCC
Homolog/Ortholog Species: Attachment site
Homolog/Ortholog Protein: attL
Homolog/Ortholog E-Value: N/A
ORF Start: 104668
ORF Stop: 104868
Strand: Forward
Protein Sequence: MTHCYATAMLMAGMTPAFCAKQLGHSVEMFLTTYSKWIDGDQNDLEMALERAISSPGPPQMKAKST
Homolog/Ortholog Species: Integrase
Homolog/Ortholog Protein: SAMEA2065998_00926, phage integrase, phage(gi593779738), PHAGE_Psychr_Psymv2_NC_023734
Homolog/Ortholog E-Value: 1.38e-12
ORF Start: 104894
ORF Stop: 104969
Strand: Backward
Protein Sequence: GPVAQGLEQSTHNRLVAGSKPAGPTA
Homolog/Ortholog Species: Trna
Homolog/Ortholog Protein: tRNA
Homolog/Ortholog E-Value: N/A
ORF Start: 105093
ORF Stop: 107135
Strand: Backward
Protein Sequence: MTKKLNEVSVDDDANQSDDQPTAASGKGEKAKARDRRAKEKALLKEAFATSTPGTAEELEERRTKLRALIKLGKERGFLTYAEINDHLPDNFTETEALEGIIGTFNDMGVAVYEQAPDAETLLLNDNAPAASSDDEVEEEAEVALSTVDSEFGRTTDPVRMYMREMGTVELLTREGEIEIAKRIEDGLRHMVMAISACPTTIADILAMAERVANEESRVDELVDGLIDPNAADTDGFSEKDAEEIENEDAEAEEDDEEEEEDDDGAAQATANAAQLEALKRASLEKFALISEWFDKMRRAFEKEGYKSKSYLKAQETIQNELMSIRFTARTVERLCDTLRAQVDEVRQVERQILHIVVDKCGMPRSEFIARFPGSETDLDWAEKVAAEGHSYGAILSRNIPAIREQQQRLLDLQARVVLPLKDLKETNRQMAAGELKARQAKREMTEANLRLVISIAKKYTNRGLQFLDLIQEGNIGLMKAVDKFEYRRGYKFSTYATWWIRQAITRSIADQARTIRIPVHMIETINKMNRISRQILQETGLEPDPATLAEKMEMPEDKIRKIMKIAKEPISMETPIGDDDDSHLGDFIEDTNTVAPADAALHASMRDVVKDVLDSLTPREAKVLRMRFGIEMSTDHTLEEVGKQFDVTRERIRQIEAKALRKLRHPSRSDKLKSFLEGN
Homolog/Ortholog Species: Phage-like protein
Homolog/Ortholog Protein: SAMEA2065998_00928, group 2 RNA polymerase sigma factor, phage(gi331028036), PHAGE_Synech_S_CBS3_NC_015465
Homolog/Ortholog E-Value: 4.68e-37
ORF Start: 107579
ORF Stop: 109453
Strand: Backward
Protein Sequence: MIPHSFLQDLLNRVDIVDVVGRYVQLKKGGANFMGLCPFHNEKSPSFTVSPTKQFYHCFGCGAHGTAIGFLMEHAGLTFPEAVQELAQSVGLTVPHEPSPMRGGGGSSDGHAPPAASKSVTTALSDVMQTACDYYRKQLRGAPNAIQYLKRRGLTGEIALRFGLGYAPDGWQNLEAAFDDYRHDSLVDAGLVIVSEKTDANGVARRYDRFRERIMFPIRNVKGQVIGFGGRVLDGGEPKYLNSPETPLFNKGSELYGLFEARLAIREHKYALVVEGYMDVVALAQLGFPNAVATLGTACTPIHVQKLLRQTDTVVFSFDGDAAGRRAARRALEACLPHAADNRTIRFLFLPPEHDPDSYVREFGTDAFSEQVKRAMPLSQFLLNEATAGKELDQPEGRAKALFDAKPLLQALPANALRAQIMHMFADRLQIPFDEVTALAEVDARIAAPARQAPARNERRRVTESEKRALRNLVMHPRIAAAIDDESVATLRALPRIGELFDEVIGHARALGDGAEFRLLSDVLRNSANAPTYDEIFREILVYDENVRDLMQQNPESDAAQEGLRERERIAGEELQAAVLKMRYDACCDRLNSLSKRSAPSSEELAEIKDLHQNLADMKRRLGL
Homolog/Ortholog Species: Phage-like protein
Homolog/Ortholog Protein: SAMEA2065998_00929, DNA primase, phage(gi371671350), PHAGE_Helico_phiHP33_NC_016568
Homolog/Ortholog E-Value: 7.44e-55
ORF Start: 109548
ORF Stop: 109994
Strand: Backward
Protein Sequence: MSLRDQISEDMKAAMRAKESERLATIRLLLAAIKQREVDERVTLDDAGVTAVVDKMIKQRKDSISQFEAAGRTDLVEKEQAEVAVLSAYLPAQLSDAEVAAEVQAAVAQTGAAGPQDMGKVMGVLKGKLAGRADMTAVSALVKAALSK
Homolog/Ortholog Species: Phage-like protein
Homolog/Ortholog Protein: SAMEA2065998_00930, gp509, phage(gi593777964), PHAGE_Bacill_G_NC_023719
Homolog/Ortholog E-Value: 6.44e-17
ORF Start: 110161
ORF Stop: 110373
Strand: Backward
Protein Sequence: MTIIRVKENEPFEVAMRRFKRTIEKNGLLTELRAREFYEKPTAERKRKKAAAVKRHYKRIRSQMLPKKLY
Homolog/Ortholog Species: Phage-like protein
Homolog/Ortholog Protein: SAMEA2065998_00931, 30S ribosomal protein S21, phage(gi460042545), PHAGE_Pelagi_HTVC008M_NC_020484
Homolog/Ortholog E-Value: 1.95e-05
ORF Start: 110453
ORF Stop: 111676
Strand: Backward
Protein Sequence: MEKFDIAVIGAGAAGMMTAAVAGQLGRRVVLLDHSPRLAEKIRISGGGRCNFTNLYAGPDNYLSANPHFCRSALARYTPRDFLGLLKNHRVTWHEKHKGQLFCDHSSDAIIDVLKSECDAGGVAWRRPVTVDAVRHAQADGFALDTSAGAIGAQALVVATGGLSIPKIGATDFAYRIAKQFGHKLVDTRPALVPLTFSADAWAPFAALSGVSLEARVSTGAKRGGGEFVEDILLTHRGLSGPGILQISSYWRSGEPIHVDLAPQIDAANELIRAKAGSRRQIGTLLAEWVPARLAHAWLEAERVHADARLADLPDKTLRRIGETLSRWTLTPSGTEGYRKAEVTKGGVDTRELSSATMMSERVPGLFFVGEALDVTGWLGGYNFQWAWASGVAAGRAAADGTRRLTG
Homolog/Ortholog Species: Non phage-like protein
Homolog/Ortholog Protein: SAMEA2065998_00932, FAD-dependent oxidoreductase
Homolog/Ortholog E-Value: N/A
ORF Start: 111795
ORF Stop: 112856
Strand: Forward
Protein Sequence: MRPPHTIMLVLGIESSCDETGLALYDTERGLLAHALHSQIAMHREYGGVVPELASRDHIRRALPLLEEVLAASGARRDDIDAIAFTQGPGLAGALLVGASIANALAFAWDKPTIGIHHLEGHLLSPLLVAEPPPFPFVALLVSGGHTQLMRVSDVGVYETLGETLDDAAGEAFDKTAKLLGLGYPGGPEVSRLAEAGTPGAVVLPRPMLHSGDLDFSFSGLKTAVLTQMKKLEAAHAGGAVLERAKADFARGFVDAAVDVLVAKSLAALKATRLKRLVVAGGVGANRQLRAALSAAAQKRGFDVHYPDLALCTDNGAMIALAGALRLARWPSQASRDYAFTVKPRWDLASLAR
Homolog/Ortholog Species: Protease
Homolog/Ortholog Protein: SAMEA2065998_00933, glycation-binding protein, predicted protease/chaperone, essential for genome maintenance, phage(gi16130960), PROPHAGE_Escher_MG1655
Homolog/Ortholog E-Value: 9.22e-133
ORF Start: 123324
ORF Stop: 123336
Strand: Forward
Protein Sequence: CAACGCACGCGCC
Homolog/Ortholog Species: Attachment site
Homolog/Ortholog Protein: attR
Homolog/Ortholog E-Value: N/A
Terminase
Portal Protein
Coat Protein
Tail Shaft
Integrase
Phage-like Protein
Other
Transposase
Plate Protein
tRNA
Download data as .txt file: png_input file_download