Submission Results
Sequence Name: Escherichia coli KTE44 acEod-supercont1.2.C11, whole genome shotgun
GenBank Accession Number: ANTA01000011.1 open_in_new
GenInfo (GI) Number: 430944337 open_in_new
Download Results: ANTA01000011.1.PHASTER.zip
gi|430944337|ref|ANTA01000011.1| Escherichia coli KTE44 acEod-supercont1.2.C11, whole genome .126399, gc%: 50.28%
Download summary as .txt file: summary.txt file_download
Total: 1 prophage regions have been identified, of which 0 regions are intact, 0 regions are incomplete, and 1 regions are questionable.
Region | Region Length | Completeness | Score | # Total Proteins | Region Position | Most Common Phage | GC % | Details |
---|---|---|---|---|---|---|---|---|
1 | 44.3Kb | questionable | 80 | 36 | 58370-102703 info_outline | PHAGE_Entero_mEp460_NC_019716(15) | 48.69% | Show info_outline |
>1 58370-102703 TTGCGAGATATGTTTGAGAATACCACTTTATCCCGCGTCAGGGAGAGGCAGTGCGTAAAAAGACGCGGAC TCATGTGAAATACTGGTTTTTAGTGCGCCAGATCTCTATAATCTCGCGCAACCTATTTTCCCCTCGAACA CTTTTTAAGCCGTAGATAAACAGGCTGGGACACTTCACATGAGCGAAAAATACATCGTCACCTGGGACAT GTTGCAGATCCATGCACGTAAACTCGCAAGCCGACTGATGCCTTCTGAACAATGGAAAGGCATTATTGCC GTAAGCCGTGGCGGTCTGGTACCGGGTGCGTTACTGGCGCGTGAACTGGGTATTCGTCATGTCGATACCG TTTGTATTTCCAGCTACGATCACGACAACCAGCGCGAGCTTAAAGTGCTGAAACGCGCAGAAGGCGATGG CGAAGGCTTCATCGTTATTGATGACCTGGTGGATACCGGTGGTACTGCGGTTGCGATTCGTGAAATGTAT CCAAAAGCGCACTTTGTCACCATCTTCGCAAAACCGGCTGGTCGTCCGCTGGTTGATGACTATGTTGTTG ATATCCCGCAAGATACCTGGATTGAACAGCCGTGGGATATGGGCGTCGTATTCGTCCCGCCAATCTCCGG TCGCTAATCTTTTCAACGCCTGGCACTGCCGGGCGTTGTTCTTTTTAACTTCAGGCGGGTTACAATAGTT TCCAGTAAGTATTCTGGAGGCTGCATCCATGACACAGGCAAACCTGAGCGAAACCCTGTTCAAACCCCGC TTTAAACATCCTGAAACCTCGACGCTAGTCCGCCGCTTTAATCACGGCGCACAACCGCCTGTGCAGTCGG CCCTTGATGGTAAAACCATCCCTCACTGGTATCGCATGATTAACCGTCTGATGTGGATCTGGCGCGGCAT TGACCCACGCGAAATCCTCGACGTCCAGGCACGTATTGTGATGAGCGATGCCGAACGTACCGACGATGAT TTATACGATACGGTGATTGGCTACCGTGGCGGCAACTGGATTTATGAGTGGGCCACCCAGGCGATGGTGT GGCAACAAAAAGCCTGTGCGGAAGACGATCCGCAACTCAGTGGTCGTCACTGGCTGCATGCGGCTACGTT GTACAACATTGCCGCCTATCCTCATCTGAAAGGAGATGACCTGGCCGAGCAAGCGCAGGCTTTGTCAAAC CGCGCCTATGAAGAGGCCGCTCAGCGTCTACCGGGCACGATGCGGCAGATGGAGTTTACCGTACCCGGCG GTGCGCCCATCACCGGCTTTTTGCATATGCCGAAAGGCGATGGCCCGTTCCCGACGGTATTAATGTGTGG TGGTCTGGATGCGATGCAGACGGACTATTACAGCCTGTATGAACGTTATTTTGCGCCGCGCGGCATTGCG ATGCTGACTATTGATATGCCGTCGGTGGGCTTTTCTTCAAAATGGAAGCTCACCCAGGACTCCAGCCTGT TGCATCAGCACGTCTTAAAGGCGCTGCCTAACGTACCGTGGGTGGATCACACTCGCGTCGCGGCCTTTGG TTTCCGTTTCGGCGCTAACGTTGCCGTGCGTCTGGCATACCTTGAATCGCCGCGTCTGAAAGCGGTTGCC TGTCTTGGTCCGGTAGTTCATACCCTGTTGAGTGATTTTAAGTGCCAGCAACAGGTGCCGGAAATGTATC TTGACGTTCTGGCCAGTCGTTTGGGGATGCATGATGCTTCCGATGAAGCGTTGCGCGTGGAGCTGAATCG CTATTCATTAAAAGTGCAAGGATTGCTGGGACGTCGCTGCCCAACGCCAATGTTATCAGGCTACTGGAAG AACGATCCGTTCAGCCCGGAAGAGGACTCACGCTTAATCACCTCATCATCTGCTGACGGTAAATTATTAG AGATCCCATTTAACCCGGTGTATCGGAATTTTGACAAAGGTCTTCAGGAAATCACCGACTGGATCGAAAA ACGCTTGTGTTAAAAATTTGCTAAATTTTGCCAATTTGGTAAAACAGTTGCATCACAACAGGAGATAGCA ATGACGTTACCGAGTGGACACCCGAAGAGCAGATTGATCAAAAAATTTACCGCACTAGGCCCGTATATTC GTGAAGGTAAGTGCGAAGATAATCGATTCTTTTTCGATTGTCTGGCTGTATGCGTCAACGTGAAACCGGC ACCGGAAGTGCGTGAATTCTGGGGCTGGTGGATGGAGCTGGAAGCGCAGGAATCCCGTTTTACCTACAGT TACCAGTTTGGTCTGTTCGATAAAGCAGGCGACTGGAAGAGTGTTCCGGTAAAAGACACTGAAGTGGTTG AACGACTGGAGCACACCCTGCGTGAGTTTCACGAGAAGCTGCGTGAACTGCTGACGACGCTGAATCTGAA GCTGGAACCGGCGGATGATTTTCGTGACGAGCCGGTGAAGTTAACGGCGTGAGTGAAATGTGCCGGATGC ATCACATCCGGCAATATTCATTAAAACTGATACGTCATGCCAACCGCGACAATATCATCATTATTAATAT TCAATTTGTTATCGCTATCCAGTTGGTTGATTTTATAATCAACAAACGCTGACATATTTTTGTTGAAATA ATACGTAGCACCGACGTCGATATAATTGACCAGATCTTCATCACCGATACCTTCAATATCTTTCCCTTTC GATAAGACATAACCCAGCGATGGACGCAGACCAAAGTCAAACTGGTATTGAGCGACCGCTTCAAAGTTCT GTGTCTTATTGGCAAAGCCGCCAGTTATTGGCGTCATTTTGCGTGTTTCAGAATAGAAAGTTGCCAGATA AATATTATTGGCATCGTATTTCAGACCTGTTGCCCATGCTTCTGCACGCTTGCCTGTGCCACGGCTTTGC AGGTTCTGCTCGTTGGTGCGATCTGAGTTGGTATAGGCCCCACTAATGGCGAAATCGCTGCCGCCAAAGT CATATGTCAATGACGTGCCGAAGCCATCGCCGTTTTGCTTTTTAACGTCGCGGTTTTCGTTTTTCCCTTG ATATTGCAGGGTTAAGTTCAGGCCATCGATAACGCCGAAGAAGTCGGTGTTCCGATACGTCGCCAGACCG CTGGCGCGTTTGGTCATAAAGTTGTCGGTCTGCGCCGAGGAATCGCCACCAAATTCCGGGAACATATCGG TCCAGGCTTCCACGTCATACAACGCCCCCAGGTTACGACCATAATCGAAAGAACCCAAATCTTTATATTT CAACCCGGCAAAAGCGAGACGCGTTTTTTGCTGTGCAGTATCACTCTCTGCTTTATTACCGGCAAACTCT GCTTCCCAACGACCATAACCAGTCAGTTGATCGTTAATTTGTGTTTCGCCTTTAAAACCAAAACGGATAT AACTCTGGTCGCCATCTTTACTGGCGTTATCACTCATATAATGCATGGCTTTAACTTTGCCATAGACATC CAGTTTATTACCGTCTTTATTATATATTTCTGCAGCCTGTACAGATGCAGATGCCACAATGCCCATCACC ACTAATGCCAGAGTGCTCTTTTTCATTTTCATTCCTGATTTTAATTAACGCGCGAATATTCAGCGGGAGA GTCCCGTTGAAAACAGGAAAGTTTTTAACCTGAGATTGTTAAAGATATATTACAGATTAATAATATTCTT AAAATGTGGTAATTTATTAAATCTGTAATAAAAGCGTAAACAACTGCCGCTAGGCTTGCTGATCCCGCGC AACAAAACGCCATGCTTTGCTCGCAGATGGTTGGCAACCGACGACAGTCCTGCTAAAACGTTCGTTTGAT ATCATTTTTCCTAAAATTGAATGGCAGAGAATCATGAGTGACAGCCAGACGCTGGTGGTAAAACTCGGCA CCAGTGTGCTAACAGGCGGATCGCGCCGTCTGAACCGTGCCCATATCGTTGAACTTGTTCGCCAGTGCGC GCAGTTACATGCCGCCGGGCATCGGATTGTTATTGTGACGTCGGGCGCGATCGCCGCCGGACGTGAGCAC CTGGGTTACCCGGAACTGCCAGCGACCATCGCCTCGAAACAACTGCTGGCGGCGGTAGGGCAGAGTCGAC TGATTCAACTGTGGGAACAGCTGTTTTCGATTTATGGCATTCACGTCGGGCAAATGCTGCTGACCCGTGC TGATATGGAAGACCGTGAACGCTTCCTGAACGCCCGCGACACCCTGCGAGCGTTGCTCGATAACAATATC GTTCCGGTAATCAATGAGAACGATGCTGTCGCTACGGCAGAGATTAAGGTCGGCGATAACGATAACCTTT CTGCGCTGGCGGCGATTCTTGCGGGTGCCGATAAACTGTTGCTGCTGACCGATCAAAAAGGTTTGTATAC CGCTGACCCGCGCAGCAATCCGCAGGCAGAACTGATTAAAGATGTTTACGGCATTGATGACGCACTGCGC GCGATTGCCGGTGACAGCGTTTCAGGCCTCGGAACTGGCGGCATGAGTACCAAATTGCAGGCCGCTGACG TGGCTTGCCGTGCGGGTATCGACACCATTATTGCCGCGGGCAGCAAGCCGGGCGTTATTGGTGATGTGAT GGAAGGCATTTCCGTCGGTACGCTGTTCCATGCCCAGGCGACTCCGCTTGAAAACCGTAAACGCTGGATT TTCGGTGCGCCGCCGGCGGGTGAAATCACGGTAGATGAAGGGGCAACTGCCGCCATTCTGGAACGCGGCA GCTCCCTGTTGCCGAAAGGCATTAAAAGCGTGACTGGCAATTTCTCGCGTGGTGAAGTCATCCGCATTTG CAACCTCGAAGGCCGCGATATCGCCCACGGCGTCAGTCGTTACAACAGCGATGCATTACGCCGTATTGCC GGACACCACTCGCAAGAAATTGATGCAATACTGGGATATGAATACGGCCCGGTTGCCGTTCACCGTGATG ACATGATTACCCGTTAAGGAGCAGGCTGATGCTGGAACAAATGGGCATTGCCGCGAAGCAAGCCTCGTAT AAATTAGCGCAACTCTCCAGCCGCGAAAAAAATCGCGTGCTGGAAAAAATCGCCGATGAACTGGAAGCAC AAAGCGAAATCATCCTCAACGCTAACGCCCAGGATGTTGCTGACGCGCGAGCCAATGGCCTTAGCGAAGC GATGCTTGACCGTCTGGCACTGACGCCCGCACGGCTGAAAGGCATTGCCGACGATGTACGTCAGGTGTGC AACCTCGCCGATCCGGTGGGGCAGGTAATCGATGGCGGCGTACTGGACAGCGGCCTGCGTCTTGAGCGTC GTCGCGTACCGCTGGGGGTTATTGGCGTGATTTATGAAGCGCGCCCGAACGTGACGGTTGATGTCGCTTC GCTGTGCCTGAAAACCGGTAATGCGGTGATCCTGCGCGGTGGCAAAGAAACGTGTCGCACTAACGCTGCA ACGGTGGCGGTGATTCAGGACGCCCTGAAATCCTGCGGCTTACCGGCGGGTGCCGTGCAGGCGATTGATA ATCCTGACCGTGCGCTGGTCAGTGAAATGCTGCGTATGGATAAATACATCGACATGCTGATCCCGCGTGG TGGCGCTGGTTTGCATAAACTGTGCCGTGAACAGTCGACAATCCCGGTGATCACAGGTGGTATAGGCGTA TGCCATATTTACGTTGATGAAAGTGTAGAGATCGCTGAAGCATTAAAAGTGATCGTCAACGCGAAAACTC AGCGTCCGAGCACATGTAATACGGTTGAAACGTTGCTGGTGAATAAAAACATCGCCGATAGCTTCCTGCC CGCATTAAGCAAACAAATGGCGGAAAGCGGCGTGACATTACACGCAGATGCAGCTGCACTGGCGCAGTTG CAGGCAGGCCCTGCGAAGGTGGTTGCTGTTAAAGCCGAAGAGTATGACGATGAGTTTCTGTCATTAGATT TGAACGTCAAAATCGTCAGCGATCTTGACGATGCCATCGCCCATATTCGTGAACACGGCACACAACACTC CGATGCGATCCTGACCCGCGATATGCGCAACGCCCAGCGTTTTGTTAACGAAGTGGATTCGTCCGCTGTT TACGTTAACGCCTCTACGCGTTTTACCGACGGCGGCCAGTTTGGTCTGGGTGCGGAAGTGGCGGTAAGCA CACAAAAACTCCACGCGCGTGGCCCAATGGGGCTGGAAGCACTGACCACTTACAAGTGGATCGGCATTGG TGATTACACCATTCGTGCGTAAATAAAACCGGGTGATGCAAAAGTAGCCATTTGATTCACAAGGCCATTG ACGCATCGCCCGGTTAGTTTTAACCTTGTCCACCGTGATTCACGTTCGTGAACATGTCCTTTCAGGGCCG ATATAGCTCAGTTGGTAGAGCAGCGCATTCGTAATGCGAAGGTCGTAGGTTCGACTCCTATTATCGGCAC CATTCAAACATCTCCCCAAGTCTACTAAAGTCTTTCAAAACCCCTTATAATCCGCGTATTAAAGCCCCGT TCGTCTTTTGACGTCTACTAAAGTCCCTCAAAATCTACGGTCAGATGGGGGTACTTATGGGGGTATTTGC TGTTCGGTTTAGTGGAGGTACCCCCAAGTGAAACTCAATGCCCGTCAAATAGACACAGCCAAGCCAAAAG AGAAGGCTTACAAGCTGGCCGATGGTGGCGGTTTGTATCTCTTGGTAAAACCTAATGGAGGTAAATACTG GCGACTTAAATATCGTGTAGCTGGTAAAGAGAAGCTATTGGCACTAGGTGTTTATCCTGAGGTTACTCTA GCCGATGCTCGGGCAAAACGTGAAGATGCGAAAAGAGGTATCGCTGGTGGTATCGATCCGATGGAAGCGA AACGAGAGGAAAAGATTGCCCGGGAAACGCAGTTAAACAACACCTTCAAAGATATTGCCCTTGAGTGGCA CAGCAGCAAATTAAAAAAATGGTCTGCTGGTTATGCTTCAGACATCCTCGAAGCCTTCAACAAAGATGTG TTCCCTTACATTGGCAAAAAACCCATCGCCGAAATCAAACCACTTGAACTTCTGAATGTGCTGCGTCGCA TCGAGGGGCGCGGTGCTACAGAAAAAGCTAAAAAAGTGAGGCAACGGTGCGGGGAAGTTTTCCGCTATGC AATAGTCACTGGTCGCGCAGAGTATAACCCTGCACCAGACCTTACTAGCGCGATGCAAGGCCATGAATCT AATCATTACCCTTTCCTTACAGCCAAAGAATTACCTGATTTTTTCAAGGCATTGTCCAGTTACTCAGGAA GCGCATTGGTTGTTATGGCGGCTCGTCTACTGATTATCACCGGCTTGCGGACTGGCGAACTACGTGGCGC ATTATGGGATGAAATTGATTTCAACAAGGCTATCTGGGAGATACCCGCTTCACGTATGAAAATGCGGCGG CCTCATATTGTGCCATTGTCTGAGCAGGCTCTTTCGCTTATTGGGAAGATTAGAGAAATAACAGGCAATT ACCCTCTTATGTTTCCCGGGCGCAATGATCCAAGGAAAACAATGAGCGAGGCCAGCATAAACCAAGTGTT TAAGCGCATTGGCTACGCTGGACGTGTAACTGGTCATGGGTTCCGGCACACTATGAGTACGATTTTGCAT GAGCAGGGCTATAACACCGCGTGGATAGAAACGCAGCTCGCTCACGTTGATAAGAACTCAATTCGTGGCA CATACAACCATGCGCAATATCTGGATGGAAGGCGGGAGATGCTTCAATGGTATGCCGACTATATGGATTC TCTCGAGCATGACGGAAATGTGGTGCATATGGTGTTTGAAAAACACGCATGAAACAACTGGACAAGTATA CAGAGGTTTACTAAAGTATAGGGGAGTAGATTGATAAAAACTCACAGAGGATTGCTATTTAACAGTTAGC CCTGGAGAAGGGCAATCGCTCTTTTACAAACGAAGGTGTATTGAGCCTTCTACGTCGCTGACAAAGCGGA ATGTTCTTTAACAGGCTTTGGCGCTGACAAAGCGCAATTGTTCTTTAAAACTACTGGATGTCACGCCTAA TAGCGTGATCCTGCCGAAATGCAGGGTAGCTCTACCCTTGTGGAGCAAACAAACCAAAACAATCTGACTA TCTTTTATAAAAGACCGCAATATGTTTTTCATATTGTGGTCTTTGGCGTATTCGACATTTGACGCAAATA AAACTCTCGCCATTCGTTGCAGGAGGGGGTTATTGCGTCATTAGTACAGGATATTAATAAATGGCTAATT ACAATTCATTAATTCGTCTCTCTGAAGTTCAAAGGCGAACTGGTTATAGCAAAGCATGGATTTATCGTCT TATTAGTCAGGGGAGGTTCCCTAAACAGGTAAAAATTGGAAGTCGGGCAATTGCTTTTGTTGAGTCTGAA ATAGATGAATGGATTGAGAAGTGTATATTAGAATCTAGAGACGAGGTGGCCTGATGGAAAAGAAAAACCG CCCATTACAGGCGGCTAATTCAGATATTCGTGTATCTGATGTTACGCCCCTTACAAAATCCCCTCAAGCA CCAAAGCGCACACCGAAAAAGCATCGTGCCAGAGTCTATATGCTGCGTACTGGTATAGAGGGATGGACAG AAAATGACATTCTTCGCTACTGCCGTCTGTCTTCTGGTCGTAACTATGCAACAGAGTTAGAACGCCAGCT TGGAATCACTCTGGAGCGTATCGACGAAAAGAATCCTGATGGTATCGGAACACACCTTCGCTACCGTTTC TCCTGCCGTGGTGATGTTCTGAAAGTGATCACTCATATTAACCATCTTGCGAACATAAATGATCACAACG GACTTTCTCAGCAGGAAATTGCCGACATTCTGAAACTCTACCCGGACGCGTTTAACGCCGCCTAACGGAG ACTGAAAATGAACATCGAAAAAAGCAGATTAATTTCTGAGGCCGCCCCTCATCTGAACGCCTCTCTGGGC ACAATTAACGGTAATGAATTTGCCGCAATTGTCCCGGTTATTCCTGGTCATATCGGTGGGCGTGAAACCA ATATTGTTAGCGCAAAAGCGTTACACAAAGCGTTGGGCGTGGGAAAAGACTTCTCTACATGGATCACTGA TCGCATCTCTGAATATGACTTCACCATTGGGCACGATTACTCAGTCCATAAAACTATTTCCCCAAATTTG GGGAAAATCCCGAATGGCGCGGCTTACAGCAAGATTAAGCAGTCTGGCAGACCCGGCAAAGACTATCTGT TAAGTGTCGGAATGGCGAAAGAACTGGCAATGATCGAACGCAATGATCAGGGGCGCGCTATCCGCCGTTA TTTCATCCAGTGCGAGGAAGAATTACAGCGTAGCGTGCCTGAAATCGCCGCCCGCTATCGTCGCCAGCTA AAAGCCCGTCTCAGTGCCGCAAACAACTTTAAGCCAATGTGCGATGCGCTGAATATGGCCCGTGCCGAGA TGGGTAAAACGACGCAGCAACACCACTACACAAACGAGAGCAATATGATTTCTCGTATCGTTCTTGGTGG GCTAACTGCTAAACAGTGGGCGCGGATAAATGGCTATTCTGGCGAACCTCGCGACCATATGAACGCAGAA CAACTTGAGCACCTCTCATATCTCGAAAGCACCAATATCACGTTAATTGATATGGGCATGGAATATGAGC AGCGCAAAGGAGAACTCACCCGACTGTCGCAACGCTGGCTCGCCAAGCGTCTGGAGGCGGTCAATGTTTA AGCCGACAGGAACACCACAACCTCAAAAACGCTACAAAGATGCCCACGGAGCACTCGTTACTGTCGAAAG CGTGTCTCACAACCGAGTGACGTTTTATCGCGACGGGTATCAATCGCCATGCGTACAACCGCTGGCGCGT TTCATGAAGGAGTTCGCGGAGGTTAACAAATGCTAACCGTCCAGAAGAAGACATTTTCACTGGCTGGTAT GTCGCCAAAATCCAGCAATATGACAGCAAAGTCAGGCATTAATACAGCAGATACAAGCAAAGTTTATCAT TTGCTGGTGGTAGGAGCGGATGCCTTAACTATGTCAGAAATTACGGTCGATGGTGTTAGCGTTGAGAAGG TCAGCGGCTGCGCCAGAGAATTTCTGGTCGTAGATCACCTGTTCTGCTCGTGTAGTGACTCCACAAAAAC ATTCGTCCACGTGCGTGACGTTAACGAAATGAGCGCGATGTATTGTGCATCTGGTGCTTCCAATAGTGAG TTTTCGGAGTCCATAAAAAAGAGCTTGCCGTTATGCGGCAACACGGTTTATGGTTATAAGGCACCTCATA AAACGGGTGCCGGGATTAGGACCCCGCAGGCAATCGAGGCGATACATGACGCGCCAGCGTCTTTTTTTGT ATCGGCGCACACGCACACCTTATCAATGGTGGGCTGTATGGGGCTGACTTCGGTCAGGCTGGTTTCCTTG ATTGCCAGTAGTCCTAACCCTGTACAGTCCACCGCCAGCGAGCTTAGGACCTCCGGCGGTGGCTATAAAC CATCAATCAAGGAGGCTGCCACATGCTGGCTACTACCCCTACCCAAAAACCGCAATTTATCTGGATTATC GCCGCAGTTCGCCGCGATTGCCCGACAATTACCGCCAAAATTCATCATATTGCTGCCGAGTCTGAACGCG ATGCTCGCCGTTCTCTGGTGCGCGATCACGTCTGCTTTTTTGCTGGTCGTATCCGCATGGAGGTGGCACA TGATTAAAACCTACGATGTGCATATGGATCCCCTCGAACGCACAAGCCAGATCATCACACTGACAGAAGT GATTAACGACATTCTGGTGAGCAACTCTCCCTCACGAGACGAAAGGCTGAAGGCGTTACTCGCGATATTG GATCTCGCCGTTCGTGACGTTCATTTCCTGCTGGAAGGTGGACAAATGCCAGTAAAAACGGGGGCAACCA ATGAATAACTCAATTAATACCCCTCGCCTTACGTCCGCACTTCAATTAATCGAGCAAGCAGCGGCTGTCC TGGTTGCTGTCAGTCTTTCGGCTGAAGAAATGGACGCTGCTGATGTCGTGGATGCGATTAAAGCGTGCTC ATCTTTGGTTAACGATGCCCGTGCCGAGCTGGTAATTCTTGGGGGTGAAAAATGAATATCAACTTAATTT ATCGTCATCCGTGTGAGCTGGAAATTGAATCATTGCTGGGGCGCGAAGAGCCATATCCAGACACATTCAC TCCCGCAGATTGCGCGACTGAACGGCTTACCAGAGCGCGCACAGGTCTGGTTCATGTGATGAATGAGATT GTTCCCTCGGTGGGAGGGGAACAGGCGACAGTAATCAATAGCTGGCTACAAAAAGTTACCTCCCTGATAG ATATCGGTTTAATCGATGTGGAGAGTGCGAAATGACCAACATCCAGCTCATTGAAGCGCAATGTCGCATC GAACAGGTTCAGACTGTTTTAGGGTTCTGGCTTGAAGGGGCCAGCCCCAGCAACAGAGACAAGTTAATGA TTGGCGCGGTTATGTCACTGCTCAATGGCGTACCAGAAGCTATTCAGGAAGCGGACGAATTGTTGGGCAA ATATGAGTTACAGAATCATTCAGGCGAGGCGAAACATGAATAATTTCTTAACTTTCCATGCAGAAGCAAC GCCTGACGGCGTAAACATCATGTACCGCAGCAACGATGGCATGACAGAACGCGTTGAGGCCGTCTCATAT ATTGATGCCGTAAATCGTCTGGATGCCGGGGATTATGACGATAAACCAGATGAAGGCATGTTTATACATC TCGCTATTGCCAGTGGCGGCAACCAGGGATATTTCGATTACACATCACAGCATCACGTGATTATGTGGCG CTGGCTGATAGCAACAGCATTCATCAATGAAATGAGAAAGGAAAACGGCACCGTCAGCATTATTGATGAC AGTGGCAATCATTCCGTGGTTTCTGTTTATTCCAATGGCATCGTCGCCATGCCGCTGTATCCAGTAGCAG AGCGCCTCGCTATGGCAAACAACATTGAGGGCGCAATGATCGAGAAATATGGTGTTGATGTCGGAACAAA GAATGCCATCATTTTTTACAGCAACATGTTCGATGTCGAACAGGGAACACTCACTTCGTTTGGGCGAGAA GTGCTTGCCGATCTTCACAACAGCTTTATTGCCGAACTAAACGAAAACGGCATCCCAGAAGCACCAGTGA CGCACTAAACGGGGGCCAGAATGCGAAACATTGATCTTATCCGTCAGGTTATCAGTGCGTCTGAAAACAA CTGGCCTCATGTGCTGGGCTGCCTGAACATAAATGTCCCTGACTCTCCGCGCCGTCATGCTCCCTGCCCT GCATGTGGGGGCAAAGATCGATTCCGGTTCGATGACAACGGGCGCGGTAGCTTCATCTGTAATCAGTGCG GCGCTGGTGATGGGCTGGATTTAATTAAACGCGTAAATAACTGCGACACAACAGAGGCGGCGCTTCTTGC CGCTGATGTTCTGGGTATTGATTACCGGACAACGGAAACACCAGAAGCCACCAGCCAGAAACGGGAACAA CTGGAAACCGAGCGCCAGCGACGCGAACAGGAGCGCCTGAAAAGGGCAGAGAAGGACGAACAACAAAGAC GGGATACGTTTTCCCGTCAGTTTGATGACATGCGCAGAAAGGCTGTAAACGGCAAATCTGATTATCTGGT TGCGAAAGGGGTAGGTGATTTTACATTCCCCGTGTTGCCCGATGGATCTCTGTTGCTGGCGCTGGTGGAT AAATCCGGCGCAGTCACAGCAGCACAGACTATTACCCCACATGGTGAAAAAAGACTCCTGACAGGTTCGG CAAAGCGGGGGGCATATCACGCCATAAACGCACAGAAACGACCTCACAGCATCATAATTGCTGAGGGGGT GGCTACCGCCCTGTCGTGCCATTTAATTCGCCCTGACGCAATGACAGTGGCAGCAATCGACGCTGGCAAC CTGTTGCCAGTAGCGGAAGTCATGCGCAGAACATATCCGCAGGCACAAATAATCATTGCCGCAGATAACG ATCACCAGCAAGGAGACTCAGAAAGTGGAGGGATCAACACGGGGAAAGATGCCGCAGAGAGGGCCGCTAT TTCCGTAGCTGGCAGGGTGTCTCTGCCACCGACTGACTATAAAGCCGACTGGAACGACTATCACCAACAA CACGGGCTGGCGGCAGCCACAGCAGCATTTAAAGATTCGATGTACCAGCCACGGGGGAAAGGGGCGCAGG TGAAAAATCACAAACAGTCAGTCGGGGCGCTGAATGAGATCAGTTCTGGCGAGGTGTTAAGCGATGATGA AATTGCTGTCCTCGAAGAAATCAACCGGACGTTTACGCATGTCACCATCGGCGGGAAACACAAGGTGGTG TCGCTAAAGCCTTCTCAAACCGGCGGTGTATCGCACGTTTTCGAGGATTTATCACAATTTCAGCATTATT TTCATCATAAACCGAGAGTCGCCAGAAAGCTGGCGGGATCGGCGTGGCTGTCATGGAGTGGGAAGAACTA CAAGCCAGGAGGCGTAGGATTTTATCCAGTACCAGACAAATGCCCTGACGATGTTTTTAATCTGTATGAG GGGCTGGCACTGGAACCAATTGAAGGAGATTGCACGGTATACCTTAATCACCTGTTGCAGGTTGTCTGTG CCGGTAATGAAGAGGCATGCCAATATCTTATCCAGTGGATGGCGCACATTATCCAAAAGCCTGATGAAAA ACCGTCCGTGGCAATCGTGATGAAATCTGTCCCGGGCACAGGGAAAGGCACAACGGTTAAACCGCTGCTG CAAATACTGGGGCAGTACGCCGCCCACATTAACGGGGCGGGACATATTTCAGGGCGCTTCAATTCAATAC TTGCTAACAAGCTACTGGTATTTGCTGACGAAGTGACGATCCACAAGCCGTCTGAAGCTGACAGACTTAA AGCGATTATTAGCGAACCGACGTTTAACCTTGAGCGCAAGGGAATTGATGCTGAACCAATGCCGAATTTT GCCCGGTTGATATTTGCCAGTAACAGCACACAGGTATTACAGGCAGGGATAAGAGAGCGTCGCTATCTCG TGCTTGAGCCATCTCCTGAAAAAGCACAGAGCCGGGAGTATTTTGATCGGCTGTACAGTTGGCTTAATGA TGGCGGTGCCGCAAAGTTGCTTTGGCATCTTAAAGGGGTGGATCTCTCAGGTTTTGACCCTCAACGAGCG CCACAGACCGATGCCTTACGTGAAGAAATCTTGCTGGGGCTGTCTGGCGTTGAATTGTTTCTTTATGGCG AGCTAATCAATGAACCTCCGTTTAATGGCGAAGTCCGTTTGTTTGCAAAGGATATGGTTAGTCGGTTTGT AGCGTGGTCGCTTGAGCGTGGGGAAAAGCTTAAAGAACCAGCTGCCAGATCACTACTGGGTAAATCACTT GCACAAATGGGGCTGGTGAAGCATGGAAGACCAGACCGAGGGAATGGCGTGTTCTACGAACTACCAGAGG TCGGAGTTCTACAGGCTGCGTTTGCCCGTCTGATCGGTATGGGGGGTTATGATGTTTTTTAATATAGTCT TTTTACAAGAAGATCGTTTTTACCTGTACCACCTATACCACTTTGCTAATTCAGTATTAAAAACAATGTG TTATAGTGGTACAGGTCAATATCTTACCTATACCAGACCTGTACCACCTATACCATTAACAAACGCATTT CTTCTGGCTGGCTGAGAGGGAACAGCCAATGATGAGCACACCATTTTACAAAGTGCGCCAGTTAGCTTCT TCCTCTGGCTGGCAGCTACGGTTTGAGGGGCGTTCAGACTGGCTACCCATTGCGGCATGGGCGAATGTTG AAATGTGTATTGACGGCGATACCGTTGAAATCATTATTCCCTGCGTGGCAAGTCGGGACGGCTGTATAGA GCCAACAGATATGGCAGCAGAAATCAGAGAGGTAAAACATGAACAATAACTATTGCATACCGCAGGGAAT GACCAGAACGGAGCGCGAAGAATTAAAAAGTTTCGCTACACAGTGCGGGAATGCTGGCGACATCCAAAGT CTGGAACGCACTTTAATTATGATTGCGCACTGGATGCGTCAGGGGCAAAGGGTTTCATTTACTGAATATG CCAGCCAGTGGACAGAGGCACAGCGCGAACGGAGCGACGGTAATCACTCAACACCCGAAATGGCGAAGCA ATGGCCTTTCAGTGGTAAACGCTGTATCAGTCCCGGTGGTTCAGATTATTACCCTGCTGGTGTGGGAGAT GAGCCATGTTGTGACGAGACTGAAATCCGCCACGCAGTGACAGTAATTACCGCTGAATACCCACAATTTA ACCTTGACGGACTGGCGCTCCACAACCGGAATGCGGACTGGGAAAACCCGCTTGATAACCCGTCATTTAT CGTATCGGCGAAAAGCTGCCTGAGATGGATCAGAGACAACGGGATGAGTAATGCCCAGATTGAGAGCTTC CCGCAGGATAACCCCACATCTGACACGTTGAAGCATGAAGTGGAGCGATATAACCAGATAAACCACCAGC ACAGTGATCACCCGCACTATATCCCCAACGGAGCATTTATTGCGGCGATGGTGGCAAGCGGCTACAAGGT TAAGCCAGCGGGAAGAATGAACGCATTTTTCAATATTTCAAAAAAAGGATTATGTGCTGCTATGGGTAAA AATTAAATAAATGGTACAGGTCTGTACAGGTGGGGAGAGGTCATTTTTACCCACCTGTACCACCTGCAGG CCGCGCTACACAAGGGTTTGTGACATGGTGGTACAGGTGGTACAGCAAAAACAGCGATTTTCTTTATAAG GTATGTTTTGCTCATCTGGCAGACGGAGGTATTCGCTGCGGTTTGGCACGTAGCCCAGAACAGCAAAGAG TTTTACGAGACTTTTGTAAAATCCAGTGTACAAGGGCGTGACGTTTTTGAAGGAGAGACCTACGAAATTT TCGTAGTTCAAAAAATAGCCTCTGAGATGTTGTTATGCCTGTTCAGCAAACCATCCGCATAGCTGATCGC GAATCAAATTACCAATGGAGCCAGAAGCAAGGGCAGAACTGGAAAGCGCCTGCCGTGAAGAGTAATGCGA CATCATAAATATAGTTTTTAGAGTCCATAAAAAACCTGTCAAAACCTGACATACAGAACGATGCCACCAG CTACTGAGCGGTGGCATTTTTTTGTGCTATTTGTTTCATATTTTGCAATCACAATGATGAATGTTGCGTT TTGTGAAACTATAATGACTGTTGTTTTATACAGTTCTAAGGGGACGTTATGGCTATTTCAGTAAAGCCAG TATTGATAAGTGAGAAGCAAATGGAAGCGATAAAGAAAATTCAGGAGGAGCAGCGTAAAAAATCAGAGGT AGGAGTTGCGCCAACGATCCACGAAATTGCTCGGGGATTAATGGATAAGGCGCTGGCTTACACTTTAACT GGACGTGGGTAAATTTATGGCTACTTGGCAACAAGGAATCAACTCAGGCGGTTTTCTGGCTGGCATCGGT ACGCAAAACGAGAATGCGCCAAAGGCAAGCGACATTAACGCAACGCTTGGTCTGATCCGCGAAAACAATG AACTGGCTCGCTCAGGTGCAAATAACGTTGCTCTGACAGGTCTTCGTGGTCTGGCTGGAGTTGCTGATAT TTATAAGCAGCAGCAACAGCAGGAGCGTAAAGCGGCATTCCAGAAAGGTTATGCAGATGCTTATGCGTCC GGTGACAGGGAGCAGATGCGTAATCTTATTACAGCATTCCCAGAAGAGTTTGAGGAAGTCCGTAAAGGGA TGAGTTATGTCGATGATGCTCAAAGGGATGATTATGGCAATCTGGCGCTCAAAGCACAGGTAGCCTCATC GCTTGGTCCGGGCGCATTTGGCAGGTTCATGATGGATAATGAGCAGGAGATGCGTCGTTTAGGTATCCCT CCAGAAACTATTGCTGAAATGCAGGTTAATGACCCGCAGGGCTTCCAGCATTTCGCAGGTAATCTGGCGC TGTTTTCTCTCGGTCATGAGAAGTATTTCGATATCAAAGATCAAATGGAAGGTCGGGATATTGAGCGTGG CAAGTTGGCAGAGACAATCCGCAGCAATCAGGCTGGCGAAGCACTTCAGGCGAGAGGGCAGGATATTAGC CGAGCAAATGCGTTAACGTCAGCATATGCACCAACAGCAGCAATGCAGAATTACAATCAGTACGCGCAAA TGTTAAAGGCGGATCCAGAGGGGGCGGCGGCATTTGCGGCAGCGGCTGGAATTAATACAAACGCCAAAAA ATTAATGAGTGTTAGAGAAAACGATGATGGCACTGTCACAAAATACTACACAGACGGAAGTGAAGAGCAG GGAAAACTAAACCAGCCAATATCTGGAGATGGGTTTCGTCCAATAGCTTTGCCAACAGCCCAAAAGATCA TGGAAAAGTCGCCAGAAGGGGCTAAAAAAGCTGCTGGATTTGCATACAGGGTTAGGGATGCTCTTGATTC AATAGATACACTGAAAGACCAGCTTAGCCCACAGCGAGTGGCGATCATTAATAATGCTTTGGGTAATGGG ACGCTGGCTAACTTAACGCTCAGCCCAGCAGAGCAGCAATACGTTGTTAATGCTAATGACGCAATAATGG CAATACTCCGTCAGGAAACAGGGGCATCTATCCTACCAGCTGAAATGTCCAAATATTATCAAATGTATTT CCCTCAGCCTGGTGATTCCACAAAAACCATTGATACCAAGCGCCGGAAGATGGAAAACCAGTTCAATTCG CTGAAAGCTGCTTCTGGTCGAACTTATGATGCTTTGCGGGTTATTTCAGCAGTTGACAGAGGAACTGCTT CTTCGTCGCAGACATTGCCGCAATCTGAGCAGGTATCACAGCCAGCAGCCAGCAGTAACTTTTCTTCACT ATGGGGTGATTAATGGCTAAAGCATGGAAAGATGTTATCGCCTCTCCACAGTATCAGGCGTTAGCACCAG AACAAAAAGCGCAGGCTCAGGAGCAATACTTCAATGAAGTCGTTGCCCCGAAAGCCGGAGAAAGTGTAGA GCAGGCTAAGCAGGCTTTTTATGCTGCCTATCCACTACCATCAACGAATGCAATAGACCGATCCCAATCA GCAACTCAAAATATTCAACATACATCATCTGATAATTCTCTTGCGTCAGGGTATGCAAAGTTAGCTACTC AGCAAAGAGAGGGGCTTGAGCGTTCAGCAGAACAGGGAGCCAGTCTTGGGGCTGCGATGCGCGATGCTAT AACAGGCGAAAGCCGAATGACTCCAGAGATGGAGAGACTGCAAAATGTCGCCTCTGCCCCAGAATTGAAC TCACTAAGCATGGATGCCCTAAAGGCTGGATGGTCTCAACTTTTCGGCTCTGACGCGTCTCAGGAAAAGA TTCTTCAGGGAATGGGGGCGACATTAAGGCAGGATGAGAAGGGGAATACTATCGTTTCTCTGCCATCAGG TGATTATGCCCTGAACAAGCCGGGTTTATCACCGCAAGACCTGACCTCGTTTCTTGCTAATGCGTTGGCG TTTACACCAGCGGGCAGGGCCGGAACGGTGCTGGGGGCCATAGGAAAATCAGCAGCTACAGATTTAGCAC TACAGGGAGCCACCAGCCTTGCTGGTGGAGAAGATATTGATCCGTTACAAACGGTAATTTCTGCTGGCAT TGGTGGTATTGGTAAGGGGCTGGAAAATACAGCGAGTGCGGTTTCGAGGGCTGTTCGTGGTGATATGTCG CCTGAAGCAAAGGCTGCTGTCGACTTTGCATCGGAAAGAAATCTGCCGTTAATGACCAGTGACATGCTGA AAGATAAAACCTTTATGCAGAGTCAGGCTCAGACATTAGGCGAAAGAGTTCCTTTTTTTGGAACCGGTAA GAATCGGCTGAATCAACAACAAGCACGAGAAAATTTAGTCAGAACATTTAGCGATGGTCTGGGTGGCATT TCTGATAAACAGCTTTATGAATCTGCGACTAAAGGGCAACAAAAATTCATTGAGGCAGCAGGAAAGCGAT ATAACCGCATAATTGACGCTATGGGGGATACCCCTGTCGATCTCTCAAACACGGTAAAAGCTATCGACAA TCAGATTGCCGTGTTAAGCCGCCCGGGCAAATCTCAGGATAGAGCCGCGGTAAAAGTCTTGCAGCAATTT AAAGACGATATCACCAGCGGACCAAATGACCTGCGTCTGGCGAGGGAAAACAGAACCGATCTTCGAAAGC GATTTATGGCGTCATCTGACACTGTTGATAAAGATACGCTCCAGAAAGCCAGCGATATTATCTACAAGGC ATATACGGCGGATATGAAAAAAGCCGTAGCCAAAAATCTTGGAGCAGACGAAGCCATTAATATGGCAAGG GTTGATCGCTCATGGTCTAAATTCAATGACATGATGGGAAGAACGCGCGTTCAAAAGGCAATAGCCAGCG GCAAGGCCACGCCTGAGGATGTAACAAAACTCGTTTTTAGCCAAAGCCCATCAGAACGTTCTCAGCTTTA CAGGCTTCTGGATGACAATGGTAGGCAAAACGCACGAGCAGCCATAGTTCAGAATGCTGTAGATAAGGCG ACTGATCCGTCTGGAAATATCAGTGTTGAGAAGTTTATTAATGCGTTACACCGGAACAGGAAGCAATCAG CAACTTTCTTTAAAGGCGTACATGGAAAGGAACTGGACGGCGTTATTAAATACCTCAACGATACAAGACA CGCGGCAAAAGCGAACGTTCAAAACTTAAATGGTCAGCAGCTTTATGGATTGTTAGTTGGTGGTGGCATC ATAAACGCAGCAGTATTAGCGGGGATGCTAAAAACGGCTGCGTTTGTTGTTCCTGCTGCTGGTGCCGTAG GCGGAGCAGCGAAGGCATACGAAAGCCCTGTTATACGAAATGCCTTGTTACGTCTGGCAAATACGCCAAA AGGTAGCACAGCATATGACAGAGCGATCAGTACGGTCACACAATCGCTCACCAGAGTCGCACAGGCATCA CAAAAAGAATCTCAATAACTGGTTAGCCATGGATGGCTAATTTTTGTTCTTTGCTCTCATCCATAGGTAA AGAAGGGAAAGAACAAGACAAAAAACAGAGAATAGATAACCTACCTCATATGGTGCGCCAAAAAAATTAG CTATTGATACGGGCAAAAAAGCTGAAGCGATCAAAATTGAAAGGCAAAAAATAAGCCCAGTAATATAGTT TATTAAATTCTTAAATGAGAAACGTTTTGCTTGATTTACGGTCTTAACGAACGATCGCTTAACGACGGAT AAAAGCAAATAGATGGCGACAGTTGCTAATGCCCCTTTCCACCAGTCTGGATATAGTTTTGCGACAATCA GACCAAGTAAAACCATGATGACAGCCTGAGCATTCACACCAACCCTCCCTTTAGCTTTGTTTAGAATGGC AATCATCATATATCCAAGACATGGCATTTGGTACATAGCTTGGGTCGGGATTTCAAATCCGCTCCCAGTT GATGGGGCCGTTACAGTACGCGCATATCCGCTTTATGTGAAACAGGAAGATTTTTCTGATATCACTGCCA CAGCAAAACACCAATTTCGAGACAAGTACGCGCGCGAGGGGTATCAGTTGCAAAAAATTTTGCAACTACG TCCGACCAGTACGCGCACGATGGAGGGGAGTACCATAAAACTGTTCTCTACATAAGTCAGAGTAGTGGTG ATGTGTTGATTATCGGCGTATATGCGCACAGGACGCGCTATGACGATGTTTTATGCGGGTACACCAATCA CATGTATCTGGTGTGATAAATCGCGTTACATGACCTCTCATGTTGTGCTGGTGGTTATCACGACTTTCTG AATCTGGCCTTTGCCTGATATTTTGCGACAAGTACGCGCGCGTAGCATCGAAATTATCGGATGGCGTATG CTGCTGGCAATATGAAGACCGATAGTATTAACCGAAAAGCGATTAAGGAGTGCCAACTTTTCAGTAGAGA AACTAGCCAGCATAGGAAAATGTAGAAATTACCCAATTAATAGGTAAAAGTTATTGACACAGCCTGTAGT GGATAACGATATGAATAAAAACCACATATAGTGGTTTTTCGTGAGTGATGATAGTACCTATGTTAGTTTC CTAAGTAACTGTGCATTGACATGTGGATAACTTCGGATAGAGTTACACTCAACTGAGCCCGGCCCCCTGG CTTTATCTTGTCTTAGATGAGAAATTGTTATGGCCGGGCCTTCTTGTATCTGGAGGTAGGAAACCTTTTG TATGGCTGTTCCGGTTAAGGTTCATAAAGAATATGATGAACTTGTCGATTTATTGCTGTCTCGGGGTATG GATGTGCCAGATCGCGAACATGCTATCAAGAAAATCTCACAAGTAGGCTACTACCGATTATCAGGCTTTT GGTACCCCTGCCGAATCCCCCATATAACAACAGACAATATTAGAACCAGACTCGATCAGGTTCGTCCAGG TACTAATTTTCGCGCAGTATATGACCTCTATTTATTCGATAAAAATCTGCGTTTGCAGATGATGAATGCG TTAGAACGCATTGAGGTTTATGTTCGGTCAGTTATTGCTCATGAGCTTGGTAAGATATCTCCTTTAGCAT ATCTTGATGACTCGCTAATAAATCCAAAACACTTTAGACCACGTTCCCATGGTCGCCCAAGCGCCAGAGA AGAATGGATAAATAAACATAATGCCGAGATTGCTAAAAGTCGCGAAGACTTTATCAAATGGCATGAAAAC AAGTACGAAGGTCTTCCTTTCTGGGTGGTTATTGAAGTGTGGGACTTCGGCTTAATGTCAAAGTATTATG CGATGCTAAAAGACAGCTATCGCAACAGGATACTATCTAGGCTAGGGATAGCGGCAGGCAATGGAGCTAT CTTCCAAAACTGGCTTAGCGCAATGAATGTGCTAAGGAATCGGTGCGCTCACCATTCACGCATCTGGAAC AAGGTCAACGAACCAAGATTAATGCCTTTGCCAAATGAGCCGTACTTTGAACAATTAAATATGAATGATG ATGCGTACGAAAGAATGTACGGAATGATCGCTGTGTTATGGTTTTTAGTGAAAAAAATTGGACCGGGATC TGACTGGATAAAAAAAGTTGCCAACCTTGTAGATAACAAGCCTAACTTACCAGGATGTAACCTGACAGCC ATGGGATTACCTGATAATAACGGTTTCCCAAGGCATTTATTTGATATTGAATAAATTATAGCGATGGCGC TGAATGTAGCGCCACTGATAACTTTTCAGCATTGCTTCCCTGACGCTTCCACGCGCTGTAAATATCTTTA TCCCATGCCTTACCAGCTTTTGTTTGATAGCCTGCTTCATTGATCCGTTCAGCTATCACGCGCCCATTAT CAAATCCCTGTCGGATGGTATCGGCAATGATCTTTATAACTGCCGTTTCGTTATATGGCTTTGGTGGGAT GTTCGGCTTTCCGGCAATAAGCGACGCAGCGGCTTTTTCCATGCGCTCCACCAGCCCAAGCATTCGTGAA TCAATAGACTTTTCCGGCTGTCCCAGCTTCATGCGAATCGCATCAACAAGCCAGGAGGTTTTATCCCCTC CAGAAGCGAGAACAGACCGATTAAACTCATCCTGCAATTCAGATGGGATGCGGAACGCGACCAGATTAGA TTTGCTCATTGCAAAGACCATATCAGTGAGTAATATCCATACAGTATACCACTGTATAACACTGTTATAC GCTCAAATATCAGCCAGAAAATAACGGACCAATCACAAAGCTATCGCTGTGTTGTTGGTTCAGAGCTGGG CTGTGTTGGTTCAAAATAATCTGAGAGGCAGAAATGATAAGGGACAGGAAAGCTGAAGAGCTGGAGTCAA AAGGGCTATACCGGAGAGCTGCCGCACGATGGATGGAAGTCATGCTGTTATGCACCGAGGACGATGATCG GGAATGGATAAAGCGCCGCCGTGAAACGTGTCTGGAGAACGTGAAGCGCCCGCCCGTGAAGGTTGAGGAC TTTGGCGACCTGCATAAAGCTGTTACCGAAACGCAACACCGCATGGGGATAGCGCAACCGAACGGTAACG CCTTCCGGTTAAATGGCGGCAAGAGGCAAAGGTAGACCACCAGAGGGAAATCATCCTCTGGCTGGCGGTT TCTGGTATTTCGGGACAAGTACGCGCGCGCGTAGCACCCAATCGGTGAGGAAATAGCGGCGACCTGAATG GTCAAAATCACAACGCAGCTATCCCCGTAAGCAGCGCTTAACGGAATAGCAGTAACCTGAATCCAGTGGG GAGGGGGTAGTCAAATCTCTACAGCCCTGACTATCCGGGACTGCCCGCCCCATCGTTTTTTTATACCCGC GAAAAATGAAATTTAATCCGGGCGTGTTTCATCCTTCCAGAGGGTATGACGTATGACACCACGCGAAATA GCCTTATTGACCATCGCCAAACTTGAGCACGGAGGCCACCAGCTTACACAGGCAGATCAACGGGAGATAG AACGATCAGTTAATGCCGATATAGCCCGGCGCGACAGGTTCCGCGAAATGATGCGAGCACCTGCCTACCA GTGGAAGAAGCCAGCGCCGCGCAGGTAGATATGGGATTATTCCACATGTAACTTATGCGTATTAAGCAAA TTTCCTTTATTCACCTTTACAGAATTGTGAGTACGTCAGTATCGTTATCCTTCCGATTCGCATTTATTCA GGATTTGCATGTACCAGCTAAAGATAACCATCCGAGACAGCAAACCACCTATATGGCGGCGTGTGCTGGT TCCAGAGCAAATCCCCTTTAGTAAACTTCATGCCGTGATCCAGTTAGCTTTTGGCTGGAATGACGAACAC CTGTATATGTTCGAGAAAGGGCGTAAAGGTGATCCAGGTAGCGAGTATCGCGTATGGGGTGAGGATGAAA GCATGGGTAACGCGGCAATCACGCCACTATGGGCGGCGCTCCAGAATGAGGGTGACAAGCTGGTTTACAC GTATGACTTTGGCGACTGGTGGGATTGCGTCATTGTGCTGGAGAAGCAAACACACGATACGAGTAATCAG CCCATTAGCTGCCTGCGTGGGAAAGGCACCACCCCGGCGGAAAACTCAGGCGGCTTGCATGGTTACAATG AATTGCTATTACAGGCCAGAGAGTCTGATAATCCTGAGCAGGCCGAGATCCACAATTTCCTGATGCTGGA TATTGAACGCCGGGTTTACGACCTGAGCCGCATCAACGACAGATTGCAGGCTATTTACTGACCCGCATGT ATTCACCTGATATTGCGTTGTCAGCGCCGCCAGTATCGCGCATAACAGCGCACTGACCGGGTAAACCGTA GCGCACGACCGTTGCGTTAGTATCACCGCCGCGCCGTATCTGGCGCGATTCATACACTGTCACCCGCTCA AGCAGTCCACCAGCAACCATGCTTTCCAGTGTGCGCCGGGTTGATTCGAGCTGGTGACGCTTATCGAACG ACACCATGCCATGAAGCAGGTAGGCCACACCCGACACATCGAACGGCGGCGCACCAATCTCACCAGTCAC CCATTCGAGGTTATCCGGTTCAAAGTAGCTAAGTATCTCTTTTTTGCGACTGGTCATTCTCATGGCTGGC TGATTCCTTATTGTGGGATAGCACTATCATACAATAAGTGGGTTAAAGGGAGAGCGTTACCGCCTGATAT GCTGTATGAATAACAACCAAATTTCAGGGGCGGAAAAAGATATGGGGGTACTTTTGGGGGTATCTATAAA AAATGAACAATAAAAAAGGCAACAAAAACAAAGATTATCGCCTATGTGTATTGTTCCTATTATCCATTTA AATCAATAAGTTACACATCATTAGTACCTTCCTTATTTTTTGACTGGGACAAATTTGGGACCGATGGGTT CAGGATCGAGTCTATTTGCCGTGCGTGTTCGGTAAGGTGATTAGGTGCAAGGTGAGCATATCGACGAACC ATTTCGATAGACTCCCAGCCTCCCATTTCCTGTAACACTGACAACGGGACTCCGGCTTGAACCAGCCAGC TTGCCCAGGTGTGTCTCAAGTCGTGAAATCTGAAATCATCAATACCAGCCCGTCTCAGCGCCGCTTTCCA GGCTGTGTTTGCGTCATACCGCATCTTCCTGACCGTTGGCGCTTTCGTTCCGTCTGGTTTGGTACAGCTT TCCTTGTACACAAATACCCAACGGTGATGATTCCCGATTTGTTTTTTCAATACGCGACATGCAGTATCAT TCAGCGCAACGCCAATTGCGCGGTTTGATTTACTCTCTTCCGGGTTTATCCATGCCACCCGGCGCTGCAT GTCTATTTGTTGCCATTCAAGGTTGATGATGTTCGAGCGTCTTAAGCCTGTTGCCAGTGCAAATTCAACA ACAGACTTTAATGGCTCCGGACATTCATCAATCAGCCTTTGTGCTTCATGGGGCTCCAGCCAGCGGATCC GTTTATTCTTTGGTTGAGGCACTTTAATAATTGGTGCCTTATCCAGCATTTTCCATTCACGCTCTGCGGC TCTTAGTAGGGCCTTTATAAATGAAAGATGCGTAGCCTTTGTTGCAACGGACGCTGGTTTTGGCATGTAT TCTGGAACAGGTTTCCCTTTTTTTCTGCATGCTTCTGCCCTGAGTTTCCAGTTTTCCTCATGACGCCGGT TCGTCATTTTCTGCATTGCTGAATAAATTTTTGATTCAGTAATGTCTCTTAGTTGCATTCCTGCGAAATG TTGAAGCCAGAATCCGATCCGGCTTTTGTCATCGTCCAGTGATTTTTTATGTGCTTTCTCTTCGAGCCAC CTGATACACGCCTCCTCGAACGTCATATCAGGTATTTCACCAAGTTTGCTGACCCGCCATGCTTCAGCCT TTAGCTTGTCATGGAGCTCTGTCGCCTGCCTTTTGTCCTTTGTTCCAAGAGACTGTTTAAATCTTTTACC GTTCGGCAATGTGAAACTGGCGTACCATATTTCACCTCTGCGGAAGAGTGACATTTTCTTTCCTCTGTTA TGCCATCACCCGCGCTCACCAGGACAGTATGCAGCGGAGACTGAAGCGCCGCAATGCAGGCTTGTCGTGT TGTGAGGTAAGGAGATTTTAGTTTAGTGGGGTCTTTGCGTGTTGCCTGTAGGCGGCCTGTTCGTATCCAG TTGGTAGCGGTAGGTCTGGATATCTTGAGAAACTGACAGGCCTCATCGAGTGTGAGGCTGTATGGCTCCA TTATTTCACCTCTTGCTGTGTCATTGTTGAAAAATGGATACCAGCTCGTTGCTGCCAGACGATCCAACCG AGAGTCATATCCCATGCCATGTATTCGTTATCGCCGTTTTTTGCTCTCCGACGATCTACTAAGTCACCGA AACGCTTTTCCATGAATAATTCATAGGCTTCGCGTTCATCTGGCTCTACTTCCAGAGATACGAGTGCGAT TTCATAAGCACGGCGCTCAATATCGTCTCGAACCTCTAGGCTGCTGATTCGTTCTTTGATTTCTTTAATC AGTTCTTTATTGGTAAATGTGGTCATTATGCTCCAGCCTCCGGTGCTTTTGGCATTACTGCCCAGTGAGT GATATTGACGTTTTCAAGGTCCCCGACCTGAAATGTCCACTGCCATTCTCCGGTTTCTTTTTGTCCCCAG GTGTACCAGAGAGAACGCCAGCCAATCAGCCAGCCTTCTCCATTAGCATCAAATAACAGAACACTTTCAT TTGCTGGTGGCAGTTCAGCTGACACTGGTATTATTTTGTTTTCCAGTGCCGCACATTTAGCTTCAAGCGC GTCGAATTTACGTACCAGGTACTCAGCATTTGTTTCGTTCACTTTCAGATCTCGCGGTACACATTTCCCG CGAAGAAACCCTTCCATTTCGAAAACATTCATGCGCATTTGCGTAACTCCGATAAATCGTTAAAACGTTC CATAAACATCCCGTAGGCATGACCCGGTGCCAGTGGAATCACGTTGAACATCTCTGTTGCCGGGATGCCT TCCAGTACAGGCCAGAAAGAGCCATCATCAAGCCCGAGATCGCGGCGTTCGGTTGCCAGCATGATGAGAT CGGCATATTTCACGGGTGTACTCATAACTGGGGGTAACCCGTATTTCTCACGGATTACGGCGTCTATTTT TTCTTCCATCCGTTTATAGTCAGGAAGAAGGCGTTTCAGTGGTGCGGGAATGTCCTGGCAATACGCTTCT GTTGCATCATGCATTAACGCTTCAAAAGCAAATTCCTGCGGCACCAGCTGGCTGCAAAGAACCGCATGTT GGGCGACGCTGTAGAAGTGCGAAAGATGACCGGCAAAGCGACAGATATTTGAAAGGGAAACCGCGATATC GTTAATATCGATGTCGTCTTTATTTATCTTGTCATAATAAAAATGCTTCCCGGAAAAAGTTTTAATAAAT GACATTTTGTTCTCCACGTATATGCACTGCACCGCGCTGAATTCTGGTAAAAGGAAGCCCTCACCATCCG GTGATTATTGAGTTAATTACGTTTCCATAAATGCCCCCGCAGGGGCATTTGCAGTAATGAAATCAGGCGG TGAAAGTACCAATAAAGGTTTCTACTTTGCTGTCTTTGAATTTCTCAACAAGCAGATCACGAAATTCGTT AGCCATTTCTTCCTGCACCGCTTCCAGCTGAATAATGCGCAGAACCAGTACAGGACGATCGCCAGTGATA ATGCTGAGGCGTAATTTAAACGGACGTTCTTTCAGACCTTCAAACGGAACGCATTTAAATTCAAATGTCA CTGGCATAATGTCTTTGGTCTTCGCTTCGACAGACTCCATCAGGGAGCGTTTGCCGCTGAAGTCATTATC TTCAAAATCAGCGGTCTGGTTCGCTTCAATTGTGATTTTACGGATCGCCGCAGCCGCTTTGGTTGCCTGA ATGGCGTCACCATTAGCATCAAAGCCCACAAGGTAGTCGGCCCAGTCTTCAATCCATTCTGCCAGTGACT TCTGGGAGTTACGCTCGCCGTTAACAGACAACAGGGCAGAGAACGGTGCTGTCTTTTTCAGTTTGAGAGT GGCGGTGTTATCTGCGTGACCTGGTTCATCAATAGTACCCAGGTTAAGCACACTGACGGCTCGCATATTA TCGGCATCGATAAAGCAGCGGGTGCCTTCATCTGCAAGATCTTTAGAATAACGGGTAAAGTCATCGATGC TGGCAGTGGAAAGCGCACCACGGAAACGGAAGCGATTTAAATTAAATTTTTCCAGATCATGAATGCGGAA ATTCTCAGGCAATGCCACAGCATCGGCACCAATCTTACTGATAATTTCATTAACACCCTGAGCAGAAATA AGGGCATGGATTTGATTAATTGCGGTTGCGTCTAAGTTCTGAGACATAATAAGTCCTCACTATATAAAGA TATTCAGTGATGAGATAAATAATCAGTTTATTAAGAACGATATTAATGACCTGCTGCGCGTAGTTTTCCG TCAGGTTCACCGGCAAGAGTCAGTAATTGTCCCTGGTCTTCCTGCAGAATAGTCAGGCGACCACCGCGAT TGACATACATCGGCGTTTCGGTGGTGTCTTCTTCGGAAATTTTCCCGCGGTTAGTTGGGCGAACATATGA GAGTTTGTGTTTGATTTTCACACGGTTCTCATCAAACGGTTCGATTTCCAGGTTGAGCGAGACCTTACCT TTGGTTTTCGTGTTCATCACACCGGAAGCGACTTCACTGAGAACAGCGCCGATTTTGGTTTCAAATACGC CGCCGTCCAGCTCCCCGATAAATGCCTGCACATCAGTACTGCGTTCGCTAGCCATTTTGCTGCTCCTCAT CATATCGACCCTGCAAGGTCGGTTGGTTTCTCCACAAAACAGAGAAGAACACCTGCGGTGACTGCCGCCC GGATGGATTGGGTTATGAGCCCGTCGTCCGGTGATTCTCTTCTCTGTTTTGTAAAAAGAGCGGTACCAGC CGGAAGCAAGGGTACAAACTGGTACCGCCAAAGCAGTGGCTGTTGTGGTGGGGTTGTCACTCAGGCGTAT GGTCAACCTGACAATCCGGTGCCCTCAACGGGGAAAGAGTAACCCCGCCATACTTACCGCCGCGCCATTT CGCGGATTACCACAACGCTGAGAGCACTTAGCCAGTTACGGCACCACACTTTGTCGCGGCTCCATAAATG CCCTCATCGTTGCACCCTGGTCTCTTCCCAGGCGTCAAACCGAATCGCCACGCTGGTTAGGCGTCTTATC AGCATCATCATTGACTTGCACATTCCGGCTACCTGGTTTGTTTGCCCGAGCAAGGAGTGGATTGTCCCCT TTAACGTCCCCAGACCGCTAACGACGCATGTGCCATACGCCGTGTTACAACCAAATTTTGTTAGTACCTT GTTTGTTTGTCTGGAAAGAAAGATAAAATGAAGTTGCGCATTATGCAAGTGTTTTTGTTGCGAGATATGC AATTTAAAGGGTAATGAAAAGCCACCTTTGGGTGGCTAATTGATGAGGAGGTAAGGGTTAATTGTGTCGC TTAAGGGTTTGTGACTGGCTGATTAAGACCTTTCCAAAGACCATAAACCGGTGTTCATTTTCGCTGGTAA TTCCCCATTCACGGTAAATCTGGTTATCAGAAATCACCAGTAGTTTGTCAGGTATCATTTGCAGTCGTTT GACATAAATTTTATCATCAAAACCAAATACATAGATACCATCTCCATCAAACTGATTGATACTGACATCA ACGAAGATGAGATCTCCTGGCTCAATGGTTGGACACATACTGTCCCCACGAACGTTGATAACTTTAATGT GATTGGCTGGCCGTCCGCCAAACATCGATACAGCATTATCAGTTCTGTATTCAATGGCATGAATCACATC AATGACATCACCGCCCTGGATAAGGCCATTTCCCGCACTGGCACTGACATCCAGCATTTCAATACGGAAT ACATCCTTCACCTGCGCAACATCCTCACTAATACTGTTTTTACATACAGTATTACTTTTGACGTCTGAGG TAAAGAGATCAGCAATATCAACACCTAAGCTCCTGGCAATATTACTCAGGGCTTGTTCAGTGAATTGTTT CTGCTTACCTGTTTCCAGGCGTGAGATATTCGCCGCATCCACTCCTATTGCTTCAGCGAGATCGGCGATT TTCATGTTCTTCGCCTGGCGAAGTTGTCTGACTCGGTTTCCTATGTTCATGCGTTTATTACATTTCTTTA TTGCGCGTTAAGCAAATCAACTTGCGCAAAATATTTGCGTGAAATAATATGCTCATCACGCAATATGTGG AGGTCATATGCAATCACCATTACGGAATGTGCGTAAGGCGCACGGATTTACTTTGCAGCATGTTGCTGCT GGCGTTCAGGTCAATCCAGCGACGCTGAGTCGTATTGAAAGACTGGAACAAATTCCATCTATCGATCTTG CAGAACGTCTGGCCAATTTTTTTAAGGGTGAAATCAGCGAAATGCAGATTCTTTATCCGGCACGTTTTCA ATCTAGCCAAAACCAGAATGGGTTTAAACCACAGGAACAGGAGGTAAGCCGTGGGTAATCATCACTGGAA AGTGGAAAAACAGCCTGAGTGGTACGTGAAAGCTGTCAGAAAAACTATCGCGGCGTTGCCGGGGGGTTAC GCTGAAGCTGCTGAGTGGCTGGATGTAACAGAGAACGCATTATTTAACCGCCTTCGTGCCGATGGCGATC AGATTTTCCCGCTGGGATGGGCAATGATTTTACAACGTGCTGGTGGCACTCACTTCATTGCCGACGCTGT GGCGCAGTCTGCAAATGGCGTCTTTGTGTCTCTTCCTGACGTCGAGGATGTGGACAATGCCGATATTAAC CAGCGTTTACTGGAAGTCATTGAACAGATCGGCAGTTATTCAAAACAGATTCGTTCGGCAATCGAAGACG GTGTAGTGGAACCGCATGAGAAGACAGCAATTAACGACGAGCTGTACCTCTCAATTTCGAAGCTGCAGGA GCATGCAGCACTTGTCTACAAAATTTTTTGCGTTTCAGAAAGTAATGACGCCCGCGAGTGTGCAGCTCCG GGCGTCGTGGCGTCGATTGCTTCTGGTTGTGGAGAAACTAACGCATGAACAGTTTAACAACACACTACCG TCGCTCGCAACTGATTGCGCTTCCTGTACCGGGTGGAAAAGCGAAGGTGGAGTATTGCTATGCAGTAAAT GTACCAGGTGACAGGGAAATTGTAACCCACAGCTTTGCTGAGTGGGCTGTGGGAGATTTCAACCGGCAGA AGGAGACAGTCCTTTGCGACAAGTTAACCGCTGGTTCAAAGATCACTACGGAGTGCCCGTCAGAGTCATT CGTTGGGAGCCGGAAACACAACGGGTTATCTACCTCCGTGAAGGCTATGAGCATGAGTGCTTCAGCCCGC TCGAACAGTTTCGTCGTAAATTCAGGGAAATAGAGGTCGGTCATGAGCACTAAATTAACCGGCTATGTAT GGGATGGTTGCGCTGCGTCAGGCATGAAATTATCCAGCGTGGCAATTATGGCCCGCCTGGCTGATTTCAG TAATGACGAAGGTGTGTGCTGGCCATCAATTGAAACTATTGCCCGTCAGATTGGCGCGGGGATGAGTACC GTCAGAACGGCTATCGCACGGCTGGAAGCAGAAGGCTGGTTAACGCGTAAGGCGCGTCGCCAGGGTAACC GCAATGCGTCGAATGTTTATCAGCTTAACGTTGCGAAGCTTCAGGCAGCGGCATTTTCTCAACTGTCAGA TTCTGACCCGTCAAAATCTGACGCATCAAAATCTGACACGTCAAAATTTGATGCGTCGAAATCTGGCAAA AAAGCGGGTTTTCACCCGTCAGAATCTGGCGGGGATCCGTCAGTAAAATCAAAACATGATCCGTCAGATA AAAAACCTTCTCGTCCGGACGCTTCGCAACCGGACACGCAGACGGCTGAACAGGATTTTTTAACTCGCCA TCCTGATGCGGTTGTATTCAGCCCTAAAAAGCGCCAGTGGGGAACGCAGGATGATTTGACCTGCGCACAG TGGCTCTGGAAAAAAATCATTGCCCTGTACGAGCAGGCCGCCGAATGTGACGGCGAGGTGGTTCGTCCCA AAGAACCGAACTGGACAGCATGGGCAAACGAAATTCGCCTGATGTGTGTGCAGGATGGTCGTACTCACAA ACAAATCTGCGAGATGTACAGCCGCGTCAGCCGCGATCCGTTCTGGTGCCGTAACGTGCTCAGCCCGTCG AAGTTGCGGGAAAAATGGGATGAGCTTTCCCTGCGCTTATCGCCGTCCGTCAGCACGCACACAGAAAAAC GTGAAGACCCGTACTTCAAAGCCAGTTACGACAACGTGGACTACAGCCAGATCCCGGCAGGATTCAGGGG GTGATCATGAGTCTTTTGAATGACGTTCAGAAATTCATTGAAGCCCATCCGGGCTGTACTTCCGGAGACA TTGCGGATGCTTTTGCAGGTTACTCACGGCAGCGCGTTCTGCAGTCAGCAAGCAAGTTACGTCAGAGTGG GCGTGTGGCTCACCGTTGTGAAGGAGATACACGCAGACATTTCCCGCGCCTGACTGAGAGAGCGCAGGAG CCGGAACCACAATCTGTTCGTGAAACCAGACCTGTGCGCAATTTCTATGTCGGCACTAACGACCCCCGGG TGATTTTGTGCCTGACCCGCCAGGCGGAAGAACTGGAGTCCAGGGGCTTATACCGTCGTGCTGCAACGGT GTGGATGGCGGCATTCCGTGAAAGCCACTCCCAGCCAGAACGAAACAATTTTCTGGCGCGTCGTGAACGG TGTTTACGGAAAAGCAGTAAGCGGGCTGCATCAGGTGAAGAGTGGTACCTCTCAGGGAATTACGTGGGGG CTTAATGAGTAATAAATATTGCCAGGCGCTGGTGGAGCTGCGGAACAAACCAGCCCATGAACTGAAGGCA GTGGGCGATCAGTGGCGCACGCCGGACAACATTTTCTGGGGAATTAACACCCTGTTTGGCCCGTTTGTTC TGGATCTGTTCACTGACGGTGATAACGCCAAATGTGCTGCGTATTACACGGCGGAAGACAACGCGCTGGC GCATGACTGGTCAGAACGCCTTGCGGAGCTTAAAGGTGCTGCCTTTGGTAATCCCCCATACAGCCGCGCC AGTCAGCATGAGGGGCAATACATCACCGGCATGCGTTACATCATGAAGCATGCCAGTGCCATGCGTGATA AAGGCGGGCGCTATGTTTTCCTGATCAAAGCTGCCACCAGCAAGTGTGGTGGCCGGAAGATGCAGATCAT ATTGCTTTTATTCGCGGGCGTATTGGTTTTGAACTGCCTGCCTGGTTTATCCCGAAGGATGAGAAGCAGG TGCCGACAGGCGCTTTCTTCGCTGGTGCTATTGCTGTTTTCGACAAGACCTGGAAGGGACCGGCAATCAG CTACATCGGGCGCGATGAACTTGAGGCATGTGGTGAGGCCTTTCTGGCGCAGGTTCGCCAGCAGGCAGAA AAACTGGTCAGGGAGATGGCGGCATGACGACGTTAACTCAATGCCAGCAGCAGGTGCTGGATATGCTGAT TTCTTATCAGAAAGAACGTGGCTTCCCGCCAACCAATCAGGAGGTGGCAACCATGCTGGGATACCGTTCA GTGAATGCAGCGGTGGAGCATCTTCGCGCACTGGAGAAAAAAGGCGTCATCACGATAAAGCGTGGCGTGG CCCGGGGCATCACGCTTCATACCGCGGTGAAGGACGACGACAGCGAGGCGGTCGGGATTATCCGCTCACT GCTTGCCGGTGAGGAAAACGCAAGGCTGCGTGCAGCCCACTGGTTACATGAGAGGGGCCTGAAAGTATGA AGCTGATCCTGCCTTTTCCGCCCAGCGTGAACACGTACTGGCGACACCCCAACAAAGGGGCGTTTGCTGG TAAGAGCCTGATAAGCGCGGCGGGGCGAAAATTCCAGAGCGCGGCGTGCGCAGCAATAGTTGAGCAGTTA CGTCGTCTGCCGAAACCAACGTCGGCACCTGCTTCAGTGGAGATCGTGTTGTTTCCTCCGGATAACCGGA TCCGCGATCTGGACAACTATAACAAGGCGCTGTTTGACGCCCTGACCCACGCGGGGGTGTGGGAAGACGA CAGTCAGGTGAAAAGAATGCTGGTGGAGTGGGGACCGGTTATCCCGGAAGGGAAGGTCGAGATCACTATC AGTAAGTACGAGAAAACGGCGGGTGCAGCTGCCTGATTAAGAGGAGAAACGAAGTATGAATAATCTGATG GTCATTGATGGTATTGAAGTTCGTCGTGATGCTTATGGGCGTTACAGCCTGAACGATCTGCATCGCGCAG CAGTAGCATCTGGTGCAAATGCCAGAACCAAGGAGCCAGGAAAGTTTCTTTCCAGCCAACAAACTGTTGA GCTTGTTCATGAATTAACCAACACCCAGAATTTGGGTGTTGACCCGGTGAGCGTGATTCATGGGGGAAAT GAACGGGGAACGTATGTCTGTAAGGAACTGGTGTATGCCTATGCAATGTGGATCAGCCCGTCATTCCATC TGAAGGTGATCCGTACTTTCGATATGGTAACCAGCGCGCCGGAAAAGTTATCCGGACAGGCTGCTGACAA GATGCAGGCTGGCGTGATTCTGCTGGACTTTATGCGCCGGGAGTTAAATCTGTCTAACTCATCTGTGCTT GGGGCCTGTCAGAAACTCCAGGAGGCTGTTGGCTTACCGAATCTGGCACCGCGCTATGCCATTGATGCAC CTGCTGACGCGCCTGATGGCTCAAGCCGCCCCACGCTGTCGCTGAGTGCACTGCTGAAGCAGTATGGTAT CCGCCTGACGGCTAATCAGGCATATCACCAGATGGCGAAGCTGGGGATCGTTGAACAACGCGAACGATAC AGCCGTACCGCGATTAACAACATCAAAAAATTCTGGTCGCTGACCGCGAAAGGCTGCATGTTCGGCAAGA ACATCACCAGTCCTGCAAATCCGCGCGAGACGCAGCCGCATTTCTTCGAATCCCGATTTCCTGAGCTGTT AAAGCTGCTCGATACCGTTCATTGAGGTGACTGTGAGAGCACTACTGACCCCTGAAATTGCCCCGCGTAT GGGGATCGTATTGTTCAGGCCAGGTTCAGAGCTGATGCCCCTGTTTATGCAGGGGCGTGTCCTGCTGGAG CCTGAGCCGGAACGTTATTCATCTTTTGCCAGTGGTGCCGTTCCGGCTGCATCACAACCGCTGGCGGATG ATCCTGCCGTTCGGGCCGTGTTCCGCAATGAGGCAGTGATCCGTCGTGCTGGTGGCGTGGAATGTCTTGA AAGCTGGTTACTTCGTGAAAAAGGCTGCCAGTGGCCTCATTCCGACTGGCACAGCGAGAACATGACCACA ATGCGACACGCGCCGGGCGCAATCCGTCTGTGCTGGCACTGCGATAACCAACTGCGCGATCAGTTCACGG AACGGCTGGAATCAATGGCAACGGATAACTGTGCCCGCTGGGTGTTGTCTGTTGTGCGTCGGGATCTCGG TTTTGATGACAGTCACGTTGTGACAATGCCGGAACTGTGCTGGTGGCTGATTCGTAATGATCTGGCGGAT GCCTTACCGGAAAGTGCAGCCCGTAAGGCACTGAGATTACCGAAGCCTGTTGTGCCGTCTGTCACCCGGG AAAGTGACCTGGTGCCTTCGGTTCCTGCCACCAGCATCATCCAGGATAAAGCGAAAAAGGTGCTGGCGCT GAAAGTGGATCCGGAGTCGCCGGACTCTTTTATGTTACGCCCAAAACGTCGCCGCTGGGTTAATGAAAAG TACACGCGATGGGTTAAGACGCAGCCGTGCGCATGTTGTGGAAAACCTGCTGATGATCCCCACCACCTGA TAGGCCACGGTCAGGGTGGAATGGGGACAAAAGCGCATGACCTCTTTGTGTTGCCTTTGTGCAGAAAGCA TCACGACGAGCTGCATGCGGATAATGTGGCATTCGAAGAGAAGTATGGCTCCCAGCTGGAGCTGATATTT CGTTTTATCGATCGTGCGCTGACGATTGGTGTGCTGGCCTGATTTTGTGGAGAAAGTTGATGCGTGATAT TCAGATGGTTCTTGAGCGTTGGGGAGCGTGGGCGGCTAATAATCATGAAGATGTGACCTGGTCGTCCATT GCCGCCGGTTTTAAGGGATTAATTCCTTCAAAAGTAAAATCTCGCCCGCAATGTTGTGACGATGACGCGA TGATCATTTGCGGGTGCATGGCCCGTCTGAAAAAGAACAACAGCGATTTGCACGATTTATTAGTAGATTA TTATGTAGTCGGTATGACATTCATGTCACTGGCAGGTAAGCATTGCTGCTCTGATGGTTATATCGGGAAA AGGTTACAAAAAGCTGAGGGCATAATTGAAGGGATGTTAATGGCATTAGATATCCGGTTAGAGATGGATA TCGTTGTTAATAACTCTAATTAATATGCCAATTGTTTACTAAAAATTATTAAAAATGGGGCGTTGAGACG CCCCCAAAAATAAAGGGTAATATATAACAGAAGGTTTATATAGTTAGAAGCAAGGTTGTGCTTCTAAAGG AAGTGGCTTGAGGGAGCCACTTATATGTTGGGGAGGCAACGCCTCCCGCAACATATCTTTTTCGTAATCA GATTAGAACTGGTAAACCAGACCTACAGCAACGATGTCATCAGTATCAATACCAGCTGTTTTGGTAAACT TACTATCGTCAATTAAGTTGATTTTGTAATCAACAAAAGTGGACATGTTTTTATTAAAGTAGTAAGTAGC ACCGACATCGACATACTTGACTAAGTCTCGGTCACCATGAACACCAAGGTCTTTACCTTTTGACTGAAGG TAAGCAACAGATGGGCGCAGACCGAAGTCAAACTGATATTGTGCTACTGCTTCAAAGTTTTGTGCTTTGT TTGCAATATGGTTATTACCAAAAACGGTCATATTCTGAGTTTCAGAATATGTGGTAGCCAGATAGATATT GTTCGCATCATATTTCAGGCCTGCAGCCCATACTTCCGCATTTTTGCCGGAGGCATTGAATTTGCTCTTA CCATAGGCGACCTGACCGTCAGTGCGATCTGATTTAGCATAGGTTGCACCCACGCCGAATCCTTCATACT CATAAGTAGTGGAGAAACCGAAACCATCACCATTGGCTTCAGTTACGTCAGTGCGGTCATTTTTACCCTG ATACTGAGCAGCAAAGTTCAGGCCATCGACCAGACCAAAGAAGTCGTTGTTACGATAAGTTGCAACACCA GTAGTGCGACCAGTCATGAACACATCTGTTTGGGTCCAGGTATCACCACCGAATTCTGGCAGGACGTCAG TCCACGCACCGATGTCGTATGCTACACCGTAGTTACGGCCGTAATCGATTGAGCCGTAATCACCAAATTT CAGGCCTGCAAATGCAAGACGGGTTTTGTCTTTGGAAGAACCTTGAGATTCAGCGCGGTTGCCTTTGAAT TCATATTCCCACTGACCGAAACCAGTCAGTTGATCGTTGATTTGGGTTTCACCTTTGAAGCCAAGACGGG CATAAGTAGTATCACCATCATCTGCATCATTAGAGGAGAAGTAGTGCTTAGCATTAACTTTCCCGTACAG ATCCAGCTTGTTACTGTCTTTATTATAAATTTCAGCTGCCTGAGCAGACATCGCCATCAGTACTGATGCA GCTACAGCAGAAATTGCCACTGTTAATTTTTTCATCGTGAGCCCTTTTTTTTGAACTATTATTAAAAAAT GATGTCACTGCGCGATAAATATTCATCTAATCAATGTGATTATTTCAAGATGTAAGTTTTGGTTTCTCGT TTGATTTGTGAAGTAGATCTCTATTTTTATCTGAACTTTTTTCTATCGAATCCTATTCATAGCTCTTGGC TGAATAAAAATAAATCTATTAGCCAATTTATACTAACGGTTGTTATTTATAAGTGCTCTATAATTTGAAG GTTCAATTTAAATCGGCTAAAAATAACACTGGAAATTATTTGTTGGTTATTTGTTGAGATTTGCTTATGT ATTTGTAGTGGTGTTTTCAATACTCGGTAGCATTCTCGCAAATATCATTTAGTGGTTTACGTACGTAAAA AATTGGTTATGCTGTTAAGAGTGGTTACTTCGTCACACAGCTTAAACCCGCCGTCGAGCGGGTTTTTCCA TTTTTTGAGTCTCGATATTAGCTGATAACCCAATACCTGAGTTATTCACTGACTTCGAGTCTGTTACGTT TCGTAGTATTCCCTCAATTTACACCCGCTTTGTCTGCGAGGTGGGGTTATGAAATCCATGGATAAGTTAA CAACGGGTGTCGCCTATGGCACCTCAGCAGGTAGTGCCGGGTACTGGTTTTTACAGCTGCTAGATAAAGT CACGCCCTCACAGTGGGCGGCAATTGGAGTGCTGGGTAGCCTGGTATTTGGCCTGCTGACGTACCTGACA AACCTTTATTTCAAGATTAAAGAAGATAAGCGTAAGGCTGCGAGAGGTGAATAATGTCGCCATCATTACG CAAGGCTGTAGCAGCTGCTATTGGTGGTGGGGCTGTTGCCATAGCGTCTGTGCTCATCACTGGTCCGAGT GGTGACGATGGTCTGGAAGGTGTCAGCTACATACCATATAAAGATATTGTTGGTGTATGGACTGTATGTC ACGGGCATACAGGAAAAGACATCATGCTCGGTAAAACGTATACCAAAGCAGAATGCAAAGCACTCTTGAA TAAAGACCTTGCCACTGTCGCCAGACAAATTAACCCGTACATCGAAGTCGGTAATGACTCCAACTTATTG ATAGTGTTTTATGTTCAGATAATGCCCGATGACTTTGTCATGCAGCTCCACCGATTTTGAGAACGACAGC GACTTCCGTCCCAGCCGTGCCAGGTGCTGCCTCAGATTCAGGTTATGCCGCTCAATGCGCTGCGTATATC GCTTGCTGATAACGTGCAGTTCTCCCTTCAGGCGTGATTCATAAAGCGGCCAGCCATCCGTCATCCATAC CACGACCTCAAAGGCCGACAGCAGGCCCAGAAGACGCTCCAGCGTGGCCAGCGTGCGTTCACCGAATACG TGCGCAACAACCGTCCTCCGTATCCTGTCATACGCGTAAAACAGCCAGCGCTGGCGCGATTTAGCCCCGA CATAGCCCCACTGTTCGTCCATTTCCGCGCAGACGATGACGTCACTGCCCGGCTGTATGCGCGAGGTTAC CGACTGCGGCCTGAGTTTTTTAAGTGACGTAAAATCGTGTTGAGGCCAACGCCCATAATGCGTGCAGTTG CCCGGCATCCAACGCCATTCATGGCCATATCAATGATTTTCTGGTGCGTACCGGGTTGAGAAGCGGTGTA AGTGAACTGCAGTTGCCATGTTTTACGGCAGTGAGAGCAGAGATAGCGCTGATGTCCGGCGGTGCTTTTG CCGTTACGCACCACCCCGTCAGTAGCTGAACAGGAGGGACAGCTGATAGAAACAGAAGCCACTGGAGCAC CTCAAAAACACCATCATACACTAAATCAGTAAGTTGGCAGCATCACCGTTATCGTTCATGAGAAGCATAA CGTAAAGGGAAAAGCTCGATTAGACGGCAGAATTTGTCAGGGGTTATGAACGAAATTCATAAATCTGTTT GAGTGTTGCGATGGGTAGTGCAAGTTCGATATCTCCGCAATTTACAGTCCGATGAAGGAAAATGAATATC CATAAAAAATATATTGGTTTATCCTGGCATATATACCTATTTCGACGTATTTCCAATAGTTTTAATTAAA GGCAGGTCATTGTTATTCACTCTGAATAGTGAATTATTCACTGTCCGCAGAGTAAGAAATATAACTTAGG TATCTATTTAATGACTTGCACAAAAAGCTAAATTTTCCCCCATAAATAAAAATATAATCCCGCGCCCAAC CACCTGATGAGTGGCTATAGGCACTGGATATATTAGGTGGCGGTGCACTTTCTTACATAAAGGTATTTCC TTTTCTGCGGAAAAGGAAATCGGGAAATCCCCGGTTTTTCTGACAAGCAGACGCCATTATTTGTGTCTGC CTATGTTCGTTAATTCGTTCATCAGGAAATTATCTCAATGTCACATTATAAAACAGGTCATAAACAACCA CGATTTCGTTATTCAGTTCTGGCCCGCTGCGTGGCGTGGGCAAATATCTCTGTTCAGGTTCTTTTTCCAC TCGCTGTCACCTTTACCCCAGTAATGGCGGCACGTGCGCAGCATGCGGTTCAGCCACGGTTGAGCATGGG AAATACTACGGTAACTGCTGATAATAACGTGGAGAAAAATGTCGCGTCGTTTGCCGCAAATGCCGGGACA TTTTTAAGCAGTCAGCCAGATAGCGATGCGACACGTAACTTTATTACCGGAATGGCCACAGCTAAAGCTA ACCAGGAAATACAGGAGTGGCTCGGGAAATATGGTACTGCGCGCGTCAAACTGAATGTCGATAAAGATTT CTCGCTGAAGGATTCTTCGCTGGAAATGCTTTATCCGATTTATGATACGCCGACAAATATGTTGTTCACT CAGGGGGCAATACATCGTACAGACGATCGTACTCAGTCAAATATTGGTTTTGGCTGGCGTCATTTTTCAG GAAATGACTGGATGGCGGGGGTGAATACTTTTATCGATCATGATTTATCCCGTAGTCATACCCGCATTGG TGTTGGTGCGGAATACTGGCGCGATTATCTGAAACTGAGCGCCAATGGTTATATTCGGGCTTCTGGCTGG AAAAAATCGCCGGATATTGAGGATTATCAGGAACGCCCGGCGAATGGCTGGGATATTCGTGCTGAGGGCT ATTTACCCGCCTGGCCGCAGCTTGGCGCAAGCCTGATGTATGAACAGTATTATGGCGATGAAGTCGGGCT GTTTGGTAAAGATAAGCGCCAGAAAGACCCGCATGCTATTTCTGCCGAGGTGACCTATACGCCAGTGCCT CTTCTGACACTGAGCGCCGGGCATAAGCAGGGCAAGAGTGGTGAGAATGACACTCGCTTTGGCCTGGAAG TTAATTATCGGATTGGCGAACCTCTGGCGAAACAACTCGATACAGACAGCATTCGCGAGCGTCGGGTACT GGCAGGCAGCCGCTATGACCTGGTTGAGCGTAATAACAACATCGTTCTTGAGTACCGCAAATCTGAAGTG ATCCGTATTGCTCTGCCTGAACGTATTGAAGGTAAGGGGGGTCAGACACTTTCCCTGGGGCTTGTGGTCA GCAAAGCAACTCACGGACTGAAAAATGTGCAGTGGGAAGCGCCGTCATTACTGGCTGAGGGTGGCAAAAT TACCGGTCAGGGTAGTCAGTGGCAAGTAACGCTCCCGGCTTATCGTCCAGGCAAAGACAATTATTATGCG ATTTCTGCGGTTGCCTACGATAACAAAGGCAATGCCTCAAAACGCGTGCAGACAGAGGTGGTCATTACCG GAGCAGGTATGAGCGCCGATCGCACGGCGTTAACGCTTGACGGTCAGAGCCGTATTCAAATGCTTGCTAA CGGTAATGAGCAAAGACCGCTGGTGCTGTCTCTGCGCGACGCCGAGGGGCAGCCAGTCACGGGCATGAAA GATCAGATCAAGACTGAACTAGCCTTCAAACCGGCTGGAAATATTGTGACTCGTTCCCTGAAGGCCACTA AATCACAGGCAAAGCCAACACTGGGTGAGTTCACCGAAACTGAAGCAGGGGTGTATCAGTCTGTCTTTAC TACCGGAACGCAGTCAGGTGAGGCAACGATTACTGTTAGCGTTGATGGCATGAGCAAAACCGTCACTGCA GAACTGCGGGCCACGATGATGGATGTGGCAAACTCCACCCTGAGCGCTAACGAGCCGTCAGGTGATGTGG TTGCTGATGGTCAGCAAGCCTATACGTTGACGTTGACTGCGGTGGACTCCGAGGGTAATCCGGTGACGGG AGAAGCCAGCCGCTTGCGATTTGTTCCGCAAGACACTAATGGTGTAACCGTTGGTGCCATTTCGGAAATA AAACCAGGCGTTTACAGCGCCACGGTTTCTTCGACCCGTGCCGGAAACGTTGTTGTGCGTGCTTTCAGCG AGCAGTATCAGCTGGGCACATTACAACAAACGCTGAAGTTTGTTGCCGGTCCGCTTGATGCAGCACATTC GTCCATCACCCTGAATCCTGATAAACCGGTGGTTGGCGGTACAGTTACGGCAATCTGGACGGCAAAAGAT GCCTATGACAACCCTGTGACCAGCCTCACGCCGGAAGCGCCGTCATTAGCGGGTGCCGCTGCTGTAGGTT CTACGGCATCTGGCTGGACAAATAATGGTGATGGGACGTGGACTGCGCAGATTACTCTCGGCTCTACGGC GGGTGAATTAGAAGTTATGCCGAAGCTAAATGGACAGGATGCGGCAGCAAATGCGGCAAAAGTAACCGTG GTGGCTGATGCGTTATCTTCAAACCAGTCGAAAGTCTCTGTCGCAGAAGATCACGTAAAAGCCGGCGAAA GCACAACCGTGACGCTTATTGCAAAAGATGCACATGGCAACACTATCAGTGGTCTTTCGTTGTTGGCAAG TTTGACGGGGACCGCCTCTGAAGGGGCGACCGTTTCCAGTTGGACCGAAAAAGGTGACTGTTCCTATGTT GCTACGTTAACTACAGGCGGAAAGACGGGCGAGCTTCGTGTCATGCCGCTCTTCAACGGCCAGCCAGCAG CCACCGAAGCCGCGCAGTTGACGGTCATCGCCGGAGAGATGTCATCAGCGAACTCTACGCTTGTTGCGGA CAATAAGGCTCCGACCGTCAAAATGACGACGGAACTCACCTTCACCGTGAAGGATGCGTACGGGAACCCG GTCACCGGGCTGAAGCCAGATGCACCAGTGTTTAGCGGTGCCGCCAGCACGGGGAGTGAGCGTCCTTCAG CAGGAAACTGGACAGAGAAAGGTAATGGGGTCTACGTGGCGACCTTAACGCTGGGATCTGCCGCGGGTCA GTTGTCTGTGATGCCGCGAGTGAACGGCCAAAATGCCGTTGCTCAGCCACTGGTGCTGAACGTTGCAGGT GACGCATCTAAGGCTGAGATTCGTGATATGACAGTGAAGGTTAATAACCAACTGGCTAATGGACAGTCTG CTAACCAGATAACCCTGACCGTCGTGGACAGCTATGGTAACCCGTTGCAGGGGCAAGAAGTTACGCTGAC TTTACCGCAGGGTGTGACCAGCAAGACGGGGAATACAGTAACAACCAATGCGGCAGGGAAAGTGGACATT GAGCTTATGTCAACGGTTGCGGGGGAACACAGCATCACGGCCTCAGTGAATAATGCTCAGAAGACGGTTA CGGTGAAATTCAAGGCGGATTTCAGTACCGGTCAGGCGACCCTGGAGGTTGATGGCAGCACGCCAAAAGT GGCAAACGACAATGATGCCTTTACGCTGACGGCAACGGTTAAGGATCAATACGGCAACCTTCTGCCTGGC GCTGTGGTCGTCTTTAATCTGCCTCGGGGCGTCAAACCGCTTGCAGACGGTAATATCATGGTGAACGCCG ACAAGGAGGGTAAAGCGGAACTGAAAGTGGTCTCCGTGACTGCCGGAACGTATGAGATCACGGCGTCGGC AGGAAATGACCAGCCTTCGAATGCGCAGTCTGTAACGTTTGTGGCCGATAAGACTACGGCGACCATCTCC AGTATTGAGGTGATTGGCAACCGTGCAGTGGCGGATGGCAAAACCAAACAGACGTATAAAGTTACGGTGA CTGATGCCAATAACAACCTGTTGAAGGATAGCGACGTGACGCTGACTGCCAGCTCGGAAAATTTAGTTCT GGATCCTAAAGGGACGGCGAAAACTAATGAGCAAGGACAGGCTGTTTTCACCGGCTCTACCACTATCGCA GCGACATATACACTCACGGCGAAAGTGGAACAGGCCAACGGTCAGGTATCGACGAAAACTGCTGAATCTA AATTCGTCGCGGATGATAAAAACGCGGTGCTCGCCGCATCTCCAGAACGTGTAGATTCTCTGGTGGCGGA CGGGAAGACTACTGCAACAATGACGGTTACCCTGATGGCGGGAGTCAATCCCGTAGGAGGAAGTATGTGG GTCGACATTGAGGCTCCGGAAGGAGTGACGGAGAAGGATTATCAATTCCTGCCGTCGAAGGCTGACCATT TCTCAGGTGGGAAAATCACGCGTACATTTAGTACCAGCAAGCCAGGTGTCTATACGTTCACATTCAACGC ACTGACGTATGGCGGGTACGAAATGACGCCTGTGAAGGTGACAATTAACGCCGTTGCTGCAGAGACTGAA AATGGCGAGGAGGAGATGCCATAA
Region | 1 |
Region Length | 44.3Kb |
Completeness(score) | questionable(80) |
Specific Keyword | integrase,injection,lysin,transposase,tail |
Region Position | 58370-102703 |
# tRNA | 0 |
# Total Proteins | 36 |
# Phage Hit Proteins | 28 |
# Hypothetical Proteins | 7 |
Phage + Hypothetical Protein % | 97.2% |
# Bacterial Proteins | 1 |
Attachment Site | yes |
# Phage Species | 12 |
Most Common Phage Name(hit genes count) | PHAGE_Entero_mEp460_NC_019716(15) PHAGE_Shigel_SfII_NC_021857(15) PHAGE_Shigel_SfIV_NC_022749(13) PHAGE_Entero_SfV_NC_003444(13) PHAGE_Entero_cdtI_NC_009514(12) PHAGE_Salmon_ST64B_NC_004313(8) PHAGE_Salmon_SE1_NC_011802(2) PHAGE_Stx2_converting_I_NC_003525(2) PHAGE_Salmon_Fels_1_NC_010391(2) PHAGE_Entero_P1_NC_005856(2) PHAGE_Salmon_vB_SemP_Emek_NC_018275(2) PHAGE_Salmon_ST160_NC_014900(2) PHAGE_Entero_ST64T_NC_004348(2) PHAGE_Bacter_APSE_2_NC_011551(1) PHAGE_Entero_P4_NC_001609(1) PHAGE_Cronob_vB_CsaM_GAP32_NC_019401(1) PHAGE_Entero_Sf6_NC_005344(1) PHAGE_Clostr_phiC2_NC_009231(1) PHAGE_Pectob_My1_NC_018837(1) PHAGE_Entero_lambda_NC_001416(1) PHAGE_Acyrth_pisum_secondary_endosymbiont_1_NC_000935(1) PHAGE_Entero_HK022_NC_002166(1) PHAGE_Pseudo_YuA_NC_010116(1) PHAGE_Cyprin_1_NC_019491(1) PHAGE_Entero_BP_4795_NC_004813(1) PHAGE_Entero_HK140_NC_019710(1) PHAGE_Salmon_SP_058_NC_021772(1) PHAGE_Vibrio_PVA1_NC_023605(1) PHAGE_Stx2_converting_1717_NC_011357(1) PHAGE_Entero_phiP27_NC_003356(1) PHAGE_Salmon_c341_NC_013059(1) PHAGE_Salmon_SPN9CC_NC_017985(1) PHAGE_Stenot_S1_NC_011589(1) PHAGE_Stx2_converting_II_NC_004914(1) PHAGE_Entero_YYZ_2008_NC_011356(1) PHAGE_Synech_S_CBS2_NC_015463(1) PHAGE_Pseudo_MP1412_NC_018282(1) PHAGE_Escher_TL_2011c_NC_019442(1) PHAGE_Entero_ST104_NC_005841(1) PHAGE_Cronob_ENT47670_NC_019927(1) PHAGE_Escher_P13374_NC_018846(1) PHAGE_Stx1_converting_NC_004913(1) PHAGE_Pseudo_M6_NC_007809(1) |
First Most Common Phage # | 16 |
First Most Common Phage % | 41.66% |
GC % | 48.69% |
Region: | The number assigned to the region. |
Region Length: | The length of the sequence of that region (in bp). |
Completeness: | A prediction of whether the region contains a intact or incomplete prophage based on the above criteria. |
Specific Keyword: | The specific phage-related keyword(s) found in protein name(s) in the region. |
Region Position: | The start and end positions of the region on the bacterial chromosome. |
# tRNA: | The number of tRNA genes present in the region. |
# Total Proteins: | The number of ORFs present in the region. |
# Phage Hit Proteins: | The number of proteins in the region with matches in the phage protein database. |
# Hypothetical Proteins: | The number of hypothetical proteins in the region without a match in the database. |
Phage + Hypothetical Protein %: | The combined percentage of phage proteins and hypothetical proteins in the region. |
# Bacterial Proteins: | The number of proteins in the region with matches in the nrfilt database. |
Attachment Site: | The putative phage attachment site. |
# Phage Species: | The number of different phages that have similar proteins to those in the region. |
Most Common Phage: | The phage(s) with the highest number of proteins most similar to those in the region. |
First Most Common Phage #: | The highest number of proteins in a phage most similar to those in the region. |
First Most Common Phage %: | The percentage of proteins in # Phage Hit Proteins that are most similar to the Most Common Phage proteins. |
GC %: | The percentage of GC nucleotides of the region. |
Questionable (score 70-90)
Incomplete (score < 70)
Region: | The number assigned to the region. |
Region Length: | The length of the sequence of that region (in bp). |
Completeness: | A prediction of whether the region contains a intact or incomplete prophage based on the above criteria. |
Score: | The score of the region based on the above criteria. |
# Total Proteins: | The number of ORFs present in the region. |
Region Position: | The start and end positions of the region on the bacterial chromosome. |
Most Common Phage: | The phage(s) with the highest number of proteins most similar to those in the region. |
GC %: | The percentage of GC nucleotides of the region. |
Criteria for scoring prophage regions (as intact, questionable, or incomplete):
Method 1:
- If the number of certain phage organism in this table is more than or equal to 100% of the total number of CDS of the region, the region is marked with total score 150. If less than 100%, method 2 and 3 will be used.
- If the number of certain phage organism in this table is more than 50% of the total number of CDS of the region, that phage organism is considered as the major potential phage for that region; the percentage of the total number of that phage organism in this table in the total number of proteins of the region is calculated and then multipled by 100; the percentage of the length of that phage organism in this table in the length of the region is calculated and then multipled by 50 (phage head's encapsulation capability is considered).
- If any of the specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin') are present, the score will be increased by 10 for each keyword found.
- If the size of the region is greater than 30 Kb, the score will be increased by 10.
- If there are at least 40 proteins in the region, the score will be increased by 10.
- If all of the phage-related proteins and hypothetical proteins constitute more than 70% of the total number of proteins in the region, the score will be increased by 10.
If the region's total score is less than 70, it is marked as incomplete; if between 70 to 90, it is marked as questionable; if greater than 90, it is marked as intact.
gi|430944337|ref|ANTA01000011.1| Escherichia coli KTE44 acEod-supercont1.2.C11, whole genome [asmbl_id: NC_000000].126399, gc%: 50.28%
Download details as .txt file: detail.txt file_download
Hits against Bacterial Database or GenBank File
Region 1, total 40 CDS
# | CDS Position | BLAST Hit | E-Value | Sequence |
---|---|---|---|---|
1 | 58370..58381 | attL | 0.0 | Showinfo_outline |
2 | 64854..64865 | attL | 0.0 | Showinfo_outline |
3 | 70500..72851 | PHAGE_Pseudo_MP1412_NC_018282: integrase; WGI_00733; phage(gi399528988) | 1e-39 | Showinfo_outline |
4 | 73028..73258 | hypothetical protein; WGI_00734 | 0.0 | Showinfo_outline |
5 | 73248..73985 | hypothetical protein; WGI_00735 | 0.0 | Showinfo_outline |
6 | 74588..74761 | hypothetical protein; WGI_00736 | 0.0 | Showinfo_outline |
7 | 74766..76232 | PHAGE_Bacter_APSE_2_NC_011551: injection gp20; WGI_00737; phage(gi212499730) | 1e-113 | Showinfo_outline |
8 | 76232..78337 | PHAGE_Entero_Sf6_NC_005344: gene 13 protein; WGI_00738; phage(gi41057291) | 3e-86 | Showinfo_outline |
9 | 79059..79160 | hypothetical protein; WGI_00739 | 0.0 | Showinfo_outline |
10 | 79581..80543 | PHAGE_Clostr_phiC2_NC_009231: putative abortive infection bacteriophage resistance protein ORF 37; WGI_00740; phage(gi134287370) | 2e-25 | Showinfo_outline |
11 | complement(80545..81000) | hypothetical protein; WGI_00741 | 0.0 | Showinfo_outline |
12 | 81153..81434 | PHAGE_Shigel_SfII_NC_021857: PerC transcriptional activator family protein; WGI_00742; phage(gi526244676) | 6e-05 | Showinfo_outline |
13 | 82038..82580 | hypothetical protein; WGI_00743 | 0.0 | Showinfo_outline |
14 | complement(82570..82932) | hypothetical protein; WGI_00744 | 0.0 | Showinfo_outline |
15 | 83051..83062 | attR | 0.0 | Showinfo_outline |
16 | complement(83160..84323) | PHAGE_Shigel_SfII_NC_021857: integrase; WGI_00745; phage(gi526244664) | 0.0 | Showinfo_outline |
17 | complement(84550..84855) | PHAGE_Shigel_SfII_NC_021857: hypothetical protein; WGI_00746; phage(gi526244666) | 8e-52 | Showinfo_outline |
18 | complement(84855..85217) | PHAGE_Shigel_SfII_NC_021857: hypothetical protein; WGI_00747; phage(gi526244667) | 4e-66 | Showinfo_outline |
19 | complement(85208..85744) | PHAGE_Shigel_SfII_NC_021857: hypothetical protein; WGI_00748; phage(gi526244668) | 1e-97 | Showinfo_outline |
20 | complement(85872..86696) | PHAGE_Shigel_SfIV_NC_022749: hypothetical protein; WGI_00749; phage(gi557307559) | 1e-151 | Showinfo_outline |
21 | complement(86762..87124) | PHAGE_Entero_mEp460_NC_019716: hypothetical protein; WGI_00750; phage(gi428782351) | 4e-63 | Showinfo_outline |
22 | 87757..87768 | attR | 0.0 | Showinfo_outline |
23 | complement(87827..88474) | PHAGE_Shigel_SfII_NC_021857: CI repressor; WGI_00751; phage(gi526244671) | 9e-114 | Showinfo_outline |
24 | 88617..88877 | PHAGE_Shigel_SfII_NC_021857: Cro repressor; WGI_00752; phage(gi526244672) | 3e-42 | Showinfo_outline |
25 | 88870..89427 | PHAGE_Shigel_SfII_NC_021857: transcriptional regulator; WGI_00753; phage(gi526244673) | 9e-98 | Showinfo_outline |
26 | 89424..89762 | PHAGE_Shigel_SfII_NC_021857: hypothetical protein; WGI_00754; phage(gi526244674) | 4e-59 | Showinfo_outline |
27 | 89772..90713 | PHAGE_Shigel_SfII_NC_021857: O protein family protein; WGI_00755; phage(gi526244675) | 0.0 | Showinfo_outline |
28 | 90716..91204 | PHAGE_Shigel_SfII_NC_021857: PerC transcriptional activator family protein; WGI_00756; phage(gi526244676) | 1e-85 | Showinfo_outline |
29 | 91593..91856 | PHAGE_Shigel_SfII_NC_021857: DNA adenine methylase; WGI_00757; phage(gi526244677) | 9e-46 | Showinfo_outline |
30 | 91853..92179 | PHAGE_Shigel_SfII_NC_021857: LexA DNA binding domain protein; WGI_00758; phage(gi526244678) | 5e-55 | Showinfo_outline |
31 | 92176..92565 | PHAGE_Shigel_SfII_NC_021857: RusA family protein; WGI_00759; phage(gi526244679) | 6e-71 | Showinfo_outline |
32 | 92585..93394 | PHAGE_Shigel_SfII_NC_021857: KliA-N domain protein; WGI_00760; phage(gi526244680) | 8e-130 | Showinfo_outline |
33 | 93402..94391 | PHAGE_Shigel_SfII_NC_021857: hypothetical protein; WGI_00761; phage(gi526244681) | 0.0 | Showinfo_outline |
34 | 94409..94792 | PHAGE_Entero_YYZ_2008_NC_011356: antitermination protein Q; WGI_00762; phage(gi209427762) | 8e-57 | Showinfo_outline |
35 | complement(94982..96064) | outer membrane porin protein LC; WGI_00763 | 0.0 | Showinfo_outline |
36 | 96638..96853 | PHAGE_Stx2_converting_II_NC_004914: holin; WGI_00764; phage(gi302393164) | 2e-28 | Showinfo_outline |
37 | 96853..97209 | PHAGE_Entero_cdtI_NC_009514: lysin; WGI_00765; phage(gi148609440) | 5e-47 | Showinfo_outline |
38 | complement(97144..97521) | PHAGE_Shigel_SfIV_NC_022749: IS1 transposase B; WGI_00766; phage(gi557307573) | 1e-64 | Showinfo_outline |
39 | complement(97566..97841) | PHAGE_Entero_P1_NC_005856: InsA; WGI_00767; phage(gi46401643) | 8e-49 | Showinfo_outline |
40 | 98447..102703 | PHAGE_Cronob_vB_CsaM_GAP32_NC_019401: long tail fiber proximal subunit; WGI_00768; phage(gi414087138) | 4e-12 | Showinfo_outline |
>58370..58381
TTGCGAGATATG
>64854..64865
ATGGGGGTACTT
>70500..72851
MRNIDLIRQVISASENNWPHVLGCLNINVPDSPRRHAPCPACGGKDRFRFDDNGRGSFICNQCGAGDGLDLIKRVNNCDTTEAALLAADVLGIDYRTTETPEATSQKREQLETERQRREQERLKRAEKDEQQRRDTFSRQFDDMRRKAVNGKSDYLVAKGVGDFTFPVLPDGSLLLALVDKSGAVTAAQTITPHGEKRLLTGSAKRGAYHAINAQKRPHSIIIAEGVATALSCHLIRPDAMTVAAIDAGNLLPVAEVMRRTYPQAQIIIAADNDHQQGDSESGGINTGKDAAERAAISVAGRVSLPPTDYKADWNDYHQQHGLAAATAAFKDSMYQPRGKGAQVKNHKQSVGALNEISSGEVLSDDEIAVLEEINRTFTHVTIGGKHKVVSLKPSQTGGVSHVFEDLSQFQHYFHHKPRVARKLAGSAWLSWSGKNYKPGGVGFYPVPDKCPDDVFNLYEGLALEPIEGDCTVYLNHLLQVVCAGNEEACQYLIQWMAHIIQKPDEKPSVAIVMKSVPGTGKGTTVKPLLQILGQYAAHINGAGHISGRFNSILANKLLVFADEVTIHKPSEADRLKAIISEPTFNLERKGIDAEPMPNFARLIFASNSTQVLQAGIRERRYLVLEPSPEKAQSREYFDRLYSWLNDGGAAKLLWHLKGVDLSGFDPQRAPQTDALREEILLGLSGVELFLYGELINEPPFNGEVRLFAKDMVSRFVAWSLERGEKLKEPAARSLLGKSLAQMGLVKHGRPDRGNGVFYELPEVGVLQAAFARLIGMGGYDVF
>73028..73258
MMSTPFYKVRQLASSSGWQLRFEGRSDWLPIAAWANVEMCIDGDTVEIIIPCVASRDGCIEPTDMAAEIREVKHEQ
>73248..73985
MNNNYCIPQGMTRTEREELKSFATQCGNAGDIQSLERTLIMIAHWMRQGQRVSFTEYASQWTEAQRERSDGNHSTPEMAKQWPFSGKRCISPGGSDYYPAGVGDEPCCDETEIRHAVTVITAEYPQFNLDGLALHNRNADWENPLDNPSFIVSAKSCLRWIRDNGMSNAQIESFPQDNPTSDTLKHEVERYNQINHQHSDHPHYIPNGAFIAAMVASGYKVKPAGRMNAFFNISKKGLCAAMGKN
>74588..74761
MAISVKPVLISEKQMEAIKKIQEEQRKKSEVGVAPTIHEIARGLMDKALAYTLTGRG
>74766..76232
MATWQQGINSGGFLAGIGTQNENAPKASDINATLGLIRENNELARSGANNVALTGLRGLAGVADIYKQQQQQERKAAFQKGYADAYASGDREQMRNLITAFPEEFEEVRKGMSYVDDAQRDDYGNLALKAQVASSLGPGAFGRFMMDNEQEMRRLGIPPETIAEMQVNDPQGFQHFAGNLALFSLGHEKYFDIKDQMEGRDIERGKLAETIRSNQAGEALQARGQDISRANALTSAYAPTAAMQNYNQYAQMLKADPEGAAAFAAAAGINTNAKKLMSVRENDDGTVTKYYTDGSEEQGKLNQPISGDGFRPIALPTAQKIMEKSPEGAKKAAGFAYRVRDALDSIDTLKDQLSPQRVAIINNALGNGTLANLTLSPAEQQYVVNANDAIMAILRQETGASILPAEMSKYYQMYFPQPGDSTKTIDTKRRKMENQFNSLKAASGRTYDALRVISAVDRGTASSSQTLPQSEQVSQPAASSNFSSLWGD
>76232..78337
MAKAWKDVIASPQYQALAPEQKAQAQEQYFNEVVAPKAGESVEQAKQAFYAAYPLPSTNAIDRSQSATQNIQHTSSDNSLASGYAKLATQQREGLERSAEQGASLGAAMRDAITGESRMTPEMERLQNVASAPELNSLSMDALKAGWSQLFGSDASQEKILQGMGATLRQDEKGNTIVSLPSGDYALNKPGLSPQDLTSFLANALAFTPAGRAGTVLGAIGKSAATDLALQGATSLAGGEDIDPLQTVISAGIGGIGKGLENTASAVSRAVRGDMSPEAKAAVDFASERNLPLMTSDMLKDKTFMQSQAQTLGERVPFFGTGKNRLNQQQARENLVRTFSDGLGGISDKQLYESATKGQQKFIEAAGKRYNRIIDAMGDTPVDLSNTVKAIDNQIAVLSRPGKSQDRAAVKVLQQFKDDITSGPNDLRLARENRTDLRKRFMASSDTVDKDTLQKASDIIYKAYTADMKKAVAKNLGADEAINMARVDRSWSKFNDMMGRTRVQKAIASGKATPEDVTKLVFSQSPSERSQLYRLLDDNGRQNARAAIVQNAVDKATDPSGNISVEKFINALHRNRKQSATFFKGVHGKELDGVIKYLNDTRHAAKANVQNLNGQQLYGLLVGGGIINAAVLAGMLKTAAFVVPAAGAVGGAAKAYESPVIRNALLRLANTPKGSTAYDRAISTVTQSLTRVAQASQKESQ
>79059..79160
MTMFYAGTPITCIWCDKSRYMTSHVVLVVITTF
>79581..80543
MAVPVKVHKEYDELVDLLLSRGMDVPDREHAIKKISQVGYYRLSGFWYPCRIPHITTDNIRTRLDQVRPGTNFRAVYDLYLFDKNLRLQMMNALERIEVYVRSVIAHELGKISPLAYLDDSLINPKHFRPRSHGRPSAREEWINKHNAEIAKSREDFIKWHENKYEGLPFWVVIEVWDFGLMSKYYAMLKDSYRNRILSRLGIAAGNGAIFQNWLSAMNVLRNRCAHHSRIWNKVNEPRLMPLPNEPYFEQLNMNDDAYERMYGMIAVLWFLVKKIGPGSDWIKKVANLVDNKPNLPGCNLTAMGLPDNNGFPRHLFDIE
>complement(80545..81000)
MVFAMSKSNLVAFRIPSELQDEFNRSVLASGGDKTSWLVDAIRMKLGQPEKSIDSRMLGLVERMEKAAASLIAGKPNIPPKPYNETAVIKIIADTIRQGFDNGRVIAERINEAGYQTKAGKAWDKDIYSAWKRQGSNAEKLSVALHSAPSL
>81153..81434
MIRDRKAEELESKGLYRRAAARWMEVMLLCTEDDDREWIKRRRETCLENVKRPPVKVEDFGDLHKAVTETQHRMGIAQPNGNAFRLNGGKRQR
>82038..82580
MYQLKITIRDSKPPIWRRVLVPEQIPFSKLHAVIQLAFGWNDEHLYMFEKGRKGDPGSEYRVWGEDESMGNAAITPLWAALQNEGDKLVYTYDFGDWWDCVIVLEKQTHDTSNQPISCLRGKGTTPAENSGGLHGYNELLLQARESDNPEQAEIHNFLMLDIERRVYDLSRINDRLQAIY
>complement(82570..82932)
MRMTSRKKEILSYFEPDNLEWVTGEIGAPPFDVSGVAYLLHGMVSFDKRHQLESTRRTLESMVAGGLLERVTVYESRQIRRGGDTNATVVRYGLPGQCAVMRDTGGADNAISGEYMRVSK
>83051..83062
ATGGGGGTACTT
>complement(83160..84323)
MSLFRRGEIWYASFTLPNGKRFKQSLGTKDKRQATELHDKLKAEAWRVSKLGEIPDMTFEEACIRWLEEKAHKKSLDDDKSRIGFWLQHFAGMQLRDITESKIYSAMQKMTNRRHEENWKLRAEACRKKGKPVPEYMPKPASVATKATHLSFIKALLRAAEREWKMLDKAPIIKVPQPKNKRIRWLEPHEAQRLIDECPEPLKSVVEFALATGLRRSNIINLEWQQIDMQRRVAWINPEESKSNRAIGVALNDTACRVLKKQIGNHHRWVFVYKESCTKPDGTKAPTVRKMRYDANTAWKAALRRAGIDDFRFHDLRHTWASWLVQAGVPLSVLQEMGGWESIEMVRRYAHLAPNHLTEHARQIDSILNPSVPNLSQSKNKEGTNDV
>complement(84550..84855)
MTTFTNKELIKEIKERISSLEVRDDIERRAYEIALVSLEVEPDEREAYELFMEKRFGDLVDRRRAKNGDNEYMAWDMTLGWIVWQQRAGIHFSTMTQQEVK
>complement(84855..85217)
MRMNVFEMEGFLRGKCVPRDLKVNETNAEYLVRKFDALEAKCAALENKIIPVSAELPPANESVLLFDANGEGWLIGWRSLWYTWGQKETGEWQWTFQVGDLENVNITHWAVMPKAPEAGA
>complement(85208..85744)
MSFIKTFSGKHFYYDKINKDDIDINDIAVSLSNICRFAGHLSHFYSVAQHAVLCSQLVPQEFAFEALMHDATEAYCQDIPAPLKRLLPDYKRMEEKIDAVIREKYGLPPVMSTPVKYADLIMLATERRDLGLDDGSFWPVLEGIPATEMFNVIPLAPGHAYGMFMERFNDLSELRKCA
>complement(85872..86696)
MSQNLDATAINQIHALISAQGVNEIISKIGADAVALPENFRIHDLEKFNLNRFRFRGALSTASIDDFTRYSKDLADEGTRCFIDADNMRAVSVLNLGTIDEPGHADNTATLKLKKTAPFSALLSVNGERNSQKSLAEWIEDWADYLVGFDANGDAIQATKAAAAIRKITIEANQTADFEDNDFSGKRSLMESVEAKTKDIMPVTFEFKCVPFEGLKERPFKLRLSIITGDRPVLVLRIIQLEAVQEEMANEFRDLLVEKFKDSKVETFIGTFTA
>complement(86762..87124)
MASERSTDVQAFIGELDGGVFETKIGAVLSEVASGVMNTKTKGKVSLNLEIEPFDENRVKIKHKLSYVRPTNRGKISEEDTTETPMYVNRGGRLTILQEDQGQLLTLAGEPDGKLRAAGH
>87757..87768
TTGCGAGATATG
>complement(87827..88474)
MKIADLAEAIGVDAANISRLETGKQKQFTEQALSNIARSLGVDIADLFTSDVKSNTVCKNSISEDVAQVKDVFRIEMLDVSASAGNGLIQGGDVIDVIHAIEYRTDNAVSMFGGRPANHIKVINVRGDSMCPTIEPGDLIFVDVSINQFDGDGIYVFGFDDKIYVKRLQMIPDKLLVISDNQIYREWGITSENEHRFMVFGKVLISQSQTLKRHN
>88617..88877
MQSPLRNVRKAHGFTLQHVAAGVQVNPATLSRIERLEQIPSIDLAERLANFFKGEISEMQILYPARFQSSQNQNGFKPQEQEVSRG
>88870..89427
MGNHHWKVEKQPEWYVKAVRKTIAALPGGYAEAAEWLDVTENALFNRLRADGDQIFPLGWAMILQRAGGTHFIADAVAQSANGVFVSLPDVEDVDNADINQRLLEVIEQIGSYSKQIRSAIEDGVVEPHEKTAINDELYLSISKLQEHAALVYKIFCVSESNDARECAAPGVVASIASGCGETNA
>89424..89762
MNSLTTHYRRSQLIALPVPGGKAKVEYCYAVNVPGDREIVTHSFAEWAVGDFNRQKETVLCDKLTAGSKITTECPSESFVGSRKHNGLSTSVKAMSMSASARSNSFVVNSGK
>89772..90713
MSTKLTGYVWDGCAASGMKLSSVAIMARLADFSNDEGVCWPSIETIARQIGAGMSTVRTAIARLEAEGWLTRKARRQGNRNASNVYQLNVAKLQAAAFSQLSDSDPSKSDASKSDTSKFDASKSGKKAGFHPSESGGDPSVKSKHDPSDKKPSRPDASQPDTQTAEQDFLTRHPDAVVFSPKKRQWGTQDDLTCAQWLWKKIIALYEQAAECDGEVVRPKEPNWTAWANEIRLMCVQDGRTHKQICEMYSRVSRDPFWCRNVLSPSKLREKWDELSLRLSPSVSTHTEKREDPYFKASYDNVDYSQIPAGFRG
>90716..91204
MSLLNDVQKFIEAHPGCTSGDIADAFAGYSRQRVLQSASKLRQSGRVAHRCEGDTRRHFPRLTERAQEPEPQSVRETRPVRNFYVGTNDPRVILCLTRQAEELESRGLYRRAATVWMAAFRESHSQPERNNFLARRERCLRKSSKRAASGEEWYLSGNYVGA
>91593..91856
MWWPEDADHIAFIRGRIGFELPAWFIPKDEKQVPTGAFFAGAIAVFDKTWKGPAISYIGRDELEACGEAFLAQVRQQAEKLVREMAA
>91853..92179
MTTLTQCQQQVLDMLISYQKERGFPPTNQEVATMLGYRSVNAAVEHLRALEKKGVITIKRGVARGITLHTAVKDDDSEAVGIIRSLLAGEENARLRAAHWLHERGLKV
>92176..92565
MKLILPFPPSVNTYWRHPNKGAFAGKSLISAAGRKFQSAACAAIVEQLRRLPKPTSAPASVEIVLFPPDNRIRDLDNYNKALFDALTHAGVWEDDSQVKRMLVEWGPVIPEGKVEITISKYEKTAGAAA
>92585..93394
MNNLMVIDGIEVRRDAYGRYSLNDLHRAAVASGANARTKEPGKFLSSQQTVELVHELTNTQNLGVDPVSVIHGGNERGTYVCKELVYAYAMWISPSFHLKVIRTFDMVTSAPEKLSGQAADKMQAGVILLDFMRRELNLSNSSVLGACQKLQEAVGLPNLAPRYAIDAPADAPDGSSRPTLSLSALLKQYGIRLTANQAYHQMAKLGIVEQRERYSRTAINNIKKFWSLTAKGCMFGKNITSPANPRETQPHFFESRFPELLKLLDTVH
>93402..94391
MRALLTPEIAPRMGIVLFRPGSELMPLFMQGRVLLEPEPERYSSFASGAVPAASQPLADDPAVRAVFRNEAVIRRAGGVECLESWLLREKGCQWPHSDWHSENMTTMRHAPGAIRLCWHCDNQLRDQFTERLESMATDNCARWVLSVVRRDLGFDDSHVVTMPELCWWLIRNDLADALPESAARKALRLPKPVVPSVTRESDLVPSVPATSIIQDKAKKVLALKVDPESPDSFMLRPKRRRWVNEKYTRWVKTQPCACCGKPADDPHHLIGHGQGGMGTKAHDLFVLPLCRKHHDELHADNVAFEEKYGSQLELIFRFIDRALTIGVLA
>94409..94792
MRDIQMVLERWGAWAANNHEDVTWSSIAAGFKGLIPSKVKSRPQCCDDDAMIICGCMARLKKNNSDLHDLLVDYYVVGMTFMSLAGKHCCSDGYIGKRLQKAEGIIEGMLMALDIRLEMDIVVNNSN
>complement(94982..96064)
MKKLTVAISAVAASVLMAMSAQAAEIYNKDSNKLDLYGKVNAKHYFSSNDADDGDTTYARLGFKGETQINDQLTGFGQWEYEFKGNRAESQGSSKDKTRLAFAGLKFGDYGSIDYGRNYGVAYDIGAWTDVLPEFGGDTWTQTDVFMTGRTTGVATYRNNDFFGLVDGLNFAAQYQGKNDRTDVTEANGDGFGFSTTYEYEGFGVGATYAKSDRTDGQVAYGKSKFNASGKNAEVWAAGLKYDANNIYLATTYSETQNMTVFGNNHIANKAQNFEAVAQYQFDFGLRPSVAYLQSKGKDLGVHGDRDLVKYVDVGATYYFNKNMSTFVDYKINLIDDSKFTKTAGIDTDDIVAVGLVYQF
>96638..96853
MKSMDKLTTGVAYGTSAGSAGYWFLQLLDKVTPSQWAAIGVLGSLVFGLLTYLTNLYFKIKEDKRKAARGE
>96853..97209
MSPSLRKAVAAAIGGGAVAIASVLITGPSGDDGLEGVSYIPYKDIVGVWTVCHGHTGKDIMLGKTYTKAECKALLNKDLATVARQINPYIEVGNDSNLLIVFYVQIMPDDFVMQLHRF
>complement(97144..97521)
MDEQWGYVGAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLGLLSAFEVVVWMTDGWPLYESRLKGELHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSVELHDKVIGHYLNIKHYQ
>complement(97566..97841)
MASVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMAMNGVGCRATARIMGVGLNTILRHLKNSGRSR
>98447..102703
MSHYKTGHKQPRFRYSVLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVTADNNVEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLLTLSAGHKQGKSGENDTRFGLEVNYRIGEPLAKQLDTDSIRERRVLAGSRYDLVERNNNIVLEYRKSEVIRIALPERIEGKGGQTLSLGLVVSKATHGLKNVQWEAPSLLAEGGKITGQGSQWQVTLPAYRPGKDNYYAISAVAYDNKGNASKRVQTEVVITGAGMSADRTALTLDGQSRIQMLANGNEQRPLVLSLRDAEGQPVTGMKDQIKTELAFKPAGNIVTRSLKATKSQAKPTLGEFTETEAGVYQSVFTTGTQSGEATITVSVDGMSKTVTAELRATMMDVANSTLSANEPSGDVVADGQQAYTLTLTAVDSEGNPVTGEASRLRFVPQDTNGVTVGAISEIKPGVYSATVSSTRAGNVVVRAFSEQYQLGTLQQTLKFVAGPLDAAHSSITLNPDKPVVGGTVTAIWTAKDAYDNPVTSLTPEAPSLAGAAAVGSTASGWTNNGDGTWTAQITLGSTAGELEVMPKLNGQDAAANAAKVTVVADALSSNQSKVSVAEDHVKAGESTTVTLIAKDAHGNTISGLSLLASLTGTASEGATVSSWTEKGDCSYVATLTTGGKTGELRVMPLFNGQPAATEAAQLTVIAGEMSSANSTLVADNKAPTVKMTTELTFTVKDAYGNPVTGLKPDAPVFSGAASTGSERPSAGNWTEKGNGVYVATLTLGSAAGQLSVMPRVNGQNAVAQPLVLNVAGDASKAEIRDMTVKVNNQLANGQSANQITLTVVDSYGNPLQGQEVTLTLPQGVTSKTGNTVTTNAAGKVDIELMSTVAGEHSITASVNNAQKTVTVKFKADFSTGQATLEVDGSTPKVANDNDAFTLTATVKDQYGNLLPGAVVVFNLPRGVKPLADGNIMVNADKEGKAELKVVSVTAGTYEITASAGNDQPSNAQSVTFVADKTTATISSIEVIGNRAVADGKTKQTYKVTVTDANNNLLKDSDVTLTASSENLVLDPKGTAKTNEQGQAVFTGSTTIAATYTLTAKVEQANGQVSTKTAESKFVADDKNAVLAASPERVDSLVADGKTTATMTVTLMAGVNPVGGSMWVDIEAPEGVTEKDYQFLPSKADHFSGGKITRTFSTSKPGVYTFTFNALTYGGYEMTPVKVTINAVAAETENGEEEMP
Questionable (score 70-90)
Incomplete (score < 70)
Viewer Options
Click on a region in the genome above to show details here.
ORF Start: 58370
ORF Stop: 58381
Strand: Forward
Protein Sequence: TTGCGAGATATG
Homolog/Ortholog Species: Attachment site
Homolog/Ortholog Protein: attL
Homolog/Ortholog E-Value: N/A
ORF Start: 64854
ORF Stop: 64865
Strand: Forward
Protein Sequence: ATGGGGGTACTT
Homolog/Ortholog Species: Attachment site
Homolog/Ortholog Protein: attL
Homolog/Ortholog E-Value: N/A
ORF Start: 70500
ORF Stop: 72851
Strand: Forward
Protein Sequence: MRNIDLIRQVISASENNWPHVLGCLNINVPDSPRRHAPCPACGGKDRFRFDDNGRGSFICNQCGAGDGLDLIKRVNNCDTTEAALLAADVLGIDYRTTETPEATSQKREQLETERQRREQERLKRAEKDEQQRRDTFSRQFDDMRRKAVNGKSDYLVAKGVGDFTFPVLPDGSLLLALVDKSGAVTAAQTITPHGEKRLLTGSAKRGAYHAINAQKRPHSIIIAEGVATALSCHLIRPDAMTVAAIDAGNLLPVAEVMRRTYPQAQIIIAADNDHQQGDSESGGINTGKDAAERAAISVAGRVSLPPTDYKADWNDYHQQHGLAAATAAFKDSMYQPRGKGAQVKNHKQSVGALNEISSGEVLSDDEIAVLEEINRTFTHVTIGGKHKVVSLKPSQTGGVSHVFEDLSQFQHYFHHKPRVARKLAGSAWLSWSGKNYKPGGVGFYPVPDKCPDDVFNLYEGLALEPIEGDCTVYLNHLLQVVCAGNEEACQYLIQWMAHIIQKPDEKPSVAIVMKSVPGTGKGTTVKPLLQILGQYAAHINGAGHISGRFNSILANKLLVFADEVTIHKPSEADRLKAIISEPTFNLERKGIDAEPMPNFARLIFASNSTQVLQAGIRERRYLVLEPSPEKAQSREYFDRLYSWLNDGGAAKLLWHLKGVDLSGFDPQRAPQTDALREEILLGLSGVELFLYGELINEPPFNGEVRLFAKDMVSRFVAWSLERGEKLKEPAARSLLGKSLAQMGLVKHGRPDRGNGVFYELPEVGVLQAAFARLIGMGGYDVF
Homolog/Ortholog Species: Integrase
Homolog/Ortholog Protein: WGI_00733, integrase, phage(gi399528988), PHAGE_Pseudo_MP1412_NC_018282
Homolog/Ortholog E-Value: 1e-39
ORF Start: 73028
ORF Stop: 73258
Strand: Forward
Protein Sequence: MMSTPFYKVRQLASSSGWQLRFEGRSDWLPIAAWANVEMCIDGDTVEIIIPCVASRDGCIEPTDMAAEIREVKHEQ
Homolog/Ortholog Species: Hypothetical protein
Homolog/Ortholog Protein: WGI_00734, hypothetical protein
Homolog/Ortholog E-Value: N/A
ORF Start: 73248
ORF Stop: 73985
Strand: Forward
Protein Sequence: MNNNYCIPQGMTRTEREELKSFATQCGNAGDIQSLERTLIMIAHWMRQGQRVSFTEYASQWTEAQRERSDGNHSTPEMAKQWPFSGKRCISPGGSDYYPAGVGDEPCCDETEIRHAVTVITAEYPQFNLDGLALHNRNADWENPLDNPSFIVSAKSCLRWIRDNGMSNAQIESFPQDNPTSDTLKHEVERYNQINHQHSDHPHYIPNGAFIAAMVASGYKVKPAGRMNAFFNISKKGLCAAMGKN
Homolog/Ortholog Species: Hypothetical protein
Homolog/Ortholog Protein: WGI_00735, hypothetical protein
Homolog/Ortholog E-Value: N/A
ORF Start: 74588
ORF Stop: 74761
Strand: Forward
Protein Sequence: MAISVKPVLISEKQMEAIKKIQEEQRKKSEVGVAPTIHEIARGLMDKALAYTLTGRG
Homolog/Ortholog Species: Hypothetical protein
Homolog/Ortholog Protein: WGI_00736, hypothetical protein
Homolog/Ortholog E-Value: N/A
ORF Start: 74766
ORF Stop: 76232
Strand: Forward
Protein Sequence: MATWQQGINSGGFLAGIGTQNENAPKASDINATLGLIRENNELARSGANNVALTGLRGLAGVADIYKQQQQQERKAAFQKGYADAYASGDREQMRNLITAFPEEFEEVRKGMSYVDDAQRDDYGNLALKAQVASSLGPGAFGRFMMDNEQEMRRLGIPPETIAEMQVNDPQGFQHFAGNLALFSLGHEKYFDIKDQMEGRDIERGKLAETIRSNQAGEALQARGQDISRANALTSAYAPTAAMQNYNQYAQMLKADPEGAAAFAAAAGINTNAKKLMSVRENDDGTVTKYYTDGSEEQGKLNQPISGDGFRPIALPTAQKIMEKSPEGAKKAAGFAYRVRDALDSIDTLKDQLSPQRVAIINNALGNGTLANLTLSPAEQQYVVNANDAIMAILRQETGASILPAEMSKYYQMYFPQPGDSTKTIDTKRRKMENQFNSLKAASGRTYDALRVISAVDRGTASSSQTLPQSEQVSQPAASSNFSSLWGD
Homolog/Ortholog Species: Phage-like protein
Homolog/Ortholog Protein: WGI_00737, injection gp20, phage(gi212499730), PHAGE_Bacter_APSE_2_NC_011551
Homolog/Ortholog E-Value: 1e-113
ORF Start: 76232
ORF Stop: 78337
Strand: Forward
Protein Sequence: MAKAWKDVIASPQYQALAPEQKAQAQEQYFNEVVAPKAGESVEQAKQAFYAAYPLPSTNAIDRSQSATQNIQHTSSDNSLASGYAKLATQQREGLERSAEQGASLGAAMRDAITGESRMTPEMERLQNVASAPELNSLSMDALKAGWSQLFGSDASQEKILQGMGATLRQDEKGNTIVSLPSGDYALNKPGLSPQDLTSFLANALAFTPAGRAGTVLGAIGKSAATDLALQGATSLAGGEDIDPLQTVISAGIGGIGKGLENTASAVSRAVRGDMSPEAKAAVDFASERNLPLMTSDMLKDKTFMQSQAQTLGERVPFFGTGKNRLNQQQARENLVRTFSDGLGGISDKQLYESATKGQQKFIEAAGKRYNRIIDAMGDTPVDLSNTVKAIDNQIAVLSRPGKSQDRAAVKVLQQFKDDITSGPNDLRLARENRTDLRKRFMASSDTVDKDTLQKASDIIYKAYTADMKKAVAKNLGADEAINMARVDRSWSKFNDMMGRTRVQKAIASGKATPEDVTKLVFSQSPSERSQLYRLLDDNGRQNARAAIVQNAVDKATDPSGNISVEKFINALHRNRKQSATFFKGVHGKELDGVIKYLNDTRHAAKANVQNLNGQQLYGLLVGGGIINAAVLAGMLKTAAFVVPAAGAVGGAAKAYESPVIRNALLRLANTPKGSTAYDRAISTVTQSLTRVAQASQKESQ
Homolog/Ortholog Species: Phage-like protein
Homolog/Ortholog Protein: WGI_00738, gene 13 protein, phage(gi41057291), PHAGE_Entero_Sf6_NC_005344
Homolog/Ortholog E-Value: 3e-86
ORF Start: 79059
ORF Stop: 79160
Strand: Forward
Protein Sequence: MTMFYAGTPITCIWCDKSRYMTSHVVLVVITTF
Homolog/Ortholog Species: Hypothetical protein
Homolog/Ortholog Protein: WGI_00739, hypothetical protein
Homolog/Ortholog E-Value: N/A
ORF Start: 79581
ORF Stop: 80543
Strand: Forward
Protein Sequence: MAVPVKVHKEYDELVDLLLSRGMDVPDREHAIKKISQVGYYRLSGFWYPCRIPHITTDNIRTRLDQVRPGTNFRAVYDLYLFDKNLRLQMMNALERIEVYVRSVIAHELGKISPLAYLDDSLINPKHFRPRSHGRPSAREEWINKHNAEIAKSREDFIKWHENKYEGLPFWVVIEVWDFGLMSKYYAMLKDSYRNRILSRLGIAAGNGAIFQNWLSAMNVLRNRCAHHSRIWNKVNEPRLMPLPNEPYFEQLNMNDDAYERMYGMIAVLWFLVKKIGPGSDWIKKVANLVDNKPNLPGCNLTAMGLPDNNGFPRHLFDIE
Homolog/Ortholog Species: Phage-like protein
Homolog/Ortholog Protein: WGI_00740, putative abortive infection bacteriophage resistance protein ORF 37, phage(gi134287370), PHAGE_Clostr_phiC2_NC_009231
Homolog/Ortholog E-Value: 2e-25
ORF Start: 80545
ORF Stop: 81000
Strand: Backward
Protein Sequence: MVFAMSKSNLVAFRIPSELQDEFNRSVLASGGDKTSWLVDAIRMKLGQPEKSIDSRMLGLVERMEKAAASLIAGKPNIPPKPYNETAVIKIIADTIRQGFDNGRVIAERINEAGYQTKAGKAWDKDIYSAWKRQGSNAEKLSVALHSAPSL
Homolog/Ortholog Species: Hypothetical protein
Homolog/Ortholog Protein: WGI_00741, hypothetical protein
Homolog/Ortholog E-Value: N/A
ORF Start: 81153
ORF Stop: 81434
Strand: Forward
Protein Sequence: MIRDRKAEELESKGLYRRAAARWMEVMLLCTEDDDREWIKRRRETCLENVKRPPVKVEDFGDLHKAVTETQHRMGIAQPNGNAFRLNGGKRQR
Homolog/Ortholog Species: Phage-like protein
Homolog/Ortholog Protein: WGI_00742, PerC transcriptional activator family protein, phage(gi526244676), PHAGE_Shigel_SfII_NC_021857
Homolog/Ortholog E-Value: 6e-05
ORF Start: 82038
ORF Stop: 82580
Strand: Forward
Protein Sequence: MYQLKITIRDSKPPIWRRVLVPEQIPFSKLHAVIQLAFGWNDEHLYMFEKGRKGDPGSEYRVWGEDESMGNAAITPLWAALQNEGDKLVYTYDFGDWWDCVIVLEKQTHDTSNQPISCLRGKGTTPAENSGGLHGYNELLLQARESDNPEQAEIHNFLMLDIERRVYDLSRINDRLQAIY
Homolog/Ortholog Species: Hypothetical protein
Homolog/Ortholog Protein: WGI_00743, hypothetical protein
Homolog/Ortholog E-Value: N/A
ORF Start: 82570
ORF Stop: 82932
Strand: Backward
Protein Sequence: MRMTSRKKEILSYFEPDNLEWVTGEIGAPPFDVSGVAYLLHGMVSFDKRHQLESTRRTLESMVAGGLLERVTVYESRQIRRGGDTNATVVRYGLPGQCAVMRDTGGADNAISGEYMRVSK
Homolog/Ortholog Species: Hypothetical protein
Homolog/Ortholog Protein: WGI_00744, hypothetical protein
Homolog/Ortholog E-Value: N/A
ORF Start: 83051
ORF Stop: 83062
Strand: Forward
Protein Sequence: ATGGGGGTACTT
Homolog/Ortholog Species: Attachment site
Homolog/Ortholog Protein: attR
Homolog/Ortholog E-Value: N/A
ORF Start: 83160
ORF Stop: 84323
Strand: Backward
Protein Sequence: MSLFRRGEIWYASFTLPNGKRFKQSLGTKDKRQATELHDKLKAEAWRVSKLGEIPDMTFEEACIRWLEEKAHKKSLDDDKSRIGFWLQHFAGMQLRDITESKIYSAMQKMTNRRHEENWKLRAEACRKKGKPVPEYMPKPASVATKATHLSFIKALLRAAEREWKMLDKAPIIKVPQPKNKRIRWLEPHEAQRLIDECPEPLKSVVEFALATGLRRSNIINLEWQQIDMQRRVAWINPEESKSNRAIGVALNDTACRVLKKQIGNHHRWVFVYKESCTKPDGTKAPTVRKMRYDANTAWKAALRRAGIDDFRFHDLRHTWASWLVQAGVPLSVLQEMGGWESIEMVRRYAHLAPNHLTEHARQIDSILNPSVPNLSQSKNKEGTNDV
Homolog/Ortholog Species: Integrase
Homolog/Ortholog Protein: WGI_00745, integrase, phage(gi526244664), PHAGE_Shigel_SfII_NC_021857
Homolog/Ortholog E-Value: 0.0
ORF Start: 84550
ORF Stop: 84855
Strand: Backward
Protein Sequence: MTTFTNKELIKEIKERISSLEVRDDIERRAYEIALVSLEVEPDEREAYELFMEKRFGDLVDRRRAKNGDNEYMAWDMTLGWIVWQQRAGIHFSTMTQQEVK
Homolog/Ortholog Species: Hypothetical protein
Homolog/Ortholog Protein: WGI_00746, hypothetical protein, phage(gi526244666), PHAGE_Shigel_SfII_NC_021857
Homolog/Ortholog E-Value: 8e-52
ORF Start: 84855
ORF Stop: 85217
Strand: Backward
Protein Sequence: MRMNVFEMEGFLRGKCVPRDLKVNETNAEYLVRKFDALEAKCAALENKIIPVSAELPPANESVLLFDANGEGWLIGWRSLWYTWGQKETGEWQWTFQVGDLENVNITHWAVMPKAPEAGA
Homolog/Ortholog Species: Hypothetical protein
Homolog/Ortholog Protein: WGI_00747, hypothetical protein, phage(gi526244667), PHAGE_Shigel_SfII_NC_021857
Homolog/Ortholog E-Value: 4e-66
ORF Start: 85208
ORF Stop: 85744
Strand: Backward
Protein Sequence: MSFIKTFSGKHFYYDKINKDDIDINDIAVSLSNICRFAGHLSHFYSVAQHAVLCSQLVPQEFAFEALMHDATEAYCQDIPAPLKRLLPDYKRMEEKIDAVIREKYGLPPVMSTPVKYADLIMLATERRDLGLDDGSFWPVLEGIPATEMFNVIPLAPGHAYGMFMERFNDLSELRKCA
Homolog/Ortholog Species: Hypothetical protein
Homolog/Ortholog Protein: WGI_00748, hypothetical protein, phage(gi526244668), PHAGE_Shigel_SfII_NC_021857
Homolog/Ortholog E-Value: 1e-97
ORF Start: 85872
ORF Stop: 86696
Strand: Backward
Protein Sequence: MSQNLDATAINQIHALISAQGVNEIISKIGADAVALPENFRIHDLEKFNLNRFRFRGALSTASIDDFTRYSKDLADEGTRCFIDADNMRAVSVLNLGTIDEPGHADNTATLKLKKTAPFSALLSVNGERNSQKSLAEWIEDWADYLVGFDANGDAIQATKAAAAIRKITIEANQTADFEDNDFSGKRSLMESVEAKTKDIMPVTFEFKCVPFEGLKERPFKLRLSIITGDRPVLVLRIIQLEAVQEEMANEFRDLLVEKFKDSKVETFIGTFTA
Homolog/Ortholog Species: Hypothetical protein
Homolog/Ortholog Protein: WGI_00749, hypothetical protein, phage(gi557307559), PHAGE_Shigel_SfIV_NC_022749
Homolog/Ortholog E-Value: 1e-151
ORF Start: 86762
ORF Stop: 87124
Strand: Backward
Protein Sequence: MASERSTDVQAFIGELDGGVFETKIGAVLSEVASGVMNTKTKGKVSLNLEIEPFDENRVKIKHKLSYVRPTNRGKISEEDTTETPMYVNRGGRLTILQEDQGQLLTLAGEPDGKLRAAGH
Homolog/Ortholog Species: Hypothetical protein
Homolog/Ortholog Protein: WGI_00750, hypothetical protein, phage(gi428782351), PHAGE_Entero_mEp460_NC_019716
Homolog/Ortholog E-Value: 4e-63
ORF Start: 87757
ORF Stop: 87768
Strand: Forward
Protein Sequence: TTGCGAGATATG
Homolog/Ortholog Species: Attachment site
Homolog/Ortholog Protein: attR
Homolog/Ortholog E-Value: N/A
ORF Start: 87827
ORF Stop: 88474
Strand: Backward
Protein Sequence: MKIADLAEAIGVDAANISRLETGKQKQFTEQALSNIARSLGVDIADLFTSDVKSNTVCKNSISEDVAQVKDVFRIEMLDVSASAGNGLIQGGDVIDVIHAIEYRTDNAVSMFGGRPANHIKVINVRGDSMCPTIEPGDLIFVDVSINQFDGDGIYVFGFDDKIYVKRLQMIPDKLLVISDNQIYREWGITSENEHRFMVFGKVLISQSQTLKRHN
Homolog/Ortholog Species: Phage-like protein
Homolog/Ortholog Protein: WGI_00751, CI repressor, phage(gi526244671), PHAGE_Shigel_SfII_NC_021857
Homolog/Ortholog E-Value: 9e-114
ORF Start: 88617
ORF Stop: 88877
Strand: Forward
Protein Sequence: MQSPLRNVRKAHGFTLQHVAAGVQVNPATLSRIERLEQIPSIDLAERLANFFKGEISEMQILYPARFQSSQNQNGFKPQEQEVSRG
Homolog/Ortholog Species: Phage-like protein
Homolog/Ortholog Protein: WGI_00752, Cro repressor, phage(gi526244672), PHAGE_Shigel_SfII_NC_021857
Homolog/Ortholog E-Value: 3e-42
ORF Start: 88870
ORF Stop: 89427
Strand: Forward
Protein Sequence: MGNHHWKVEKQPEWYVKAVRKTIAALPGGYAEAAEWLDVTENALFNRLRADGDQIFPLGWAMILQRAGGTHFIADAVAQSANGVFVSLPDVEDVDNADINQRLLEVIEQIGSYSKQIRSAIEDGVVEPHEKTAINDELYLSISKLQEHAALVYKIFCVSESNDARECAAPGVVASIASGCGETNA
Homolog/Ortholog Species: Phage-like protein
Homolog/Ortholog Protein: WGI_00753, transcriptional regulator, phage(gi526244673), PHAGE_Shigel_SfII_NC_021857
Homolog/Ortholog E-Value: 9e-98
ORF Start: 89424
ORF Stop: 89762
Strand: Forward
Protein Sequence: MNSLTTHYRRSQLIALPVPGGKAKVEYCYAVNVPGDREIVTHSFAEWAVGDFNRQKETVLCDKLTAGSKITTECPSESFVGSRKHNGLSTSVKAMSMSASARSNSFVVNSGK
Homolog/Ortholog Species: Hypothetical protein
Homolog/Ortholog Protein: WGI_00754, hypothetical protein, phage(gi526244674), PHAGE_Shigel_SfII_NC_021857
Homolog/Ortholog E-Value: 4e-59
ORF Start: 89772
ORF Stop: 90713
Strand: Forward
Protein Sequence: MSTKLTGYVWDGCAASGMKLSSVAIMARLADFSNDEGVCWPSIETIARQIGAGMSTVRTAIARLEAEGWLTRKARRQGNRNASNVYQLNVAKLQAAAFSQLSDSDPSKSDASKSDTSKFDASKSGKKAGFHPSESGGDPSVKSKHDPSDKKPSRPDASQPDTQTAEQDFLTRHPDAVVFSPKKRQWGTQDDLTCAQWLWKKIIALYEQAAECDGEVVRPKEPNWTAWANEIRLMCVQDGRTHKQICEMYSRVSRDPFWCRNVLSPSKLREKWDELSLRLSPSVSTHTEKREDPYFKASYDNVDYSQIPAGFRG
Homolog/Ortholog Species: Phage-like protein
Homolog/Ortholog Protein: WGI_00755, O protein family protein, phage(gi526244675), PHAGE_Shigel_SfII_NC_021857
Homolog/Ortholog E-Value: 0.0
ORF Start: 90716
ORF Stop: 91204
Strand: Forward
Protein Sequence: MSLLNDVQKFIEAHPGCTSGDIADAFAGYSRQRVLQSASKLRQSGRVAHRCEGDTRRHFPRLTERAQEPEPQSVRETRPVRNFYVGTNDPRVILCLTRQAEELESRGLYRRAATVWMAAFRESHSQPERNNFLARRERCLRKSSKRAASGEEWYLSGNYVGA
Homolog/Ortholog Species: Phage-like protein
Homolog/Ortholog Protein: WGI_00756, PerC transcriptional activator family protein, phage(gi526244676), PHAGE_Shigel_SfII_NC_021857
Homolog/Ortholog E-Value: 1e-85
ORF Start: 91593
ORF Stop: 91856
Strand: Forward
Protein Sequence: MWWPEDADHIAFIRGRIGFELPAWFIPKDEKQVPTGAFFAGAIAVFDKTWKGPAISYIGRDELEACGEAFLAQVRQQAEKLVREMAA
Homolog/Ortholog Species: Phage-like protein
Homolog/Ortholog Protein: WGI_00757, DNA adenine methylase, phage(gi526244677), PHAGE_Shigel_SfII_NC_021857
Homolog/Ortholog E-Value: 9e-46
ORF Start: 91853
ORF Stop: 92179
Strand: Forward
Protein Sequence: MTTLTQCQQQVLDMLISYQKERGFPPTNQEVATMLGYRSVNAAVEHLRALEKKGVITIKRGVARGITLHTAVKDDDSEAVGIIRSLLAGEENARLRAAHWLHERGLKV
Homolog/Ortholog Species: Phage-like protein
Homolog/Ortholog Protein: WGI_00758, LexA DNA binding domain protein, phage(gi526244678), PHAGE_Shigel_SfII_NC_021857
Homolog/Ortholog E-Value: 5e-55
ORF Start: 92176
ORF Stop: 92565
Strand: Forward
Protein Sequence: MKLILPFPPSVNTYWRHPNKGAFAGKSLISAAGRKFQSAACAAIVEQLRRLPKPTSAPASVEIVLFPPDNRIRDLDNYNKALFDALTHAGVWEDDSQVKRMLVEWGPVIPEGKVEITISKYEKTAGAAA
Homolog/Ortholog Species: Phage-like protein
Homolog/Ortholog Protein: WGI_00759, RusA family protein, phage(gi526244679), PHAGE_Shigel_SfII_NC_021857
Homolog/Ortholog E-Value: 6e-71
ORF Start: 92585
ORF Stop: 93394
Strand: Forward
Protein Sequence: MNNLMVIDGIEVRRDAYGRYSLNDLHRAAVASGANARTKEPGKFLSSQQTVELVHELTNTQNLGVDPVSVIHGGNERGTYVCKELVYAYAMWISPSFHLKVIRTFDMVTSAPEKLSGQAADKMQAGVILLDFMRRELNLSNSSVLGACQKLQEAVGLPNLAPRYAIDAPADAPDGSSRPTLSLSALLKQYGIRLTANQAYHQMAKLGIVEQRERYSRTAINNIKKFWSLTAKGCMFGKNITSPANPRETQPHFFESRFPELLKLLDTVH
Homolog/Ortholog Species: Phage-like protein
Homolog/Ortholog Protein: WGI_00760, KliA-N domain protein, phage(gi526244680), PHAGE_Shigel_SfII_NC_021857
Homolog/Ortholog E-Value: 8e-130
ORF Start: 93402
ORF Stop: 94391
Strand: Forward
Protein Sequence: MRALLTPEIAPRMGIVLFRPGSELMPLFMQGRVLLEPEPERYSSFASGAVPAASQPLADDPAVRAVFRNEAVIRRAGGVECLESWLLREKGCQWPHSDWHSENMTTMRHAPGAIRLCWHCDNQLRDQFTERLESMATDNCARWVLSVVRRDLGFDDSHVVTMPELCWWLIRNDLADALPESAARKALRLPKPVVPSVTRESDLVPSVPATSIIQDKAKKVLALKVDPESPDSFMLRPKRRRWVNEKYTRWVKTQPCACCGKPADDPHHLIGHGQGGMGTKAHDLFVLPLCRKHHDELHADNVAFEEKYGSQLELIFRFIDRALTIGVLA
Homolog/Ortholog Species: Hypothetical protein
Homolog/Ortholog Protein: WGI_00761, hypothetical protein, phage(gi526244681), PHAGE_Shigel_SfII_NC_021857
Homolog/Ortholog E-Value: 0.0
ORF Start: 94409
ORF Stop: 94792
Strand: Forward
Protein Sequence: MRDIQMVLERWGAWAANNHEDVTWSSIAAGFKGLIPSKVKSRPQCCDDDAMIICGCMARLKKNNSDLHDLLVDYYVVGMTFMSLAGKHCCSDGYIGKRLQKAEGIIEGMLMALDIRLEMDIVVNNSN
Homolog/Ortholog Species: Phage-like protein
Homolog/Ortholog Protein: WGI_00762, antitermination protein Q, phage(gi209427762), PHAGE_Entero_YYZ_2008_NC_011356
Homolog/Ortholog E-Value: 8e-57
ORF Start: 94982
ORF Stop: 96064
Strand: Backward
Protein Sequence: MKKLTVAISAVAASVLMAMSAQAAEIYNKDSNKLDLYGKVNAKHYFSSNDADDGDTTYARLGFKGETQINDQLTGFGQWEYEFKGNRAESQGSSKDKTRLAFAGLKFGDYGSIDYGRNYGVAYDIGAWTDVLPEFGGDTWTQTDVFMTGRTTGVATYRNNDFFGLVDGLNFAAQYQGKNDRTDVTEANGDGFGFSTTYEYEGFGVGATYAKSDRTDGQVAYGKSKFNASGKNAEVWAAGLKYDANNIYLATTYSETQNMTVFGNNHIANKAQNFEAVAQYQFDFGLRPSVAYLQSKGKDLGVHGDRDLVKYVDVGATYYFNKNMSTFVDYKINLIDDSKFTKTAGIDTDDIVAVGLVYQF
Homolog/Ortholog Species: Non phage-like protein
Homolog/Ortholog Protein: WGI_00763, outer membrane porin protein LC
Homolog/Ortholog E-Value: N/A
ORF Start: 96638
ORF Stop: 96853
Strand: Forward
Protein Sequence: MKSMDKLTTGVAYGTSAGSAGYWFLQLLDKVTPSQWAAIGVLGSLVFGLLTYLTNLYFKIKEDKRKAARGE
Homolog/Ortholog Species: Phage-like protein
Homolog/Ortholog Protein: WGI_00764, holin, phage(gi302393164), PHAGE_Stx2_converting_II_NC_004914
Homolog/Ortholog E-Value: 2e-28
ORF Start: 96853
ORF Stop: 97209
Strand: Forward
Protein Sequence: MSPSLRKAVAAAIGGGAVAIASVLITGPSGDDGLEGVSYIPYKDIVGVWTVCHGHTGKDIMLGKTYTKAECKALLNKDLATVARQINPYIEVGNDSNLLIVFYVQIMPDDFVMQLHRF
Homolog/Ortholog Species: Phage-like protein
Homolog/Ortholog Protein: WGI_00765, lysin, phage(gi148609440), PHAGE_Entero_cdtI_NC_009514
Homolog/Ortholog E-Value: 5e-47
ORF Start: 97144
ORF Stop: 97521
Strand: Backward
Protein Sequence: MDEQWGYVGAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLGLLSAFEVVVWMTDGWPLYESRLKGELHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSVELHDKVIGHYLNIKHYQ
Homolog/Ortholog Species: Transposase
Homolog/Ortholog Protein: WGI_00766, IS1 transposase B, phage(gi557307573), PHAGE_Shigel_SfIV_NC_022749
Homolog/Ortholog E-Value: 1e-64
ORF Start: 97566
ORF Stop: 97841
Strand: Backward
Protein Sequence: MASVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQPGTHQKIIDMAMNGVGCRATARIMGVGLNTILRHLKNSGRSR
Homolog/Ortholog Species: Phage-like protein
Homolog/Ortholog Protein: WGI_00767, InsA, phage(gi46401643), PHAGE_Entero_P1_NC_005856
Homolog/Ortholog E-Value: 8e-49
ORF Start: 98447
ORF Stop: 102703
Strand: Forward
Protein Sequence: MSHYKTGHKQPRFRYSVLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVTADNNVEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLLTLSAGHKQGKSGENDTRFGLEVNYRIGEPLAKQLDTDSIRERRVLAGSRYDLVERNNNIVLEYRKSEVIRIALPERIEGKGGQTLSLGLVVSKATHGLKNVQWEAPSLLAEGGKITGQGSQWQVTLPAYRPGKDNYYAISAVAYDNKGNASKRVQTEVVITGAGMSADRTALTLDGQSRIQMLANGNEQRPLVLSLRDAEGQPVTGMKDQIKTELAFKPAGNIVTRSLKATKSQAKPTLGEFTETEAGVYQSVFTTGTQSGEATITVSVDGMSKTVTAELRATMMDVANSTLSANEPSGDVVADGQQAYTLTLTAVDSEGNPVTGEASRLRFVPQDTNGVTVGAISEIKPGVYSATVSSTRAGNVVVRAFSEQYQLGTLQQTLKFVAGPLDAAHSSITLNPDKPVVGGTVTAIWTAKDAYDNPVTSLTPEAPSLAGAAAVGSTASGWTNNGDGTWTAQITLGSTAGELEVMPKLNGQDAAANAAKVTVVADALSSNQSKVSVAEDHVKAGESTTVTLIAKDAHGNTISGLSLLASLTGTASEGATVSSWTEKGDCSYVATLTTGGKTGELRVMPLFNGQPAATEAAQLTVIAGEMSSANSTLVADNKAPTVKMTTELTFTVKDAYGNPVTGLKPDAPVFSGAASTGSERPSAGNWTEKGNGVYVATLTLGSAAGQLSVMPRVNGQNAVAQPLVLNVAGDASKAEIRDMTVKVNNQLANGQSANQITLTVVDSYGNPLQGQEVTLTLPQGVTSKTGNTVTTNAAGKVDIELMSTVAGEHSITASVNNAQKTVTVKFKADFSTGQATLEVDGSTPKVANDNDAFTLTATVKDQYGNLLPGAVVVFNLPRGVKPLADGNIMVNADKEGKAELKVVSVTAGTYEITASAGNDQPSNAQSVTFVADKTTATISSIEVIGNRAVADGKTKQTYKVTVTDANNNLLKDSDVTLTASSENLVLDPKGTAKTNEQGQAVFTGSTTIAATYTLTAKVEQANGQVSTKTAESKFVADDKNAVLAASPERVDSLVADGKTTATMTVTLMAGVNPVGGSMWVDIEAPEGVTEKDYQFLPSKADHFSGGKITRTFSTSKPGVYTFTFNALTYGGYEMTPVKVTINAVAAETENGEEEMP
Homolog/Ortholog Species: Fiber protein
Homolog/Ortholog Protein: WGI_00768, long tail fiber proximal subunit, phage(gi414087138), PHAGE_Cronob_vB_CsaM_GAP32_NC_019401
Homolog/Ortholog E-Value: 4e-12
Terminase
Portal Protein
Coat Protein
Tail Shaft
Integrase
Phage-like Protein
Other
Transposase
Plate Protein
tRNA
Download data as .txt file: png_input file_download