Submission Results
Sequence Name: Not Available
GenBank Accession Number: SINM01000001.1 open_in_new
GenInfo (GI) Number: 0000000000 open_in_new
Download Results: SINM01000001.1.PHASTER.zip
gi|00000000|ref|SINM01000001.1| Rhizobium leguminosarum strain SM113 chrom_SM113, whole genome 5037985, gc%: 61.05%
Download summary as .txt file: summary.txt file_download
Total: 3 prophage regions have been identified, of which 1 regions are intact, 2 regions are incomplete, and 0 regions are questionable.
Region | Region Length | Completeness | Score | # Total Proteins | Region Position | Most Common Phage | GC % | Details |
---|---|---|---|---|---|---|---|---|
1 | 10Kb | incomplete | 50 | 12 | 559068-569116 info_outline | PHAGE_Entero_phi92_NC_023693(4) | 58.52% | Show info_outline |
2 | 43.3Kb | intact | 110 | 44 | 1811452-1854766 info_outline | PHAGE_Mesorh_phagevB_MloP_Lo5R7ANS_NC_025431(18) | 58.98% | Show info_outline |
3 | 8.6Kb | incomplete | 30 | 9 | 1907906-1916536 info_outline | PHAGE_Agroba_Atu_ph07_NC_042013(2) | 58.73% | Show info_outline |
>1 559068-569116 ATGTCGGTCGCGCGCAGCATATTTCCTGCCTATAGTCGCGCGAGAATGGAATCGCCGGTT CATCCGGCGGTTCCAAGGGATGCAGGATTGGCGGAGGCGGCCATCGCGAGCGAAGCGGAT ATCAGGAGCGGTCTGACGGAAAATCTGGCAAGGCTCTGGCGTTATGGTCTCGTTCTCTCG CATCAGCGTGATGTCGCCGACGACCTGGTCCAGGCGACCTGCCTTCGCGCGCTCGAGCGC GCCGATCAGTTCATCCCCGGCACCCGGCTCGACCGTTGGCTGTTTTCGATCCTGCACTCG ATCTGGCTGAACGAAATTCGCTCCCGCCGGGTGCGCCAGGGCCAGGGGTTCGTCGATGCC GGGGAGACGCTGACCTTTGACGGCGCGCACGACACCGAAACGCATGTGATGGCCCATCAG GTGCTGAAACAGGTGAATGCGCTGCCTGAGGCGCAGAGGACGGTGGTTTTCCTCGCCTAT GTGGAAGGACTTTCCTATCGCGAGGTTGCAGGCATATTGGATATCCCGATCGGAACCGTG ATGAGCCGGCTGGCTGCTGCCCGCGCCAAGCTCTCCGGCGCCGGACCGGAAGGGGGACGG CAATGAACACGAAACACACCATTCCCTCCGACGAGGACCTGACCGCCTTCATCGACGGCG AGCTGACGGCCGAAGAGGCCGCGCGCATTCAAACCATGGTGGAGGAAGACGAGAGCACCG CCGAACGGCTGGAATTCCTGGCGCGCGCCAGCCTGCCGTTCAAGCAGGCCTTCGCCCCGC TGCTGTCGGAAGCCCCGCGCGAGAAGCTGGAGACGATTCTTGCCGCCATCCCGGCGCAGC CGAGCGCAAGACCCGCCTCCGCGCCCGCATTCGCCAGCCGTCGCCGCTTCCTCGGCGCGC TCGCCGCCTCGCTCGTCGCTGGCATCGCCATCGACCGCGCCGTCATCGGCATCGGCAGGA GCTTCTCGGCAAAGGACGAAAACAGCGAATGGCGCGCCGTGGTCGCCGATTATATCTCGC TCTATACCCCGGAAACCCTGGCCGGCCCCGTACCCGCTAGGGAAGATCAGGCCGCCCAGC TCGGCCCGCTCGACGAAAAACTCGGCCTGTCGCTCTCCCCCGAAGCCGTCTCGCTCCCGG GGATCGATTTCAAACGCGCCCTGCTGCTGCAATATGACGGTAAGGCGCTTGCCCAGGTCG CCTATCTCGACCCCGAGACCGGCCCGATGGCGCTCTGCATCGTCCGCTCCGATGCCGGTC CCAAAGCGCCGGACGTCGAAAACCGCAAAGGCATGAATGTCGTCTACTGGTCGAACGAGA CGCACGCCTTCATGCTGATCGGCCGCATTCCGGGCGACCGCATGAAGGAGTTGGGAGAGG ACGTCAGGAGGAGGCTTTCAGCCTAACCTCTATATTCTGCCAAATCGCAGCTATCTAATC GTGAGCGCAACGAATGGCCCACGGCTTTCGGGATTGACGGAATAGGTTTCTGGCTCTCGG TCAGAACTTCTGCTTTGTCCGCTTCGGCATAAAGTAGGTTCGGATATTGACGAGTGCTTC GGCGACGTCAGCAGCATCCTGGCGGGTGGCTTTCCGTTTTGCCTCGCCGAGCTTGCGGAT GGGTGCGATGATATCGCCAGTGACAGCATGACCCCGAAGGACGGGCTGCCAAAGCCTGGG CGCGTACCCTGTTCCCAGCAGAAAGCAGAGGTCCCAATCGAAGATATCAAAGGTCTGATC ATCGATCCTGGTAAACCTGGGCCGGTGGTCTTTCGGTGTTTCGGAAAGGCTCGCAGAGAT GCGGTTATACTCCGCAACGATCGTCTGCACGGCCTGATGTTCGGTTGTTTCCATCGGCAG GTTCAAGGCATCCGGACCGGTCAGTTCCGGGATCCATTGGCGTGGATCGATGAACTTCGG ACCGATGATGAGAGCTGTAAGATAGCCATCGAGACCGCTCATTGACCAGATTGGCGATGC CGGACGACGTCCCCTGATAAAGGCTTCGAACGCCTCGTCGTCGAGCTTCTGTCGTGCCGT CGTCTCCTGACCGTTCTGTGTCATGCAGCTCGTCGCTCCTGCTGTGTCATCGCTTCACGC TCAGCCTTCCAGGCCCAAGGCAGGAGACTTTCCATTTCGTTGGCTTTCACGTTGCAAGAG ATGATCCGCTCCAGCACATCGGCAAGCCAGTCCTCGGGATCCACGCCGTTAAGCTTTGCC GTGTTGACGAGCGATGCCAGGACAGCGAAGGACTTTCCACCCCGCTCGCTGCCCACAAAT AACGCGTTCTTTCTCGTCAGGGCCACGGATTTCATTGAACGTTCGACGACATTGGAGTCC ACCTCAACCCTGCCATCATCAAGGAAGGCTGTCAGCCCACCCCAGTGGTTGAGTGTGTAG GAGACCGCCTTGCCAAGCGTCGATTTCGGCGACACCTCGTCCCTCAGCTCGGTAAGCTTG GCCTTCAGTTCCCTCATGATGGGAGCTGCCTCGCGACGCCTGACTATAAGCCTGGTATCG GCATTTCCCCCGCGCAAGTCCGCTTCGATGCGATAGATTTCGGCAATCCTGGCAAGGATC GACAAGGCTTCCGAAGAGCCGGTCAGTTTGACGACATCAACGAACTTTCGCCGGGCGTGA GCAAGGCAGAAGGCCAACCGCATAGGGGCGACATTGCTCTTGCCCCGACGTTTGACCACG GTTTTATAGGCTTGGTATCCATCAACTTGCAGCACGCCGGCAAATGACGATAGTTGCCCC TCGATCTCGCGCGCGCTGCGGCTTTCGGCGAAGATGTACGCGACTGCCGGCGGTGCCGGA CCATTCCATGGGCGATCGTCGACCGCTTGTGCCCATAACTGGCAGACCTTGGTTCGTTTG CGCCCCGGATCGAGCCGCGGAAGCGGCGTCTCATCACAGAACACCCTCGGCTGCGAGCGG ATGAAGGCGGTGAGCGCATCGTAGAGAAGCTCAAGCCACCAGGCGACCCGCGTGACCCAG GCGCCAGCGTGCCGCGATCGATGATCACGCCGCAAGAGGCAAGCATCTGGGCTTGACGAT GAAGCGGCAGATGCCAGGCAAACTTCGAAACCGCGATATGAGCTGCAAAGGCCGTGGTCA CCATACCGCCATCCATCACGCGTGCTGGTGCGGGCGCTTGTACAATGGCATTCTCGCATG CCCGGCAGGCATAGCGTGGCCGGATCGTCCGCTTCACCCGGACAACCGCAGGCACGATGT CGAGCGCCTCGCTGACGTCCGTGCCAATACAATGGAGTTCGAACGAACAGCAGGGGCAAA TCTTGCTCTCCGGCTCGATGAGCTCGTCATAACGCGGAAGGTGCTTGAGCAAATGGCCGA TATTGCGAGCTGGCGATCGCCGTGCCTGGGTTTTGTCTTCACCGACCCGCGCGACATCGT CATTGGCAGCAACAGGAATGTCGCTCAGATCGCCAAGGTCAAGGATCGCCTGCGTCGGAT CGATTGTCGTCATCTTCTCCGATTTCGCCCCGAAGAGCTGCCCCTCAAGGAAGGCTACGC GCGCCTTGAGGTCGGCATTCTCCTCGTTGAGCGAGAGAATGATCCGGGTCAGTTGCGCAG CGTCCTGGGGTAAGGGATCGTGTCGAAGCGGCATGACAGATCCTACCACGAAGGCCGGAT TCCTCAAGCAAAACCAATAGGATCAACCCGCTTTCAAGGGGCGTTTCACCGGGTTCTGCT TCACCCTTGTCCAGTCAATACCGGCCAGCAAAAGTGAGAACTCTTCCCGTGTCATCTGCA TGGCGCCATCGCGGATCGGTGGCCAAACGAACTTGCCCGCCTCCAGCCACTTCGTCGCCA GGATCATGCCTGATCCGTCCCAATAAATGCAGCGAAGTCGATCGAGACGCTTGGCGCGGA ACACGAACACGTCGCCACAATAGGGATCAGCCGCAAGCGCTGACGCCACCAATGCCACCA GGCCATTCATGCCGCGCCGGAAGTCGACAGGTCGCGTCGCAACCATGATCTTCACTCCAC CAGGGGAAAGACCGATCACCGGCGACCCCGCAACGCCGTTACTACCGCTTGCGCCAGCTC GGGCGCTACCGAGCCGATCATGGTCAGCCGTGCGCCAGCAACCTCGATCTCAATCCGACC TGCGGCGAAGTGGGCGGCATCCGACGGCAAGGTCTCGGGCGACGTTCGCTCGCCAACCAC GGTCACCGGCACGAATATCGCCTGTTGCGCGCAGGCCTTGGCCGAGCGACGAGAGGTCAG CCCGGCATCCCGACGCCAAACATTCAGCAAGCCGCGATTGACGCCCCAGCGCCGAGCAAC AGCCGAGATGTTCACATCAGTTTCCGCGCTTTCCGCAAGGATGCGCGCCTTCTCCTCGTC AGTCCAATTCCGCCGCTGGCGCCGACCGGTGATCACCTCGATCCGACGATATTTTCCCTC ATGCCTGGCTTCATGCATGCCTTCATCCATGACTTCAAGCATGGCATCAGCGCGATCTCC AACCATCCCATTCCGCTCCTATCGGTGAAGGAGCTTATCTCGCGGCCTCGATCACAAAAG GAAAGGTGGGCTGTTCGTAGCGCTCACATCTAATCCACCAGACCACGCAAGTACTGGCCG TAATCCCCTATGCCCGCTTTGGCGGTAATCTGGGCGAAGTGCTCCTTCGTGATGAAGCCT TTCGCCAGCGCAATTTCCTCCGGACAAGCGATTTTGAGGCCCTGGCGTTTTTCAAGCGTG CGGACGAACTCGCCGGCCTCAAGGAGACTTTCGGGCGTGCCGGTATCGAGCCAGGCATAT CCCGGCCCATCATCGAGACACGCAGCTTCCCCCGCTCCAGATATGTCTTGTTGACATCGG TGATCTCGTACTCGCCGCGCGGTGATGGCTTTAGATCGGCTGCAATGTTCACCACGTCAG CATCGTAAAAATGAAGGCCGGTGATAGCCCAATGCGATTTGGGGTTCGCCGGCTTTTCTT CAATGGAAATTGCCGTCATATCGCTGCCGAATTCGACGACACCGTATCGTCCAGGATCGC GGACGTGATAGGCGAATACCGTCGCGCCGTCACCCTTCGGCACACCCTCGTCAAGCAGTT CGGGAAGGCCGTGGCCGAAATAGATATTGTCTCCAAGGATCAGACAGGATGGCCCGCCCG CCACGAAGTCTGCGCCGATGATGTAGGCCTAGGCCAGTCCATCGGGGCTCTGCTGAGCAG CATAACTCAGCGACATGCCCCAGTCGCTGTCATCACCCAGCAACCGCTGGAAAAGCGGCA TATCGTGCGGAGTGGAGATGATCAAAACTTCCCGTATGCCAGCCAGCATGAAGGTGGTCA GCGGGTACTAGATCATTGGCTTATCATAGATCGGAAGCAACTGCTTCGAGACGGAATGAG TCATCGGGTGCAGGCGACTGCCGCTCCCGCCCGCAAGCACAATACCTTTCATACGGCCTC CTTCGGCTCTTCCAAGAGCTTTGTCACAGCAGTGCGCGCTTGATATCCGCCACTCCGGCA TGCACACGCCATAAAGGCTCTTCAGCTTATCGCAACACTGTCCGGAATTGGGTGGCCGTT TCGCTGTTGTGGGATAGTCAGCGGTCGCAATGTCCTTGACGGTGAGACGTCTGCCGGTCT TTTCCTCGAGGCATCGTTTTCACGAAGTTGTGGCCAAACGTCGAATAGACCCAGGCCGTC CGCAGGATCACATGATTCTCATTGGCTGCGGCGACCGCATATTCTCCCTCGAGCTTCGAT CTGCCATAGTGATGCCGGTCCGACCGGATCGCTCTCGACATAGCGTTCGGGCTTGTCTCC ATCGAAAACATAATCGGTCGAAAGATGAATGATAGGAAGGGACAGCTCGGCAGCTGCTGC GGCGAATGCTCTTGCGCCGTCTCGATTAATGGCGAATGCCGCCGCCTCGTCGCTTTCGGC TTTGTCAACGGCGGTGTCGGCTGCCGACGAGACGACGACGTCAGGTTTGATTTTCGCAAT GATTTCACTCACCATCGAAGGCTCGAGCAAATCCAGCTCGGGCCTCCCTACGGCGATGAC CTCGGTACCTCCCGCGTACAGCGCTTGCAACGCAGACGCGACCTGGCCGTTCTTGCCTGT TACCACGATGCGCATCTCAGACCTTTTTCAACACGCCAAGCCGTTCACCGGAATACGCAC GCTCACGCAGCGGCTGCCACCACCATGCATTTTCCAGATACCACTCGACAGTCTTGCGGA TACCGCTGTCGAAATTCTCTAGTGCTCTCCAGTCGAGTTCGGTCTCGAGCTTGGTGGCAT CGACAGCATAGCGTGCATCGTGGCCCGGCCGATCGGTGACATAGTTTATAAGATCGCTAT GCGGCGCGGTTCCAGGCCGGACTTCATCCATGATCGCGCAAACGCGCTCGACCACCTCGA TGTTTCGCCGCTCGTTGCGCCCGCCGACATTGTATTTTTCGCCGGGACGACCTCGTTGCA CGATCAGCCAGAGCGCTCGTGCGTGATCGATGACATAGAGCCAATCGCGAATGTTGGAGC CGCTTCCATAGACGGGCAAAGGCTTCCGTTCCAGCGCATTCAATATGATGAGAGGAATGA GCTTTTCGGGGAAATGAAAAGGACCATAATTGTTCGAACAGTTGGAAATGATCACGGGCA GGCCGTAAGTCCTTTCCCACGCCGTTGCCAGGTGATCGCTCGCCGCCTTGGACGCGGAGT AGGGAGAGGACGGATCGTAAGGAGTCGTCTCGGCAAACAGCCCGTCTTCGCCGAGTGAAC CATAGACTTCGTCGGTGGACACATGCAGCGTCCTGAATGCCGCCTTTTCATACGCTGGAA GGTCTTGCCAATATTGCCGCGCGGCCTCCAACATGCTGAAGGTGCCGTTTATGTTCGTTT CGATAAAATCCGAAGCCCCGGTGATCGAGCGGTCGACGTGGCTTTCCGCCGCCAGATGCA CAACGTAATCGGGACGAAACTCCTCGAAAGCGTTGCTGACCGCGCTTCTGTCGCAAATGT CGGCTTTGAGAAACCTGTAGTTGGGAGCATTCTCGACTGCCTTGAGCGAAGCGAGATTGC CGGCATAGGTCAGCTTGTCGATATTGAGCACTTCGGCGCCGATGTCGCTGACGAGATAGC GAACCAAGGCCGAACCGATGAAGCCAGCGCCACCTGTTACAAGAACACGCATGAGGCCCT CCGTTTCACGAAAATTGAAAATATGCGGGGAGTTCGGAAAGCAGCGGCTGTTTGTTGTCT TTATCAGAAAGAACGTAGGCATCCATCTGCGGCCACTCGATCCCGATCTCGGCGTCGTTC CACCGCACGCCACGGTCATGCTCCGCGCTGTAGGGAGCCGTGACCCTGTAACTGATTTTC GTATCAGGCTGCAAAGTGACGAACCCATGCGCAAACCCGGCCGGCAACCAAAGCTGTTCG CCATTTTCCGGGGACAGTTCTGCGGAGACCCACTTCCCGTACGTCGGAGAGCCATGTCTG ATATCGACAGCCACATCGAGGAGCGCGCCGCGCAGGCAGCGTACAAGTTTGCCCTGCGCA AAGGGAGACAACTGAAAATGCAATCCTCGGACCGTCCCCGGTTGCGCTGAAAGGGATTCA TTATCCTGCACGAAGGAAACGTCGGCCACGTTACTGCGGAACCAAGAATCTTTGAAGACT TCGCTGAAATAGCCCCTCGAGTCACCAAATCTGGCAGGGGTGACCTTCTTGACGCCTTCG ATCGCCAGTTGCTCTACATTCAACGTATGGATCCTTGGGCTAGGCTCGACCGCTGAATGT GACGGCGGGTGGGGCCGGCAATATAGCCAATTGCAGAGCATATACATTGGCGAAGGCGAG ACGGCAACTGGGGTGCGACGTGAGATGGTATTCGTCCACCTCGTCGCATCCGTACTGGCG CTGGCCGTAGTTGTGGGATGGTTCGTCCTTATGTCGCCAGTCGCGACCGGCACGAGACGC ATAGAGGATTTGCTTCAGTCGCTGGCGGAAGTAAAGATATCTAATAGCGGAAGTAAAGAT ATCAGGACATGAAATCGGCATATGCTCGCGCTATACCATCGCTAAGCGATGTCGTCGCCT TCCACCCAAGTGCCGACAGACGGTCAACGCTCATCAGTTTGCGCGGCGTTCCGTCCGGCT TCGTCTGGTCGTAAACAATCTCGCCCGTAAACCCGACAACCCCCATGATGGCCTCGGCAA GTTCGCGGATGGTAATGTCTTCGCCCGTCCCGATATTGATGAGCCCCTCGTTGACGTCCT TTTCCATCAAAAAAACGCAAGCGTCAGCCATGTCGTCGACATACAGGAACTCACGCATTG GCCGACCGGAACCCCAGACGACCAGCTGCCGGTCGCCGCGAATTTTGGCCTCATGGACCT TTCTGATCAGGGCCGGCATCACATGGCTACTGTCCAAATCATAGTTGTCATTCGGCCCGT ACAGATTAGTCGGCATCCCCGAGACATAGCGGGTTCCATATTGCCGATTGTAGCTCTCGC AGAGCTTGACGCCCGCAATCTTGGCTATCGCATAGGGCTCGTTCGTCTGCTCCAGGAGGC CAGTCAGCAAATACTCCTCGCGAATGGGCTGAGGGCAGTCGCGTGGGTAGATGCAACTCG AGCCAAGAAAAAGCATACGTTCCACACCAGCCTGCCAGGCGGCATGAACAACATTTGTCT CGATCATCAGATTCTGGTAGAGGAACTCAGCACGATAAACGTTATTTGCATGGATGCCGC CAACCCTTGCAGCGGCCATGAAGATGTAGTCCGGCCGCTCCGCCTTCATGAATTCGGCAA CGGCCGCCTGATTGACGAGATCCAGTTCCGCGTGGCTTCTGGTGACGATGTTCATGTAGC CGCCAGCCTTCAGCCTCCGGACAATGGCCGACCCCACCATGCCTCGATGGCCTGCAACAT AGATTTTCACGTCTCTGTTCATCGGATCACTCGTGATAGTCGTATGCGGAGAAGCCGTGG CGTTTCACGAGTTCGTCCCGCTCTGCGGACTTAAGATCCTCGCGCATCATTTCCGCAACA AGCTGCTTAAATGTTATCCTTGGTTCCCAGCCGAGCTTTTCCTTCGCCTTCGACGGATCG CCCAGCAGGGTCTCGACCTCTGTGGGACGGAAGTATCGGGGGTCAACCGCGACGATGCAG CGGCCGTTCTCGTCGTAGCCTTTTTCCTCCGCGCCTGATCCCTTCCATGAGATTGGAAGT CCAATCTCATGGGCCGCGGCGTCGACAAATTCGCGGACGCTGTACTGCACGCCAGTGGCA ATGACGAAATCTTCCGGTTCGTCCTGCTGCAGCATCAGCCATTGCACCTCGACATAGTCC TTGGCATGCCCCCAGTCACGCTTTGCGTCCATATTGCCGAGATAGAGGCAGTCCTGAAGG CCGAGCTTGATGCGCGCCAGTGCACGGGTGATCTTGCGCGTCACAAATGTCTCGCCCCGC ACCGGGCTCTCGTGGTTGAAGAGGATGCCGTTGCAGGCATAAATGCCATACGCTTCCCGA TAGTTAACCGTAATCCAGTAGGCGTAGAGTTTGGCGACAGCGTAAGGCGAGCGCGGATAG AAAGGGGTCGTTTCGCGTTGCGGGATTTCCTGCACCAGGCCGTAGAGTTCGGAGGTGGAC GCCTGGTAGAAACGCGTCTTCTTCTCAAGGCCGAGAATGCGGATCGCCTCGAGGATGCGC AGCGCGCCGAGCGCGTCGGAATTGGCCGTATATTCCGGCTCTTCGAATGAGACGGCGACA TGCGACTGCGCCGCGAGGTTATAGATCTCGTCGGGTTGGACCTGCTGGATGATGCGAACG AGACTCGATGAATCCGTCATGTCCCCGTAGTGCAGCACAAGCCTGCGGTTGGTGTCGTGT GGATCCTGGTAGAGGTGGTCGATGCGGTCGGTATTGAAGAGGGACGTGCGGCGCTTGATG CCGTGCACCTCATATCCTTTCTCAATGAGAAGCTCTGCAAGATAGGAGCCGTCCTGGCCC GTGATGCCTGTGATGAGGGCTCTTTTCAT
Region | 1 |
Region Length | 10Kb |
Completeness(score) | incomplete(50) |
Specific Keyword | transposase,capsid |
Region Position | 559068-569116 |
# tRNA | 0 |
# Total Proteins | 12 |
# Phage Hit Proteins | 10 |
# Hypothetical Proteins | 0 |
Phage + Hypothetical Protein % | 83.3% |
# Bacterial Proteins | 2 |
Attachment Site | no |
# Phage Species | 6 |
Most Common Phage Name(hit genes count) | PHAGE_Entero_phi92_NC_023693(4) PHAGE_Escher_phAPEC8_NC_020079(4) PHAGE_Sphing_PAU_NC_019521(3) PHAGE_Synech_ACG_2014f_NC_026927(3) PHAGE_Synech_S_CAM7_NC_031927(2) PHAGE_Synech_S_SKS1_NC_020851(2) PHAGE_Stx2_c_1717_NC_011357(2) PHAGE_Sinorh_phiM9_NC_028676(1) PHAGE_Bacill_SP_15_NC_031245(1) PHAGE_Bacill_G_NC_023719(1) PHAGE_Synech_S_SM2_NC_015279(1) PHAGE_Rhizob_vB_RleM_P10VF_NC_025429(1) PHAGE_Prochl_P_SSM2_NC_006883(1) |
First Most Common Phage # | 4 |
First Most Common Phage % | 33.33% |
GC % | 58.52% |
Region: | The number assigned to the region. |
Region Length: | The length of the sequence of that region (in bp). |
Completeness: | A prediction of whether the region contains a intact or incomplete prophage based on the above criteria. |
Specific Keyword: | The specific phage-related keyword(s) found in protein name(s) in the region. |
Region Position: | The start and end positions of the region on the bacterial chromosome. |
# tRNA: | The number of tRNA genes present in the region. |
# Total Proteins: | The number of ORFs present in the region. |
# Phage Hit Proteins: | The number of proteins in the region with matches in the phage protein database. |
# Hypothetical Proteins: | The number of hypothetical proteins in the region without a match in the database. |
Phage + Hypothetical Protein %: | The combined percentage of phage proteins and hypothetical proteins in the region. |
# Bacterial Proteins: | The number of proteins in the region with matches in the nrfilt database. |
Attachment Site: | The putative phage attachment site. |
# Phage Species: | The number of different phages that have similar proteins to those in the region. |
Most Common Phage: | The phage(s) with the highest number of proteins most similar to those in the region. |
First Most Common Phage #: | The highest number of proteins in a phage most similar to those in the region. |
First Most Common Phage %: | The percentage of proteins in # Phage Hit Proteins that are most similar to the Most Common Phage proteins. |
GC %: | The percentage of GC nucleotides of the region. |
>2 1811452-1854766 CAGATTTAGGTTGTGCCTCTACATCCTAGTCACCCGTAGATACTTCCCCCTTCATATAGT AAGTTAGCCCCGGGAAACTGGGGCTTTCTTTCAATTTAAATGTATTCACTGGTGGTTACT TATCAATTCACCTGTGATTGCATGCCACATTCGTGTGTCATGGCGGGGAGAATTACCACC CACTTAGGGAGCCGCACTCGGCCCCTCGGCAGCCATGCCACAGCGATTACCCCCCACTTA AGCAAAGCCCCGAGAGCGGCCCCTAGAAGGGGACCGATAGGGAGCCTTTGGCCCACGCAA AGACGAGGACCACAGAATGACCTTCGAACAATCCATCCTTGATAATCCCCTCTTTGAACG TCAGCTCGACCTGGAACAGGAGATGAGAACCTCCGGTATTGATAGGTTCCGAAAGAACGT CGATAAGGCATCCGTTAAGGGGGCGATGAGCGACACCATGGCAGTGAACCGTCTGGTCGT CGAAGCTCACGAGAAGGTGGTAGCAGCCATCAATGAGTTTCTGACTGAGGCTAAGAGCGG CGCTGCCGGTCGACGCCACACGGCAGTGGTCTTCATCGAGAAGCTCGATGTCGACACGGT CGCCAACATCACGGCCCGCGTGATCCTCGACGAGGTGACCCGCAAGTCGAACCTCACGAA GACCTCGCTCGCCATCGGCTCGATGCTGGAGAACGAGTTCAACAGCCGCAAGTTCGAAGA GGAGATGCCGAAGGCTCACAAGAAGTTCCTCAAGAAGGCGCAGAAGGAGAGCCTCGACCG CCGCAAGTGGTCGCACCTACTGTACCCCGCACGTCTCCTCGGCGTCGAGCTGGAGGAGTG GAGCGAGAAGGATCGCATCCTCGTCGGCCTCAAGCTGGTCGACCTGTTCATCCAGTCCAC CGGCCTGATCGAGCGAGAGGTGGTCCAGTCGGCCCGCTTCGGTACCCTCGAACTGCTCGT TGCCAATGAGGCCACGCTGAAGTGGATGGAGACCGAGAACAGCCGGCTCGAACACCTGTT CCCGATCTACATGCCGACCATCGTCCCGCCCAAGCCGTGGACCTCGCCTTTCGATGGCGG TTACTTCACGGCCTTCCGCCGCCTGAAGCTGGTGAAGACCCATAACCAGCAGTACCTCGA AGAGCTGGCCAACCGAGACCTGTCGCAGGTTTACGAGGCGATCAACGCTCTGCAGGACAC TGCCTGGGCGATCAACACTCAGGTGCTGGACGTCATCCGAACCCTCTACGAGACCGGCGC TGGCGTGGCTGGCCTGCCTCAGGCTGACAAGCTGCGGATGCCTCTGCGTCCTCACTGGTT GCCGGAAGGCAAGGACAGGATGTCGACCGAGGACATGACCGAGGAGCAGCTTGAAGAGTT CAAGGCGTGGAAGGCCGAGACCCATCGGACGCACGTCGAGAACGCCGCGATCTCAGGTCG TCGGGCGAGCTTCCTGCGTACCCTCGGGGTTGCCGAGAAGTTCAAGGATGAGGAGGCGTT CTTCTACCCCCACACCCTCGACTGGCGTGGTCGGGCCTACCCGCTGCCCCTGTATCTGAC CCCTCAGGGCAACGATCTGCAGCGCGGTCTGCTCACCTTTGCCAACGCCGTGCCGATCCA CGACGAGGAAGCTGCCGAGTGGCTGGCCATCCACGGCGCGGGCTGCTGGGGCTACGACAA GGTCAGCCTTGAAGAGCGCGTGCAGTGGGTGCTTGAGCATGAGGTCGAGATCATCGCCTC TGCCCAGAACCCCTACGACAACCACTTCTGGATGGGTGCCGACAAGGGCGAGAAGAAATG GCAGTTCCTTGCGTTCTGCTTCGAGTGGGCAGCCTTCAAGGAAGAAGGATATGGCTACCT GTCCAGCCTTCCGGTCCAGATGGATGGCACCTGCAACGGCCTGCAGAACTTCTCAGCGAT GCTGCTCGACGAGGTCGGTGGCGCTGCGGTCAACCTCATCCCCGCCGACGAGCCGCAGGA CATCTACCAGAAGGTCTGCGATATCGTCTGCGAGCAGCTCGCCCGCGATCTAGAAAGCAC CGAACTGGTCACCATCAAGGGCAAGACCGACGATGGAGTGGAGTTCGAGAAGGTGGTCTG CTCGGTGGCTGACATGGCTAACGGCTGGCTGCCTAAGATGGGCCGCAAGGTCACCAAGCG GCCGGTCATGACGCTCGCCTACGGCGCACGCCGCTTTGGCTTCGTGTCACAGGTCGACGA GGACACCATCAAGGACTGGCGCTCGGGTTCGCCTGAGAGCTATCCCTTCATCAGCCAGGG CGATGATGGTAAGCCGGTCGACTATGGCTACAAGGCAGCCCAGTACATGGGCGGTCTGAT CTGGGACAGCGTGGGCGAGGTGGTCGTCAAGGCTCGGCAGGCAATGGACTGGCTGCAGGC CGTGTCCAAGGTTGCCTCGAACGAGCAGCTTCCGATCAACTGGACCACGCCTGTAGGCTT CCTCGTGCAGCAGGCCTACCGTGTGCCGAACACGAAGCGGGTGGACACCACGTTCAACTC GCAGCGCATTCGTCTCACGTACCAGCACGGGGTAGGGAAGATCGACGGTCGCCGGCAGGC GTCTGGCATCAGCCCGAACTGGGTCCACTCGCTCGACGCGGCCCACCTGATGAAGACCAT CGGGCGCTGCAGGCGGGAAGGGATCGCCTCCTTCTCGATGATCCACGACAGCTACGGCAC CCACGCAGGCAATGCCTGGGCAATGGCCCGGTATCTTAGGGAGGAGTTTGTCCAAATGTA CTCACAGGTGGATGTATTGACGCGGTTCAAAGAGGAACTGGAGGCGCAGACTGGGGAGCA ACTTCCCGATCTCCCGGCGAAGGGAAACTTGGACCTCCAGCAGGTTCTCGATAGCCCGTT TTTCTTTGCCTAAATGCATGCACGAGTGAATTGAATATCGCCATCGGTGATCCCTCTCGG GGGATTACCCCCCACTTAAGCAAACCCACGGCTCGGCCCCCGAACACAGGGCCTCAGCCG ATTACCACCCCCTTAAGCAAATAGCTCAAACAGCAGGTCTCATCACATGAAGAACTCCGA CGCCCACAAGGCTCAGAGCGCGCTCGGTCGCGCCATCGAACTCTGGAATCAGGGTCGCGA AATCTCCTTCCAGCACGGCCAGGAGCTGCGCGAAGACGGCTACGATGTAGCTGACCTCCG TCGCTTCCACTTCAAGCTCTCGATCTGAGGAGCGGCGACATGGCCAAGCTCACTCCAGCC ATTTACGGCGCAGCTCCAGTCGACCTGGGCTGCATGGAACTGGACACATCGGAAATGATG TTCTGGCTCTACCTCCCGATCAAGATGCCGGGACAGTTCATGCCGAAGCTCCCGGCCAAC CTGAAGAAGTACGAGCGCATTGTCGATGCAGTCATGGACAATGTGATCGACGACGACACG ATCAACCCACAGGGCAGACGCTGGACCGAAAGCTACGTCTACCTCTCGGTCAAGATCACG CATGTCACTCCCGATGCCCCCGGCAATCGCCCTGGCTGGCACTCCGATGGCTTCCTGACG GACGACCTGAACTACATCTGGACCGACCGAAACCCGACCGAGTTCTTCATCACCGATGCA CTCTTCGGAACCGAGCCCGATCACCGGTCGTCAATGAAACAGTTCGATTGGATGGCCCGC CATCTGCTGCGCTCCAAGGGCGACCGGCTCGAACACGCCAAGGTGAACCATCTCTACCGT CTCGATCAGACGAACATCCATCGGGTGTCGCTCAACGTCGAAAGCGGGAAGCGGGCATTC ATCAAGGTCTCTGTTTCGGACAAGCCTTACGTCCAGCTCGGCAACTCGATCAACCACGAT CTGCCTCAGCATCCGCTGCCAACACTGAAGCGGCAGGCTGACCGCAACTGCCCGCAGGGG AACAAGTGATGACCCACTCCCGCATCGGCAAGATCACCTTCAAGTCCCCGCTCTCGGGGG CTTTCGGCAGAACCATGGTCCCGGCTCGGTCGGGCCGAGGCTACGTCCATCAGTACTTCA AACGAATATCGGAACTCGAATGGGTGCCCATCACCCTCCTCGAATACGTCCGCTCCACCC GCTGATCATCAGCCCTCGACATCTGAGAGAAAACCATCATGGCTAACAAGAATTTCCACA AGACCGTCTCCGTTGCTGCATCCGCACACAACTGCAACAGCGTGGGCGAGATCACCCTCG CAACCCTGAACGCGGACAGCTACATCAATCTCATCACCTCCGCTCGCAGCCTCACGATCT CCCCGGTGGAGGCCAAGAAGCTCCGCGATCTGCTGGTCGAGGCCTACCCGCTCGAAGCGC CGGCCGCTGCCGCCAAGAGGACCAAGTTCAAGGCCGGCGACAAGGTCACCTACAAGTCCA TCGTCGGCTACGGTGCCCGTGGCATGGACGGTCGCAAGGGCTCGATCAAGGAAGTGCTGA CCAACGAATGGTACATGGTCAACTTCACGGGTGGTCCGTTCGACAACCTGATCAAGGTTC ACTCCGACTATCTCGTGGCCCAGCCGGTCCCGTCTGTTGGTGGCCTCAAGGTCGGTGATC GCGTTCGCCGCCTTCAGGGCAGCAGCAAGGGTAGCATCTTCACGGTGACCGAGCTGAACA GCTCCCTCACGGAACTGAAGGTCGATGGCATGACCGGCTGGCGTATGCCGAAGTACTTCG AGCTGGTCACTGATGCCCCGGCATCGGCCCCCGAACCGCTGGCCAAGATCGACACCGGCC GCTTCCTCGTCGTCGCCCTCGAAGGTGCCAACTACGTCCCCGGCAGCAAGCCGAAGGTCC ACGTCACGGACTTCTCGGCCAGGGTCGAGGCGGAGCGTCTGGCTCGTGATGTCGGCGGTA CCTACCACGTCTTCCGCGCGGTCTTCGAGGCCAGCCGCGAGAAGCCGGTGATCCCCCCGG TCAAGACGACCAAGCTTTAACCAAATACATCCACGAGTGGATGCTTCCCCAGCCCCTTCG CCTTGCCGGTGAGGGGGCTTTTTCATTTCCTCCAGCAGGACTTTCTACATGGCAGAACGC AAGAAGAACCCGTCGCTCATCTCCCCCCGTGGCCCCCTGAAATTCCCGAAGATCGACAAG GTCGACTACGGCACGAAGGAATACCCGAAGCCGAACGGCGAGTACTCCACCAAGCAGGTA CTCGAAGCTGACGCACCCGCAACCAAGGCCTTCATCGCCGCCCTGATGCCGCACTATCAG GCCGCAATGGAGGAGGCTGCGGCCAAGTTCAAGGAACTGAAGATCGAGACTCGGAAGAAG CTCGGCAAGGTCACCGAGAACGACCTCTTCACCACCCTCTATGACCAGGAAACCGAGCAG CCGACCGGCTACATCGAGTTCAAGTTCGCGATGGCGGCGAGCGGCGAGCGCAAGGACAAG ACGAAGTGGTCGGCAAAGCCGGCGATCTTCGACGCCAAGGGCAAGCCGATGACCAAGGTC CCCGAGATCTGGTCGGGCACCGAGGCGAAGGTCTCGTTCGAGTGCCAGCCGTACTTCATT CCGGGCACTGGCGCAGCCGGCCTCAAGCTGAAGCTGAAGGCTATCCAGATCATCGAGCTG GTCTCGGGTGGTCAGCGCTCCGCATCGAGCTACGGCTTCGGCGCCGAAGATGGCTACGAA TACGAAGAGCCGGCGACCGAAGAGAACGAAGGCGGCTTCGGTGATGAGAGCGGCGAGGAC ACCTCCTCGAAGACCATCGACGACGACATCCCGTTCTAAGGATCACATGACCTACCGCAC ATCCGCAGCAGGACTGCGCGCGGTGGGTATTCGAGAAGGCTTCCGCTCCGGTCTTGAGGA CAAGGTGGGCGACCAACTCAGAGCGCAGGGCATTGACCCACGCTACGAGGAGGTCGTCAT CCCTTACGTCAAGCCCGAGCGGAAGGCGAAATACACCCCAGACTTCCAGCTCCCGAACGG TATCTTTATCGAGACCAAGGGGCGCTTCGTCACCGAGGACCGGCAGAAGCATTTGCTGGT CAAGACGCAGCACCCTGAGCTGGACATCCGGTTCGTCTTCTCGAACCCCAAAGCCCGCAT CTCCAAGACCTCACAAACAACCTACGCCGACTGGTGCCTGAAGCATGGCTTCAAGTTCGC GGCGAAGATCATCCCCCAGGAATGGATCGATGAGTAGCATGTTCACACCTCGAAAGGTGA CCTCCTACCTCGTCGTGCATTGCTCGGCCACGCAGCCCAAGATGGACATCGGCGCGAAGG AAATCCGCCAGTGGCATCGCGAGAAGGGCTGGATCGATATCGGCTACCACTTCGTCATCC GCCGTGATGGCACGGTGGAACTCGGTCGACCCGAGAATGTGGTCGGCGCTCACGTCGAGA ACCACAATTCCAATTCAATCGGCATCTGCCTCGTGGGCGGTGTCGATGCCAAAGGCAAGG CCGAGAACAACTTCACTCCGGCCCAGTTTGCCACGCTCGCTATCAAGCTCCGCGAACTCC GCTCCAAGTACCCAGGCGTCACCGTGCAGGGACACCGGGATTTCCCAGGCGTGAAGAAGG ACTGCCCCTCGTTCGATACCCGCAAGTGGATCAACGAAACTGGTGTCTTCGAGACCTCAC ATGTGCCGGCTGAACCGGACGCCCGAGCAGTCGAGATCACCAGCGCCACCCCCACGATTT TCAGTCTGGCGAAGAAGTACGGCACGACCGTCGAGGCGATCCTCAAGGTCAACCCGCACG TCGACCCGGCGAAGCTGAAGCTTGGTCAGGTCATCCGCCTCCCGGGCTGATTACCACCCA CTTAAGCTGACCTCCAACTGGCCCTCACGGTTCCTCCGTGGGGGTCTTTTTCGTTTTTAC GACCTCAACATCCGAGAGATCACATGAACACTTTCGTTGCTGGCGCACACGTCCGCCACA AAGACCACCCGGAATACGGCAATGGCCGCATCGTTTACGTCCACACCAATGGCAAGTCGC TGGCCGTCGAGTTCGAGAACAGCACCGGCCTGCGGCACGACTGCGCCGGCCACGCCAAGT ACGACTTCTGCCGCTGGTCGCGCGCCGCCAGCCTGGAGCTGATCAACCCCTTCAAGTTTG GCGACACGGTCCTGATGTGCGCCTTCGATGCGCGCAGCAATCCAGCCTTCTACTATGACC ACGACTTTGCCAACCAGCAGGGTATCATCGATCACGCCGACAAAGACGGCTATGTCACGG TCTTGGTGGCCAGCCTTGGGCACACCCAGTTCGTGCCGATCGCTGACCTCACGCTGGTCA AGGTCGACCCCGTTGAGGCTGTCAAGAAGGCAACCCCGAAGTTCAAGACCATCGTCTTCC AGTCGGGCAGCCAGTGCGACCGCCTCGTCAAGTACCTCCTGGCCGGCAATTCGATCACCC CGCTCGTGGCCCGTCAGTTGTTCGGCGTCGAGCGCCTCGCAGCCCGCATCCTCGAAATCA AGAAGGCCGGCCACAAGGTCACCTCGACGATCAAGACCGATGTCAACGGCAAGGTCTACG CCGAGTACGCCCTCCGCAAGGCAGGGAGGGTCGGCGCATGAGCTTCTTCATTTGCGCCTT CCTGATCATCCTCTACGTCGTCCTGGGCTTTGCAGTCTCGGTAGCAACGGTTGTGGCCAG TCCCCTCGGGCTTGTCGTGATCTCTGCCTACCTCGCGGGCCTGCTCGCCATCCTCAATAA ATTCGACCTCATCTAAGGTGTCCCCATGAAACAGGCATTCTTTGCCGTGGTCATTCGTGT GGCCAACGGTCTCCTCACTGTTGCCGGCGTGCGTGAGCGCAAGGTGGCATCATGAAGGCT GCCCTCCATTTCCTTCTCAACTTCTTGACCATCGCCGGTCTCGGCTTGCTCGCCATCCTC GTAGCATCTTCTACCGGCGCGGCCTTCGTTGCCTTCGGCCCGCTCATTGGTTCCGTTGTC GCGTTCTTTTGGATCGCGGTCGGGGTTTCCTTCGAAAGGTCGGAGCGCCTCTAACCTATG CAGTCAGAAAGCTCCTTCGTTCGTAAGGAGCCATGCCCCGCGTGCGGCTCAAGGGACAAC CTTGGCCGCTACTCGGACGGCCACGGTCATTGCTTTGGCTGTGACTACTACGAGCCGGGA GATGGCTCCGTTCCCCAAACCTCAAGGAAATCAAAGATGTCGCGAGAACTTATCTCGGGC GGGGAATACCGTGCCCTTTCGAAGCGCGGCATTACCGAAGAAACCTGCCGCAAGTTCGGC TACCAGATCGGTCATTTCAAGGACCAGCTCGTCCACATTGCCCCTTACTACGACGACGAG GGAAACCTCACGGCGCAGAAGGTACGCTTCGCTGACAAGACCTTCACGGTCACCGGCAAC ATGAAGCCGGCGCTGCTCTTCGGACAGAACCTCTGGAGTGGTGGTGGCCGCAAGGTCGTC ATCACCGAAGGCGAGATCGACGCCCTGTCGGTCAGTCAAGTGCAGGGTAACAAGTGGCCT GTGGTGTCCATCCAGAATGGCGCTCAGTCGGCAAAGAAAGCCCTGTCAGCAGCACTCGAA TGGCTCACCACCTTCGAGGAGGTCGTCCTGATGTTTGATATGGACGAGCCAGGGCGCGAA GCTGCAGCCGCGTGTGCTGCGCTCTTTCCCCCAGGCAAGTGCAAGATCGCCCGACTACCG GACAAAGACCCCAACGCTCTGCTCATGGCAGGGAAGGGGGACGAGATCATCACGGCGATT TGGCAGGCTCAGGTCTATCGGCCCGATGGCGTGGTCGCATTCAAGGACATCAAGGAAGCC GCTCGTCGGCCTATCGAGATGGGCCTTCGGTGGTTCTGTGATCGCCTGACCAAGCTTACC TATGGTCGCCGTTGGGGCGAGGTCTATGCCTTCGGCGCGGGTACCGGTATCGGCAAGACC GACTTCCTCACCCAGCAGATCACCTTCGATGTCACCGAGTTGGGACAGAAGGTGGGCGTG TTCTTCCTCGAACAGATGCCGACCGAGACAGCAAAGCGGTTGGCAGGCAAGTTCGCCAAG CGTCGGTTCCATATCCCAGATGATGGATGGACTGATGCTGAGCTGGACGAAGCGCTCGAT AAGCTCGATCAAGACATGCTGTTCTTCTACGACAGCTTTGGTGCGACCGAGTGGAGCGTT ATCCGCGAAACCATCCGCTACCTCGCCCACAGCGAGGACGTGAAGGTCTTCTACATCGAC CACCTCACAGCCCTTGCGGCTGCCGAGGATGACGAACGCAAAGCCCTTGAACAGATCATG GCCGAGATGGCTGCCCTGGCCAAGGAGCTGGGCATCATCATCCACCTCGTTTCGCACCTC GCCACCCCCGAAGGCAAACCTCACGAAGAAGGTGGCCGCGTCATGATCAGGCACTTCAAG GGCAGCCGAGCCATCGGCTTCTGGTGCCACTACATGTTCGGCCTTGAACGTGATCAACAG CACGAGGACGAGCGTCTCCGTGCGGTGACAACCTTCCGTGTACTGAAGGATCGCTACACC GGCCAAGCCACAGGCGAGGTGATCTACCTCGGCTACGAGCGCGAGACCGGCATGCTCTAC GAAACCAGCATCGAGTTTGGGGACGAGACCGGCTCTGATTTCAAGGATGAGAGCGCCCCC TTTTAACCGAAACAATCCACCCGTGAATAGTTACTCCGGCGAGAGGACAACATGGACTTC AATTGGCACTGGCCAGACGGGTCGACAATCGACGCGCCTGCGCCTTCAACCTACATCTTC GACTGCGAAACCAACGGGCTGCTGGACACTCTTGATGAGGTCCACTCGTTGGTCATGCGT GACCCCAGCAGTGGCTTCACGATCTCCGCGACCTCGAACGACTACAGCTCGGATGATCCC ACGATCATCACCGACTGGTCCGTCGAGGAAGCGTTACATGCCCTGATGAACGCCGATGTC ATCATCGGTCACAACATCATCAAGTTCGACATCCCGGCATTGCAGAAGGTGTTCCCTTGG TTCCAGCCGAAAGGCTTGATCATCGACACCTTGGTCTGCTCCCGCCTGATCTGGTCCGAC ATCGCTGACCACGATCTGAAGCAGGTGCGCAAGGGCTACCCCGGCAAGCTGGTCGGCTCC CACTCCCTGAAGGCCTGGGGCTACCGCCTCGGCGTGCTGAAGGGTGACTTCGGTGAGACC TCCGACTGGCGCTACTGGTCGCCTGAAATGCAGACTTACTGCGAGCAGGACGTTGAAGTC ACTGCCCAGTTCTACGCCCGCATCAAAAAGAAGGAGCCTTCCCCGAAGTCCCTTTGGATC GAGCATGAGTTCTGCAAGATCATCGCCATGCAAGAGCGCCACGGCTTCGCCTTCAACGAG GAGGAAGCCATCAAGCTCTACAGCCAGCTCGTCACGCGGCGGCTGGAGATAGCTCGTGAG CTGCAGGTTGCCTTCCCCCCGGTCGAGAAGACCGAGGTGTTCATCCCGAAGGTCAACAAC AAGCAGCGCGGCTATGTGAAGGGCGAGCCGTTCACGAAGAAGTGGATGGTCGAGTTCAAC CCATCATCCCGGCAGATGATTGCAGATCGCCTGCAAGCCATGGGCTGGGTGCCGCAGGAG TTCACCCCTTCCGGTCAGCCGAAGATCGACGAAACGATCCTCCAGGCATTGCCCTATGCG CAAGCAAAGGTGCTGGCCGAACACTTCCTCGTCGAGAAACGCATCGGTCAACTGGCCGAG GGAGATCAGGCCTGGCTCAAGCTGGTCAAGAAAGGACGCATCCATGGCTCGGTCAACACC AATGGAGCGGTCACCGGCCGTTGCACCCATAGCAACCCAAACGTCGCTCAGGTGCCGCGT GTGGGCAGCCCTTTCGGCGCAGAGTGCCGGGCGCTATTCACAACTACCTCGCGGTGGGTT CTGGTGGGGGCCGACCTATCTGGCCTTGAGCTTCGGTGCCTTGCCCATTTCATGGCCCTC TTTGATGGAGGGGAATACGGGCGCATCGTCCTCGAAGGGGACATCCACACCGTAAACCAG AACGCTGCCGGTCTGCCGACCCGCAACGATGCCAAGACGTTCATCTACGCCTTCCTCTAT GGGGCAGGCGACCAGAAGATCGGAAGCATCGTAGCGCCTGACGCATCCCCTGAGGAGCAG AAGCGCATCGGCAAGAAGCTCAAGCGGCAGTTCCTCGCGAAGACTCCGGCACTGCGCCGC CTCCGCGAAGTCGTCGAGCTGAAGGTTCTCGGGTTCGTCCCGAAGGCCCGCCCGCTCAAC GTCAACCCTGCCTACGAGCATATGTGGCGACAGGACAGCGCCAAGCAGTGGTGGTTCAAG GCGGGTGCAGGCGGCGTCCTCGTCGGCCTTGACGGTCGCAAGCTCAATGTTCGCTCAGCT CACTCTGCCCTCAACACGCTCCTGCAATCTGCCGGCGCGCTGATCTCGAAGGCCGCGATG ATCTTCGCTTATCGAGAACTATCCACCCGTGGTTACGTCTTTGGGCGCGACTACGCCTTC GTGGCACACATCCACGACGAAATCCAAACCGAATGCCGTCCTGAATTGGCGGAAGAGGTG GGCCAGATCGTCGTCGAGGGGATGCGTGCTGCGGGGACATTCTTCGCCTTCGGTTGTCCG ATCGATGGCGAGTTCAAGATCGGCAACAACTGGAAGGAAACCCACTGACGCCATGATTGA AATCCTCAAGCAGGCCTTGGAGAACCCCTTCAAGACCAAATCAAACTTCGCCCGTGAGAA CGCCGATCTGATTGCCATGGCAGCCAGCGACGGGTTCCTCACGACCCGCATTGCCACCGG CCTGTACTCCCGCAAGTGGATGATCACGCCCGTTGGTCTCTCCCACTACTACGCGCTGAC GGGGCTGAACCATGACTGAGCGCACGGCCAAAATCCTGTCAGTGATCCTGATCGTGGCGA TCCTCGTCGACATCGTGAGGTTGGTCTCCTGATGTTCGGACGTTCCAGCTCATCCAGCTA CTCGTCGTCGGCCTCCCGCGCGGAGAACCACCTGACGATCAACCAGCAGCCCCACGATGC TGCCGATGCGGCCCGCCTCTATGGCGAGCTGCAAGACAAGGCTAACGACTCCATCACGGA AATCGTTGGTCACCAGATCGCGGACACCAAGGTCGAGTTCGTAACGCTCGATACCGCCCG CGATGTCCTGCACTTCAAGGACCACGTCCGCGTGATCTTCAAGATCAACGGCAAGACCTT CGACACGAGGGTTGAGATCGATGACCCGATCAAACTCAGCGAACCCCGCGAGCGTGTGGC CTATCGCGCAGTTGCCGAGGGCATCGCCAACACCCTGATGGATCGCTCGATCTTTCAAAT ATATCAAACCTTTGCGAGGAAACGCTGATGTCAACCAAACGCTGGGTCACCTTCGGCCGC ACCGAAAGTGGCGACGATCTGGTCCCGATCATCTGGGACGAACGCCCGCCGCACCATGTG GTCGACGACGCATACCGGGAGCTGTACCCGGATGAATACCGCTTTGTCGGCCACGTCAAC TGGACGGCAAGAGAAGCGGAGGAGGGCGTCATCATCCATGACTAGAACCCTCCTCATTGA CGCGGACGTGGTGGCCTACGTGGCCGCTTCGTCCCTTGAGGTGGCCACTGACTGGGGCGA CGGCTACTGGACTTGGCACGTCGACGAGTTCGAGGTGCAGAAGAAGGTCAAGCAGATCAT CGACGATACGATGGAAGATCTGAAGGGTGATAGCTGCAAGCTTTGCCTGACCGACAGCTT CGGCAACTTCCGCAAGTCCGTCTTGGCGACCTACAAGGGCAACCGCTCGAACATCAAGAA GCCTCTGGTCCTCATGAAGACCAAGCAGTGGATGATCGATGAGCTAGGCGCTTACTTCCG TCCCGGTCTTGAGGGCGATGACTGCATGGGCATTCTCGCCACCATGAAGGGGACCGATGA ACGCATCATCGTCTCCATCGACAAGGACATGAAGACGGTTCCCGGCAAGTTCTGCCGGTA CACCGACAGCAAGGCCAAGATCATCGAGTACTCCGAGAAGGAGGCCGACTATTGGCACCT CTATCAGACCCTCATGGGTGACGCCACGGACGGCTACCCAGGCTGCCCCGGCATTGGCCC CAAGAAGGCTGAGGCGATCCTCGGCCCGATTGACGAGTTCGATCTGACCGAAGGCTGGGC GAGGGTGCTTGCCGCCTTCGAGAAAGCAAAACTCACCGAAACCGATGCACTGACGCAGGC CCGCGTGGCCCGCATTCTTCGTGCGTCTGATTACGACTTCAAGAAGAAGGAGCCAATCCT GTGGCAACCAAAGGCAAACTGATCGCCCTCTACAGCGATGCCGCTGGCAGCGGAAAGTCT GAAGTTGCCGGCACGCTGATCAGGCACGGCTACGAGTCGGTGAAGTTCGCAGGCCCCCTG AAGAACATGGCGCGTGGCCTTCTGGGATCGATGGGTTTCGAAACCGTGACTGTCGAGCGC ATGATCGAGGGTGACCTGAAGGAAGCCGTAATCCCCGGCTTCAAGACGGTGACACCTCGC CAGATCATGCAGACGCTCGGCACCGACTGGGGTAGGGAAGCCATCGACCAAGACCTTTGG ACTAAGGTGGCTGCCGCCAAGATCGAAGGACTTCGTGACAAGGGGGTAGATGTGGTGGTC GACGATCTCCGCTTCCCCAACGAGTACGACCTTATCGCTTCCCTTGGTGGGACGCTCGTG CAGGTCGTTCGTGCTGACCCGTCACGGGAGGCTGGCGGTGCGTATGAGGGCAAGCTCTCG GGCCACCTCTTCCACCACATCGTCCACAACAACGGAACTCTTCGCGAGCTTTACAGCAAG ACGCTTCTTCTCGCGCAGTCCATTTAACCAATTGAATCCACAGGTGAATATGATGAAGTT CCTCTCCGGCCTCGTTGCCTTCGCTCTTGTCGCCATTCTAGCGCTGGCCCCGGCAATCGC TGAAGCCCGCTCCTCGTTTGGCGGTTTCAGTGGTGGCAGCAGATTCCGCTCTTCCTCCTC GTTCTCGTCCCGGTCGACGACCACCTACAGCCGCCCCTCCACGAGCTACAGCCGACCGTC AACGACCTACCGTCCTGCGCCGGTCTACAGCAGCCCGAGCTATCGTTCGTACTCCTCGAC CACGATCAACCAGTCGAGCGGTGGCGGCGGTTTCTTCTCCAGCATGGTCGGCTCGATTGC CGGCTATGGCATTGCTCAGTGGCTCTTCGGTGAAGACGAGAAGCCGGCCGAGCAGGCACC TGCAGCAGCGCCGGCACCAGCCGCCCAGGCGCCGGTAACGGGAGCTGTTCCGACCACCAC GGTGCAGGAAGCTCCGAAGGCACAGTGAGTAACCTCCAAGACCACACGAACGAGAACCTG CCCAGCAACCTGAAGGTCTCCCGAAAGGGGGGCCACCTCGTTCTGCAGACGCGAACCGCC ACGATCTACCTCGATTTCAGTCGTCGAGACGAGCTGATCGAAGCAATCAAATCCTTGGGG TGAAGACCGCCCCGAGGGCATTTTTCAATTACCCCCCACTTAAGGATAACTCGATGGTCG ACGACCAAGATCGATTTCCACACATCCCCAAAGACCTCGTCGAGGCTCTCGATAAGAGGT TCCCTGAGAGGACTCCCTCCTTAAACACCTCCTTAGATGAGATCAGATGGAAGGGTGGAG AGAGAGCTGTCGTGAGGTTTCTCCTCGAACAGTACAACCGTCAGAATGAGACGGTGATCA ACGAAAAGGTTCTCTCTTAATGTGCCCACCGAAAGTGAAGACGCAGAAGGTCGAGCCTGT CGCTCAAGCAGCGCCGCCCGCCCAGCCTGCCGCCACTGTCAATCAGTCAGCACCCCAGAC ACCCGACGAGCTGTCTCCCGAGCAGGCAGCCATCAAGGCCAAGCGCAAGGGCCGCTCCAG TCTCCGCATCCCGCTCGACGCAGGTGTCGGTAGCGGCGCAACCGGGATCAACGTCCCTCA GGCATAAGGTTCCAGAATGACCGGGCAAACTGCTTCCGGTCGTTATCAGCAGCTCAGTCA AGCAAGATCGGCCGTCCTCGAACGTGGCCGCGCTTCCGCCAAGCTGACAATCCCATCGCT CCTTCCTCCCGCTGGCCATTCCGAATCATCGTCTCTCCCGACCCCGTTCCAGGGCATCGG CGCACGGGGCGTGAACAACTTGGCCTCCAAGCTCCTGCTGGCCCTTCTCCCTCCGAACTC CCCCTTCTTCCGCCTGATGATCGACGACTTCACGCTCGAAAATCTGACGAAGCGAAAGGG CATGAGAGCCGAGGTCGAGAAGGGCCTGAACAAGATCGAACGCGCTCTGATGACAGAGAT CGAGACCACGGCAATCCGTGTGTCTGCGTTCGAAGCGCTGAAGCAGCTCCTCGTTGCTGG TAACGTCCTCATCTACCTCCCGACCGAGGGAGGCATGCGGGTCTTCCGTCTCGACCGCTA CGTGGTCAAGCGTGACCCGATGGGTAACGTGATCGAGATCATTACCCGAGAAGACATTTC CCCAGACATGGTCCCCGAGGCCATGAAGGGACACGTCAAATCCAAGTCGAGGTCCAACGA AAAGACCATCGAGCTTTACACCCACATCGTCCGTCAGCGTGACAAGTGGACCATTCGTCA AGAGATCAAGGGCATGACCGTGCCAGGGTCTCGTGGTTCCTATCCGCTCGATAAGTGCCC GTGGATTCCACTGCGCTTCACCAAGATCGACTGCGAGGACTACGGTCGAGGCTACGTCGA AGAATACTTCGGTGACCTCCTGTCCCTCGAAACGCTCACTCAAGCCATCGTTGAAGGCTC CGCAGCCGCTGCCAAAGTTCTGTTCCTCGTGAACCCGAACGGCACGACCCGCATGACTGA TATCGCCAAAGCGCCCTCGGGTGCAGTGCGAGCCGGTAACGCCGAAGACGTGAGCGTCCT GCAGCTCGATAAGTTCGCAGACTTCAAGATCGCCTCCGAGACGATCAACAACATCCAGCA GCGCCTCTCGTTCGCCTTCCTCCTGAACACCGCGATCCAGCGGGCAGGGGAGCGTGTGAC GGCCGAGGAAATCCGGTACATGGCCGGCGAGCTGGAAGATGCCCTCGGTGGCGTCTACTC GATCCTGTCTCAGGAGTTTCAGCTCCCGCTCGTGCGTGTCCTCATGTTCCGCTTGGAGCG CCAGAAGAAGATTCCGCCGCTGCCCAAGGGCGTGGTGAAGCCGACTATCACGACCGGCCT CGAGGCTCTTGGCCGTGGCCATGACATGAACAAACTCACTATCTTTGCGCAGACCGCGTC GAACATCGCCGCCCTGCCGCCTGAGATTAGCAAGGCTGACTTCCTGATGCGTGTGGGCAC AGCCCTCGGCATCGACATGGACGGCCTCGTCAAGACGCCTGAACAGCTCCAGCAGGACCA GCAGCAACTCATGATGCAGCAGCTCATCGAGAAGCTCGGCCCGAAGGGGATGGACATCCT CCGAGATCAACTCAAACCAGAGGTGCAGAATGGCCCGCAAGCCCAAGCCCAGTGAACCTC AGGCCGAGGTGACCGAGAAGGTCGCCCCTGAGGATGCCCGCAAAGGTGAGCCGAAGCGGA CTGTGCATGACGGTGGCATCGTCTCCCTCGACTACTAAGGACCACCCATGACCAGTTTCG GTGACGCCTTCAAGGCAGCCCGTAAAGCCGGTAAGACGACCTTCAAGTTCGGAGGCAAGT CCTACCACACCAAGACCAAGGACGAGATGGCCAAGACCAAGAAGGCCGTCCCCACACCTT CACCTCGCCCCGAGGCGATGAAGACCGATGCCCAGGCAGCCGTCGACAGCGCACCGAAGT CAGCCCCTAAGGCCGCTCCGAAGCAGGACTATCCACGTCCCGCAAAGCCCGTGGGCATAG CCAGCGCCAATTCGGCAATCGGCCGAGCTGCGGCTGCCCGTGAGAACGCTCCCGTCCTCA AGATTCCGTCTCGTGCCAACGCCAACGCGAATACCGTTCAGAAGCCTCGGGCCCCCGCTC GGGGCAGCTCAACGCCGGCCGAGAAGCAGGGACCGAAGCCTGAAGAGCAGCAGTGGTTCG CCCGCAAGGGCTCCGCGATCTCGCTAGGTATCGCCCGTCGCCGCAACGCCCCGGTATCGA AATAAGCAGGAACTATGGAAAACGAAGTCAAAGAACAGGACCAGATCGTGGTCCCCGGCT CTGATGAGCATAACGCTCTCATGGTCGAGAAATTCCAGAACCAGTCCGGTTCGACCGAGC AGACCAATGCACAACCCGCCGAGCGGCCGGCCTGGCTGCCCGAGAAATTCGAGAAGCCGG AAGACCTCGCAGCCTCCTACGCTGAGCTGGAACGCAAGCTTTCTGGTGGTCAACAGCAGG AAGCGCCGAAGGTTGAGGCGAATGCCGAAGAAGCTCGTGAAGCTGTCAACGCCCTCGGCC TCGACTTCGACGCCCTCGGTGCTGAGTTCGCTGAAAGCGGCGCGCTGTCCGATGATAGCT ACACCAAGCTGGCCGAGAAGGGCCTGAGCCGCGACATCGTCGACGCCTACATCGAAGGCC AGGAAGCCAAGGCCCAGCTCCACCGAGCTGAAGTCCTCCTCGCGGTCGGCGGCGAAGCCA CCTACAACGAAATCGCCAACTGGGCCGCAGGCAACCTCACGAACGAGGAGCTGCAGGCCT ACAACGATCAGGTCGAAAGCGGCAACCTCACGGCAGCCAAGATGGCTGTTCAGGGCCTGA AGGCTCGCTTTGAAGCGGAGAACGGCTCAGAGCCGCAACTGCTCAACGGCGAGACTGGCG GCAACTCCGCTGAGGTCTTCCGGTCGACCGCCGAACTCACTGCAGCTATGCGCGATCCTC GCTACAAGAAAGACCCTGCCTACCGGGCAGACGTCGAACGCAAGCTGTCCAAGTCCTCTC TATTCTAAGGAACGTCAATGACCGCACTTGCCTCTATCGTGGCGCTGGTTTGGGCACATC TCGATGACATCTTCGCAATTCTCTTTGCGCTGCAGGCTGTCCTCGTCCTGATCTCCAAGC TCACGCCGACCCCGAAGGATGACGCTGTCGCAGCGAAAATCCTGTCGGTCCTCGAAAGCA TCGCCTCGGTCCTCTCCGTCAAGCGGAAGGATTTCCCGGCAGCTCCGACATCCCCGGGCC TCTACTAGCTGAACATAGGAGAGCTACTAGTTGGTGGCTCCCTTCCTCTTTGCGAGAAGG CGTTGAACGGCGTCGCTAAGCTGCTTGTTGTCCTGTTTGAGCATGGCAATGCGTTCAACA AGATCACTCAAAACAACCCGGATCGCTTGGGCATTCTGGAGGCAGTCCTCATCACTTTCA TTGTGAAGCATCTCGCTCAAGGCACCGTGTAAAGCGTTGAGGGGGTTGTGTCCGTCAATG AGCAGCCCTTGCGGGAGGCCGGTTTTTATCTTTTCGATTGATTTGGCAAAGGAGATTTCC TTTTTCGCAGCCTTGAGCTCGTCGATCAGCTCTGCGGGAGCCCCTACGGTTTCACTGACT TTGATAATTTCGTCGAACAGATCGTTCCTGTGGTTCTCGACGACGCGACGATAGTAGGCA GATGCGGCGATCCCAAAACCCTGGTCTTCACACTGTCTTCCCTTGAGAAAAAGTTTTGCA TCCGCGTTGCCAAATAGGCGGAGAACTCGATTGGGAACTGGAATTCCGAACGGCGGAATC TCGCCGTACTTGTAGACTTTTGCCCCTCCTGCATCCGTGAACGTCAGGTGAAGCGCATAG TGCTTTACTTGGTCACGGCAGTCACCACAGGTGTAAAGCGGATGCACGTTGGCGATGGTC GTGTTCCACCCGAATGCGTCATCGGTAAGACAGCGGAAATTCCGTTCGCCCGCGCAGATA CTGCAGTACAGGCGAATCTGGGGGTAAGTGAGCCTCCTGCGATTCGCGGCGTTAAGTGTG ACCTGTGTCTGCCAAACATCGGGAACAACCTTCGTTAGTGAGGGATGCGCTTTTTCGAGA AACGCCTGAAAGCTGACAATGTTGTCATCGCCGCGATTCTCTCCCTTTTCTACAGATTCC GCCGATTCGACTTCGCTCATCTTGGTGTCGTTTCTCAATAATATCTCTCCGAATTGATAT TAGGGAAATCGCACAACCACAACGCTTGTTCTCATTGTTCACAGCAAATCTTACCAGCAC CCGAGGGCGAGCGCGTCATCCATTGCGAAGTTTGCAGTATAAGGTTCTGTCAAGCAGATA GTGGTGTAAATCAGATAACGCCAATATTATCCAACAATATCAATTACCTGTGAAAGAAGT TCGACAGTGCTGCGTGCCTTATACGCGGCTGTCCAAGGTAATCCCAAGAACACCACTCAG CAACTTAGCCCCGATGCGTCGGGATAACTTTGTCGTGTCGAGTGAAGACGTTCGGGAAGC CTGATCAACTTCCAACTCTTCACAGGAAATACTCTATATGGCAAACGCCAACGTAACTCG CATCGGCCAGATCAACGGTTCTGGTGATGTCGACGCACTCTTCCTCATGCAGTTCGCAGG CGAAGTCCTCACTGCTTTCGAGGAAACTAACGTCGCTCTCGAACACACGATGACACGCAC GATCAATTCCGGCAAGTCGGCTCAGTTCCCGGCGACCGGCAAGGTCGGCGGCGAATACCA CGTGCCCGGTACCGAGATCACGGGTCTGAACCTGAAGGCCGCTGAGACGGTCATCACCAT TGACGACCTGCTCATCTCGCACGGCTTTATCGCCAACATCGACGAGGCCAAGACCCACTA CGATCTGCGCTCGATCTACTCGACCGAGATGGGCCGCTTCCTCGCCAAGACGATGGACAA GCACCTGCTTCAGGTGGGCGTCCTCGCTGCTCGCGCCACGAACGTGGTCGACGGTGAGCC GGGTGGTTCGGTCATCCTGACGGGTGAATCTGGTCTGCCGTCCACGCCGAACTTCGACGC CAACGGCGATCACCTCGCTGCGGCCCTCTTCATCGCAGCTCAGAAGCTCGACGAAAAGGA CGTGCCGGAAGACGAGCGCGTCGCCTTCGTCCGCCCTGCGCAGTACTACAACCTCGTCAA GGCCACCAACAACCTGAACAAGGATTGGGGCGGCATGGGTTCCTACGCTGAGGGCTCGAT CCTCAAGGTCGCCGGCATCCAGATCGTGAAGACGAACCACCTGCCGAACACCGACCTATC GGCCGCAACCGGCGTCGAAGCTGGCTCGGGCCTGAAGTATCGTGGCAACTTCACGAACAC CTCCGCGCTCGTCATGCACAAGCAGGCTGTCGGCACCGTCAAGCTGCTCGATATGGGCAT GGAGTCGGCCTACGACATCCGCCGTCAGGGCCACCTGATGGTCGCCAAGTACGCGGTCGG TCACGGCATCCTGCGCCCGCAGGCCGCTGTCGAAATCCGCAATGCTGCGGCCTAATCCAT CCACCAGTGCATACAGGGGAGGCTCCTTCGGGGGTCTCCCCTTTTTTTCGTGAAGGACAT TCACATGCCTTACCAAAGCAATCGTGACCCGTACCCGAGCTTCGGGGAGCGGTCGGTGAT CAGCTCACCCACCAAGTGCTACAAGGTTGCGGCCGGGGACATCGGCGCAAAGGAGCTGCC TGTCTACTCCAAGGCGCTCGCAATCTACGTCCCCGATGGTGCGACCGCGACAATCTCGAT GGTCTACCTGAACAATCTTGATGGCGAAGCTGTCACCCGCACGTTCACCGCAGGCAACCA CTACATCGCAGCCCGCATTCGGCGCATCACGGCAGTCTCGAACGCCAGCGTTGAAATCCA CGTCGAGACGGAATAGGAGGCCCCATGCCAATCATAGGCGTGACGGTCTCTCCTCGATTT GGGAGGGCACCTCGTCGGTGGCTTAGTGGACCCCCTGCATGGGTGCCAGACCTAAACCGT TACATGCCCGCAGCGACGGGTACGCGCTGGCCAACAGGCTCGACGACGAACCCTTGGACC TACGCCGCTGGGCTTAACTACCAGTGCTCCAAGCTGTTCTTCGGGTCGCCTGACTATCCG ACCAACGACTTCCTGATCCCCTTCGTTGGCTTCGCTCTGACCGAGGGTGGGAACGCGCCC CAGGAAACCCAAGGCCCCACCACCGACACGCTGCTCGATGAGGCTTTCTTCATCCATCCA GACGGTACCGAATACCCGATCCTCTTTGGAGGTCAGGCGGCAGCCACTATCACGGCAAAT ACGGGCATCGTCTACGGGCAGGTAACCCTCCCGACATGGCTCCCCGCTTGGTCCATCTTC GGCATCCGCACGATCTACCACGGCAACGCAGGCGAGAACCGATTAGGGTCCTACCGCATC CAGAGACACCGTGGTGAGAAGTTCTGGGGCGCGGGTGACCTCGCATCCATTCGTGCGCTT GCGACCGCCAATGGTCCATCGACGCCGGCCCTAGACCCCGACAACTGGTACAACACAGTC GGCAATGCCACCAACTCCCAGCAGCAAGCCTATGGCCCCGCCATGGTGCTGGCCAAGGGT TGGGACGGCAGGCCAGTCCCGCTCATGCTCGCAGACTCCCTCGCGGAGCGCCAGGAGATC GCCGCATCGGCGGACGAACGGCGGAACATGGGTATTTGGCGGCGTTGGCTCGACCAGAGA GACCCTGTTTGGGGAAGCCTCATACCGGTCGTCATGGGTGTCCCAGGCGCTCACTCCGAA TACGAGCTAGCCGGCTCGGGTGCCTCCATCGCCACCAGACGTTGGGCCATGATCGACTAC ATTCGCGACACCTTCAATGGTGGTAAGAACATCTGGACCTTCGTGCTGGATCAGTCTGGA CGTAACGACACCAGCTCCACCCTTAGCTTGTGGCAAAGCCGTAAGTTCGGCTTGGATGAC CGAGTTAAGGCAAGGTACCCAGGCGTGCATATGGTGGGCATCACCATCATGCCTACCTTC ACGTCCAGTGACGGAGGTAGGACGGTCGCCGGCTACTCGACGTCCGCCGTCTGGAACCCC GTTACGGGCACTCTCGCCAGCCTCAACGCGTCGATCATGTCCAGCCCTCGGTTCGCCAAG GTCATCGACATGCTGCCCGCATTCATGTCGGACACCGACCCCACCAAAGGCCCTGCCGCC GAGCTGTTCCCGCTTGGTAACGTTATCGGCCATCCCGGCAATCAGGACGGAACGACAACC TGGGACACCATACGCCTACCTTCGACGGTACCGTTGGGCTCCCGTATCTCGTTTGAATAT CAGCCAGGGCAATGGGCCAGCCGAACGCTCTCCGGGCGAACGGACAGGGGTGATGGCACC GCTGACTACAGGGTCGCGGAGATCTTCGCAACGAACGTCCAGGACAACGCAACGCTCCTC GGTCACGGCATGCACACTGACTTCATCCATCCGGCCCTTCACGGTGTCCTGAGGACGGTC AGCCGCATCCCGCAGTCCGAGAAGGCGAAGTTCTACCCCGCCGCTTAACCCTCCCAAGGC CCGCCACTGAGCGGGCTTTTTCTTTTTCAGGAACACCATGTCTGTCGTCATCCTCACTCC GACCACGGAGCTTGAGGCGATCAACCTTATGCTGTCCGTCATCGGTGAAAGCCCCGTCAA CACGGTGGAAGACACTGGTCTCGTCGACGCGGTGGTCGCCCGTCAAATCCTCATCCAGTC CAGCCGAGACGTTCAGCTCGTAGGCTGGCATTGGAACACCGAGATCGACTACCCCATCGC CGCGAGCTTCCCTGAAGGTGAGCTGACGCTGCCCCCGAACACCCTCAAGGTGGACACTGC CGGGGCCGATGCGGGTCTCGATCTGGTCCAGCGAGGCAACCGCCTCTACGACCGCAAGAA CCACACCTTCAACGTTGGCCGCACGGTCTACGTCGAGATCGTCCTCCTCCTGCCATTCGA TCAGCTCCCCGAGGCTGCCCGCTCATACATCGTGATGCGCGCTGCTCGGCAGTTCCAAGA GCGCATGGTCGGCTCGGAAACCATCTGGCAGTTCAACTCCAGGGACGAACTGCGGGCCTG GGCGAACCTCATGTCGTCCGAGGCTGAGACGCAAGACCTCAACGTCTTCAACGACAATCC GTCTGTGCGAAGGGTGCTGGATCGTACTCCTCCGGGGGGCCTCGTCTAATGGCAGGCGCT CTCGTCTCTACCACCATCCCGAACCTGATCAACGGCGTCTCTCAGCAGCCCTATGCGCTC CGTCTCGCGAGCCAGTGCGAGCTTCAGGAGAATGCCCATAGCTCGGTCGTCGAGGGGCTC CGCAAGCGCCCCGGCACGACCCATCGAGCGAAGATCACGAACGCCCCCGCAGGTGAGCTT TTCACGCACACCATCAACCGCGACCGCACCGAGCAGTATGAGGTCATGGTCGGCAATGGT GATCTGAAGGTTTACGACTTGAAGACTGGCGCTGAGAAGACCGTGACGTTTCCGAACGGC AGGGCCTACCTGACAGCGGCCGACCCCCGGTCCTCCTTCAAGGCCGTGACGATTGCGGAC TATACGTTCCTGATCAACAAGACCGTGACGGTCGAGCAGGACACCACCCTCAGCACCTCG CGTGCCCCTGAGGCTGTTGTCTGGGTGAAGCAGGGCGCCTACGGTACCAAGTACACGGTC ACCCTGAATGGGGTGAGCGCCACCGTATCGACGCCTGACGGCTCGACGGCATCACACATC AACAACATCCAGACCGATGTCATCGCGTCCGGTCTCGTGTCCGCGCTGAACGCGGCGATC GGTGGCTTCTCCTTCGCCCTGAACGGATCGAGCATCTACATCAAGCGCGCCGATAACGCT GACTTCACGATCAACGTGACGGACAGCCAGGGCGACCAAGCGATGAAGCTCTTGAAGGGA ACCGTCCAGCGGTTCTCCGATCTACCGGCCAAGGGCTTTAACGGCTTCGCGGTCGAGATC GTCGGGGACCAGTCGTCCTCCTTCGACAACTATTACGTCAAGTTCGACACGGCCTCAGGC GTTGCCTCAGGTGTCTGGGTCGAAAGCGTCAAGGGTGGCGAGGCAATCCGCCTCAAAGCA TCCACCATGGCGCACGCGCTCACCCGTAACGCGGACGGCACCTTCACCTTCAAGCAGGTG GAGTGGATTGACCGGAAGACCGGCGACCTCGACAGCAGCCCGATGCCTTCCTTCGTGGGC AAGAAGATGAACGATATCTTCTTCCACCGAAACCGGCTCGGCTTCATCGCGGACGAAAAC GTAGTCTTCTCCCGGTCGGGTGACTTCTTCAACTTCTTCCGCAGCTCGGCCACCCAGGTG TTGGATACCGACCCCATCGACGCAGCCGTGTCGCACATCAAGGTCTCGATCCTTCAGCAC GCCATCCCGTTCAACGAGACGCTGCTGTTGTTCTCGGAGCAGACCCAGTTCCAGCTCGGG GCCTCTGAGCTGCTCACCCCAGAGACGATTTCGATCAACCAGACGACCGAGTTCGAATGC TCGCTCAAGGCTCGGCCTGTCGGCTCGGGCCGGAACATCTACTTCACGTTCAACCGAGGC AACTTCTCGGGCCTGCGGGAATACTACGTCGACGGTGACACCAAGACCAATGACGCATCC GATGTGACCTCGCACGTCCCGGCCTATGTGCCGAAGGATGTCTCGAAGATGGCTGCCTCC TCGTCCGAGGACACCATCGCTTTGATCAGCGAGAGCGAGCGCAACTCGATCTACGTCTAC AAATATTACTGGAACGAGCAGGAGAAGCTGCAGTCCGCATGGTACAAGTGGACGTTCCCA GCCACCGACACGATCCTCTCGGTCGAGTTCGTCGAGAGCAACCTCTACCTCATCATCCGT CGCCCTGATGGCGTGTTCCTTGAGAGCATGTCGGTCAACCCCGGCTATGTGGATGACGGC TTCGACTTTGGCCTGAACATCGACCGCAAGGCCAAGGAAGATGCCTGCACGGTTTCGTAC AATGCCGTGACGAACGAGACCACGATCACCCCGCCGTATCTGCTTCAAGCTGAACTGCTT CCGGCGAGCGAGGCTGACGTGATCGTCTCCCGTGCTGGTGACCCGATCAAGAAGCCTGGA CAGCTCATTCCCTACACCATTGATGGCAACAACATGGTGGTGAAGGGGAAGCTCGAGAAG TTCATCATCGGGCGCTCCTATGTGATGCGCTACCGGTTCTCGACGTTCGTCATCAAGGAA GAGGCAGTCGGCGGTGGTCAGATGACAGTGGGCGAGGGGCGCATCCAGCTCCGCAAGGCC ACGCTGACCTACGACAACAGCGGCTACTTCCGAATTGAGGTAACGCCCCTGCGAAGGGAA ACCTACCGATATGTCTTCTCGGGTCGTGTGATTGGCTCGGCCAAGAACGTGATCGGGCAG ACAGCGATCGACAAGGGCCGCTTCTCGTTCCCGCTCATGTCGAAGAACGATCTCGTCACG GTCGACATCGTCAATGATACCTTCCTCCCTTGCGCGTTCCTGAGCGCCGAGTGGGAGGCC CTTTACGTCATTCGTTCCAAGAGGCTTTAATGCTGGAGACACGTCCGTCCCGGCCTGAGG ATGTGACGTACCTTGCACCCCGACTTCGAGAAGCCGACCGACAGGAACTGCTTGCCGCTG GCGCACCCGGCCCAGAGCAGTCCCTTCGGGATGGCCTCATGCTGTCCAAAAACTGCATCT CGGTTGTCGATGACGAGGATAGGGCAGTCGCCATGTTTGGCGTCTGTCCGTCCCCCGTTG AGGGTCTCGGGTACATCTGGCTCCTCGGCAGTGACGATATCAAACAGAACAAGACAAGGT TCCTACGCCGCTCGAAGCAGTGGGTGGACACCTTCCATCAAGATTTCACGGTACTGACCA ACTACGTCGACCAGCGCAATGAGGTCCACATCACGTGGCTCCGCTGGCTCGGCTTCAAGT TTCTCAGGATCGTCAACGCACCGGGGCCGGGAAACCTGCCCTTCTATGAATTTGCGAGGA TACGCAATGTGTGACCCCCTCTCGATGATCGGGTTCGCGATTGGCGCTGCCCAGCAGGTG GTGAGCTATCAGGCTGAGAAGACAGCCGCCGAGCAGCAGAACCAGCTCTACAAAGAGAAC GCCGCCAGGGCGAACCAGAATGCCCGCGACCAGATGTTCCAGACCCAGCAGCGCATGCTG CAGGAGCAGGAGAAGGGCGCGGCTGAGAAGATGGACACGGTACGCGAGGCGCGCGAGGCC AAGGCAACCGCAACAGTGGCTGCCGGCGAGGCTGGTGTCTCGGGCCTGTCGGTCGACGCC CTCTTGGCTGAGTTCGATGGCCGAGCTGCTGCAGCCAATGATCGCACCGATCAGAACACC GAATGGACGCTCTCCCAGCTCAACAACGAGATGAAGGGAATTAGGGCCAACGCCGAGGAC CGCATCAACTCGGTCCAGCGGGCAGCCGCTCCGTCCTTCTTCAACACCGGCCTTAAGATC GCTGGGGTTGGTCTCGATTCCTACAATGACTTCAAGGTCAAACAACGGAGTAAGTAATGG CACGTTTGCCGGGGCTGAGGCCCATTGACGAAGAGCGCCGCCGCTCCGGTGGTGGCCAGA ACCGCGGTCGTGTTCGGACGCCTTCAGCCGACAACATCAGGGTGCAGGGGCTGTCCCCGA ACGCGTCTCCGGTCGACACTTACGCCCGCCCCGAGCAGGCGCCTATCGGTTCGAATAGCT GGGAGGCTCTTGCCAAGTCATTGGCGGGCATCCAGCCCAGCATCAACAACTTCCTGAACG TCCAGGCTGCCGAGCAGCAGGATGACGATGTCACTGCGGTGAGACAGGCATTCCTTCAAA AGTCCCCCGAAGATGTCCGCAAGGCCATCAAGGAGGGCTCAGTTCCCGGCCTTACGAGCC TCGCTGGTCGAGAGCTGGTCGGTGAGCGGTTGGCCTATGACCGGTCGCTGCAGATCATGG CATCGTATCAGACCGACTTCGACCGGCAGACCGGTGACGTGGATGCCTTCGTGCAGGAGC GCATCAAGGACGATCTGGCTGAGTTCGGAAACGACAAGGCCCTCATGGGTGCCTACACCA AGCAGATGACGGCCTTCACCGAAAAGCTGCGCAACCAGTCGGTCGACGATAAGGCCACCT TTCAGCAGGATGTTCGACAGGGTAACCTGTTCGAGAAGTGGTCCGCAAAGGCCACCTACG ACCGCGCTGAGGGCAAGGCCCCGGCTGATGTGGCCGGCAGCATGTTCGGGGAGTTCACGA AGAACCAAGAGCTGTTGCGCGTCCCATTCCAGAAGCAGCAGGAGATGATGCTCCAGCTTG CCGATCAGGCTGCCACCAGCGGCGACTACGATCTGGCCAAGGCCATCCTCCAGCACAAGC GTGAAGACGGCCCCTACAAGGGCAGCCTCATGACGGACGTGAAGGTGGGCGATACGGCCA CCAAGCTGTTCGCTCGCATCGACGCCGATCAGACCAGGGAGCGCCTCACCGCGCAGGCCC AGGAGGACGAAGAGAGCCTCTACAGCCAGGGTGTGGCAGCTGCCGAGAGTGGCTCGATCC TGGCCATCGGTGACGCTCAGGTTCGCGATAAGCAGGGCGAGATGAGGACGATCACAGCCG ACGCTCAGAAGAAGGAGGTCGCCAACCGACTGATCGCCAAGGCAGCCGATGAAGCAGCTT ACCGGGAGAAAGACCCCGAGAAGCGCCCAGCTCTGGCCCGCCGCCTGGAGAAGGAGAAGT TCGTTGGTTCCGGTCTGGAGCATCCCGTCTGGTTCAAGGCAATGAATGGTGCCCCCGGCC AGATGAACCTGAACGCTGCCACAGGTGAAATTCCACCGTCAGCCAAGGATGCCTTCGACA CCTATCAGGACCTCTACAAGGACAGCCCCCAGTACCTCGCCAAGTACCTGAACAAGGACG CTCTGGAGTTCTTCGAAAGTGCCCGATTGGCTGAGGAAGTTGGTAACGCCGGCACACCTG AGGCTGCCTTGCGCATCGCTCATATGGTTACCCAAGACCCGAACCAAATGGATGAGGCGC TGAAGCTCAAGTACGACAGCATCGACAGTGCCGTTAAATCTGCAGTGTCCAACTCGACGA GCTGGGGCCAGTGGGTCTTCGGCAAGCAGACTGCCGGCAACCAGAACTACGTCAGGAGCG AAATCGTCCGTCTGGCGAAGCAGTATGCTCTGCTCGGCAAGGACAACGACGAGGCCATCG AGCAGGCCCAGCAGACCTTCGAGAAGACCCACATCAACGTGGCCGGCTCCTACGTCAAGA ACGACAAGCGGCTGCCGGGAGACTTCGAGCCTCTGGTCAACCAGTACCTGAGCGAGTTCG TCGAGAAGCACAAGGGCGACCTGAACTACGACATCGACGATCTGACGATCACCCAAGGCA ACGGCACTGGGGCCTACATGGTGGTCAGAAAGTCCGACCGCATGCCGGCAGACCCAGCGT CTGACGACACGTTTTTCTCTCTCAACATCCTCAACGACCTGCGAACGCGCAACCGAGATC AGAAGATCAAGGAGGTGACGGCGAAGCAGAACGCACGCTAAGGAACACGAATGGCCGACA TCAGGTCCATCATCACGGATGCCGCAAACCGCTACGGTATCAACCCGCAAGACGCGCTCG AAATGGCGCAGATCGAAAGCGGCCTCAATCCCCACGCGCAGAACAAGTCCTCCACCGCAG GGGGACTTTTTCAATTCCTCGACAGCACCTGGGCGAAGTACGGGAAGGGGGCATCAAAGT ATGACCCTTATGCCAATGCTGACGCCGGGATGCGGCTGGCCCGCGACAACATCAACTTCC TCAAGAAGAAGCTCGGCCGGGACATCACGGGCGGGGAAATGTACCTCGCTCACCAGCAGG GTGCAGGCGGCGCGCTGAACCTCCTGGCCAACCCGAACACCATGGCAGTTGACCTCGTAG GCCGCGCTGCGGTCCTCGGCAACGCTGGCCAGACGGGCATGACGGCCGCTGAGTTCGCGA ACCTCTGGATCAACAAGATCGGCAGCACGAAGGTCGGGAACGGCCCAGGTCTGGTCATGC CAGGATCGATGGCAAACTCCCAGAACCCCGGTGACTTCTCGGTCCACGACCAGGGCCGCG TCAGTGCCTCGGATGTCATCCCCACGATGAACACCACGCGGGCCGAGGAGGTCCAGCAGG AAAAGGATCGGCAGGCTATGATGCCTTCCTTCGGGGAGGCTGTCGCAACCGCCGTGAAGA ACGAGTGGTCGGTACTCACCCCTTTTCGGGCACTCGGTCATTTTGATCCTGAACCGGACT ACAAGCTGACCGAGGACAAGCTACGGACTTTCGGCCAGAGCATCCCAGACGACTACCTCG ACGAGTTCGAGGACGCTGTCTCCGATGAGCATGCTGAGGCAATCCGCAACCGGCTGCTGA CCCAGCTCGAAGACAACCAGAAGATCGCCTCGCTGGGCACTGCAGGCACCATCATCTCCA TGGGAGCTGCCCTGACCGACCCCGGCGCTATCGCCGCTACGGCTGCCATTGGTGCAGTGA CGGGTGGCTTCGGCGTACCAGCGGCTGTCGCAGCTCGCCTCGGTCGCGTTGGAATGGTCG GCTTGGCTGCGGCTGAGGGTGTGGCCGGTAACCTCGCCACCGACATCCCCCTCGTGGCCG TCGACCCAACCCGTGACGTGTCCTTTGACGAGCTGAAATACAGCATCGGCACTGGGCTCG TGATGGGCGGTGTGATGGGTGCCTTCAGGCGCAACCCGATGTTCACCGAAGAGGCCAAGC AGATCGCGAAGATCGGGCAGCAGATGCAGGAGCAGGCGGTGAGCCTTCCGGCCGGCAGTC GCTCAGCAGGTGCTGCCTCGGTCATGGGTGACAACTTCACCCGATCTGACACCTCGAACC TGATCGACGACTTCAAGCGCCTCGACCCCAAGGGCACCTTCCTGAACTGGCGCGTTGACG CGGTCGGCCAGCTCATGGCCTCCAAGAACCCGATGGCGCAGACGCTCGCCCGTTACCTCG GTGAGGACGGTGTGCGAGCTGCGAAGGGCAGCGGAGTGGTCACTGAGATCGCCGCGACCG AGCGCATGCAGCGCCGGCTTCGCGTGGCCCAGATCAACTGGTACCGGGGCTATGACGATG CCTTCAAGAAGTTCCGCAAGGCCAACGGCATCAACGCCTTTCAAGCCAAGGACGCGGAGC TGAAGTTCAAGGAGCAGATCACCGATTACATCCGCGAGGAGAACCCGAGCGTCCGCGCTC AGTTCCCTGCTGAGGTGAAGCAGGCTGCCGGCGCATTCCAGGCCGAGATGAAATCTTTCT GGAAGGAAGCTCAAGAGCTGGGCCTGACGAGGACCGAGGCCGGCGTCGAGAACTACTTCC CTCGCTACGGGCACCTCGCCAAGGCCACCAAGCTGATCAGAGAGGTCGGCTACAGCATGG ACAGGAACGGTGGTCTCACCGATCTCTTTGCTGGGGCCATCCTGAAGAAGCAGCCCGGTC TCGATCCTAAGATCGCAAAGCGCATGGGCTATGCGGTCCTCGACCGCTTCCAGAAGCTCA GCTCGGGCGAGGAGATGTTCGGTACCGGCCACCTCGGGTTCGATCTCGACGACCTTGAGG TTGAGCTGAAGAACTACCTCGACGACGAGCAGATCGCGAACGTGAAGGCTTGGGCATCAC GCAACGAGAAGAAGGAAGGCGAGGCCAGCGGACCAGCTCGCATGAAGGCCCGCATCATGC TCGATGAGAACCACTTCGCTGATGTCATGACCAAGCGTGATGGCGTGAAGCGGGTGAACA TCTCCGACTTTTACGTCAAAGACCCCCACACGGCCTTCCAGCTCTACGCTCGCAACATGA GCGGTCAGCTCGCTATGGCTCGCATTCAGGTCCGCGATCCTGTCACCGGCAATCTGCTGA TCGACGGGATCAAGAACGGGAACGACTGGACCAAGCTGAAGAACCAGATCAAGTCGGTGG GTGAGGCCACAGGTGCGAACAACACCCGCGATGAGAGGAACCTCGACTTCCTCTACTCGG CCATCACAGGCACCCCGCTTGCCGGCATTGATCGTGGCTCCGATGGGGCGACGTTCCTGC GCATGCTGAGGGACTTCAACTTCCTGCGTCTGATGGGGCAGGTGGGTTTCTCTCAGGTTC CTGAGTTCGGTCGGCAGGTGGCCCAGGTCGGCGTTAAGACCACGTTCCAGGCTGTGCCTT CCTTCCGTCACCTGATCGACATGGCCCGCTCGGGCAAGATGACCGACGAGGTGGCCGAGG AGCTGGACGCGATTGGTGCCTTCGGTACCGACTACGAGCGGACGGCGCACTACCTCGACA CCGATGAGTTGGGAGTCCCGGTCACCAGCGGGAGCGATTCGACCATCCAGCGGGTGGCCG GCGCGGTGAACCCAAAGCTCCACGCGATGAACCGCTTCGTCTCAATGGGCTCGGGCATGG CCCCGATCAACCGCGTATTCCAGAAGTGGTCGGCCCGCGCGGCTGCCGTCAAGTTCACCA AGATGGCCATGTTCGGGGACAAGGTCGATGCGGAGCGCCTGCGCGCCCTCGGCCTGGACG ATGCCACGACCAAGCAGATCTTCGAAGCAATCAAGACCAATGCCACCTTCAAGGGTGGTG TGAAGTCCCCCTCGAAGCTCCAGAGCCTCGGCATCAAGAACTGGGATGGCAACACGCTGT CGGCGTTCGAAGATGCAATGTTCCGGCTGAACCGGACGATGATTCTTGAGAACGACCCCG GCCAGATGCACCGCTGGTTGGCTCATCCGCTCGGCCAGATGGTCATGCAGTTCCGCACGT TCGCGATGTCAGCCCACACCAAGGCGCTGCTGCAGGGGCTGAACCTACGCGACGGGCCGG CGCTCTTTGGCATGCTCGCTTCGAGCTTCCTCGGCGCTGCCGTCTACGCGGGCCAGACGC ACCTCAATCTGATCGGCCGGCCTGACCGTGACGACCAACTGAAGGAGCGCTTGACGTGGA ACAAACTCGGCCTCGCTGGCTTCTCCCGCTCCTCGGAAAGCGCGTTGATCCCGATGGCCG CTGATATCGGCTGGCAGTTCTTCGACGATGAGCCGCTGTTCGACACCCGCTCGTCGGGCC TGAAAACCACCGTATCGAGCTTCCTCGGTAACCCGACAGGTGACCTGATCTCGACTGGTC TGGCGGGCGCAGCGGGGGTGACCTCGGCCATGGTCGGTGACGACTACTCCCAGACCGACT GGCAGAACCTCACTCGCACCCTGCCGTTCGCACGCATGATGGGCGCTGTTCAGTTCCTCA ATTGGGTCGGCTCAGGCCTGCCTCGACGGGAGCTTCGCGACTAAACATACCCACCAGTGA ATGGCTGGCCCTCGGGGAAACTCGGGGGCCTTTTTCATTTTCAACAGAGAGACACTCATG GCTCTTGCCTACGCACAATCCCTTGGGGACGGGGTGACGAACACCTTCTCGGTCCCCTTT CCCTACATCTCGAAAAACCACGTTCAGGTGAAGGTCGACGGTGTCGCGGTTCCTTACACC TGGCTGTCCGATACCTCGATCCAAATCTCTCCGGCCCCGGCCGCTGACAAGATCGTGGAC CGCCGCCGCGTGACCCCCCGCGACACCCTGCTGGTCGACTTCGTAGATGGCTCGACGCTG GTCGAAAGCGACCTCGATCTGTCCGCGCTGCAGGTATTCTATCTCGCGCAGGAGTCGTTT GACCTGGGTGAGTCCTCCCTCGGCGTGACCGACGATGGCTCCTTCTCGGCCCTCGGCCGG CGCATCTCGAACGTCCTCACTCCGACCCTCCCGAATGATGTCGCCACCAAGCAATTCGTT GAAACCGGCGTGGCCTCGGGCGTGACTGTCGCGACCCAGAAGGCCAGCGAGGCGTCGGCG TCTGCTGTCGCTGCTGCCCTCTCCGAGACGAACGCCGCAGGCTCTGCAACCTCAGCCAAC GCGTCGAAGATCACAGCCACCACCAAGGCCGGCGAGGCTGCGACGAGCGCCACCAACGCT GCAGGGTCTGCCACCACCGCTTCTACCAAGGCGAGCGAAGCTGCATCCAGTGCCGTCGAA GCGCAGGGCTACCGCGACACCGCAGCGACAAAAGCCAGCGAGGCGGCTGCCAGCGCGGCT GCTGCTGCTATGTTCGACCCGTCGACCTTCTACACGAAGACGGAGATCAACACCTTCCTC GGCGGGAAGCTCGACAAGACCGGTGGAACGCTGACGGGCGATGTCACGATCCAGAAGGCA AATCCCTCGCTTGTGCTGGATCATACCGGCGTCAACAAGTGGGGTATCCTGAGCGCTGCG AATGGGAGCCTCTCCATCCAGAAGCTGAACGGGACAGTCGTCAATGCGCTGACTATCGGC GCTGATGGTGCGATCTCGACGGCGCTGCTTGGTGACCTCAACTCCCGCATTGAGAGCCGC GCCACTGCCTGGGCCAACGACCGCGTGGCAAACCTCGCGTTCCGTAAGGTCAGTTCCAGT TCCTTCACTGTGCCTGACAACGGCCTGATGATGTGCCCAGCGGGCGCGGTACTTACCGGG ATGAACATGCAAGGCACGTCGAACAACCCCGCAATGCATTACCACTACCTGCAATCTTTT GACCCCGTCCGTGGCTGGGTCACGTTCAGCGGATCGTAATTCATGGAAATTGTAAACTTC GGCCTCTTCAAGCCGCTCAACGACACCGGCATCGTTTTTTACGAGAACGAGTACCAGCAG GACTGGTACGACCTCCGTAAGGGCCTGACTAACTGGACCGACCAGGGCGAGTTCGTCGAC GCTGTCTATGGAGCCTGGGCACTCGTCCGCCCCGAGGATGGGGTCATCACGAACGTGGAA CACGACCCCTCCCGCCTCGTGCCCCACAACAAGATCGTAATCGGCATCGACGCGTCCCCC TCCGAGGTCACTCCTGGGATGATCTTCAAGGAGGGCGCTCTGCTTCCCGCAGTCCCTGTG GACGAGCCTTTACCGAACCTCTCGCCCCGTCAGCTCTGGCTGGCCGCGCTGGAGATTAAC ACCACCAAGGCTCAGGTCATGGCCCAGATCGGTACCATCACTGATGCCAAGCTCCGCGCT ACGTTGGAGATCGAGCTGACCGAACCGCCCCTCGAAGGCTACGTTCGGGATAGCTTTGCG GTCGAGCGGCTGCGCGAGATGATGGGCATACCCGTTGACCAGTTCGACACCCTCTGGCTC TGGGCAAGGACACTGTAATGGAACACATGAACACCGAGGCCCTGCTGTTGATCGGCAGGG TCGAGGGCAAGGTAGACACGCTCATCAGCCTCTCGTCCGCGCAATCCCAGCGCATCGATC AGCTCGAAGGGCGCATGTCGGCGGGGGAGGTTGATATCGCCTCCCTCAAAGCCAAGTCAA CGACCAACCAATCCTTCGTCACCAATATCACCGCGATCCTGGCCCTCATCGTTGCCGCGA TCTCGGCCTATCTGAGCTACAAGTAATGGACCTCAAAGACATCCTCTCGAAGCTCCATGA GGAAATGGCCCAGAAGCTCCTCGACAAGGTCAGGAGCGGGGAGGTTACAGCCGCTGAGCT GAACGTTGCCCGCCAGTTCCTCAAGGACAACAACATCGACTCGATCCCGAAGGAAGGCAG CCCGCTCAAGTCCCTGACTGACGAGCTTCCCTTCACCGGCGACGACGACCGTCCCTCCTA CAACTAAACCCCTCGCAGCCCCTCAGGTCTGACTCAGGGCGCGCTCCAGCGACCTTCTGG CTACTACCCTAGCCGGGAGCGCTGACGCGCGTCTGTGAGTCCCTGTGGGCCGTTATATCC ACGGGTGGATTAATGACAGCCGATAGCCTGAAGACAGGCACCCATCTTTCTCCCGCCGTC GACCCCCTGAAGAAGGATTTCAGAAACTTCCTCTTCGTGGTGTGGAAGCACCTCAATCTT CCGGTCCCGACAGCCGTTCAATACGACATCGCCGGCTACCTCCAGCACGGCCCCAAGCGT TGCGTGATCGAGGCCTTCCGAGGCGTGGGCAAGTCCTACGTTACCTCGGCCTTCGTGGTC TGGCTCCTCTACTGCAACCCCCAGCTCAACATCCTCGTGGTCTCGGCCTCGAAGGACCGC TCCGACCAATTCTCCAGCTTCACCAAGAGGCTGATCGCTGAGATGCCGATCCTGGCTCAC CTCCGCGCCCGCCCAGGGCAGCGTGATTCGATGGTGGCCTTCGATGTCGGCCCAGCCCGC AACTCTCACTCCCCCTCCGTCAAGTCCGTGGGCATCACCGGCCAGCTCGCCGGCTCCCGT GCTGACATCATCATCGCGGATGACGTTGAGGTCCCCAACAACTCCATGACCCAGCTCCAG CGAGATCAGCTCTCGGAGCGCGTGAAGGAGTTCGACGCCATCTTGAAGCCGCTCCCCACG AGCCGCATCATCTACCTCGGCACGCCCCAGACCGAGATGAGCCTCTACAACAGGCTGCCC GAGCGCGGCTACGAAATCCGCATCTGGCCGGCCCGTGTGCCCATCGACCCCGAGCGCTAC CTCGGTCGCCTCTCGAAGTTCGTCATGGACATGATCGAGGCCGGCGCTCAGCCGCGTCAG CCGGTCGACCCTCAGCGCTTCCAAGAGCAAGACCTCATCGAACGCGAGGCATCCTACGCC CGCTCAGGCTTCGCCCTGCAGTTCATGCTCGACACCTCGCTCAGTGACCAAGACAAATAC CCCCTGAAGCTCTCCGACCTGATCGTCGCTTCCCTCGACCCGCGCATGGCTCCGGCCAAG CTGGTTTGGTGCAACGACCCCGACAGGGTGATCTCCGATCTCCCCGCAGTGGGCCTCCAG GGCGACCGTCTGCATCGCCCCATGTGGGTGGCCAATGAGATGGGCGAGTACACCGGCACG GTCATGGCGATCGATCCCTCGGGCAAGGGCGGCGACGAGACTGCCTATGCCATCGTCAAG ATACTCCACGGCAACCTCTTCTTGGTCGCTTCAGGGGGCTTCAAAGAAGGCTACTCCGAG GCAACGCTCAAGTCCCTCGCGGTGCTGGGCAAGACCCACAACGTCAACCGTGTGATCGTC GAAGCCAACTTCGGTGACGGCATGTTCACGCAGCTCCTGAAGCCGGTGTTCACCCGCGTC CATCCGGTGACCATTGAGGAGGTCAAGCACTCGACCCAGAAGGAGCGCCGCATCTGCGAC GTTCTCGAACCTGTCCTCAACCAGCATCGTCTCATAGTCGACGCTGCGGTCATAAAGCGT GACCACGAGGCTGAGCCCCATCGGCAGCTCTTCTACCAGCTCACCCGCATCACCCGAGAT CGAGGTGCCCTGATTAACGACGACCGACTCGATGCCCTGGCCATCGCTGTGACCTACTGG GTCGAGCATATGGCCCGAGACACCGACAAGGCCGCTGACGAACACAAGGCAGCGCTGCTC GAACAGGAGCTGAGGAGCTTCTCCGAACACATCTTCGGGGCACCCGCAGACAGCGACCTG AGGTGGTATAACATCGGTTGAAGACCAACCTTTAATTACCACCCACTTAAGGAGAACCCC GAGAGGGGAAGACCAAGGAATAAAGGATGAGGAGAGGGGGACTGATGAAAAAGAGTCCCT CTCTCCAGATGATCTTTATGAAGGATGAGGGTGGGATGAACCCTCGATCTACCGCATGAG ACCGATGAGAAACCTAAAGGTAGCCTAGCCGGAGCGCGAAAGGTATCGCCGCGTGTCATA CTATTGACGCAACGGAATGTTTGAAACCGCAACTCGATTTAAGGCATCGCTCACACCGTT GCTCGCGAGAAGATTTTGGAATGCGGCCCTCATGACATTAGCGTGCTGCGCGGGGCCAAT GATTACCCGGTCAATGGCCTCAACGAACTGTTTGTCGCCGAATTGACCTGGGAATTTAAA CTTGCAAATTTGCTGAGGGACTCCAGCAATCGTGACCGGCAGTCGCTGCATTGGGGATGG ATCGTGTGACGGTGAATGCATAATTCTCCACTCCCGTTCCTCACGAAATCCAGGGTGCTT GTTTGCCAGGATGGAAAACCTCAGCGCCGAGTGGAGCGAATTAACGATAAATTCTCTCTT CATTAATCGAAGCTGGCCTGCATTTGCCGCAATAGCTGCAGCCATTCGCCAAAGCGCGCT TTCAGCCTCTTTCGCGGTAAAGTAGGCAACGGGACTGGCGGTTACACCCGATATTGGGAC AGGGCCTGGATTGGTGAATACGTCGCCCTTTAAAACGATTGCGACATTGCCGTAGGCACG CCACATCGAAAGTCGACCGATTTCATCTTCATCATTTTCGTGGACCGCGAAACTGGCAAG AAACGTGTCGTTCTGGAAGTACGGTTGCCACTTCTGAAACAGATCATCCACTGTTGTAAC TATCGACGGATCAATGGAGTTCAAAGCATTCGTGAACGCAATGCCTGCAGGCCCAGCACG AGCGGCGCTGAGGCAGTCAATACCATAACTAATTTCTGAGAAGTCGTTCATAACCGCAGC GTTACGAAGCCAGATTTCTTTATTCTCAATTATCTTGTAAGCCGTTTCAGCGGACGTATA GTGAACAAACCTGTTTCTTGCTGTCCTAATCTTTTCGAGTTCATCATTGAATGTCTGAAA AAATGTCGCCGCTAACAAACTTTGTTCACTTGCGGTTGGCCACATAAACATCGCTCCGAA TTGTTGGCTTCTCGTGCCTTAGAGTCGTAAAGCCTCTTCCGACTGATCTTCTCGACGACT GAAGTCGCGACTGTCATCCCACCTTTATTACAATAGGCGTATCACCCTTCAACACACTCT CGAGCAACTCCTGCAAAACCTTTTGCTTGGCTGTGACATCCTCGTCGGCCGGGAAAGATT TTATATCGAGCGGCAGCGATGTCTCTCCTGCGAACAGCTTGTAGAGGCCCTTCCGGATGA CCTCTTGCTCCCCCTGCACAATGGAATGCCCAACGGGCGAATAGACTTCCCTTTCCAGTT CCAATCTTGAAAATTCGTATCCAACTGCAGCCGAGAGCTTCAGAAGAAGATCGACAAAGA GCGTTTGACGCCTTTGGACCCACAGGTCCATATTTGTCTGTTCCTTCCCGAGATGGTCGA GAAGCATTCTCCAAGCTTCGACCACCTCCTTAAAAACTCGAGATTTGCCATAGAACTCAA TAGGAATAGCGTTCAACGCTTCAACGTGGTCTGGTGAAAGTGAAGACGCTCTGGTGGCCA TCAGTGTGCGAAATATAGTCACGCGCCGATTTCGCTTTTCGCGGGCCAAGTCAACCAACC GCTGAGCCTGGACAGCCAGAATGGGCCCCATCAACGTCGCCGCGACTACTGCAAAGTTCC AATACCATCCATCAGCCATGAAAAGTCTCCCAGCTCATGACACGATAGATTTTTTTAGTT GGTTTGTTTCACTGGGCGTTGAATGATCCACTATTATTTTTCGGGGAAGTTGCAGCCGGC AAATATTTGCTGCAAAAATTTCTGAGGTCAGATCAGATATAAGGAACCGGCAGTTCCCCC CGTGGCCCCTCGACGACACGCAATCGACCCTTCGGGCTGCCGCCAATGCCACATGAACGT GACACAAGATGGCTAAGCCTTTGAAATCATTAGAGCCAACAGCAGATGATTGATCTACAT ATTGCGCATGATTGATCACGAGCGGCTGCCTTGAGTGGGTTTGTGGCTGAATATGTGGGC GTCTGTTTCTTCTGGGGCGATAATGCCTTTTTGTGATATCGCACGGTTGCACATTAGGAG GCACATCAATGGCATTTATAACGGTTGAAGAAGTGCACAGTCAGCAAGACAGGGTGATAG GCACTGTCTTTCTGAACGTGGAGTTCATTATCAAATACGAGTCGAAGTCGACCGAGCAGC CTTTCACCAGCAAAATAACGTACCTCACCGGAAGCTCGATAAGCGAATTGATCGTGATGG GTGCGCCAGTTGATATCACCTCAAAGATTTCTAACGCTTCAAGAAGCTGACACAAAGCTG CCAGCAAATTCACGAGTGAATAAATAAGATAAAGCACCATCAATGCCTTAGCGCGTCGAT GGTGTTTTTTTCATATCTACCACTTTGCAATCAATCCACCTGTGAATACATTCCAACCAC CAAATCGGGGCGGCTAGGTCGGCCTGAGGCGGCGGGTGAGGGGCAGGCAAGCCACTCGGT TCTTTGACAATTGAAGACGCTAGCATCGACCTTTCGGGGTTGGTGGGTAGCACTGGGTAC AGCGGCGAAGCTGTGTCTAGTCATATCCACGGGTGAATGAAAAGGAGACATTCAATGTTC AAGGGAACACTCATTCGGTCTGGCAACAATGCGAAGACGATCAAGGGTGATGGGGAATAT GAAACCGCCATCATGTACCTTGCGCCCTTCACCATGGCAGGCGCGAACGTCTGCCCTATG GCTGAACAAGCGGGCTGCGTTAAAGGCTGCCTCAATACGGCGGGCAGGGGCGCTTACAAT AACGTTCAGCAAGCCCGCATCGCCAAGACAAAGCGCTATCTGGCCAGCCGCACGGCTTTC ATGGCTGATTTGGTCACCGATCTGGAACGCTTCGTGGCCTACTGCAAGCGCAAGGGCGTC AAGCCTGCCGTTCGCCTGAATGGCACCTCAGATATTCAATGGGAGGTGGCCCACTACGCT AGCCGGGGTGACGCTCGCGGCTCGGTCTTTGAGTTGTTCCCCGAGGTGCAGTTTTACGAC TACACGAAGGTTTACAAGCGGGCTTATCGCCAGTTGCCCGCCAACTACGCCTTGACGCTG AGCTATAGCGCGGCAAACCCGGTCTACGCTGAGGTGGTCACGAAGGTTGCCCATGAGACC GGCGCTAACTTGGCCATCGTCTACCGCACAAAGGAATTGCGCGACTACTTTGTCGGCAAG CTTGTGCAATACGGTGATGCCTGCCGCGATGTTATCGACGGCGACGAAACTGACATGCGG TTCCTTGATCCCAAAGGCGTGATTGTTGGTCTCTACGCCAAGGGCAAAGCCAAGGGTGAC CAATCGGGCTTTGTCGTGGGCTAACGAACTCACCTGTGAATTGAATACTGCCGAAACCGG CTGCCATGGGTGGCCGGTCGCGGGGCTTGGCATGCGCCCGCCTGATGATGGCTGCCGATA GGAGAATGGGAATGGCGAGGACCTATTACACGCTCTTGCAACGGGTTGACGATCACTGGT CACCGCAGTTTGGAGCCTATGACCGCGAGGACGTTGAAAGCGAACGCGATGACTACCGCG ACCATGGCGTAAAGGCCAAAGACCTCAAGATCGTGACCACGCAAGGCCACTCTTGGAAAG CGATTGAAGCTGTCCTGAACAAGCTGAACGGGAGGGCACGCTAATGGCCACCACGTACCA CGTTCGCAAGGTTGCCAAGGGTCGCTGGGGCATTACGGCCGCACATTCCGGCTGGGTTAC GCCAATCGGCAACTATCCCAAGCGTTCGGCCGCTATCACGGTCGCAAAGCTGCTCGCGGG CTGGCGCTGCTCGGTCGTCGTTCACTCCTCCTAAAACTGCCGAAACGGGCCTCGCAGGGG CCTGTCTTCGCCCTTGGCATGGGTGGGCTGATGATGGCTGCCAATTGGAGAATGACCATG GACAACTCGATCCTTTTCAACTGCTGCACTGACCATGTTGCCCCTGATTGGTCGCAGTAT GATGCGCTCGAACTGGGCGGCTGCGTCGAAGCCAAATGCACGCTCACCAACGACACTTGG ACCGAGGGCGGCTATCATCGAAATGACGCCGAGTTCTTCACGGTCTACGGGCACCTCAAG GAAGGTGGCTGCGAGGCAATCACCGACTGGCATGGCAGCTTCGATGAGGCTGTTTGCACT GCCGAGGAACTGGCGAGGCTCTCAGGCCTGCCCCTTGAAGTCTGCTGCTAACCACCTGAT GAGGCCGGCAAGGCCGAAACCTAAGGGATTGTTACGCGGTCCCTGACGGTCGTGGTTCAT CCACAACAATCCACAGGGTTTACTGGATTGACCGGGCATAAACGTCAGGTTGATCGGCAG CTCCTGTTAAGTGCCTCTTTTGACTTCGCTTTGGGCCATGAGCGCCCGCGTATCATCACC ATGTATCGGTGGAGCAGAGCTAAAGATTTTGCATGGCTGATATGCTCTGATCGCAATGCT GAATAGGATAGAATTCCTATTTCATTCACTAAATCCAATGTGAAATCAGCGACATGCGCG ACACACGGGCCAGATGAAACGATACTTTCCTGCCTAGACCTATTCACCGGTGGCCCTTTT GCAGCTTGTGGTACGGGTCAGCGCGGGAGTATATCCGCTTCCGTCCTAGAACGAGAACAG GACGCGAACGGAAAGCGAGGGATTTGAAGGTGCGGGTATTCAAGAAAACAAAGCGATTTG AAATATATATCGAGATCGAACGGTCTCGAATTCTGGATCGCTTTTTTAGCTATCGCTGCG ACGATGGGAACCACGAACTGTGGCTCGGCCGCTGCTACATCACACTGTTTATTGCACAAA CCTAATCACGGGTGTATGGGTTCCATCGTACAGCCCCGGGCTGGGGCCAATAAGAGGGAC ATTCCACATGCCTAAGGCATTGTTTTCGCGCATGGGCGCGGTCATGAATGAGGTGCGCCA GCTCCAAGATGAGAAGGCGCTCATGTCGGTGCAGACTTTCGAGGTCTTCCTCGTCATCGC CTCAAAGGATGGCATTCCGTCTTCCGAGATCAGGAAGATAACGGGCATTCCACAGCCTTC TGTAAGCCGCGCTCTGGGTGATCTCGGTGAAAAGGCCGTCCGCCGTGACGCCGAAGGCCT CAAGCTCATCAAAACTGAGCGTGACCCCAGCGATATGCGCAATGTGGTCTGCTTCCTGAC CCCCAAGGGGAAACTATTGGCTGCCCGCATAGCGCAGCTAATGGGCATCAACGACACCAA GGTAGACGGGTCGTTCGAACGCAATGCCCAGTGAGGGAGAGATACCGATGCCGAGCCTTT TCAGAGGACTTTCGGCTGCCGTGTGGCGGGAGCGTTTTGAGGCTGCTGGTTGGCCGAAAG AAGCTGCCGATCACATGGCCCTGTGGCTCACCATTGGAGCGAAAGACCCGCCACCTGAGT GGCAGGAAATTCTCTCCAAAGTAAAGGGTGGTGCCCAGTGAGGGACTCGAACCCCCACAC CTCTCGGCGCTTGGACCTAAACCAAGTGCGTCTACCAATTCCGCCAACTGGGCAACGAAA GGAGATGCGATGCCTGTTAAACCACGCGGAGCCTCATGGCAAGCCGCTGTCTCTCATAAG GGGACACGGTTGAGAAAGGACTTCCCAACAAAGCTCGAAGCTGAGATTTGGGAAGCCGAG ACGAAGGCAGCACTGCTGTCTGGTAAAGAGGTGGTCGTAAAGACTGCCGAGCCTGTCATG ACCCTGCAGCAGCTCTTCGATCTGGTCGCTGAGACCCGATGGAGGGGCACGAAGGGCGAG AAGACGGCGCTGATCAATGGTCAGCATGTGGTCAACATCCTCGGCCCTCAGAGGGACGTT AAAACTCTCTGCTATGAGGACAGTCTGACCATCAAGAAGACGGTGACCGGCTGGAAACGG GCGGATGCCACGATCAACCGCAAGCTCGCTGCTTTCTCCACCATGGTGAAAGAGGCGTAC AAGCTGGGCAAGATCGACAAGCTGTTCGACATTGGCCTGATCAAGGAGCGAAACACCCGC GTCAGGTACTATGAGGACAAGGAACTCGATCAGATGCTGGCTTGGTGCGACGAGATGCTT GAAGATGAGCTGAGGGACTATATCATCGTCTCTCTGGACACCGGTTTCCGGCAGGGTGAG GTCTTGAAGATCACCAAGCGGGATGCCGAACTGGAAGACCTCTGGACCTTCGACACGAAG GCGGGGGACAAACGGGATGTGCCGCTCACAGCCAGGGCGAGGGAAGTTCTCCTCCGCAGG GCCAAGCCTCTCAACGATCCCGATGCGAAGCTCTTCACTCAGAAGCCTGCTTGGTACCGG GAACATTGGAAGAGCATGCAGTCGGACCTCGGCATGACCGATGACAACAACTACGTGCCG CACGTTCTGCGCCACACGTTCGTCACCAACATGCTACTGCATACCGACATTCGCACGGTG CAGGAGCTGGCCGGTCACAAGCGCATCGAGACGACCATGCGCTATGCCAAGACATCAGCC GAACGCAAACGTCTTGCAATTAAGCGGATGTCGGACTATCAGGGGGCCGAAATCGGGGCG TGACACATGACACATTGTGCCATTTCTCCGTGACACAGCAGTAAGTAAAAGGAATGCTCA GAGCCTCAAGCTATTGAAAAGCTTGTGATTCTGGGGATGGTAAAACGGGAAAAGC
Region | 2 |
Region Length | 43.3Kb |
Completeness(score) | intact(110) |
Specific Keyword | head,capsid,tail,virion,integrase |
Region Position | 1811452-1854766 |
# tRNA | 1 |
# Total Proteins | 44 |
# Phage Hit Proteins | 29 |
# Hypothetical Proteins | 14 |
Phage + Hypothetical Protein % | 97.7% |
# Bacterial Proteins | 1 |
Attachment Site | yes |
# Phage Species | 10 |
Most Common Phage Name(hit genes count) | PHAGE_Mesorh_phagevB_MloP_Lo5R7ANS_NC_025431(18) PHAGE_Pelagi_HTVC011P_NC_020482(13) PHAGE_Pelagi_HTVC019P_NC_020483(10) PHAGE_Ralsto_RSB2_NC_023736(5) PHAGE_Vibrio_ICP3_NC_015159(5) PHAGE_Prochl_P_SSP3_NC_020874(4) PHAGE_Vibrio_N4_NC_013651(3) PHAGE_Cyanop_P_SSP2_NC_016656(3) PHAGE_Synech_S_CBP42_NC_029031(3) PHAGE_Salmon_phiSG_JL2_NC_010807(2) PHAGE_Entero_T7_NC_001604(2) PHAGE_Synech_S_CBP1_NC_025456(2) PHAGE_Morgan_MmP1_NC_011085(2) PHAGE_Synech_S_CBP2_NC_025455(2) PHAGE_Entero_UAB_Phi78_NC_020414(1) PHAGE_Mannhe_vB_MhS_587AP2_NC_028743(1) PHAGE_Acinet_133_NC_015250(1) PHAGE_Synech_Syn5_NC_009531(1) PHAGE_Pseudo_Pf_10_NC_027292(1) PHAGE_Morgan_vB_MmoP_MP2_NC_031115(1) PHAGE_Vibrio_VP4_NC_007149(1) PHAGE_Sulfit_NYA_2014a_NC_027299(1) PHAGE_Mycoba_Peaches_NC_013694(1) PHAGE_Entero_E_2_NC_029102(1) PHAGE_Escher_vB_EcoP_GA2A_NC_031943(1) PHAGE_Synech_S_CBP4_NC_025464(1) PHAGE_Rhodob_RcRhea_NC_028954(1) PHAGE_Sulfit_pCB2047_A_NC_020858(1) PHAGE_Rhizob_RHEph10_NC_034248(1) PHAGE_Citrob_CR8_NC_023548(1) PHAGE_Entero_K1E_NC_007637(1) PHAGE_Rhizob_vB_RleM_P10VF_NC_025429(1) PHAGE_Mycoba_Kampy_NC_024141(1) PHAGE_Escher_LM33_P1_NC_031937(1) PHAGE_Sulfit_pCB2047_C_NC_020856(1) PHAGE_Acinet_Acj9_NC_014663(1) PHAGE_Citrob_SH4_NC_031018(1) PHAGE_Rhizob_RHEph04_NC_041908(1) PHAGE_Cyanop_PSS2_NC_013021(1) PHAGE_Pseudo_phi15_NC_015208(1) PHAGE_Sphing_Lacusarx_NC_041927(1) PHAGE_Rhizob_vB_RleS_L338C_NC_023502(1) PHAGE_Mycoba_LHTSCC_NC_023745(1) PHAGE_Halocy_JM_2012_NC_017975(1) PHAGE_Entero_K1_5_NC_008152(1) PHAGE_Cyanop_NATL2A_133_NC_016659(1) PHAGE_Podovi_Lau218_NC_024329(1) PHAGE_Entero_EcoDS1_NC_011042(1) PHAGE_Entero_13a_NC_011045(1) PHAGE_Cronob_Dev2_NC_023558(1) PHAGE_Rhodob_RcCronus_NC_042049(1) PHAGE_Prochl_P_SSP7_NC_006882(1) PHAGE_Punice_HMO_2011_NC_021864(1) PHAGE_Rhizob_RHEph06_NC_027296(1) PHAGE_Pseudo_PPpW_4_NC_023005(1) PHAGE_Citrob_SH2_NC_031092(1) PHAGE_Acinet_vB_AbaP_Acibel007_NC_025457(1) PHAGE_Salmon_SPN1S_NC_016761(1) PHAGE_Erwini_vB_EamP_L1_NC_019510(1) PHAGE_Achrom_JWF_NC_029075(1) PHAGE_Salmon_epsilon15_NC_004775(1) PHAGE_Entero_vB_EcoP_ACG_C91_NC_019403(1) PHAGE_Salmon_Vi06_NC_015271(1) |
First Most Common Phage # | 13 |
First Most Common Phage % | 40.9% |
GC % | 58.98% |
Region: | The number assigned to the region. |
Region Length: | The length of the sequence of that region (in bp). |
Completeness: | A prediction of whether the region contains a intact or incomplete prophage based on the above criteria. |
Specific Keyword: | The specific phage-related keyword(s) found in protein name(s) in the region. |
Region Position: | The start and end positions of the region on the bacterial chromosome. |
# tRNA: | The number of tRNA genes present in the region. |
# Total Proteins: | The number of ORFs present in the region. |
# Phage Hit Proteins: | The number of proteins in the region with matches in the phage protein database. |
# Hypothetical Proteins: | The number of hypothetical proteins in the region without a match in the database. |
Phage + Hypothetical Protein %: | The combined percentage of phage proteins and hypothetical proteins in the region. |
# Bacterial Proteins: | The number of proteins in the region with matches in the nrfilt database. |
Attachment Site: | The putative phage attachment site. |
# Phage Species: | The number of different phages that have similar proteins to those in the region. |
Most Common Phage: | The phage(s) with the highest number of proteins most similar to those in the region. |
First Most Common Phage #: | The highest number of proteins in a phage most similar to those in the region. |
First Most Common Phage %: | The percentage of proteins in # Phage Hit Proteins that are most similar to the Most Common Phage proteins. |
GC %: | The percentage of GC nucleotides of the region. |
>3 1907906-1916536 ATGAACCTGAATACACCCGCATTTTCAAGCTTCACCCATGATGGATTACAGCTCGCCTTC TTCGATGAGGGCGATCCGGCCGGTGTGCCCGTGTTGTTGATCCACGGCTTTGCCTCGACC GCAAACGTCAACTGGGTGCATCCGGGCTGGCTGAAGACGCTCGGTGATGCCGGCTACCGG GTGATCGCCATGGACAATCGCGGCCACGGCGCAAGCGACAAGCCGCACGATGCCGAAGCC TATCGTCCATGGATCATGGCCGGCGATGCGATCGCGCTGCTCGACCATCTCGGCATCCCG GAAGCCAATGTCATGGGCTATTCGATGGGCGCGCGCATTTCCGTCTTTGCCGCACTTGCC AATCCGCATCGGGTCCGTTCGCTGGTTCTCGGCGGCCTCGGCATCGGCATGACCGACGGC GTCGGCGATTGGGACCCGATCGCCGATGCGCTGCTGGCTCCCTCGCTCGACGCGGTGACG CATGCCCGCGGCCGGATGTTCCGCGCCTTCGCCGAGCAGACGAAGAGTGACCGCGTCGCC CTTGCCGACTGCATCCGCGGCTCGCGCGATCTCGTTGCCCGTTCCGATATGGCCAAGCTC GACATGCCGACGCTGATCGGTGTCGGCACCAAGGATGATATCGCCGGCTCGCCGCGGGAA TTGGCGGCGCTGATGCAAAATGCCGAAGCGCTCGATATTCCAGGCCGCGATCATATGCTC GCCGTCGGCGACAGGGTTTTCAAGCAGGCGGTGCTGACCTTCTATGCAAGGGTCGCCCAT CGCTGACAACATCGTGATGCCGAAAAACCATGCTAAAAAACCATGGCGACGGCACCCATT TATGTTATTGGCGTTTTCCTCTATATATTGGCATTCCGAAAACGGCATTCCGGCTGGCAA CGAGGAGAGCGGCGATGGTCGCAAAGACTGACATCCGTGCTTTTGACACAGGCCATCCGC TGAAGGTGATGGACCCCATCTGGGACAGCCTACGCGAGGAAGCCCGGCTCGCCGCCGAAC GGGACCCGGTTCTCGCCGCCTTCCTCTATTCTACGGTGATCAACTACCATTCGCTCGAAG AATGCGTCATCCACCGCATCTGCGAACGTCTCGATCATCCCGACATGCAGGCGAACCTGC TTCGCCAGACCTTCGAGGAAATGCTTCTCGACTGGCCTGACTGGAGCTCCATCCTGCGCG TCGATATCCAGGCGATCTATGACCGCGATCCCGCATGCCTGCGCTTCATGGAGGCGGTGC TTTATTTCAAGGGCTTCCATGCGCTGCAGACACACCGTCTGGCCCATTGGCTACTGAACC GCGGCCGGCGTGATTTCGCGCTCTATCTGCAGAGCCGTTCCTCCAGCGTCTTCCAGACCG ACATCAATCCAGCCGCTCGTATCGGCAAGGGCATCTTCCTCGATCACGCCACCGGCCTCG TCGTCGGCGAGACGGCCGTTATCGGCGACAACGTCTCGATCCTGCACGGCGTGACGCTCG GTGGCACCGGCAAGGAGGGCGCCGACCGCCATCCGAAGATCGGCTCTGGCGTCATGATCG GTGCCGGCGCCAAGATCCTCGGCAATATCGAGATCGGCTACTGCTCACGCGTCGCCGCCG GCTCTGTCGTCCTGAAGGCGGTGCCACCCAAGAAGACGGTCGCGGGCGTGCCGGCCAAGG TCGTAGGGGAGGCCGGTTGTTCCGAGCCGTCGCGCAACATGGACCAGGTAATCGGCGCCG ATATTTGAGCGCTCGTTGGAAACAAGGATAGGAAAAGTCGGCGGGATCCGCGCGGGGAGG GCCATGAGCAAACGGCAACCTTGCCTTTACACCGCTGCATTTCCTGTGCAAGAAGCGGCC AATCAAGACTGCTTACGGAGATGACAGAGTGAAGCCAGAAGAAATCAAGAAGCTCGACGC CTATTTCAAGCGCATGTTCAATCCGCAGATGATCGTCAAGGCGCGTCCGCGCAAGGATGA TTCTGCGGAAGTCTATCTCGGCGAAGAATTTCTGGGTGTCGTCTATATCGATGACGAGGA CGGCGACCGCTCCTACAACTTTTCGATGGCGATCCTCGACGTCGATCTCTGATCGCGTTC GAAAGCTTCAAAAGGACCGAACGGCGCGATCCGTTCGGTCCTTTTTTCGTTTTTTCGATT CGCAACGAGAATTTATTCAAAACTTCCGGTTGCGGCATTCTGATATATTTTAGCAAAATC AGTTAAAGACGTACGTGCCCCGGCGCGCTGCCTTTGTTATGTTATTGTAATGCAACCTTT GGATGTTTTGCAACGCACAAGACTACTTGACTATTTGTGCATTGCAGCTACCCTGCGGCC ACCAACAGCCCAACGGAGGATGGCTCATGTTCAACTTTGAAGACGCCAACAAGAAGAGCA AGGAAGCCGTCGACACGGCCCTGAAGACCTATACCGACACGTCCAAGGGCTTTCAGGCAA TCGCCGCAGAAGCCACTGAATATTCGAAGAAATCTTTTCAGGACGCGGTGACGCATTTCG AAACGCTGGCCGGCGTCAAGAGCTTCGAGGCTGCCTTCGAGCTGCAGACGAGCTACGTCA AGGCGTATTTTGAAGGCTTTGTCTCCGAGACGACGAAGCTCAGTGAGATGTATGCCGATC TCGCCAAATCAGCCTACAAGCCCTATGAGGCACCGATCGCCGCTGCGGTCGTCAAGACCG CCAAGTCGGTGTCGGCGGCGACGCCTGCTGCTGCTTGAACTGATTTCAAAGGCGCAACCT GCGCTACATATTGAAAAATGAAGACCGGCTACGCATCTGTAGCCGGTCTTTTTGTTTCCA CTTTCGCTGCAGCGCCCGCATGGGATCGACCGGGGACTTTTTTGGTCTTTCCCGTGCATG CCGGCCATTTCTTGATTGCAGTGTCTGACAGGGGGCTTAAAATCAGCCTATTATGAACTA AGTTAGTGTTTCAGATATTCGGCGGGAAAAGCACCCTCCCATGCTCCGCTGGATTGCTTG AGGAATGAATGACAATGATCGCAAAGCCGATCCGGATGCAGAACGACAGCGAAAGGAACG GGGACAACGCAAATCGAACCTCGGTCATCACGCGCACCAAGCCGAAGACCAAGAAGCCCA ATCTTTATCGTGTGCTGCTTTTGAATGACGACTACACTCCCATGGAATTCGTCATTCATA TTCTGGAGCGTTTTTTTCAGAAGGATCGTGAAAGTGCCACCCGCATCATGCTCCATGTCC ATAACCACGGCGTCGGCGAATGCGGAATATTCACATACGAGGTAGCGGAAACGAAGGTCA GCCAGGTGATGGACTTCGCCCGGCAGCACCAGCATCCGCTGCAATGCGTCATGGAAAAGA AGTGAGGATCTGAACGTGCCAACATTTTCGCCTAGTTTAGAGAAGGCGCTCCATCAGGCA CTGACCTTTGCCAACGAGCGGCACCACGAATATGCGACGCTCGAGCATCTGCTGCTCGCC CTGATCGACGATGCCGATGCGGCCGCGGTCATGGGTGCCTGCAATGTCGATCTCGACGCG CTGCGCAAGACGCTCGTCGAATATGTCGATAACGAACTTTCCAACCTGATCACCGGTTAT GACGAGGATTCGAAGCCGACCTCCGGCTTCCAGCGCGTCATCCAGCGTGCCGTCATCCAC GTGCAATCGTCCGGCCGCGAAGAGGTGACCGGCGCCAACGTGCTCGTCGCGATCTTCGCC GAGCGCGAAAGCCACGCCGCTTATTTCCTGCAGGAGCAGGAGATGACCCGCTACGATGCC GTCAACTATATCTCCCACGGCATCGGCAAGCGCCCGGGCGTTTCGGAAGCGCGTCCCCCG CGCGGCGCCGAGGACGAAGCCGAAAGCAGCAAGCCGACGGCGCGCGGCGGCGAGGAAGAG GGCGGCCCCAAGAAGCAGCAGGACGCGCTCAAGGCCTATTGCGTCAATCTCAACGAGAAA GCCAAGGGCGGTAAGATCGATCCGCTGATCGGCCGTCACGCCGAGGTGAGCCGCACGATC CAGATCCTGTGCCGCCGTTCGAAGAACAATCCGCTCTATGTCGGTGATCCCGGCGTCGGC AAGACGGCGATCGCCGAAGGCCTTGCCAAGCGCATCGTCGAAGGCAAGGTTCCGGAAGCG CTCGCCGATGCAACGATCTTTTCGCTCGACATGGGCACGCTCTTGGCCGGCACGCGCTAC CGCGGCGATTTCGAGGAACGCCTGAAGCAGGTCGTCAAGGAACTGGAAGAATATCCGGGC GCCGTGCTCTTTATCGACGAGATCCACACGGTGATCGGCGCCGGCGCCACCTCAGGCGGC GCAATGGATGCATCGAACCTCCTGAAGCCGGCCCTGTCATCGGGCGCGATTCGCTGCATT GGTTCGACCACCTACAAGGAATACCGCCAGTTCTTCGAGAAGGATCGGGCGCTGGTCCGT CGTTTCCAGAAGATCGACGTCAGCGAGCCGTCGATCGAAGATGCGATCGAGATCATGAAG GGCTTGAAGCCCTATTTCGAAGAGTATCACCACCTGCGTTATTCGAACGACGCCATCAAG TCGGCCGTCGAATTGTCGGCCCGCTACATCTCCGACCGCAAACTGCCCGACAAGGCGATC GACGTGATCGACGAAACCGGTGCGGCGCAGATGCTGCTGCCGCCGTCCAAGCGCCGCAAG CTGATCACCGAAAAGGAGATCGAGGCGACGGTCGCGACGATGGCGCGCATTCCGCCGAAG ACCGTCTCCAAGGACGATGAAGCCGTGCTTGCCAATCTCGAGAAGGAACTGCGCTCGGTC GTCTACGGCCAGGATATCGCCATCGAAGCCCTTTCGACTTCGATCAAGCTGGCGCGCGCC GGTCTTCGCGAGCCGAACAAACCGATCGGCGCCTATGTCTTCTCCGGTCCGACCGGCGTT GGCAAGACCGAGGTGGCAAAACAGCTGGCCTCGTCGCTCGGCGTCGAACTTCTGCGCTTC GACATGTCGGAATATATGGAGCGGCACACGGTTTCGCGTCTGCTCGGCGCGCCTCCCGGC TATGTCGGCTTCGACCAGGGCGGCCTTCTCACAGATGGCGTCGATCAGCACCCGCATTGT GTGGTGCTGCTCGACGAAATCGAGAAGGCGCATCCCGACATCTACAATATCCTGCTGCAG GTCATGGACCACGGCACGCTGACCGACCATAACGGCAAGAAGATCGATTTCCGCAACGTC ATCCTGATCATGACGACCAATGCCGGTGCCTCCGAAATGGCCAAGGCGGCGATCGGCTTC GGTTCGTCCAAGCGCACCGGCGAGGACGAGGAGGCGCTGACCCGTCTCTTCACGCCGGAA TTCCGCAACCGTCTCGACGCGATCATTCCTTTCGCGGCGTTGCCGACGGCCGTCATCCAC AAGGTCGTGCAGAAGTTCATCATGCAGCTGGAGGCCCAGCTTTCCGAAAGGAACGTCACC TTCGACCTGCACGAGGATGCGATCGCCTGGCTGGCGGAAAAGGGTTACGACGAGAAGATG GGCGCCCGCCCGCTTGCTCGTGTCATTCAGGATACGATCAAGAAGCCGCTCGCCAACGAA ATCCTCTTCGGCAAGCTGAAGAAGGGTGGTGTCGTGAACGTCACTGTCGGCCCGAAGGAA GATGGCAAGCCCGGCATCGTGCTCGAAGCCATTTCGGAAACGGCGCCGATCAAGCCGAAG CCCGAAGCCGAGGTCGTGCATCCCGAAGGCGATGATGGGGATGACGGCGAGCTGAAGACG AAGGCAGCCCGCAAGACCCGCGCCAAGGCGGTGCCGCAGGCCGAGCCTGAGGTTCGCGAC GCTCCGAAGAAGGGAAGCGCGGTTCCGAAGGTTCCGCGCAAGAAGTAAGATACCGTCACC GAATTGGAAAAGGCCGCGTCACCGCGGCCTCAGACTGCTGACAAACCACTGGCCCCGTCC AGGGGTTTGTGATTCATTGGGACATGTTGAAGAAACCTGCCCCCACCCAGACGGCTCTTG AGATGGTGACGCTCGACAGCCTGGTGCCAAAGGATCACGTGCTTCGCAAGATCGATGCGG TGATCGACTTTTCCTTCATCCATGGCCGGGTTGCGGGGCTTTATTGCGCCGACAACGGCC GCCCGCCGCTCGATCCGACCTTGATGTTCAAGGCGCTGTTCATTGGCTACCTGTTCGGCA TCCGCTCGGAGCGTCAGCTGGTGCGCGAGATTGAGGTCAACGTCGCCTATCGCTGGTTTC TGCAGATGAAGCTGACGGATGGTGTGTTTGACGCCTCGACGCTGTCGCAAAACCGCCGCC GGCGCTTCAACGACACCTCGGTTGCACAGGACATCTTTGATCATATCGTTGAGCAGGCGA TCCGTCATGGCCTGGTGGATGGCACGGTGCTTTATACGGATTCAACGCATCTGAAGGCGA ATGCCAACAAGGGCAAATATGATCTTGCGATGATCGAAAAGTCGCGTTCCGATTACTGGG CCGACCTCGACCGAGCGATTGAGGCCGAGCGGGCACTCCACGGCCAGAAGCCCTTGAAGG AAAAAGAGCGCGAGCCGGAGGTGAAGGAAACCAAGGTGTCGCGCACCGATCCCGACAGCG GCTACATGGTGCGCGACGGCAAGCCGAAGGGCTTCTTCTACCTCGATCACCGCACGGTGG ATGGCAAGCTGGCGATCATCACCGACACCCATGTCACGCCCTCCAATGTGCATGACAGTA TCGTCTATCTCGACCGGCTGGACCGGCAGCGTGAGCGGTTCGGCTTCGAGGTCGGTGCTG TCGGGCTGGATGCCGGCTATGCGACATCCGGCATCGCCAAGGGTCTTGAAGACCGCACCA TCCTCGGTGTCACCGGCTATCGCAATCCGACCCCGCCCAGAGCCGGCATGATGCGCAAGT CGAAATTCGGCTATGAGCCCGAGACAGACGGTTACCGCTGCCCCGAAGGCCAACTGCTCG CCTATGCCACCACCGACCGCAACGGTTATCGCCACTATCGTTCGGATCCTGCCATCTGCC GCGATTGTCCGTTGTTGGCCTCCTGCACCAATAACGCCACGGCGACACGTACCATCACCC GCCATGTCTGGGCCGATGCCCGCCAGCGCACCGATGCCAACCGCCTGACCCCTTGGGGCA AGGCAATCTACAAGCGGCGAAAGGAGACGGTCGAACGTTCCTTCGCCGATGCCAAGCAGC TTCACGGGCATCGCTATGCCCGCTTCCGAAGTCTCACCCGCGTCTCATGCCAGTGCCTGT TGGCTGCGGCCGCCCAAAACATCAAGAAGATCGCAATGGCGCTCACCACAGCGTCAAAAC CAGCCATGGCGTGAGCCAATACATCCTTCTTCACCTCTGCCAAAACACTACCGCCAAAAA AATCCAATCCGCAAAAACAAAACCCGCCGAAAAATCGACGGGTTTGTCAGCGGGCTGAGG CCGCGTCACCGCGGCCTTTTTCGTTTTCTGAACCCTGCATCACCACTCCTCAGTGCGCCT AGTGTGGTCGACCGCGACAAAGTCGGTGACCTCGGACATTGTAGGTGTCGTAGCCAGCCT GCTGCATAATTCCTTAAACCGGATCGATTTAAGGATAAAACTAAAGACTCAACAGGCTAG CGAGTTGACGCGCTATGTAGACGTCGGCGAACATCGGCCGGTGCAATATCGAAACCGGTT GCACGGACACCACTCCAGTTGATACCGACCCTCACGTTATCCTGCTCCAAACCTTGCAGC CAACGATCAAGAAACACCTCCACTGGCAGCTTGTGCGGAATGAACCCCCGATAAGCGGGC ACTTGATCAACAATGCGTCGAGCCCGAATTTCTGACGACCAGAACGGCATGGCCGCTTCA TTTGACTGGTTTGTTGACGTTGGGAAGCCTGCTTCGTCACGAATAGTCCATACGAACCCA CTCTGTGTGACTTCATTGAAAAATGCGTGGGTGTGGGCTGCCACAAGGCTCATAGGACTA CGTAGGTGAGTAAATGAAGAGATCTGTGTCGGTTTGAGCGGCCATCAAAGCGCGCCATGC CGCGCTCGCCGAGGCGAGATCGATACTTTTGAATAGCTTTGCGAACTGGTCACGAAGCAC CTTCAGCTTATTTCCAGCACTATATCCAGGAAATCACCGTTTTAAGCAAGGACAGGCTAT CAGTTTGAGTCCGCCCTCGCGTGACATCGGCTGGTGACCCTACGCCAGCGCGGCCCTGAT CTTTTCGGCATTGGCGGCAAGCACGGCGCCGTCCTCCATCTTGCCGGAATGCGGCTTGAG GGCGGTGCCCTCATGGCGGGGGATGACGTGGAAATGCAGATGGAACACCGTTTGCCCGGC GGCCGGTTCGTTGAACTGGGCGATGAACACGCCGTCAGCATCGAAAACACCCTTGACCGC ATTGGCGACCTTCTGGACGACCGTAATCGCATGGGTGAGGGTGGCGGGATCGGCATCGAA GATATTGCGCGACGCTGCCTTCGGAACGACGAGCACGTGGCCCGGCGCCTGCGGCATCAC ATCCATGAACGCGACGGTATGCTCGTCCTCATAAACCCGGTGCGAAGGGATTTCGCCGCG CAGGATCTTGGCGAAGATGTTGCTGTCGTCATAGGCGGCTGGGCTGGTCAT
Region | 3 |
Region Length | 8.6Kb |
Completeness(score) | incomplete(30) |
Specific Keyword | virion,transposase |
Region Position | 1907906-1916536 |
# tRNA | 0 |
# Total Proteins | 9 |
# Phage Hit Proteins | 6 |
# Hypothetical Proteins | 0 |
Phage + Hypothetical Protein % | 66.6% |
# Bacterial Proteins | 3 |
Attachment Site | no |
# Phage Species | 6 |
Most Common Phage Name(hit genes count) | PHAGE_Agroba_Atu_ph07_NC_042013(2) PHAGE_Yersin_fHe_Yen9_04_NC_042116(2) PHAGE_Serrat_BF_NC_041917(2) PHAGE_Strept_9871_NC_031069(1) PHAGE_Cronob_vB_CsaM_GAP32_NC_019401(1) PHAGE_Strept_9872_NC_031094(1) PHAGE_Mycoba_ZoeJ_NC_024147(1) PHAGE_Mycoba_Milly_NC_026598(1) PHAGE_Staphy_phiPV83_NC_002486(1) PHAGE_Lactoc_P162_NC_024214(1) PHAGE_Strept_phiSASD1_NC_014229(1) PHAGE_Mycoba_Bactobuster_NC_031279(1) PHAGE_Mycoba_Mufasa_NC_028759(1) PHAGE_Strept_9874_NC_031023(1) PHAGE_Staphy_Pvl108_NC_008689(1) |
First Most Common Phage # | 1 |
First Most Common Phage % | 22.22% |
GC % | 58.73% |
Region: | The number assigned to the region. |
Region Length: | The length of the sequence of that region (in bp). |
Completeness: | A prediction of whether the region contains a intact or incomplete prophage based on the above criteria. |
Specific Keyword: | The specific phage-related keyword(s) found in protein name(s) in the region. |
Region Position: | The start and end positions of the region on the bacterial chromosome. |
# tRNA: | The number of tRNA genes present in the region. |
# Total Proteins: | The number of ORFs present in the region. |
# Phage Hit Proteins: | The number of proteins in the region with matches in the phage protein database. |
# Hypothetical Proteins: | The number of hypothetical proteins in the region without a match in the database. |
Phage + Hypothetical Protein %: | The combined percentage of phage proteins and hypothetical proteins in the region. |
# Bacterial Proteins: | The number of proteins in the region with matches in the nrfilt database. |
Attachment Site: | The putative phage attachment site. |
# Phage Species: | The number of different phages that have similar proteins to those in the region. |
Most Common Phage: | The phage(s) with the highest number of proteins most similar to those in the region. |
First Most Common Phage #: | The highest number of proteins in a phage most similar to those in the region. |
First Most Common Phage %: | The percentage of proteins in # Phage Hit Proteins that are most similar to the Most Common Phage proteins. |
GC %: | The percentage of GC nucleotides of the region. |
Questionable (score 70-90)
Incomplete (score < 70)
Region: | The number assigned to the region. |
Region Length: | The length of the sequence of that region (in bp). |
Completeness: | A prediction of whether the region contains a intact or incomplete prophage based on the above criteria. |
Score: | The score of the region based on the above criteria. |
# Total Proteins: | The number of ORFs present in the region. |
Region Position: | The start and end positions of the region on the bacterial chromosome. |
Most Common Phage: | The phage(s) with the highest number of proteins most similar to those in the region. |
GC %: | The percentage of GC nucleotides of the region. |
Criteria for scoring prophage regions (as intact, questionable, or incomplete):
Method 1:
- If the number of certain phage organism in this table is more than or equal to 100% of the total number of CDS of the region, the region is marked with total score 150. If less than 100%, method 2 and 3 will be used.
- If the number of certain phage organism in this table is more than 50% of the total number of CDS of the region, that phage organism is considered as the major potential phage for that region; the percentage of the total number of that phage organism in this table in the total number of proteins of the region is calculated and then multipled by 100; the percentage of the length of that phage organism in this table in the length of the region is calculated and then multipled by 50 (phage head's encapsulation capability is considered).
- If any of the specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin') are present, the score will be increased by 10 for each keyword found.
- If the size of the region is greater than 30 Kb, the score will be increased by 10.
- If there are at least 40 proteins in the region, the score will be increased by 10.
- If all of the phage-related proteins and hypothetical proteins constitute more than 70% of the total number of proteins in the region, the score will be increased by 10.
If the region's total score is less than 70, it is marked as incomplete; if between 70 to 90, it is marked as questionable; if greater than 90, it is marked as intact.
gi|00000000|ref|SINM01000001.1| Rhizobium leguminosarum strain SM113 chrom_SM113, whole genome 5037985, gc%: 61.05%
Download details as .txt file: detail.txt file_download
Hits against Bacterial Database or GenBank File
Region 1, total 12 CDS
# | CDS Position | BLAST Hit | E-Value | Sequence |
---|---|---|---|---|
1 | 559068..559673 | PHAGE_Sinorh_phiM9_NC_028676: DNA-directed RNA polymerase specialized sigma subunit; ELH61_02880; phage(gi966199376) | 3.19e-12 | Showinfo_outline |
2 | 559670..560473 | anti-sigma factor; ELH61_02885 | 0.0 | Showinfo_outline |
3 | complement(560568..561131) | YecA family protein; ELH61_02890 | 0.0 | Showinfo_outline |
4 | complement(561128..562701) | PHAGE_Stx2_c_1717_NC_011357: transposase; ELH61_02895; phage(gi209447153) | 5.23e-19 | Showinfo_outline |
5 | complement(562750..563106) | PHAGE_Stx2_c_1717_NC_011357: transposase; ELH61_02900; phage(gi209447152) | 1.21e-22 | Showinfo_outline |
6 | complement(563103..563573) | PROPHAGE_Ralsto_GMI1000: ISRSO10-transposase ORFA protein; ELH61_02905; phage(gi17546153) | 4.01e-05 | Showinfo_outline |
7 | complement(563657..564519) | PHAGE_Entero_phi92_NC_023693: Phi92_gp066; ELH61_02910; phage(gi726646999) | 4.09e-84 | Showinfo_outline |
8 | complement(564516..565202) | PHAGE_Entero_phi92_NC_023693: Phi92_gp064; ELH61_02915; phage(gi726646997) | 1.83e-12 | Showinfo_outline |
9 | complement(565204..566259) | PHAGE_Entero_phi92_NC_023693: Phi92_gp067; ELH61_02920; phage(gi726647000) | 3.82e-114 | Showinfo_outline |
10 | complement(566273..566839) | PHAGE_Entero_phi92_NC_023693: Phi92_gp065; ELH61_02925; phage(gi726646998) | 6.78e-41 | Showinfo_outline |
11 | complement(567109..568029) | PHAGE_Synech_S_CAM7_NC_031927: minor capsid protein inhibitor of protease; ELH61_02930; phage(gi100226) | 1.06e-100 | Showinfo_outline |
12 | complement(568034..569116) | PHAGE_Synech_S_SKS1_NC_020851: GDP-D-mannose 4,6-dehydratase; ELH61_02935; phage(gi472340900) | 2.11e-178 | Showinfo_outline |
>559068..559673
RhizobiumleguminosarumMSVARSIFPAYSRARMESPVHPAVPRDAGLAEAAIASEADIRSGLTENLARLWRYGLVLSHQRDVADDLVQATCLRALERADQFIPGTRLDRWLFSILHSIWLNEIRSRRVRQGQGFVDAGETLTFDGAHDTETHVMAHQVLKQVNALPEAQRTVVFLAYVEGLSYREVAGILDIPIGTVMSRLAAARAKLSGAGPEGGRQ
>559670..560473
RhizobiumleguminosarumMNTKHTIPSDEDLTAFIDGELTAEEAARIQTMVEEDESTAERLEFLARASLPFKQAFAPLLSEAPREKLETILAAIPAQPSARPASAPAFASRRRFLGALAASLVAGIAIDRAVIGIGRSFSAKDENSEWRAVVADYISLYTPETLAGPVPAREDQAAQLGPLDEKLGLSLSPEAVSLPGIDFKRALLLQYDGKALAQVAYLDPETGPMALCIVRSDAGPKAPDVENRKGMNVVYWSNETHAFMLIGRIPGDRMKELGEDVRRRLSA
>complement(560568..561131)
RhizobiumleguminosarumMTQNGQETTARQKLDDEAFEAFIRGRRPASPIWSMSGLDGYLTALIIGPKFIDPRQWIPELTGPDALNLPMETTEHQAVQTIVAEYNRISASLSETPKDHRPRFTRIDDQTFDIFDWDLCFLLGTGYAPRLWQPVLRGHAVTGDIIAPIRKLGEAKRKATRQDAADVAEALVNIRTYFMPKRTKQKF
>complement(561128..562701)
RhizobiumleguminosarumMPLRHDPLPQDAAQLTRIILSLNEENADLKARVAFLEGQLFGAKSEKMTTIDPTQAILDLGDLSDIPVAANDDVARVGEDKTQARRSPARNIGHLLKHLPRYDELIEPESKICPCCSFELHCIGTDVSEALDIVPAVVRVKRTIRPRYACRACENAIVQAPAPARVMDGGMVTTAFAAHIAVSKFAWHLPLHRQAQMLASCGVIIDRGTLAPGSRGSPGGLSFSTMRSPPSSARSRGCSVMRRRFRGSIRGANEPRSASYGHKRSTIAHGMVRHRRQSRTSSPKAAARARSRGNYRHLPACCKLMDTKPIKPWSNVGARAMSPLCGWPSALLTPGESSLMSSNPALRKPCRSLPGLPKSIASKRTCAGEMPIPGLSGVARQLPSGNRPSLPSGTRCRRNRRLARRSPTHSTTGVGQPSLMMAGLRWTPMSSNVQNPWPRERTRYLWAASGVESPSLSWHRSSTRQSLTAWIPRTGLPMCWSGSSLATKPTKWKVSCLGPGRLSVKRHSRSDELHGA
>complement(562750..563106)
RhizobiumleguminosarumMIGLSPGGVKIMVATRPVDFRRGMNGLVALVASALAADPYCGDVFVFRAKRLDRLRCIYWDGSGMILATKWLEAGKFVWPPIRDGAMQMTREEFSLLLAGIDWTRVKQNPVKRPLKAG
>complement(563103..563573)
RhizobiumleguminosarumMVGDRADAMLEVMDEGMHEARHEGKYRRIEVITGRRQRRNWTDEEKARILAESAETDVNISAVARRWGVNRGLLNVWRRDAGLTSRRSAKACAQQAIFVPVTVVGERTSPETLPSDAAHFAAGRIEIEVAGARLTMIGSVAPELAQAVVTALRGRR
>complement(563657..564519)
RhizobiumleguminosarumMKGIVLAGGSGSRLHPMTHSVSKQLLPIYDKPMIYPLTTFMLAGIREVLIISTPHDMPLFQRLLGDDSDWGMSLSYAAQQSPDGLAAYIIGADFVAGGPSCLILGDNIYFGHGLPELLDEGVPKGDGATVFAYHVRDPGRYGVVEFGSDMTAISIEEKPANPKSHWAITGLHFYDADVVNIAADLKPSPRGEYEITDVNKTYLERGKLRVSMMGRDMPGSIPARPKVSLRPASSSARLKNARASKSLVRRKLRWRKASSRRSTSPRLPPKRAGITASTCVVWWIAG
>complement(564516..565202)
RhizobiumleguminosarumMRIVVTGKNGQVASALQALYAGGTEVIAVGRPELDLLEPSMVSEIIAKIKPDVVVSSAADTAVDKAESDEAAAFAINRDGARAFAAAAAELSLPIIHLSTDYVFDGDKPERYVESDPVGPASLWQIEARGRICGRRSQESCDPADGLGLFDVWPQLRENDASRKRPADVSPSRTLRPLTIPQQRNGHPIPDSVAISRAFMACACRSGGYQARTAVTKLLEEPKEAV
>complement(565204..566259)
RhizobiumleguminosarumMRVLVTGGAGFIGSALVRYLVSDIGAEVLNIDKLTYAGNLASLKAVENAPNYRFLKADICDRSAVSNAFEEFRPDYVVHLAAESHVDRSITGASDFIETNINGTFSMLEAARQYWQDLPAYEKAAFRTLHVSTDEVYGSLGEDGLFAETTPYDPSSPYSASKAASDHLATAWERTYGLPVIISNCSNNYGPFHFPEKLIPLIILNALERKPLPVYGSGSNIRDWLYVIDHARALWLIVQRGRPGEKYNVGGRNERRNIEVVERVCAIMDEVRPGTAPHSDLINYVTDRPGHDARYAVDATKLETELDWRALENFDSGIRKTVEWYLENAWWWQPLRERAYSGERLGVLKKV
>complement(566273..566839)
RhizobiumleguminosarumMHTLNVEQLAIEGVKKVTPARFGDSRGYFSEVFKDSWFRSNVADVSFVQDNESLSAQPGTVRGLHFQLSPFAQGKLVRCLRGALLDVAVDIRHGSPTYGKWVSAELSPENGEQLWLPAGFAHGFVTLQPDTKISYRVTAPYSAEHDRGVRWNDAEIGIEWPQMDAYVLSDKDNKQPLLSELPAYFQFS
>complement(567109..568029)
RhizobiumleguminosarumMNRDVKIYVAGHRGMVGSAIVRRLKAGGYMNIVTRSHAELDLVNQAAVAEFMKAERPDYIFMAAARVGGIHANNVYRAEFLYQNLMIETNVVHAAWQAGVERMLFLGSSCIYPRDCPQPIREEYLLTGLLEQTNEPYAIAKIAGVKLCESYNRQYGTRYVSGMPTNLYGPNDNYDLDSSHVMPALIRKVHEAKIRGDRQLVVWGSGRPMREFLYVDDMADACVFLMEKDVNEGLINIGTGEDITIRELAEAIMGVVGFTGEIVYDQTKPDGTPRKLMSVDRLSALGWKATTSLSDGIARAYADFMS
>complement(568034..569116)
RhizobiumleguminosarumMKRALITGITGQDGSYLAELLIEKGYEVHGIKRRTSLFNTDRIDHLYQDPHDTNRRLVLHYGDMTDSSSLVRIIQQVQPDEIYNLAAQSHVAVSFEEPEYTANSDALGALRILEAIRILGLEKKTRFYQASTSELYGLVQEIPQRETTPFYPRSPYAVAKLYAYWITVNYREAYGIYACNGILFNHESPVRGETFVTRKITRALARIKLGLQDCLYLGNMDAKRDWGHAKDYVEVQWLMLQQDEPEDFVIATGVQYSVREFVDAAAHEIGLPISWKGSGAEEKGYDENGRCIVAVDPRYFRPTEVETLLGDPSKAKEKLGWEPRITFKQLVAEMMREDLKSAERDELVKRHGFSAYDYHE
Region 2, total 47 CDS
# | CDS Position | BLAST Hit | E-Value | Sequence |
---|---|---|---|---|
1 | 1811452..1811463 | attL | 0.0 | Showinfo_outline |
2 | 1811768..1814344 | PHAGE_Mesorh_phagevB_MloP_Lo5R7ANS_NC_025431: DNA-directed RNA polymerase; ELH61_09290; phage(gi712913252) | 2.91e-174 | Showinfo_outline |
3 | 1814671..1815360 | PHAGE_Acinet_vB_AbaP_Acibel007_NC_025457: hypothetical protein; ELH61_09295; phage(gi712915495) | 2.36e-21 | Showinfo_outline |
4 | 1815570..1816391 | hypothetical protein; ELH61_09300 | 0.0 | Showinfo_outline |
5 | 1816480..1817190 | PHAGE_Pelagi_HTVC011P_NC_020482: single-stranded DNA-binding protein; ELH61_09305; phage(gi460042334) | 8.49e-11 | Showinfo_outline |
6 | 1817198..1817608 | PHAGE_Mesorh_phagevB_MloP_Lo5R7ANS_NC_025431: putative endonuclease I; ELH61_09310; phage(gi712913258) | 5.50e-48 | Showinfo_outline |
7 | 1817601..1818221 | PHAGE_Escher_vB_EcoP_GA2A_NC_031943: hypothetical protein; ELH61_09315; phage(gi100022) | 3.40e-57 | Showinfo_outline |
8 | 1818315..1818992 | PHAGE_Sulfit_pCB2047_C_NC_020856: DNA binding protein; ELH61_09320; phage(gi472341794) | 1.44e-05 | Showinfo_outline |
9 | 1819243..1819425 | hypothetical protein; ELH61_09325 | 0.0 | Showinfo_outline |
10 | 1819429..1821057 | PHAGE_Pelagi_HTVC011P_NC_020482: DNA primase; ELH61_09330; phage(gi460042337) | 9.80e-121 | Showinfo_outline |
11 | 1821226..1823079 | PHAGE_Pelagi_HTVC011P_NC_020482: DNA polymerase family A; ELH61_09335; phage(gi460042339) | 2.10e-165 | Showinfo_outline |
12 | 1823084..1823290 | PHAGE_Morgan_vB_MmoP_MP2_NC_031115: membrane-associated initiation of head vertex; ELH61_09340; phage(gi100034) | 3.76e-05 | Showinfo_outline |
13 | 1823363..1823779 | hypothetical protein; ELH61_09345 | 0.0 | Showinfo_outline |
14 | 1823867..1824733 | PHAGE_Pelagi_HTVC011P_NC_020482: exonuclease; ELH61_09350; phage(gi460042342) | 8.29e-50 | Showinfo_outline |
15 | 1824712..1825278 | PHAGE_Sphing_Lacusarx_NC_041927: ssDNA binding protein; ELH61_09355; phage(gi100044) | 1.32e-25 | Showinfo_outline |
16 | 1825304..1825759 | hypothetical protein; ELH61_09360 | 0.0 | Showinfo_outline |
17 | 1825965..1826171 | PHAGE_Pelagi_HTVC011P_NC_020482: hypothetical protein; ELH61_09365; phage(gi460042344) | 1.73e-05 | Showinfo_outline |
18 | 1826171..1826398 | hypothetical protein; ELH61_09370 | 0.0 | Showinfo_outline |
19 | 1826408..1827946 | PHAGE_Pelagi_HTVC019P_NC_020483: head-to-tail joining protein; ELH61_09375; phage(gi460042400) | 9.75e-157 | Showinfo_outline |
20 | 1828059..1828556 | hypothetical protein; ELH61_09380 | 0.0 | Showinfo_outline |
21 | 1828566..1829339 | PHAGE_Pelagi_HTVC011P_NC_020482: scaffolding protein; ELH61_09385; phage(gi460042348) | 5.57e-51 | Showinfo_outline |
22 | 1829349..1829579 | hypothetical protein; ELH61_09390 | 0.0 | Showinfo_outline |
23 | complement(1829598..1830491) | hypothetical protein; ELH61_09395 | 0.0 | Showinfo_outline |
24 | 1830869..1831846 | PHAGE_Pelagi_HTVC011P_NC_020482: capsid protein; ELH61_09400; phage(gi460042352) | 2.62e-80 | Showinfo_outline |
25 | 1831967..1832227 | hypothetical protein; ELH61_09405 | 0.0 | Showinfo_outline |
26 | 1832245..1833879 | PHAGE_Rhizob_RHEph04_NC_041908: hypothetical protein; ELH61_09410; phage(gi100013) | 0.0 | Showinfo_outline |
27 | 1833941..1834540 | PHAGE_Pelagi_HTVC011P_NC_020482: tail tubular protein A; ELH61_09415; phage(gi460042353) | 4.88e-31 | Showinfo_outline |
28 | 1834540..1836921 | PHAGE_Pelagi_HTVC011P_NC_020482: tail tubular protein B; ELH61_09420; phage(gi460042354) | 0.0 | Showinfo_outline |
29 | 1836921..1837385 | PHAGE_Pelagi_HTVC011P_NC_020482: hypothetical protein; ELH61_09425; phage(gi460042355) | 5.96e-28 | Showinfo_outline |
30 | 1837378..1837908 | PHAGE_Mesorh_phagevB_MloP_Lo5R7ANS_NC_025431: hypothetical protein; ELH61_09430; phage(gi712913288) | 3.02e-24 | Showinfo_outline |
31 | 1837908..1839932 | PHAGE_Mesorh_phagevB_MloP_Lo5R7ANS_NC_025431: hypothetical protein; ELH61_09435; phage(gi712913289) | 5.03e-21 | Showinfo_outline |
32 | 1839942..1843715 | PHAGE_Mesorh_phagevB_MloP_Lo5R7ANS_NC_025431: putative internal virion protein; ELH61_09440; phage(gi712913290) | 9.78e-74 | Showinfo_outline |
33 | 1843732..1845030 | PHAGE_Pelagi_HTVC011P_NC_020482: tail fiber; ELH61_09445; phage(gi460042359) | 2.01e-22 | Showinfo_outline |
34 | 1845310..1845609 | hypothetical protein; ELH61_09450 | 0.0 | Showinfo_outline |
35 | 1845609..1845857 | hypothetical protein; ELH61_09455 | 0.0 | Showinfo_outline |
36 | 1845857..1846078 | PHAGE_Pelagi_HTVC011P_NC_020482: DNA maturase A; ELH61_09460; phage(gi460042367) | 1.06e-15 | Showinfo_outline |
37 | 1846204..1847892 | PHAGE_Pelagi_HTVC011P_NC_020482: DNA maturase B; ELH61_09465; phage(gi460042368) | 0.0 | Showinfo_outline |
38 | complement(1848112..1849056) | DUF2971 domain-containing protein; ELH61_09470 | 0.0 | Showinfo_outline |
39 | complement(1849145..1849750) | hypothetical protein; ELH61_09475 | 0.0 | Showinfo_outline |
40 | 1850160..1850381 | hypothetical protein; ELH61_09480 | 0.0 | Showinfo_outline |
41 | 1850686..1851435 | PHAGE_Mesorh_phagevB_MloP_Lo5R7ANS_NC_025431: hypothetical protein; ELH61_09485; phage(gi712913236) | 6.89e-103 | Showinfo_outline |
42 | 1851543..1851755 | PHAGE_Mesorh_phagevB_MloP_Lo5R7ANS_NC_025431: hypothetical protein; ELH61_09490; phage(gi712913238) | 9.61e-11 | Showinfo_outline |
43 | 1852009..1852302 | hypothetical protein; ELH61_09495 | 0.0 | Showinfo_outline |
44 | 1853015..1853365 | PHAGE_Synech_S_CBP2_NC_025455: MarR family transcription regulator; ELH61_09500; phage(gi712915375) | 5.70e-10 | Showinfo_outline |
45 | complement(1853540..1853624) | tRNA | 0.0 | Showinfo_outline |
46 | 1853641..1854654 | PHAGE_Mesorh_phagevB_MloP_Lo5R7ANS_NC_025431: putative integrase; ELH61_09510; phage(gi712913246) | 5.15e-28 | Showinfo_outline |
47 | 1854766..1854777 | attR | 0.0 | Showinfo_outline |
>1811452..1811463
CAGATTTAGGTT
>1811768..1814344
RhizobiumleguminosarumMTFEQSILDNPLFERQLDLEQEMRTSGIDRFRKNVDKASVKGAMSDTMAVNRLVVEAHEKVVAAINEFLTEAKSGAAGRRHTAVVFIEKLDVDTVANITARVILDEVTRKSNLTKTSLAIGSMLENEFNSRKFEEEMPKAHKKFLKKAQKESLDRRKWSHLLYPARLLGVELEEWSEKDRILVGLKLVDLFIQSTGLIEREVVQSARFGTLELLVANEATLKWMETENSRLEHLFPIYMPTIVPPKPWTSPFDGGYFTAFRRLKLVKTHNQQYLEELANRDLSQVYEAINALQDTAWAINTQVLDVIRTLYETGAGVAGLPQADKLRMPLRPHWLPEGKDRMSTEDMTEEQLEEFKAWKAETHRTHVENAAISGRRASFLRTLGVAEKFKDEEAFFYPHTLDWRGRAYPLPLYLTPQGNDLQRGLLTFANAVPIHDEEAAEWLAIHGAGCWGYDKVSLEERVQWVLEHEVEIIASAQNPYDNHFWMGADKGEKKWQFLAFCFEWAAFKEEGYGYLSSLPVQMDGTCNGLQNFSAMLLDEVGGAAVNLIPADEPQDIYQKVCDIVCEQLARDLESTELVTIKGKTDDGVEFEKVVCSVADMANGWLPKMGRKVTKRPVMTLAYGARRFGFVSQVDEDTIKDWRSGSPESYPFISQGDDGKPVDYGYKAAQYMGGLIWDSVGEVVVKARQAMDWLQAVSKVASNEQLPINWTTPVGFLVQQAYRVPNTKRVDTTFNSQRIRLTYQHGVGKIDGRRQASGISPNWVHSLDAAHLMKTIGRCRREGIASFSMIHDSYGTHAGNAWAMARYLREEFVQMYSQVDVLTRFKEELEAQTGEQLPDLPAKGNLDLQQVLDSPFFFA
>1814671..1815360
RhizobiumleguminosarumMAKLTPAIYGAAPVDLGCMELDTSEMMFWLYLPIKMPGQFMPKLPANLKKYERIVDAVMDNVIDDDTINPQGRRWTESYVYLSVKITHVTPDAPGNRPGWHSDGFLTDDLNYIWTDRNPTEFFITDALFGTEPDHRSSMKQFDWMARHLLRSKGDRLEHAKVNHLYRLDQTNIHRVSLNVESGKRAFIKVSVSDKPYVQLGNSINHDLPQHPLPTLKRQADRNCPQGNK
>1815570..1816391
RhizobiumleguminosarumMANKNFHKTVSVAASAHNCNSVGEITLATLNADSYINLITSARSLTISPVEAKKLRDLLVEAYPLEAPAAAAKRTKFKAGDKVTYKSIVGYGARGMDGRKGSIKEVLTNEWYMVNFTGGPFDNLIKVHSDYLVAQPVPSVGGLKVGDRVRRLQGSSKGSIFTVTELNSSLTELKVDGMTGWRMPKYFELVTDAPASAPEPLAKIDTGRFLVVALEGANYVPGSKPKVHVTDFSARVEAERLARDVGGTYHVFRAVFEASREKPVIPPVKTTKL
>1816480..1817190
RhizobiumleguminosarumMAERKKNPSLISPRGPLKFPKIDKVDYGTKEYPKPNGEYSTKQVLEADAPATKAFIAALMPHYQAAMEEAAAKFKELKIETRKKLGKVTENDLFTTLYDQETEQPTGYIEFKFAMAASGERKDKTKWSAKPAIFDAKGKPMTKVPEIWSGTEAKVSFECQPYFIPGTGAAGLKLKLKAIQIIELVSGGQRSASSYGFGAEDGYEYEEPATEENEGGFGDESGEDTSSKTIDDDIPF
>1817198..1817608
RhizobiumleguminosarumMTYRTSAAGLRAVGIREGFRSGLEDKVGDQLRAQGIDPRYEEVVIPYVKPERKAKYTPDFQLPNGIFIETKGRFVTEDRQKHLLVKTQHPELDIRFVFSNPKARISKTSQTTYADWCLKHGFKFAAKIIPQEWIDE
>1817601..1818221
RhizobiumleguminosarumMSSMFTPRKVTSYLVVHCSATQPKMDIGAKEIRQWHREKGWIDIGYHFVIRRDGTVELGRPENVVGAHVENHNSNSIGICLVGGVDAKGKAENNFTPAQFATLAIKLRELRSKYPGVTVQGHRDFPGVKKDCPSFDTRKWINETGVFETSHVPAEPDARAVEITSATPTIFSLAKKYGTTVEAILKVNPHVDPAKLKLGQVIRLPG
>1818315..1818992
RhizobiumleguminosarumMNTFVAGAHVRHKDHPEYGNGRIVYVHTNGKSLAVEFENSTGLRHDCAGHAKYDFCRWSRAASLELINPFKFGDTVLMCAFDARSNPAFYYDHDFANQQGIIDHADKDGYVTVLVASLGHTQFVPIADLTLVKVDPVEAVKKATPKFKTIVFQSGSQCDRLVKYLLAGNSITPLVARQLFGVERLAARILEIKKAGHKVTSTIKTDVNGKVYAEYALRKAGRVGA
>1819243..1819425
RhizobiumleguminosarumMKAALHFLLNFLTIAGLGLLAILVASSTGAAFVAFGPLIGSVVAFFWIAVGVSFERSERL
>1819429..1821057
RhizobiumleguminosarumMQSESSFVRKEPCPACGSRDNLGRYSDGHGHCFGCDYYEPGDGSVPQTSRKSKMSRELISGGEYRALSKRGITEETCRKFGYQIGHFKDQLVHIAPYYDDEGNLTAQKVRFADKTFTVTGNMKPALLFGQNLWSGGGRKVVITEGEIDALSVSQVQGNKWPVVSIQNGAQSAKKALSAALEWLTTFEEVVLMFDMDEPGREAAAACAALFPPGKCKIARLPDKDPNALLMAGKGDEIITAIWQAQVYRPDGVVAFKDIKEAARRPIEMGLRWFCDRLTKLTYGRRWGEVYAFGAGTGIGKTDFLTQQITFDVTELGQKVGVFFLEQMPTETAKRLAGKFAKRRFHIPDDGWTDAELDEALDKLDQDMLFFYDSFGATEWSVIRETIRYLAHSEDVKVFYIDHLTALAAAEDDERKALEQIMAEMAALAKELGIIIHLVSHLATPEGKPHEEGGRVMIRHFKGSRAIGFWCHYMFGLERDQQHEDERLRAVTTFRVLKDRYTGQATGEVIYLGYERETGMLYETSIEFGDETGSDFKDESAPF
>1821226..1823079
RhizobiumleguminosarumMRDPSSGFTISATSNDYSSDDPTIITDWSVEEALHALMNADVIIGHNIIKFDIPALQKVFPWFQPKGLIIDTLVCSRLIWSDIADHDLKQVRKGYPGKLVGSHSLKAWGYRLGVLKGDFGETSDWRYWSPEMQTYCEQDVEVTAQFYARIKKKEPSPKSLWIEHEFCKIIAMQERHGFAFNEEEAIKLYSQLVTRRLEIARELQVAFPPVEKTEVFIPKVNNKQRGYVKGEPFTKKWMVEFNPSSRQMIADRLQAMGWVPQEFTPSGQPKIDETILQALPYAQAKVLAEHFLVEKRIGQLAEGDQAWLKLVKKGRIHGSVNTNGAVTGRCTHSNPNVAQVPRVGSPFGAECRALFTTTSRWVLVGADLSGLELRCLAHFMALFDGGEYGRIVLEGDIHTVNQNAAGLPTRNDAKTFIYAFLYGAGDQKIGSIVAPDASPEEQKRIGKKLKRQFLAKTPALRRLREVVELKVLGFVPKARPLNVNPAYEHMWRQDSAKQWWFKAGAGGVLVGLDGRKLNVRSAHSALNTLLQSAGALISKAAMIFAYRELSTRGYVFGRDYAFVAHIHDEIQTECRPELAEEVGQIVVEGMRAAGTFFAFGCPIDGEFKIGNNWKETH
>1823084..1823290
RhizobiumleguminosarumMIEILKQALENPFKTKSNFARENADLIAMAASDGFLTTRIATGLYSRKWMITPVGLSHYYALTGLNHD
>1823363..1823779
RhizobiumleguminosarumMFGRSSSSSYSSSASRAENHLTINQQPHDAADAARLYGELQDKANDSITEIVGHQIADTKVEFVTLDTARDVLHFKDHVRVIFKINGKTFDTRVEIDDPIKLSEPRERVAYRAVAEGIANTLMDRSIFQIYQTFARKR
>1823867..1824733
RhizobiumleguminosarumMWSTTHTGSCTRMNTALSATSTGRQEKRRRASSSMTRTLLIDADVVAYVAASSLEVATDWGDGYWTWHVDEFEVQKKVKQIIDDTMEDLKGDSCKLCLTDSFGNFRKSVLATYKGNRSNIKKPLVLMKTKQWMIDELGAYFRPGLEGDDCMGILATMKGTDERIIVSIDKDMKTVPGKFCRYTDSKAKIIEYSEKEADYWHLYQTLMGDATDGYPGCPGIGPKKAEAILGPIDEFDLTEGWARVLAAFEKAKLTETDALTQARVARILRASDYDFKKKEPILWQPKAN
>1824712..1825278
RhizobiumleguminosarumMATKGKLIALYSDAAGSGKSEVAGTLIRHGYESVKFAGPLKNMARGLLGSMGFETVTVERMIEGDLKEAVIPGFKTVTPRQIMQTLGTDWGREAIDQDLWTKVAAAKIEGLRDKGVDVVVDDLRFPNEYDLIASLGGTLVQVVRADPSREAGGAYEGKLSGHLFHHIVHNNGTLRELYSKTLLLAQSI
>1825304..1825759
RhizobiumleguminosarumMKFLSGLVAFALVAILALAPAIAEARSSFGGFSGGSRFRSSSSFSSRSTTTYSRPSTSYSRPSTTYRPAPVYSSPSYRSYSSTTINQSSGGGGFFSSMVGSIAGYGIAQWLFGEDEKPAEQAPAAAPAPAAQAPVTGAVPTTTVQEAPKAQ
>1825965..1826171
RhizobiumleguminosarumMVDDQDRFPHIPKDLVEALDKRFPERTPSLNTSLDEIRWKGGERAVVRFLLEQYNRQNETVINEKVLS
>1826171..1826398
RhizobiumleguminosarumMCPPKVKTQKVEPVAQAAPPAQPAATVNQSAPQTPDELSPEQAAIKAKRKGRSSLRIPLDAGVGSGATGINVPQA
>1826408..1827946
RhizobiumleguminosarumMTGQTASGRYQQLSQARSAVLERGRASAKLTIPSLLPPAGHSESSSLPTPFQGIGARGVNNLASKLLLALLPPNSPFFRLMIDDFTLENLTKRKGMRAEVEKGLNKIERALMTEIETTAIRVSAFEALKQLLVAGNVLIYLPTEGGMRVFRLDRYVVKRDPMGNVIEIITREDISPDMVPEAMKGHVKSKSRSNEKTIELYTHIVRQRDKWTIRQEIKGMTVPGSRGSYPLDKCPWIPLRFTKIDCEDYGRGYVEEYFGDLLSLETLTQAIVEGSAAAAKVLFLVNPNGTTRMTDIAKAPSGAVRAGNAEDVSVLQLDKFADFKIASETINNIQQRLSFAFLLNTAIQRAGERVTAEEIRYMAGELEDALGGVYSILSQEFQLPLVRVLMFRLERQKKIPPLPKGVVKPTITTGLEALGRGHDMNKLTIFAQTASNIAALPPEISKADFLMRVGTALGIDMDGLVKTPEQLQQDQQQLMMQQLIEKLGPKGMDILRDQLKPEVQNGPQAQAQ
>1828059..1828556
RhizobiumleguminosarumMTSFGDAFKAARKAGKTTFKFGGKSYHTKTKDEMAKTKKAVPTPSPRPEAMKTDAQAAVDSAPKSAPKAAPKQDYPRPAKPVGIASANSAIGRAAAARENAPVLKIPSRANANANTVQKPRAPARGSSTPAEKQGPKPEEQQWFARKGSAISLGIARRRNAPVSK
>1828566..1829339
RhizobiumleguminosarumMENEVKEQDQIVVPGSDEHNALMVEKFQNQSGSTEQTNAQPAERPAWLPEKFEKPEDLAASYAELERKLSGGQQQEAPKVEANAEEAREAVNALGLDFDALGAEFAESGALSDDSYTKLAEKGLSRDIVDAYIEGQEAKAQLHRAEVLLAVGGEATYNEIANWAAGNLTNEELQAYNDQVESGNLTAAKMAVQGLKARFEAENGSEPQLLNGETGGNSAEVFRSTAELTAAMRDPRYKKDPAYRADVERKLSKSSLF
>1829349..1829579
RhizobiumleguminosarumMTALASIVALVWAHLDDIFAILFALQAVLVLISKLTPTPKDDAVAAKILSVLESIASVLSVKRKDFPAAPTSPGLY
>complement(1829598..1830491)
RhizobiumleguminosarumMSEVESAESVEKGENRGDDNIVSFQAFLEKAHPSLTKVVPDVWQTQVTLNAANRRRLTYPQIRLYCSICAGERNFRCLTDDAFGWNTTIANVHPLYTCGDCRDQVKHYALHLTFTDAGGAKVYKYGEIPPFGIPVPNRVLRLFGNADAKLFLKGRQCEDQGFGIAASAYYRRVVENHRNDLFDEIIKVSETVGAPAELIDELKAAKKEISFAKSIEKIKTGLPQGLLIDGHNPLNALHGALSEMLHNESDEDCLQNAQAIRVVLSDLVERIAMLKQDNKQLSDAVQRLLAKRKGATN
>1830869..1831846
RhizobiumleguminosarumMANANVTRIGQINGSGDVDALFLMQFAGEVLTAFEETNVALEHTMTRTINSGKSAQFPATGKVGGEYHVPGTEITGLNLKAAETVITIDDLLISHGFIANIDEAKTHYDLRSIYSTEMGRFLAKTMDKHLLQVGVLAARATNVVDGEPGGSVILTGESGLPSTPNFDANGDHLAAALFIAAQKLDEKDVPEDERVAFVRPAQYYNLVKATNNLNKDWGGMGSYAEGSILKVAGIQIVKTNHLPNTDLSAATGVEAGSGLKYRGNFTNTSALVMHKQAVGTVKLLDMGMESAYDIRRQGHLMVAKYAVGHGILRPQAAVEIRNAAA
>1831967..1832227
RhizobiumleguminosarumMISSPTKCYKVAAGDIGAKELPVYSKALAIYVPDGATATISMVYLNNLDGEAVTRTFTAGNHYIAARIRRITAVSNASVEIHVETE
>1832245..1833879
RhizobiumleguminosarumMGVTVSPRFGRAPRRWLSGPPAWVPDLNRYMPAATGTRWPTGSTTNPWTYAAGLNYQCSKLFFGSPDYPTNDFLIPFVGFALTEGGNAPQETQGPTTDTLLDEAFFIHPDGTEYPILFGGQAAATITANTGIVYGQVTLPTWLPAWSIFGIRTIYHGNAGENRLGSYRIQRHRGEKFWGAGDLASIRALATANGPSTPALDPDNWYNTVGNATNSQQQAYGPAMVLAKGWDGRPVPLMLADSLAERQEIAASADERRNMGIWRRWLDQRDPVWGSLIPVVMGVPGAHSEYELAGSGASIATRRWAMIDYIRDTFNGGKNIWTFVLDQSGRNDTSSTLSLWQSRKFGLDDRVKARYPGVHMVGITIMPTFTSSDGGRTVAGYSTSAVWNPVTGTLASLNASIMSSPRFAKVIDMLPAFMSDTDPTKGPAAELFPLGNVIGHPGNQDGTTTWDTIRLPSTVPLGSRISFEYQPGQWASRTLSGRTDRGDGTADYRVAEIFATNVQDNATLLGHGMHTDFIHPALHGVLRTVSRIPQSEKAKFYPAA
>1833941..1834540
RhizobiumleguminosarumMLTPTTELEAINLMLSVIGESPVNTVEDTGLVDAVVARQILIQSSRDVQLVGWHWNTEIDYPIAASFPEGELTLPPNTLKVDTAGADAGLDLVQRGNRLYDRKNHTFNVGRTVYVEIVLLLPFDQLPEAARSYIVMRAARQFQERMVGSETIWQFNSRDELRAWANLMSSEAETQDLNVFNDNPSVRRVLDRTPPGGLV
>1834540..1836921
RhizobiumleguminosarumMAGALVSTTIPNLINGVSQQPYALRLASQCELQENAHSSVVEGLRKRPGTTHRAKITNAPAGELFTHTINRDRTEQYEVMVGNGDLKVYDLKTGAEKTVTFPNGRAYLTAADPRSSFKAVTIADYTFLINKTVTVEQDTTLSTSRAPEAVVWVKQGAYGTKYTVTLNGVSATVSTPDGSTASHINNIQTDVIASGLVSALNAAIGGFSFALNGSSIYIKRADNADFTINVTDSQGDQAMKLLKGTVQRFSDLPAKGFNGFAVEIVGDQSSSFDNYYVKFDTASGVASGVWVESVKGGEAIRLKASTMAHALTRNADGTFTFKQVEWIDRKTGDLDSSPMPSFVGKKMNDIFFHRNRLGFIADENVVFSRSGDFFNFFRSSATQVLDTDPIDAAVSHIKVSILQHAIPFNETLLLFSEQTQFQLGASELLTPETISINQTTEFECSLKARPVGSGRNIYFTFNRGNFSGLREYYVDGDTKTNDASDVTSHVPAYVPKDVSKMAASSSEDTIALISESERNSIYVYKYYWNEQEKLQSAWYKWTFPATDTILSVEFVESNLYLIIRRPDGVFLESMSVNPGYVDDGFDFGLNIDRKAKEDACTVSYNAVTNETTITPPYLLQAELLPASEADVIVSRAGDPIKKPGQLIPYTIDGNNMVVKGKLEKFIIGRSYVMRYRFSTFVIKEEAVGGGQMTVGEGRIQLRKATLTYDNSGYFRIEVTPLRRETYRYVFSGRVIGSAKNVIGQTAIDKGRFSFPLMSKNDLVTVDIVNDTFLPCAFLSAEWEALYVIRSKRL
>1836921..1837385
RhizobiumleguminosarumMLETRPSRPEDVTYLAPRLREADRQELLAAGAPGPEQSLRDGLMLSKNCISVVDDEDRAVAMFGVCPSPVEGLGYIWLLGSDDIKQNKTRFLRRSKQWVDTFHQDFTVLTNYVDQRNEVHITWLRWLGFKFLRIVNAPGPGNLPFYEFARIRNV
>1837378..1837908
RhizobiumleguminosarumMCDPLSMIGFAIGAAQQVVSYQAEKTAAEQQNQLYKENAARANQNARDQMFQTQQRMLQEQEKGAAEKMDTVREAREAKATATVAAGEAGVSGLSVDALLAEFDGRAAAANDRTDQNTEWTLSQLNNEMKGIRANAEDRINSVQRAAAPSFFNTGLKIAGVGLDSYNDFKVKQRSK
>1837908..1839932
RhizobiumleguminosarumMARLPGLRPIDEERRRSGGGQNRGRVRTPSADNIRVQGLSPNASPVDTYARPEQAPIGSNSWEALAKSLAGIQPSINNFLNVQAAEQQDDDVTAVRQAFLQKSPEDVRKAIKEGSVPGLTSLAGRELVGERLAYDRSLQIMASYQTDFDRQTGDVDAFVQERIKDDLAEFGNDKALMGAYTKQMTAFTEKLRNQSVDDKATFQQDVRQGNLFEKWSAKATYDRAEGKAPADVAGSMFGEFTKNQELLRVPFQKQQEMMLQLADQAATSGDYDLAKAILQHKREDGPYKGSLMTDVKVGDTATKLFARIDADQTRERLTAQAQEDEESLYSQGVAAAESGSILAIGDAQVRDKQGEMRTITADAQKKEVANRLIAKAADEAAYREKDPEKRPALARRLEKEKFVGSGLEHPVWFKAMNGAPGQMNLNAATGEIPPSAKDAFDTYQDLYKDSPQYLAKYLNKDALEFFESARLAEEVGNAGTPEAALRIAHMVTQDPNQMDEALKLKYDSIDSAVKSAVSNSTSWGQWVFGKQTAGNQNYVRSEIVRLAKQYALLGKDNDEAIEQAQQTFEKTHINVAGSYVKNDKRLPGDFEPLVNQYLSEFVEKHKGDLNYDIDDLTITQGNGTGAYMVVRKSDRMPADPASDDTFFSLNILNDLRTRNRDQKIKEVTAKQNAR
>1839942..1843715
RhizobiumleguminosarumMADIRSIITDAANRYGINPQDALEMAQIESGLNPHAQNKSSTAGGLFQFLDSTWAKYGKGASKYDPYANADAGMRLARDNINFLKKKLGRDITGGEMYLAHQQGAGGALNLLANPNTMAVDLVGRAAVLGNAGQTGMTAAEFANLWINKIGSTKVGNGPGLVMPGSMANSQNPGDFSVHDQGRVSASDVIPTMNTTRAEEVQQEKDRQAMMPSFGEAVATAVKNEWSVLTPFRALGHFDPEPDYKLTEDKLRTFGQSIPDDYLDEFEDAVSDEHAEAIRNRLLTQLEDNQKIASLGTAGTIISMGAALTDPGAIAATAAIGAVTGGFGVPAAVAARLGRVGMVGLAAAEGVAGNLATDIPLVAVDPTRDVSFDELKYSIGTGLVMGGVMGAFRRNPMFTEEAKQIAKIGQQMQEQAVSLPAGSRSAGAASVMGDNFTRSDTSNLIDDFKRLDPKGTFLNWRVDAVGQLMASKNPMAQTLARYLGEDGVRAAKGSGVVTEIAATERMQRRLRVAQINWYRGYDDAFKKFRKANGINAFQAKDAELKFKEQITDYIREENPSVRAQFPAEVKQAAGAFQAEMKSFWKEAQELGLTRTEAGVENYFPRYGHLAKATKLIREVGYSMDRNGGLTDLFAGAILKKQPGLDPKIAKRMGYAVLDRFQKLSSGEEMFGTGHLGFDLDDLEVELKNYLDDEQIANVKAWASRNEKKEGEASGPARMKARIMLDENHFADVMTKRDGVKRVNISDFYVKDPHTAFQLYARNMSGQLAMARIQVRDPVTGNLLIDGIKNGNDWTKLKNQIKSVGEATGANNTRDERNLDFLYSAITGTPLAGIDRGSDGATFLRMLRDFNFLRLMGQVGFSQVPEFGRQVAQVGVKTTFQAVPSFRHLIDMARSGKMTDEVAEELDAIGAFGTDYERTAHYLDTDELGVPVTSGSDSTIQRVAGAVNPKLHAMNRFVSMGSGMAPINRVFQKWSARAAAVKFTKMAMFGDKVDAERLRALGLDDATTKQIFEAIKTNATFKGGVKSPSKLQSLGIKNWDGNTLSAFEDAMFRLNRTMILENDPGQMHRWLAHPLGQMVMQFRTFAMSAHTKALLQGLNLRDGPALFGMLASSFLGAAVYAGQTHLNLIGRPDRDDQLKERLTWNKLGLAGFSRSSESALIPMAADIGWQFFDDEPLFDTRSSGLKTTVSSFLGNPTGDLISTGLAGAAGVTSAMVGDDYSQTDWQNLTRTLPFARMMGAVQFLNWVGSGLPRRELRD
>1843732..1845030
RhizobiumleguminosarumMAGPRGNSGAFFIFNRETLMALAYAQSLGDGVTNTFSVPFPYISKNHVQVKVDGVAVPYTWLSDTSIQISPAPAADKIVDRRRVTPRDTLLVDFVDGSTLVESDLDLSALQVFYLAQESFDLGESSLGVTDDGSFSALGRRISNVLTPTLPNDVATKQFVETGVASGVTVATQKASEASASAVAAALSETNAAGSATSANASKITATTKAGEAATSATNAAGSATTASTKASEAASSAVEAQGYRDTAATKASEAAASAAAAAMFDPSTFYTKTEINTFLGGKLDKTGGTLTGDVTIQKANPSLVLDHTGVNKWGILSAANGSLSIQKLNGTVVNALTIGADGAISTALLGDLNSRIESRATAWANDRVANLAFRKVSSSSFTVPDNGLMMCPAGAVLTGMNMQGTSNNPAMHYHYLQSFDPVRGWVTFSGS
>1845310..1845609
RhizobiumleguminosarumMIFKEGALLPAVPVDEPLPNLSPRQLWLAALEINTTKAQVMAQIGTITDAKLRATLEIELTEPPLEGYVRDSFAVERLREMMGIPVDQFDTLWLWARTL
>1845609..1845857
RhizobiumleguminosarumMEHMNTEALLLIGRVEGKVDTLISLSSAQSQRIDQLEGRMSAGEVDIASLKAKSTTNQSFVTNITAILALIVAAISAYLSYK
>1845857..1846078
RhizobiumleguminosarumMDLKDILSKLHEEMAQKLLDKVRSGEVTAAELNVARQFLKDNNIDSIPKEGSPLKSLTDELPFTGDDDRPSYN
>1846204..1847892
RhizobiumleguminosarumMTADSLKTGTHLSPAVDPLKKDFRNFLFVVWKHLNLPVPTAVQYDIAGYLQHGPKRCVIEAFRGVGKSYVTSAFVVWLLYCNPQLNILVVSASKDRSDQFSSFTKRLIAEMPILAHLRARPGQRDSMVAFDVGPARNSHSPSVKSVGITGQLAGSRADIIIADDVEVPNNSMTQLQRDQLSERVKEFDAILKPLPTSRIIYLGTPQTEMSLYNRLPERGYEIRIWPARVPIDPERYLGRLSKFVMDMIEAGAQPRQPVDPQRFQEQDLIEREASYARSGFALQFMLDTSLSDQDKYPLKLSDLIVASLDPRMAPAKLVWCNDPDRVISDLPAVGLQGDRLHRPMWVANEMGEYTGTVMAIDPSGKGGDETAYAIVKILHGNLFLVASGGFKEGYSEATLKSLAVLGKTHNVNRVIVEANFGDGMFTQLLKPVFTRVHPVTIEEVKHSTQKERRICDVLEPVLNQHRLIVDAAVIKRDHEAEPHRQLFYQLTRITRDRGALINDDRLDALAIAVTYWVEHMARDTDKAADEHKAALLEQELRSFSEHIFGAPADSDLRWYNIG
>complement(1848112..1849056)
RhizobiumleguminosarumMWPTASEQSLLAATFFQTFNDELEKIRTARNRFVHYTSAETAYKIIENKEIWLRNAAVMNDFSEISYGIDCLSAARAGPAGIAFTNALNSIDPSIVTTVDDLFQKWQPYFQNDTFLASFAVHENDEDEIGRLSMWRAYGNVAIVLKGDVFTNPGPVPISGVTASPVAYFTAKEAESALWRMAAAIAANAGQLRLMKREFIVNSLHSALRFSILANKHPGFREEREWRIMHSPSHDPSPMQRLPVTIAGVPQQICKFKFPGQFGDKQFVEAIDRVIIGPAQHANVMRAAFQNLLASNGVSDALNRVAVSNIPLRQ
>complement(1849145..1849750)
RhizobiumleguminosarumMADGWYWNFAVVAATLMGPILAVQAQRLVDLAREKRNRRVTIFRTLMATRASSLSPDHVEALNAIPIEFYGKSRVFKEVVEAWRMLLDHLGKEQTNMDLWVQRRQTLFVDLLLKLSAAVGYEFSRLELEREVYSPVGHSIVQGEQEVIRKGLYKLFAGETSLPLDIKSFPADEDVTAKQKVLQELLESVLKGDTPIVIKVG
>1850160..1850381
RhizobiumleguminosarumMAFITVEEVHSQQDRVIGTVFLNVEFIIKYESKSTEQPFTSKITYLTGSSISELIVMGAPVDITSKISNASRS
>1850686..1851435
RhizobiumleguminosarumMFKGTLIRSGNNAKTIKGDGEYETAIMYLAPFTMAGANVCPMAEQAGCVKGCLNTAGRGAYNNVQQARIAKTKRYLASRTAFMADLVTDLERFVAYCKRKGVKPAVRLNGTSDIQWEVAHYASRGDARGSVFELFPEVQFYDYTKVYKRAYRQLPANYALTLSYSAANPVYAEVVTKVAHETGANLAIVYRTKELRDYFVGKLVQYGDACRDVIDGDETDMRFLDPKGVIVGLYAKGKAKGDQSGFVVG
>1851543..1851755
RhizobiumleguminosarumMARTYYTLLQRVDDHWSPQFGAYDREDVESERDDYRDHGVKAKDLKIVTTQGHSWKAIEAVLNKLNGRAR
>1852009..1852302
RhizobiumleguminosarumMDNSILFNCCTDHVAPDWSQYDALELGGCVEAKCTLTNDTWTEGGYHRNDAEFFTVYGHLKEGGCEAITDWHGSFDEAVCTAEELARLSGLPLEVCC
>1853015..1853365
RhizobiumleguminosarumMNEVRQLQDEKALMSVQTFEVFLVIASKDGIPSSEIRKITGIPQPSVSRALGDLGEKAVRRDAEGLKLIKTERDPSDMRNVVCFLTPKGKLLAARIAQLMGINDTKVDGSFERNAQ
>complement(1853540..1853624)
GCCCAGTTGGCGGAATTGGTAGACGCACTTGGTTTAGGTCCAAGCGCCGAGAGGTGTGGGGGTTCGAGTCCCTCACTGGGCACCA
>1853641..1854654
RhizobiumleguminosarumMPVKPRGASWQAAVSHKGTRLRKDFPTKLEAEIWEAETKAALLSGKEVVVKTAEPVMTLQQLFDLVAETRWRGTKGEKTALINGQHVVNILGPQRDVKTLCYEDSLTIKKTVTGWKRADATINRKLAAFSTMVKEAYKLGKIDKLFDIGLIKERNTRVRYYEDKELDQMLAWCDEMLEDELRDYIIVSLDTGFRQGEVLKITKRDAELEDLWTFDTKAGDKRDVPLTARAREVLLRRAKPLNDPDAKLFTQKPAWYREHWKSMQSDLGMTDDNNYVPHVLRHTFVTNMLLHTDIRTVQELAGHKRIETTMRYAKTSAERKRLAIKRMSDYQGAEIGA
>1854766..1854777
CAGATTTAGGTT
Region 3, total 9 CDS
# | CDS Position | BLAST Hit | E-Value | Sequence |
---|---|---|---|---|
1 | 1907906..1908691 | PHAGE_Mycoba_Milly_NC_026598: hydrolase; ELH61_09850; phage(gi764160985) | 4.24e-07 | Showinfo_outline |
2 | 1908820..1909653 | PHAGE_Strept_9871_NC_031069: hypothetical protein; ELH61_09855; phage(gi100019) | 9.67e-07 | Showinfo_outline |
3 | 1909794..1909997 | DUF3126 family protein; ELH61_09860 | 0.0 | Showinfo_outline |
4 | 1910272..1910643 | phasin family protein; ELH61_09865 | 0.0 | Showinfo_outline |
5 | 1910920..1911270 | PHAGE_Serrat_BF_NC_041917: putative DnaB helicase; ELH61_09870; phage(gi100317) | 2.14e-08 | Showinfo_outline |
6 | 1911281..1913773 | PHAGE_Agroba_Atu_ph07_NC_042013: putative virion structural protein; ELH61_09875; phage(gi100292) | 0.0 | Showinfo_outline |
7 | 1913869..1915239 | PHAGE_Staphy_Pvl108_NC_008689: putative transposase; ELH61_09880; phage(gi119443708) | 1.02e-09 | Showinfo_outline |
8 | complement(1915522..1915878) | DUF2750 domain-containing protein; ELH61_09885 | 0.0 | Showinfo_outline |
9 | complement(1916105..1916536) | PHAGE_Strept_phiSASD1_NC_014229: gp4; ELH61_09890; phage(gi298103512) | 1.01e-07 | Showinfo_outline |
>1907906..1908691
RhizobiumleguminosarumMNLNTPAFSSFTHDGLQLAFFDEGDPAGVPVLLIHGFASTANVNWVHPGWLKTLGDAGYRVIAMDNRGHGASDKPHDAEAYRPWIMAGDAIALLDHLGIPEANVMGYSMGARISVFAALANPHRVRSLVLGGLGIGMTDGVGDWDPIADALLAPSLDAVTHARGRMFRAFAEQTKSDRVALADCIRGSRDLVARSDMAKLDMPTLIGVGTKDDIAGSPRELAALMQNAEALDIPGRDHMLAVGDRVFKQAVLTFYARVAHR
>1908820..1909653
RhizobiumleguminosarumMVAKTDIRAFDTGHPLKVMDPIWDSLREEARLAAERDPVLAAFLYSTVINYHSLEECVIHRICERLDHPDMQANLLRQTFEEMLLDWPDWSSILRVDIQAIYDRDPACLRFMEAVLYFKGFHALQTHRLAHWLLNRGRRDFALYLQSRSSSVFQTDINPAARIGKGIFLDHATGLVVGETAVIGDNVSILHGVTLGGTGKEGADRHPKIGSGVMIGAGAKILGNIEIGYCSRVAAGSVVLKAVPPKKTVAGVPAKVVGEAGCSEPSRNMDQVIGADI
>1909794..1909997
RhizobiumleguminosarumMKPEEIKKLDAYFKRMFNPQMIVKARPRKDDSAEVYLGEEFLGVVYIDDEDGDRSYNFSMAILDVDL
>1910272..1910643
RhizobiumleguminosarumMFNFEDANKKSKEAVDTALKTYTDTSKGFQAIAAEATEYSKKSFQDAVTHFETLAGVKSFEAAFELQTSYVKAYFEGFVSETTKLSEMYADLAKSAYKPYEAPIAAAVVKTAKSVSAATPAAA
>1910920..1911270
RhizobiumleguminosarumMIAKPIRMQNDSERNGDNANRTSVITRTKPKTKKPNLYRVLLLNDDYTPMEFVIHILERFFQKDRESATRIMLHVHNHGVGECGIFTYEVAETKVSQVMDFARQHQHPLQCVMEKK
>1911281..1913773
RhizobiumleguminosarumMPTFSPSLEKALHQALTFANERHHEYATLEHLLLALIDDADAAAVMGACNVDLDALRKTLVEYVDNELSNLITGYDEDSKPTSGFQRVIQRAVIHVQSSGREEVTGANVLVAIFAERESHAAYFLQEQEMTRYDAVNYISHGIGKRPGVSEARPPRGAEDEAESSKPTARGGEEEGGPKKQQDALKAYCVNLNEKAKGGKIDPLIGRHAEVSRTIQILCRRSKNNPLYVGDPGVGKTAIAEGLAKRIVEGKVPEALADATIFSLDMGTLLAGTRYRGDFEERLKQVVKELEEYPGAVLFIDEIHTVIGAGATSGGAMDASNLLKPALSSGAIRCIGSTTYKEYRQFFEKDRALVRRFQKIDVSEPSIEDAIEIMKGLKPYFEEYHHLRYSNDAIKSAVELSARYISDRKLPDKAIDVIDETGAAQMLLPPSKRRKLITEKEIEATVATMARIPPKTVSKDDEAVLANLEKELRSVVYGQDIAIEALSTSIKLARAGLREPNKPIGAYVFSGPTGVGKTEVAKQLASSLGVELLRFDMSEYMERHTVSRLLGAPPGYVGFDQGGLLTDGVDQHPHCVVLLDEIEKAHPDIYNILLQVMDHGTLTDHNGKKIDFRNVILIMTTNAGASEMAKAAIGFGSSKRTGEDEEALTRLFTPEFRNRLDAIIPFAALPTAVIHKVVQKFIMQLEAQLSERNVTFDLHEDAIAWLAEKGYDEKMGARPLARVIQDTIKKPLANEILFGKLKKGGVVNVTVGPKEDGKPGIVLEAISETAPIKPKPEAEVVHPEGDDGDDGELKTKAARKTRAKAVPQAEPEVRDAPKKGSAVPKVPRKK
>1913869..1915239
RhizobiumleguminosarumMLKKPAPTQTALEMVTLDSLVPKDHVLRKIDAVIDFSFIHGRVAGLYCADNGRPPLDPTLMFKALFIGYLFGIRSERQLVREIEVNVAYRWFLQMKLTDGVFDASTLSQNRRRRFNDTSVAQDIFDHIVEQAIRHGLVDGTVLYTDSTHLKANANKGKYDLAMIEKSRSDYWADLDRAIEAERALHGQKPLKEKEREPEVKETKVSRTDPDSGYMVRDGKPKGFFYLDHRTVDGKLAIITDTHVTPSNVHDSIVYLDRLDRQRERFGFEVGAVGLDAGYATSGIAKGLEDRTILGVTGYRNPTPPRAGMMRKSKFGYEPETDGYRCPEGQLLAYATTDRNGYRHYRSDPAICRDCPLLASCTNNATATRTITRHVWADARQRTDANRLTPWGKAIYKRRKETVERSFADAKQLHGHRYARFRSLTRVSCQCLLAAAAQNIKKIAMALTTASKPAMA
>complement(1915522..1915878)
RhizobiumleguminosarumMSLVAAHTHAFFNEVTQSGFVWTIRDEAGFPTSTNQSNEAAMPFWSSEIRARRIVDQVPAYRGFIPHKLPVEVFLDRWLQGLEQDNVRVGINWSGVRATGFDIAPADVRRRLHSASTR
>complement(1916105..1916536)
RhizobiumleguminosarumMTSPAAYDDSNIFAKILRGEIPSHRVYEDEHTVAFMDVMPQAPGHVLVVPKAASRNIFDADPATLTHAITVVQKVANAVKGVFDADGVFIAQFNEPAAGQTVFHLHFHVIPRHEGTALKPHSGKMEDGAVLAANAEKIRAALA
Questionable (score 70-90)
Incomplete (score < 70)
Viewer Options
Click on a region in the genome above to show details here.
ORF Start: 559068
ORF Stop: 559673
Strand: Forward
Protein Sequence: MSVARSIFPAYSRARMESPVHPAVPRDAGLAEAAIASEADIRSGLTENLARLWRYGLVLSHQRDVADDLVQATCLRALERADQFIPGTRLDRWLFSILHSIWLNEIRSRRVRQGQGFVDAGETLTFDGAHDTETHVMAHQVLKQVNALPEAQRTVVFLAYVEGLSYREVAGILDIPIGTVMSRLAAARAKLSGAGPEGGRQ
Homolog/Ortholog Species: Phage-like protein
Homolog/Ortholog Protein: ELH61_02880, DNA-directed RNA polymerase specialized sigma subunit, phage(gi966199376), PHAGE_Sinorh_phiM9_NC_028676
Homolog/Ortholog E-Value: 3.19e-12
ORF Start: 559670
ORF Stop: 560473
Strand: Forward
Protein Sequence: MNTKHTIPSDEDLTAFIDGELTAEEAARIQTMVEEDESTAERLEFLARASLPFKQAFAPLLSEAPREKLETILAAIPAQPSARPASAPAFASRRRFLGALAASLVAGIAIDRAVIGIGRSFSAKDENSEWRAVVADYISLYTPETLAGPVPAREDQAAQLGPLDEKLGLSLSPEAVSLPGIDFKRALLLQYDGKALAQVAYLDPETGPMALCIVRSDAGPKAPDVENRKGMNVVYWSNETHAFMLIGRIPGDRMKELGEDVRRRLSA
Homolog/Ortholog Species: Non phage-like protein
Homolog/Ortholog Protein: ELH61_02885, anti-sigma factor
Homolog/Ortholog E-Value: N/A
ORF Start: 560568
ORF Stop: 561131
Strand: Backward
Protein Sequence: MTQNGQETTARQKLDDEAFEAFIRGRRPASPIWSMSGLDGYLTALIIGPKFIDPRQWIPELTGPDALNLPMETTEHQAVQTIVAEYNRISASLSETPKDHRPRFTRIDDQTFDIFDWDLCFLLGTGYAPRLWQPVLRGHAVTGDIIAPIRKLGEAKRKATRQDAADVAEALVNIRTYFMPKRTKQKF
Homolog/Ortholog Species: Non phage-like protein
Homolog/Ortholog Protein: ELH61_02890, YecA family protein
Homolog/Ortholog E-Value: N/A
ORF Start: 561128
ORF Stop: 562701
Strand: Backward
Protein Sequence: MPLRHDPLPQDAAQLTRIILSLNEENADLKARVAFLEGQLFGAKSEKMTTIDPTQAILDLGDLSDIPVAANDDVARVGEDKTQARRSPARNIGHLLKHLPRYDELIEPESKICPCCSFELHCIGTDVSEALDIVPAVVRVKRTIRPRYACRACENAIVQAPAPARVMDGGMVTTAFAAHIAVSKFAWHLPLHRQAQMLASCGVIIDRGTLAPGSRGSPGGLSFSTMRSPPSSARSRGCSVMRRRFRGSIRGANEPRSASYGHKRSTIAHGMVRHRRQSRTSSPKAAARARSRGNYRHLPACCKLMDTKPIKPWSNVGARAMSPLCGWPSALLTPGESSLMSSNPALRKPCRSLPGLPKSIASKRTCAGEMPIPGLSGVARQLPSGNRPSLPSGTRCRRNRRLARRSPTHSTTGVGQPSLMMAGLRWTPMSSNVQNPWPRERTRYLWAASGVESPSLSWHRSSTRQSLTAWIPRTGLPMCWSGSSLATKPTKWKVSCLGPGRLSVKRHSRSDELHGA
Homolog/Ortholog Species: Transposase
Homolog/Ortholog Protein: ELH61_02895, transposase, phage(gi209447153), PHAGE_Stx2_c_1717_NC_011357
Homolog/Ortholog E-Value: 5.23e-19
ORF Start: 562750
ORF Stop: 563106
Strand: Backward
Protein Sequence: MIGLSPGGVKIMVATRPVDFRRGMNGLVALVASALAADPYCGDVFVFRAKRLDRLRCIYWDGSGMILATKWLEAGKFVWPPIRDGAMQMTREEFSLLLAGIDWTRVKQNPVKRPLKAG
Homolog/Ortholog Species: Transposase
Homolog/Ortholog Protein: ELH61_02900, transposase, phage(gi209447152), PHAGE_Stx2_c_1717_NC_011357
Homolog/Ortholog E-Value: 1.21e-22
ORF Start: 563103
ORF Stop: 563573
Strand: Backward
Protein Sequence: MVGDRADAMLEVMDEGMHEARHEGKYRRIEVITGRRQRRNWTDEEKARILAESAETDVNISAVARRWGVNRGLLNVWRRDAGLTSRRSAKACAQQAIFVPVTVVGERTSPETLPSDAAHFAAGRIEIEVAGARLTMIGSVAPELAQAVVTALRGRR
Homolog/Ortholog Species: Transposase
Homolog/Ortholog Protein: ELH61_02905, ISRSO10-transposase ORFA protein, phage(gi17546153), PROPHAGE_Ralsto_GMI1000
Homolog/Ortholog E-Value: 4.01e-05
ORF Start: 563657
ORF Stop: 564519
Strand: Backward
Protein Sequence: MKGIVLAGGSGSRLHPMTHSVSKQLLPIYDKPMIYPLTTFMLAGIREVLIISTPHDMPLFQRLLGDDSDWGMSLSYAAQQSPDGLAAYIIGADFVAGGPSCLILGDNIYFGHGLPELLDEGVPKGDGATVFAYHVRDPGRYGVVEFGSDMTAISIEEKPANPKSHWAITGLHFYDADVVNIAADLKPSPRGEYEITDVNKTYLERGKLRVSMMGRDMPGSIPARPKVSLRPASSSARLKNARASKSLVRRKLRWRKASSRRSTSPRLPPKRAGITASTCVVWWIAG
Homolog/Ortholog Species: Phage-like protein
Homolog/Ortholog Protein: ELH61_02910, Phi92_gp066, phage(gi726646999), PHAGE_Entero_phi92_NC_023693
Homolog/Ortholog E-Value: 4.09e-84
ORF Start: 564516
ORF Stop: 565202
Strand: Backward
Protein Sequence: MRIVVTGKNGQVASALQALYAGGTEVIAVGRPELDLLEPSMVSEIIAKIKPDVVVSSAADTAVDKAESDEAAAFAINRDGARAFAAAAAELSLPIIHLSTDYVFDGDKPERYVESDPVGPASLWQIEARGRICGRRSQESCDPADGLGLFDVWPQLRENDASRKRPADVSPSRTLRPLTIPQQRNGHPIPDSVAISRAFMACACRSGGYQARTAVTKLLEEPKEAV
Homolog/Ortholog Species: Phage-like protein
Homolog/Ortholog Protein: ELH61_02915, Phi92_gp064, phage(gi726646997), PHAGE_Entero_phi92_NC_023693
Homolog/Ortholog E-Value: 1.83e-12
ORF Start: 565204
ORF Stop: 566259
Strand: Backward
Protein Sequence: MRVLVTGGAGFIGSALVRYLVSDIGAEVLNIDKLTYAGNLASLKAVENAPNYRFLKADICDRSAVSNAFEEFRPDYVVHLAAESHVDRSITGASDFIETNINGTFSMLEAARQYWQDLPAYEKAAFRTLHVSTDEVYGSLGEDGLFAETTPYDPSSPYSASKAASDHLATAWERTYGLPVIISNCSNNYGPFHFPEKLIPLIILNALERKPLPVYGSGSNIRDWLYVIDHARALWLIVQRGRPGEKYNVGGRNERRNIEVVERVCAIMDEVRPGTAPHSDLINYVTDRPGHDARYAVDATKLETELDWRALENFDSGIRKTVEWYLENAWWWQPLRERAYSGERLGVLKKV
Homolog/Ortholog Species: Phage-like protein
Homolog/Ortholog Protein: ELH61_02920, Phi92_gp067, phage(gi726647000), PHAGE_Entero_phi92_NC_023693
Homolog/Ortholog E-Value: 3.82e-114
ORF Start: 566273
ORF Stop: 566839
Strand: Backward
Protein Sequence: MHTLNVEQLAIEGVKKVTPARFGDSRGYFSEVFKDSWFRSNVADVSFVQDNESLSAQPGTVRGLHFQLSPFAQGKLVRCLRGALLDVAVDIRHGSPTYGKWVSAELSPENGEQLWLPAGFAHGFVTLQPDTKISYRVTAPYSAEHDRGVRWNDAEIGIEWPQMDAYVLSDKDNKQPLLSELPAYFQFS
Homolog/Ortholog Species: Phage-like protein
Homolog/Ortholog Protein: ELH61_02925, Phi92_gp065, phage(gi726646998), PHAGE_Entero_phi92_NC_023693
Homolog/Ortholog E-Value: 6.78e-41
ORF Start: 567109
ORF Stop: 568029
Strand: Backward
Protein Sequence: MNRDVKIYVAGHRGMVGSAIVRRLKAGGYMNIVTRSHAELDLVNQAAVAEFMKAERPDYIFMAAARVGGIHANNVYRAEFLYQNLMIETNVVHAAWQAGVERMLFLGSSCIYPRDCPQPIREEYLLTGLLEQTNEPYAIAKIAGVKLCESYNRQYGTRYVSGMPTNLYGPNDNYDLDSSHVMPALIRKVHEAKIRGDRQLVVWGSGRPMREFLYVDDMADACVFLMEKDVNEGLINIGTGEDITIRELAEAIMGVVGFTGEIVYDQTKPDGTPRKLMSVDRLSALGWKATTSLSDGIARAYADFMS
Homolog/Ortholog Species: Head protein
Homolog/Ortholog Protein: ELH61_02930, minor capsid protein inhibitor of protease, phage(gi100226), PHAGE_Synech_S_CAM7_NC_031927
Homolog/Ortholog E-Value: 1.06e-100
ORF Start: 568034
ORF Stop: 569116
Strand: Backward
Protein Sequence: MKRALITGITGQDGSYLAELLIEKGYEVHGIKRRTSLFNTDRIDHLYQDPHDTNRRLVLHYGDMTDSSSLVRIIQQVQPDEIYNLAAQSHVAVSFEEPEYTANSDALGALRILEAIRILGLEKKTRFYQASTSELYGLVQEIPQRETTPFYPRSPYAVAKLYAYWITVNYREAYGIYACNGILFNHESPVRGETFVTRKITRALARIKLGLQDCLYLGNMDAKRDWGHAKDYVEVQWLMLQQDEPEDFVIATGVQYSVREFVDAAAHEIGLPISWKGSGAEEKGYDENGRCIVAVDPRYFRPTEVETLLGDPSKAKEKLGWEPRITFKQLVAEMMREDLKSAERDELVKRHGFSAYDYHE
Homolog/Ortholog Species: Phage-like protein
Homolog/Ortholog Protein: ELH61_02935, GDP-D-mannose 4,6-dehydratase, phage(gi472340900), PHAGE_Synech_S_SKS1_NC_020851
Homolog/Ortholog E-Value: 2.11e-178
ORF Start: 1811452
ORF Stop: 1811463
Strand: Forward
Protein Sequence: CAGATTTAGGTT
Homolog/Ortholog Species: Attachment site
Homolog/Ortholog Protein: attL
Homolog/Ortholog E-Value: N/A
ORF Start: 1811768
ORF Stop: 1814344
Strand: Forward
Protein Sequence: MTFEQSILDNPLFERQLDLEQEMRTSGIDRFRKNVDKASVKGAMSDTMAVNRLVVEAHEKVVAAINEFLTEAKSGAAGRRHTAVVFIEKLDVDTVANITARVILDEVTRKSNLTKTSLAIGSMLENEFNSRKFEEEMPKAHKKFLKKAQKESLDRRKWSHLLYPARLLGVELEEWSEKDRILVGLKLVDLFIQSTGLIEREVVQSARFGTLELLVANEATLKWMETENSRLEHLFPIYMPTIVPPKPWTSPFDGGYFTAFRRLKLVKTHNQQYLEELANRDLSQVYEAINALQDTAWAINTQVLDVIRTLYETGAGVAGLPQADKLRMPLRPHWLPEGKDRMSTEDMTEEQLEEFKAWKAETHRTHVENAAISGRRASFLRTLGVAEKFKDEEAFFYPHTLDWRGRAYPLPLYLTPQGNDLQRGLLTFANAVPIHDEEAAEWLAIHGAGCWGYDKVSLEERVQWVLEHEVEIIASAQNPYDNHFWMGADKGEKKWQFLAFCFEWAAFKEEGYGYLSSLPVQMDGTCNGLQNFSAMLLDEVGGAAVNLIPADEPQDIYQKVCDIVCEQLARDLESTELVTIKGKTDDGVEFEKVVCSVADMANGWLPKMGRKVTKRPVMTLAYGARRFGFVSQVDEDTIKDWRSGSPESYPFISQGDDGKPVDYGYKAAQYMGGLIWDSVGEVVVKARQAMDWLQAVSKVASNEQLPINWTTPVGFLVQQAYRVPNTKRVDTTFNSQRIRLTYQHGVGKIDGRRQASGISPNWVHSLDAAHLMKTIGRCRREGIASFSMIHDSYGTHAGNAWAMARYLREEFVQMYSQVDVLTRFKEELEAQTGEQLPDLPAKGNLDLQQVLDSPFFFA
Homolog/Ortholog Species: Phage-like protein
Homolog/Ortholog Protein: ELH61_09290, DNA-directed RNA polymerase, phage(gi712913252), PHAGE_Mesorh_phagevB_MloP_Lo5R7ANS_NC_025431
Homolog/Ortholog E-Value: 2.91e-174
ORF Start: 1814671
ORF Stop: 1815360
Strand: Forward
Protein Sequence: MAKLTPAIYGAAPVDLGCMELDTSEMMFWLYLPIKMPGQFMPKLPANLKKYERIVDAVMDNVIDDDTINPQGRRWTESYVYLSVKITHVTPDAPGNRPGWHSDGFLTDDLNYIWTDRNPTEFFITDALFGTEPDHRSSMKQFDWMARHLLRSKGDRLEHAKVNHLYRLDQTNIHRVSLNVESGKRAFIKVSVSDKPYVQLGNSINHDLPQHPLPTLKRQADRNCPQGNK
Homolog/Ortholog Species: Hypothetical protein
Homolog/Ortholog Protein: ELH61_09295, hypothetical protein, phage(gi712915495), PHAGE_Acinet_vB_AbaP_Acibel007_NC_025457
Homolog/Ortholog E-Value: 2.36e-21
ORF Start: 1815570
ORF Stop: 1816391
Strand: Forward
Protein Sequence: MANKNFHKTVSVAASAHNCNSVGEITLATLNADSYINLITSARSLTISPVEAKKLRDLLVEAYPLEAPAAAAKRTKFKAGDKVTYKSIVGYGARGMDGRKGSIKEVLTNEWYMVNFTGGPFDNLIKVHSDYLVAQPVPSVGGLKVGDRVRRLQGSSKGSIFTVTELNSSLTELKVDGMTGWRMPKYFELVTDAPASAPEPLAKIDTGRFLVVALEGANYVPGSKPKVHVTDFSARVEAERLARDVGGTYHVFRAVFEASREKPVIPPVKTTKL
Homolog/Ortholog Species: Hypothetical protein
Homolog/Ortholog Protein: ELH61_09300, hypothetical protein
Homolog/Ortholog E-Value: N/A
ORF Start: 1816480
ORF Stop: 1817190
Strand: Forward
Protein Sequence: MAERKKNPSLISPRGPLKFPKIDKVDYGTKEYPKPNGEYSTKQVLEADAPATKAFIAALMPHYQAAMEEAAAKFKELKIETRKKLGKVTENDLFTTLYDQETEQPTGYIEFKFAMAASGERKDKTKWSAKPAIFDAKGKPMTKVPEIWSGTEAKVSFECQPYFIPGTGAAGLKLKLKAIQIIELVSGGQRSASSYGFGAEDGYEYEEPATEENEGGFGDESGEDTSSKTIDDDIPF
Homolog/Ortholog Species: Phage-like protein
Homolog/Ortholog Protein: ELH61_09305, single-stranded DNA-binding protein, phage(gi460042334), PHAGE_Pelagi_HTVC011P_NC_020482
Homolog/Ortholog E-Value: 8.49e-11
ORF Start: 1817198
ORF Stop: 1817608
Strand: Forward
Protein Sequence: MTYRTSAAGLRAVGIREGFRSGLEDKVGDQLRAQGIDPRYEEVVIPYVKPERKAKYTPDFQLPNGIFIETKGRFVTEDRQKHLLVKTQHPELDIRFVFSNPKARISKTSQTTYADWCLKHGFKFAAKIIPQEWIDE
Homolog/Ortholog Species: Phage-like protein
Homolog/Ortholog Protein: ELH61_09310, putative endonuclease I, phage(gi712913258), PHAGE_Mesorh_phagevB_MloP_Lo5R7ANS_NC_025431
Homolog/Ortholog E-Value: 5.50e-48
ORF Start: 1817601
ORF Stop: 1818221
Strand: Forward
Protein Sequence: MSSMFTPRKVTSYLVVHCSATQPKMDIGAKEIRQWHREKGWIDIGYHFVIRRDGTVELGRPENVVGAHVENHNSNSIGICLVGGVDAKGKAENNFTPAQFATLAIKLRELRSKYPGVTVQGHRDFPGVKKDCPSFDTRKWINETGVFETSHVPAEPDARAVEITSATPTIFSLAKKYGTTVEAILKVNPHVDPAKLKLGQVIRLPG
Homolog/Ortholog Species: Hypothetical protein
Homolog/Ortholog Protein: ELH61_09315, hypothetical protein, phage(gi100022), PHAGE_Escher_vB_EcoP_GA2A_NC_031943
Homolog/Ortholog E-Value: 3.40e-57
ORF Start: 1818315
ORF Stop: 1818992
Strand: Forward
Protein Sequence: MNTFVAGAHVRHKDHPEYGNGRIVYVHTNGKSLAVEFENSTGLRHDCAGHAKYDFCRWSRAASLELINPFKFGDTVLMCAFDARSNPAFYYDHDFANQQGIIDHADKDGYVTVLVASLGHTQFVPIADLTLVKVDPVEAVKKATPKFKTIVFQSGSQCDRLVKYLLAGNSITPLVARQLFGVERLAARILEIKKAGHKVTSTIKTDVNGKVYAEYALRKAGRVGA
Homolog/Ortholog Species: Phage-like protein
Homolog/Ortholog Protein: ELH61_09320, DNA binding protein, phage(gi472341794), PHAGE_Sulfit_pCB2047_C_NC_020856
Homolog/Ortholog E-Value: 1.44e-05
ORF Start: 1819243
ORF Stop: 1819425
Strand: Forward
Protein Sequence: MKAALHFLLNFLTIAGLGLLAILVASSTGAAFVAFGPLIGSVVAFFWIAVGVSFERSERL
Homolog/Ortholog Species: Hypothetical protein
Homolog/Ortholog Protein: ELH61_09325, hypothetical protein
Homolog/Ortholog E-Value: N/A
ORF Start: 1819429
ORF Stop: 1821057
Strand: Forward
Protein Sequence: MQSESSFVRKEPCPACGSRDNLGRYSDGHGHCFGCDYYEPGDGSVPQTSRKSKMSRELISGGEYRALSKRGITEETCRKFGYQIGHFKDQLVHIAPYYDDEGNLTAQKVRFADKTFTVTGNMKPALLFGQNLWSGGGRKVVITEGEIDALSVSQVQGNKWPVVSIQNGAQSAKKALSAALEWLTTFEEVVLMFDMDEPGREAAAACAALFPPGKCKIARLPDKDPNALLMAGKGDEIITAIWQAQVYRPDGVVAFKDIKEAARRPIEMGLRWFCDRLTKLTYGRRWGEVYAFGAGTGIGKTDFLTQQITFDVTELGQKVGVFFLEQMPTETAKRLAGKFAKRRFHIPDDGWTDAELDEALDKLDQDMLFFYDSFGATEWSVIRETIRYLAHSEDVKVFYIDHLTALAAAEDDERKALEQIMAEMAALAKELGIIIHLVSHLATPEGKPHEEGGRVMIRHFKGSRAIGFWCHYMFGLERDQQHEDERLRAVTTFRVLKDRYTGQATGEVIYLGYERETGMLYETSIEFGDETGSDFKDESAPF
Homolog/Ortholog Species: Phage-like protein
Homolog/Ortholog Protein: ELH61_09330, DNA primase, phage(gi460042337), PHAGE_Pelagi_HTVC011P_NC_020482
Homolog/Ortholog E-Value: 9.80e-121
ORF Start: 1821226
ORF Stop: 1823079
Strand: Forward
Protein Sequence: MRDPSSGFTISATSNDYSSDDPTIITDWSVEEALHALMNADVIIGHNIIKFDIPALQKVFPWFQPKGLIIDTLVCSRLIWSDIADHDLKQVRKGYPGKLVGSHSLKAWGYRLGVLKGDFGETSDWRYWSPEMQTYCEQDVEVTAQFYARIKKKEPSPKSLWIEHEFCKIIAMQERHGFAFNEEEAIKLYSQLVTRRLEIARELQVAFPPVEKTEVFIPKVNNKQRGYVKGEPFTKKWMVEFNPSSRQMIADRLQAMGWVPQEFTPSGQPKIDETILQALPYAQAKVLAEHFLVEKRIGQLAEGDQAWLKLVKKGRIHGSVNTNGAVTGRCTHSNPNVAQVPRVGSPFGAECRALFTTTSRWVLVGADLSGLELRCLAHFMALFDGGEYGRIVLEGDIHTVNQNAAGLPTRNDAKTFIYAFLYGAGDQKIGSIVAPDASPEEQKRIGKKLKRQFLAKTPALRRLREVVELKVLGFVPKARPLNVNPAYEHMWRQDSAKQWWFKAGAGGVLVGLDGRKLNVRSAHSALNTLLQSAGALISKAAMIFAYRELSTRGYVFGRDYAFVAHIHDEIQTECRPELAEEVGQIVVEGMRAAGTFFAFGCPIDGEFKIGNNWKETH
Homolog/Ortholog Species: Phage-like protein
Homolog/Ortholog Protein: ELH61_09335, DNA polymerase family A, phage(gi460042339), PHAGE_Pelagi_HTVC011P_NC_020482
Homolog/Ortholog E-Value: 2.10e-165
ORF Start: 1823084
ORF Stop: 1823290
Strand: Forward
Protein Sequence: MIEILKQALENPFKTKSNFARENADLIAMAASDGFLTTRIATGLYSRKWMITPVGLSHYYALTGLNHD
Homolog/Ortholog Species: Head protein
Homolog/Ortholog Protein: ELH61_09340, membrane-associated initiation of head vertex, phage(gi100034), PHAGE_Morgan_vB_MmoP_MP2_NC_031115
Homolog/Ortholog E-Value: 3.76e-05
ORF Start: 1823363
ORF Stop: 1823779
Strand: Forward
Protein Sequence: MFGRSSSSSYSSSASRAENHLTINQQPHDAADAARLYGELQDKANDSITEIVGHQIADTKVEFVTLDTARDVLHFKDHVRVIFKINGKTFDTRVEIDDPIKLSEPRERVAYRAVAEGIANTLMDRSIFQIYQTFARKR
Homolog/Ortholog Species: Hypothetical protein
Homolog/Ortholog Protein: ELH61_09345, hypothetical protein
Homolog/Ortholog E-Value: N/A
ORF Start: 1823867
ORF Stop: 1824733
Strand: Forward
Protein Sequence: MWSTTHTGSCTRMNTALSATSTGRQEKRRRASSSMTRTLLIDADVVAYVAASSLEVATDWGDGYWTWHVDEFEVQKKVKQIIDDTMEDLKGDSCKLCLTDSFGNFRKSVLATYKGNRSNIKKPLVLMKTKQWMIDELGAYFRPGLEGDDCMGILATMKGTDERIIVSIDKDMKTVPGKFCRYTDSKAKIIEYSEKEADYWHLYQTLMGDATDGYPGCPGIGPKKAEAILGPIDEFDLTEGWARVLAAFEKAKLTETDALTQARVARILRASDYDFKKKEPILWQPKAN
Homolog/Ortholog Species: Phage-like protein
Homolog/Ortholog Protein: ELH61_09350, exonuclease, phage(gi460042342), PHAGE_Pelagi_HTVC011P_NC_020482
Homolog/Ortholog E-Value: 8.29e-50
ORF Start: 1824712
ORF Stop: 1825278
Strand: Forward
Protein Sequence: MATKGKLIALYSDAAGSGKSEVAGTLIRHGYESVKFAGPLKNMARGLLGSMGFETVTVERMIEGDLKEAVIPGFKTVTPRQIMQTLGTDWGREAIDQDLWTKVAAAKIEGLRDKGVDVVVDDLRFPNEYDLIASLGGTLVQVVRADPSREAGGAYEGKLSGHLFHHIVHNNGTLRELYSKTLLLAQSI
Homolog/Ortholog Species: Phage-like protein
Homolog/Ortholog Protein: ELH61_09355, ssDNA binding protein, phage(gi100044), PHAGE_Sphing_Lacusarx_NC_041927
Homolog/Ortholog E-Value: 1.32e-25
ORF Start: 1825304
ORF Stop: 1825759
Strand: Forward
Protein Sequence: MKFLSGLVAFALVAILALAPAIAEARSSFGGFSGGSRFRSSSSFSSRSTTTYSRPSTSYSRPSTTYRPAPVYSSPSYRSYSSTTINQSSGGGGFFSSMVGSIAGYGIAQWLFGEDEKPAEQAPAAAPAPAAQAPVTGAVPTTTVQEAPKAQ
Homolog/Ortholog Species: Hypothetical protein
Homolog/Ortholog Protein: ELH61_09360, hypothetical protein
Homolog/Ortholog E-Value: N/A
ORF Start: 1825965
ORF Stop: 1826171
Strand: Forward
Protein Sequence: MVDDQDRFPHIPKDLVEALDKRFPERTPSLNTSLDEIRWKGGERAVVRFLLEQYNRQNETVINEKVLS
Homolog/Ortholog Species: Hypothetical protein
Homolog/Ortholog Protein: ELH61_09365, hypothetical protein, phage(gi460042344), PHAGE_Pelagi_HTVC011P_NC_020482
Homolog/Ortholog E-Value: 1.73e-05
ORF Start: 1826171
ORF Stop: 1826398
Strand: Forward
Protein Sequence: MCPPKVKTQKVEPVAQAAPPAQPAATVNQSAPQTPDELSPEQAAIKAKRKGRSSLRIPLDAGVGSGATGINVPQA
Homolog/Ortholog Species: Hypothetical protein
Homolog/Ortholog Protein: ELH61_09370, hypothetical protein
Homolog/Ortholog E-Value: N/A
ORF Start: 1826408
ORF Stop: 1827946
Strand: Forward
Protein Sequence: MTGQTASGRYQQLSQARSAVLERGRASAKLTIPSLLPPAGHSESSSLPTPFQGIGARGVNNLASKLLLALLPPNSPFFRLMIDDFTLENLTKRKGMRAEVEKGLNKIERALMTEIETTAIRVSAFEALKQLLVAGNVLIYLPTEGGMRVFRLDRYVVKRDPMGNVIEIITREDISPDMVPEAMKGHVKSKSRSNEKTIELYTHIVRQRDKWTIRQEIKGMTVPGSRGSYPLDKCPWIPLRFTKIDCEDYGRGYVEEYFGDLLSLETLTQAIVEGSAAAAKVLFLVNPNGTTRMTDIAKAPSGAVRAGNAEDVSVLQLDKFADFKIASETINNIQQRLSFAFLLNTAIQRAGERVTAEEIRYMAGELEDALGGVYSILSQEFQLPLVRVLMFRLERQKKIPPLPKGVVKPTITTGLEALGRGHDMNKLTIFAQTASNIAALPPEISKADFLMRVGTALGIDMDGLVKTPEQLQQDQQQLMMQQLIEKLGPKGMDILRDQLKPEVQNGPQAQAQ
Homolog/Ortholog Species: Head protein
Homolog/Ortholog Protein: ELH61_09375, head-to-tail joining protein, phage(gi460042400), PHAGE_Pelagi_HTVC019P_NC_020483
Homolog/Ortholog E-Value: 9.75e-157
ORF Start: 1828059
ORF Stop: 1828556
Strand: Forward
Protein Sequence: MTSFGDAFKAARKAGKTTFKFGGKSYHTKTKDEMAKTKKAVPTPSPRPEAMKTDAQAAVDSAPKSAPKAAPKQDYPRPAKPVGIASANSAIGRAAAARENAPVLKIPSRANANANTVQKPRAPARGSSTPAEKQGPKPEEQQWFARKGSAISLGIARRRNAPVSK
Homolog/Ortholog Species: Hypothetical protein
Homolog/Ortholog Protein: ELH61_09380, hypothetical protein
Homolog/Ortholog E-Value: N/A
ORF Start: 1828566
ORF Stop: 1829339
Strand: Forward
Protein Sequence: MENEVKEQDQIVVPGSDEHNALMVEKFQNQSGSTEQTNAQPAERPAWLPEKFEKPEDLAASYAELERKLSGGQQQEAPKVEANAEEAREAVNALGLDFDALGAEFAESGALSDDSYTKLAEKGLSRDIVDAYIEGQEAKAQLHRAEVLLAVGGEATYNEIANWAAGNLTNEELQAYNDQVESGNLTAAKMAVQGLKARFEAENGSEPQLLNGETGGNSAEVFRSTAELTAAMRDPRYKKDPAYRADVERKLSKSSLF
Homolog/Ortholog Species: Phage-like protein
Homolog/Ortholog Protein: ELH61_09385, scaffolding protein, phage(gi460042348), PHAGE_Pelagi_HTVC011P_NC_020482
Homolog/Ortholog E-Value: 5.57e-51
ORF Start: 1829349
ORF Stop: 1829579
Strand: Forward
Protein Sequence: MTALASIVALVWAHLDDIFAILFALQAVLVLISKLTPTPKDDAVAAKILSVLESIASVLSVKRKDFPAAPTSPGLY
Homolog/Ortholog Species: Hypothetical protein
Homolog/Ortholog Protein: ELH61_09390, hypothetical protein
Homolog/Ortholog E-Value: N/A
ORF Start: 1829598
ORF Stop: 1830491
Strand: Backward
Protein Sequence: MSEVESAESVEKGENRGDDNIVSFQAFLEKAHPSLTKVVPDVWQTQVTLNAANRRRLTYPQIRLYCSICAGERNFRCLTDDAFGWNTTIANVHPLYTCGDCRDQVKHYALHLTFTDAGGAKVYKYGEIPPFGIPVPNRVLRLFGNADAKLFLKGRQCEDQGFGIAASAYYRRVVENHRNDLFDEIIKVSETVGAPAELIDELKAAKKEISFAKSIEKIKTGLPQGLLIDGHNPLNALHGALSEMLHNESDEDCLQNAQAIRVVLSDLVERIAMLKQDNKQLSDAVQRLLAKRKGATN
Homolog/Ortholog Species: Hypothetical protein
Homolog/Ortholog Protein: ELH61_09395, hypothetical protein
Homolog/Ortholog E-Value: N/A
ORF Start: 1830869
ORF Stop: 1831846
Strand: Forward
Protein Sequence: MANANVTRIGQINGSGDVDALFLMQFAGEVLTAFEETNVALEHTMTRTINSGKSAQFPATGKVGGEYHVPGTEITGLNLKAAETVITIDDLLISHGFIANIDEAKTHYDLRSIYSTEMGRFLAKTMDKHLLQVGVLAARATNVVDGEPGGSVILTGESGLPSTPNFDANGDHLAAALFIAAQKLDEKDVPEDERVAFVRPAQYYNLVKATNNLNKDWGGMGSYAEGSILKVAGIQIVKTNHLPNTDLSAATGVEAGSGLKYRGNFTNTSALVMHKQAVGTVKLLDMGMESAYDIRRQGHLMVAKYAVGHGILRPQAAVEIRNAAA
Homolog/Ortholog Species: Head protein
Homolog/Ortholog Protein: ELH61_09400, capsid protein, phage(gi460042352), PHAGE_Pelagi_HTVC011P_NC_020482
Homolog/Ortholog E-Value: 2.62e-80
ORF Start: 1831967
ORF Stop: 1832227
Strand: Forward
Protein Sequence: MISSPTKCYKVAAGDIGAKELPVYSKALAIYVPDGATATISMVYLNNLDGEAVTRTFTAGNHYIAARIRRITAVSNASVEIHVETE
Homolog/Ortholog Species: Hypothetical protein
Homolog/Ortholog Protein: ELH61_09405, hypothetical protein
Homolog/Ortholog E-Value: N/A
ORF Start: 1832245
ORF Stop: 1833879
Strand: Forward
Protein Sequence: MGVTVSPRFGRAPRRWLSGPPAWVPDLNRYMPAATGTRWPTGSTTNPWTYAAGLNYQCSKLFFGSPDYPTNDFLIPFVGFALTEGGNAPQETQGPTTDTLLDEAFFIHPDGTEYPILFGGQAAATITANTGIVYGQVTLPTWLPAWSIFGIRTIYHGNAGENRLGSYRIQRHRGEKFWGAGDLASIRALATANGPSTPALDPDNWYNTVGNATNSQQQAYGPAMVLAKGWDGRPVPLMLADSLAERQEIAASADERRNMGIWRRWLDQRDPVWGSLIPVVMGVPGAHSEYELAGSGASIATRRWAMIDYIRDTFNGGKNIWTFVLDQSGRNDTSSTLSLWQSRKFGLDDRVKARYPGVHMVGITIMPTFTSSDGGRTVAGYSTSAVWNPVTGTLASLNASIMSSPRFAKVIDMLPAFMSDTDPTKGPAAELFPLGNVIGHPGNQDGTTTWDTIRLPSTVPLGSRISFEYQPGQWASRTLSGRTDRGDGTADYRVAEIFATNVQDNATLLGHGMHTDFIHPALHGVLRTVSRIPQSEKAKFYPAA
Homolog/Ortholog Species: Hypothetical protein
Homolog/Ortholog Protein: ELH61_09410, hypothetical protein, phage(gi100013), PHAGE_Rhizob_RHEph04_NC_041908
Homolog/Ortholog E-Value: 0.0
ORF Start: 1833941
ORF Stop: 1834540
Strand: Forward
Protein Sequence: MLTPTTELEAINLMLSVIGESPVNTVEDTGLVDAVVARQILIQSSRDVQLVGWHWNTEIDYPIAASFPEGELTLPPNTLKVDTAGADAGLDLVQRGNRLYDRKNHTFNVGRTVYVEIVLLLPFDQLPEAARSYIVMRAARQFQERMVGSETIWQFNSRDELRAWANLMSSEAETQDLNVFNDNPSVRRVLDRTPPGGLV
Homolog/Ortholog Species: Tail protein
Homolog/Ortholog Protein: ELH61_09415, tail tubular protein A, phage(gi460042353), PHAGE_Pelagi_HTVC011P_NC_020482
Homolog/Ortholog E-Value: 4.88e-31
ORF Start: 1834540
ORF Stop: 1836921
Strand: Forward
Protein Sequence: MAGALVSTTIPNLINGVSQQPYALRLASQCELQENAHSSVVEGLRKRPGTTHRAKITNAPAGELFTHTINRDRTEQYEVMVGNGDLKVYDLKTGAEKTVTFPNGRAYLTAADPRSSFKAVTIADYTFLINKTVTVEQDTTLSTSRAPEAVVWVKQGAYGTKYTVTLNGVSATVSTPDGSTASHINNIQTDVIASGLVSALNAAIGGFSFALNGSSIYIKRADNADFTINVTDSQGDQAMKLLKGTVQRFSDLPAKGFNGFAVEIVGDQSSSFDNYYVKFDTASGVASGVWVESVKGGEAIRLKASTMAHALTRNADGTFTFKQVEWIDRKTGDLDSSPMPSFVGKKMNDIFFHRNRLGFIADENVVFSRSGDFFNFFRSSATQVLDTDPIDAAVSHIKVSILQHAIPFNETLLLFSEQTQFQLGASELLTPETISINQTTEFECSLKARPVGSGRNIYFTFNRGNFSGLREYYVDGDTKTNDASDVTSHVPAYVPKDVSKMAASSSEDTIALISESERNSIYVYKYYWNEQEKLQSAWYKWTFPATDTILSVEFVESNLYLIIRRPDGVFLESMSVNPGYVDDGFDFGLNIDRKAKEDACTVSYNAVTNETTITPPYLLQAELLPASEADVIVSRAGDPIKKPGQLIPYTIDGNNMVVKGKLEKFIIGRSYVMRYRFSTFVIKEEAVGGGQMTVGEGRIQLRKATLTYDNSGYFRIEVTPLRRETYRYVFSGRVIGSAKNVIGQTAIDKGRFSFPLMSKNDLVTVDIVNDTFLPCAFLSAEWEALYVIRSKRL
Homolog/Ortholog Species: Tail protein
Homolog/Ortholog Protein: ELH61_09420, tail tubular protein B, phage(gi460042354), PHAGE_Pelagi_HTVC011P_NC_020482
Homolog/Ortholog E-Value: 0.0
ORF Start: 1836921
ORF Stop: 1837385
Strand: Forward
Protein Sequence: MLETRPSRPEDVTYLAPRLREADRQELLAAGAPGPEQSLRDGLMLSKNCISVVDDEDRAVAMFGVCPSPVEGLGYIWLLGSDDIKQNKTRFLRRSKQWVDTFHQDFTVLTNYVDQRNEVHITWLRWLGFKFLRIVNAPGPGNLPFYEFARIRNV
Homolog/Ortholog Species: Hypothetical protein
Homolog/Ortholog Protein: ELH61_09425, hypothetical protein, phage(gi460042355), PHAGE_Pelagi_HTVC011P_NC_020482
Homolog/Ortholog E-Value: 5.96e-28
ORF Start: 1837378
ORF Stop: 1837908
Strand: Forward
Protein Sequence: MCDPLSMIGFAIGAAQQVVSYQAEKTAAEQQNQLYKENAARANQNARDQMFQTQQRMLQEQEKGAAEKMDTVREAREAKATATVAAGEAGVSGLSVDALLAEFDGRAAAANDRTDQNTEWTLSQLNNEMKGIRANAEDRINSVQRAAAPSFFNTGLKIAGVGLDSYNDFKVKQRSK
Homolog/Ortholog Species: Hypothetical protein
Homolog/Ortholog Protein: ELH61_09430, hypothetical protein, phage(gi712913288), PHAGE_Mesorh_phagevB_MloP_Lo5R7ANS_NC_025431
Homolog/Ortholog E-Value: 3.02e-24
ORF Start: 1837908
ORF Stop: 1839932
Strand: Forward
Protein Sequence: MARLPGLRPIDEERRRSGGGQNRGRVRTPSADNIRVQGLSPNASPVDTYARPEQAPIGSNSWEALAKSLAGIQPSINNFLNVQAAEQQDDDVTAVRQAFLQKSPEDVRKAIKEGSVPGLTSLAGRELVGERLAYDRSLQIMASYQTDFDRQTGDVDAFVQERIKDDLAEFGNDKALMGAYTKQMTAFTEKLRNQSVDDKATFQQDVRQGNLFEKWSAKATYDRAEGKAPADVAGSMFGEFTKNQELLRVPFQKQQEMMLQLADQAATSGDYDLAKAILQHKREDGPYKGSLMTDVKVGDTATKLFARIDADQTRERLTAQAQEDEESLYSQGVAAAESGSILAIGDAQVRDKQGEMRTITADAQKKEVANRLIAKAADEAAYREKDPEKRPALARRLEKEKFVGSGLEHPVWFKAMNGAPGQMNLNAATGEIPPSAKDAFDTYQDLYKDSPQYLAKYLNKDALEFFESARLAEEVGNAGTPEAALRIAHMVTQDPNQMDEALKLKYDSIDSAVKSAVSNSTSWGQWVFGKQTAGNQNYVRSEIVRLAKQYALLGKDNDEAIEQAQQTFEKTHINVAGSYVKNDKRLPGDFEPLVNQYLSEFVEKHKGDLNYDIDDLTITQGNGTGAYMVVRKSDRMPADPASDDTFFSLNILNDLRTRNRDQKIKEVTAKQNAR
Homolog/Ortholog Species: Hypothetical protein
Homolog/Ortholog Protein: ELH61_09435, hypothetical protein, phage(gi712913289), PHAGE_Mesorh_phagevB_MloP_Lo5R7ANS_NC_025431
Homolog/Ortholog E-Value: 5.03e-21
ORF Start: 1839942
ORF Stop: 1843715
Strand: Forward
Protein Sequence: MADIRSIITDAANRYGINPQDALEMAQIESGLNPHAQNKSSTAGGLFQFLDSTWAKYGKGASKYDPYANADAGMRLARDNINFLKKKLGRDITGGEMYLAHQQGAGGALNLLANPNTMAVDLVGRAAVLGNAGQTGMTAAEFANLWINKIGSTKVGNGPGLVMPGSMANSQNPGDFSVHDQGRVSASDVIPTMNTTRAEEVQQEKDRQAMMPSFGEAVATAVKNEWSVLTPFRALGHFDPEPDYKLTEDKLRTFGQSIPDDYLDEFEDAVSDEHAEAIRNRLLTQLEDNQKIASLGTAGTIISMGAALTDPGAIAATAAIGAVTGGFGVPAAVAARLGRVGMVGLAAAEGVAGNLATDIPLVAVDPTRDVSFDELKYSIGTGLVMGGVMGAFRRNPMFTEEAKQIAKIGQQMQEQAVSLPAGSRSAGAASVMGDNFTRSDTSNLIDDFKRLDPKGTFLNWRVDAVGQLMASKNPMAQTLARYLGEDGVRAAKGSGVVTEIAATERMQRRLRVAQINWYRGYDDAFKKFRKANGINAFQAKDAELKFKEQITDYIREENPSVRAQFPAEVKQAAGAFQAEMKSFWKEAQELGLTRTEAGVENYFPRYGHLAKATKLIREVGYSMDRNGGLTDLFAGAILKKQPGLDPKIAKRMGYAVLDRFQKLSSGEEMFGTGHLGFDLDDLEVELKNYLDDEQIANVKAWASRNEKKEGEASGPARMKARIMLDENHFADVMTKRDGVKRVNISDFYVKDPHTAFQLYARNMSGQLAMARIQVRDPVTGNLLIDGIKNGNDWTKLKNQIKSVGEATGANNTRDERNLDFLYSAITGTPLAGIDRGSDGATFLRMLRDFNFLRLMGQVGFSQVPEFGRQVAQVGVKTTFQAVPSFRHLIDMARSGKMTDEVAEELDAIGAFGTDYERTAHYLDTDELGVPVTSGSDSTIQRVAGAVNPKLHAMNRFVSMGSGMAPINRVFQKWSARAAAVKFTKMAMFGDKVDAERLRALGLDDATTKQIFEAIKTNATFKGGVKSPSKLQSLGIKNWDGNTLSAFEDAMFRLNRTMILENDPGQMHRWLAHPLGQMVMQFRTFAMSAHTKALLQGLNLRDGPALFGMLASSFLGAAVYAGQTHLNLIGRPDRDDQLKERLTWNKLGLAGFSRSSESALIPMAADIGWQFFDDEPLFDTRSSGLKTTVSSFLGNPTGDLISTGLAGAAGVTSAMVGDDYSQTDWQNLTRTLPFARMMGAVQFLNWVGSGLPRRELRD
Homolog/Ortholog Species: Phage-like protein
Homolog/Ortholog Protein: ELH61_09440, putative internal virion protein, phage(gi712913290), PHAGE_Mesorh_phagevB_MloP_Lo5R7ANS_NC_025431
Homolog/Ortholog E-Value: 9.78e-74
ORF Start: 1843732
ORF Stop: 1845030
Strand: Forward
Protein Sequence: MAGPRGNSGAFFIFNRETLMALAYAQSLGDGVTNTFSVPFPYISKNHVQVKVDGVAVPYTWLSDTSIQISPAPAADKIVDRRRVTPRDTLLVDFVDGSTLVESDLDLSALQVFYLAQESFDLGESSLGVTDDGSFSALGRRISNVLTPTLPNDVATKQFVETGVASGVTVATQKASEASASAVAAALSETNAAGSATSANASKITATTKAGEAATSATNAAGSATTASTKASEAASSAVEAQGYRDTAATKASEAAASAAAAAMFDPSTFYTKTEINTFLGGKLDKTGGTLTGDVTIQKANPSLVLDHTGVNKWGILSAANGSLSIQKLNGTVVNALTIGADGAISTALLGDLNSRIESRATAWANDRVANLAFRKVSSSSFTVPDNGLMMCPAGAVLTGMNMQGTSNNPAMHYHYLQSFDPVRGWVTFSGS
Homolog/Ortholog Species: Fiber protein
Homolog/Ortholog Protein: ELH61_09445, tail fiber, phage(gi460042359), PHAGE_Pelagi_HTVC011P_NC_020482
Homolog/Ortholog E-Value: 2.01e-22
ORF Start: 1845310
ORF Stop: 1845609
Strand: Forward
Protein Sequence: MIFKEGALLPAVPVDEPLPNLSPRQLWLAALEINTTKAQVMAQIGTITDAKLRATLEIELTEPPLEGYVRDSFAVERLREMMGIPVDQFDTLWLWARTL
Homolog/Ortholog Species: Hypothetical protein
Homolog/Ortholog Protein: ELH61_09450, hypothetical protein
Homolog/Ortholog E-Value: N/A
ORF Start: 1845609
ORF Stop: 1845857
Strand: Forward
Protein Sequence: MEHMNTEALLLIGRVEGKVDTLISLSSAQSQRIDQLEGRMSAGEVDIASLKAKSTTNQSFVTNITAILALIVAAISAYLSYK
Homolog/Ortholog Species: Hypothetical protein
Homolog/Ortholog Protein: ELH61_09455, hypothetical protein
Homolog/Ortholog E-Value: N/A
ORF Start: 1845857
ORF Stop: 1846078
Strand: Forward
Protein Sequence: MDLKDILSKLHEEMAQKLLDKVRSGEVTAAELNVARQFLKDNNIDSIPKEGSPLKSLTDELPFTGDDDRPSYN
Homolog/Ortholog Species: Phage-like protein
Homolog/Ortholog Protein: ELH61_09460, DNA maturase A, phage(gi460042367), PHAGE_Pelagi_HTVC011P_NC_020482
Homolog/Ortholog E-Value: 1.06e-15
ORF Start: 1846204
ORF Stop: 1847892
Strand: Forward
Protein Sequence: MTADSLKTGTHLSPAVDPLKKDFRNFLFVVWKHLNLPVPTAVQYDIAGYLQHGPKRCVIEAFRGVGKSYVTSAFVVWLLYCNPQLNILVVSASKDRSDQFSSFTKRLIAEMPILAHLRARPGQRDSMVAFDVGPARNSHSPSVKSVGITGQLAGSRADIIIADDVEVPNNSMTQLQRDQLSERVKEFDAILKPLPTSRIIYLGTPQTEMSLYNRLPERGYEIRIWPARVPIDPERYLGRLSKFVMDMIEAGAQPRQPVDPQRFQEQDLIEREASYARSGFALQFMLDTSLSDQDKYPLKLSDLIVASLDPRMAPAKLVWCNDPDRVISDLPAVGLQGDRLHRPMWVANEMGEYTGTVMAIDPSGKGGDETAYAIVKILHGNLFLVASGGFKEGYSEATLKSLAVLGKTHNVNRVIVEANFGDGMFTQLLKPVFTRVHPVTIEEVKHSTQKERRICDVLEPVLNQHRLIVDAAVIKRDHEAEPHRQLFYQLTRITRDRGALINDDRLDALAIAVTYWVEHMARDTDKAADEHKAALLEQELRSFSEHIFGAPADSDLRWYNIG
Homolog/Ortholog Species: Phage-like protein
Homolog/Ortholog Protein: ELH61_09465, DNA maturase B, phage(gi460042368), PHAGE_Pelagi_HTVC011P_NC_020482
Homolog/Ortholog E-Value: 0.0
ORF Start: 1848112
ORF Stop: 1849056
Strand: Backward
Protein Sequence: MWPTASEQSLLAATFFQTFNDELEKIRTARNRFVHYTSAETAYKIIENKEIWLRNAAVMNDFSEISYGIDCLSAARAGPAGIAFTNALNSIDPSIVTTVDDLFQKWQPYFQNDTFLASFAVHENDEDEIGRLSMWRAYGNVAIVLKGDVFTNPGPVPISGVTASPVAYFTAKEAESALWRMAAAIAANAGQLRLMKREFIVNSLHSALRFSILANKHPGFREEREWRIMHSPSHDPSPMQRLPVTIAGVPQQICKFKFPGQFGDKQFVEAIDRVIIGPAQHANVMRAAFQNLLASNGVSDALNRVAVSNIPLRQ
Homolog/Ortholog Species: Non phage-like protein
Homolog/Ortholog Protein: ELH61_09470, DUF2971 domain-containing protein
Homolog/Ortholog E-Value: N/A
ORF Start: 1849145
ORF Stop: 1849750
Strand: Backward
Protein Sequence: MADGWYWNFAVVAATLMGPILAVQAQRLVDLAREKRNRRVTIFRTLMATRASSLSPDHVEALNAIPIEFYGKSRVFKEVVEAWRMLLDHLGKEQTNMDLWVQRRQTLFVDLLLKLSAAVGYEFSRLELEREVYSPVGHSIVQGEQEVIRKGLYKLFAGETSLPLDIKSFPADEDVTAKQKVLQELLESVLKGDTPIVIKVG
Homolog/Ortholog Species: Hypothetical protein
Homolog/Ortholog Protein: ELH61_09475, hypothetical protein
Homolog/Ortholog E-Value: N/A
ORF Start: 1850160
ORF Stop: 1850381
Strand: Forward
Protein Sequence: MAFITVEEVHSQQDRVIGTVFLNVEFIIKYESKSTEQPFTSKITYLTGSSISELIVMGAPVDITSKISNASRS
Homolog/Ortholog Species: Hypothetical protein
Homolog/Ortholog Protein: ELH61_09480, hypothetical protein
Homolog/Ortholog E-Value: N/A
ORF Start: 1850686
ORF Stop: 1851435
Strand: Forward
Protein Sequence: MFKGTLIRSGNNAKTIKGDGEYETAIMYLAPFTMAGANVCPMAEQAGCVKGCLNTAGRGAYNNVQQARIAKTKRYLASRTAFMADLVTDLERFVAYCKRKGVKPAVRLNGTSDIQWEVAHYASRGDARGSVFELFPEVQFYDYTKVYKRAYRQLPANYALTLSYSAANPVYAEVVTKVAHETGANLAIVYRTKELRDYFVGKLVQYGDACRDVIDGDETDMRFLDPKGVIVGLYAKGKAKGDQSGFVVG
Homolog/Ortholog Species: Hypothetical protein
Homolog/Ortholog Protein: ELH61_09485, hypothetical protein, phage(gi712913236), PHAGE_Mesorh_phagevB_MloP_Lo5R7ANS_NC_025431
Homolog/Ortholog E-Value: 6.89e-103
ORF Start: 1851543
ORF Stop: 1851755
Strand: Forward
Protein Sequence: MARTYYTLLQRVDDHWSPQFGAYDREDVESERDDYRDHGVKAKDLKIVTTQGHSWKAIEAVLNKLNGRAR
Homolog/Ortholog Species: Hypothetical protein
Homolog/Ortholog Protein: ELH61_09490, hypothetical protein, phage(gi712913238), PHAGE_Mesorh_phagevB_MloP_Lo5R7ANS_NC_025431
Homolog/Ortholog E-Value: 9.61e-11
ORF Start: 1852009
ORF Stop: 1852302
Strand: Forward
Protein Sequence: MDNSILFNCCTDHVAPDWSQYDALELGGCVEAKCTLTNDTWTEGGYHRNDAEFFTVYGHLKEGGCEAITDWHGSFDEAVCTAEELARLSGLPLEVCC
Homolog/Ortholog Species: Hypothetical protein
Homolog/Ortholog Protein: ELH61_09495, hypothetical protein
Homolog/Ortholog E-Value: N/A
ORF Start: 1853015
ORF Stop: 1853365
Strand: Forward
Protein Sequence: MNEVRQLQDEKALMSVQTFEVFLVIASKDGIPSSEIRKITGIPQPSVSRALGDLGEKAVRRDAEGLKLIKTERDPSDMRNVVCFLTPKGKLLAARIAQLMGINDTKVDGSFERNAQ
Homolog/Ortholog Species: Phage-like protein
Homolog/Ortholog Protein: ELH61_09500, MarR family transcription regulator, phage(gi712915375), PHAGE_Synech_S_CBP2_NC_025455
Homolog/Ortholog E-Value: 5.70e-10
ORF Start: 1853540
ORF Stop: 1853624
Strand: Backward
Protein Sequence: AQLAELVDALGLGPSAERCGGSSPSLGTA
Homolog/Ortholog Species: Trna
Homolog/Ortholog Protein: tRNA
Homolog/Ortholog E-Value: N/A
ORF Start: 1853641
ORF Stop: 1854654
Strand: Forward
Protein Sequence: MPVKPRGASWQAAVSHKGTRLRKDFPTKLEAEIWEAETKAALLSGKEVVVKTAEPVMTLQQLFDLVAETRWRGTKGEKTALINGQHVVNILGPQRDVKTLCYEDSLTIKKTVTGWKRADATINRKLAAFSTMVKEAYKLGKIDKLFDIGLIKERNTRVRYYEDKELDQMLAWCDEMLEDELRDYIIVSLDTGFRQGEVLKITKRDAELEDLWTFDTKAGDKRDVPLTARAREVLLRRAKPLNDPDAKLFTQKPAWYREHWKSMQSDLGMTDDNNYVPHVLRHTFVTNMLLHTDIRTVQELAGHKRIETTMRYAKTSAERKRLAIKRMSDYQGAEIGA
Homolog/Ortholog Species: Integrase
Homolog/Ortholog Protein: ELH61_09510, putative integrase, phage(gi712913246), PHAGE_Mesorh_phagevB_MloP_Lo5R7ANS_NC_025431
Homolog/Ortholog E-Value: 5.15e-28
ORF Start: 1854766
ORF Stop: 1854777
Strand: Forward
Protein Sequence: CAGATTTAGGTT
Homolog/Ortholog Species: Attachment site
Homolog/Ortholog Protein: attR
Homolog/Ortholog E-Value: N/A
ORF Start: 1907906
ORF Stop: 1908691
Strand: Forward
Protein Sequence: MNLNTPAFSSFTHDGLQLAFFDEGDPAGVPVLLIHGFASTANVNWVHPGWLKTLGDAGYRVIAMDNRGHGASDKPHDAEAYRPWIMAGDAIALLDHLGIPEANVMGYSMGARISVFAALANPHRVRSLVLGGLGIGMTDGVGDWDPIADALLAPSLDAVTHARGRMFRAFAEQTKSDRVALADCIRGSRDLVARSDMAKLDMPTLIGVGTKDDIAGSPRELAALMQNAEALDIPGRDHMLAVGDRVFKQAVLTFYARVAHR
Homolog/Ortholog Species: Phage-like protein
Homolog/Ortholog Protein: ELH61_09850, hydrolase, phage(gi764160985), PHAGE_Mycoba_Milly_NC_026598
Homolog/Ortholog E-Value: 4.24e-07
ORF Start: 1908820
ORF Stop: 1909653
Strand: Forward
Protein Sequence: MVAKTDIRAFDTGHPLKVMDPIWDSLREEARLAAERDPVLAAFLYSTVINYHSLEECVIHRICERLDHPDMQANLLRQTFEEMLLDWPDWSSILRVDIQAIYDRDPACLRFMEAVLYFKGFHALQTHRLAHWLLNRGRRDFALYLQSRSSSVFQTDINPAARIGKGIFLDHATGLVVGETAVIGDNVSILHGVTLGGTGKEGADRHPKIGSGVMIGAGAKILGNIEIGYCSRVAAGSVVLKAVPPKKTVAGVPAKVVGEAGCSEPSRNMDQVIGADI
Homolog/Ortholog Species: Hypothetical protein
Homolog/Ortholog Protein: ELH61_09855, hypothetical protein, phage(gi100019), PHAGE_Strept_9871_NC_031069
Homolog/Ortholog E-Value: 9.67e-07
ORF Start: 1909794
ORF Stop: 1909997
Strand: Forward
Protein Sequence: MKPEEIKKLDAYFKRMFNPQMIVKARPRKDDSAEVYLGEEFLGVVYIDDEDGDRSYNFSMAILDVDL
Homolog/Ortholog Species: Non phage-like protein
Homolog/Ortholog Protein: ELH61_09860, DUF3126 family protein
Homolog/Ortholog E-Value: N/A
ORF Start: 1910272
ORF Stop: 1910643
Strand: Forward
Protein Sequence: MFNFEDANKKSKEAVDTALKTYTDTSKGFQAIAAEATEYSKKSFQDAVTHFETLAGVKSFEAAFELQTSYVKAYFEGFVSETTKLSEMYADLAKSAYKPYEAPIAAAVVKTAKSVSAATPAAA
Homolog/Ortholog Species: Non phage-like protein
Homolog/Ortholog Protein: ELH61_09865, phasin family protein
Homolog/Ortholog E-Value: N/A
ORF Start: 1910920
ORF Stop: 1911270
Strand: Forward
Protein Sequence: MIAKPIRMQNDSERNGDNANRTSVITRTKPKTKKPNLYRVLLLNDDYTPMEFVIHILERFFQKDRESATRIMLHVHNHGVGECGIFTYEVAETKVSQVMDFARQHQHPLQCVMEKK
Homolog/Ortholog Species: Phage-like protein
Homolog/Ortholog Protein: ELH61_09870, putative DnaB helicase, phage(gi100317), PHAGE_Serrat_BF_NC_041917
Homolog/Ortholog E-Value: 2.14e-08
ORF Start: 1911281
ORF Stop: 1913773
Strand: Forward
Protein Sequence: MPTFSPSLEKALHQALTFANERHHEYATLEHLLLALIDDADAAAVMGACNVDLDALRKTLVEYVDNELSNLITGYDEDSKPTSGFQRVIQRAVIHVQSSGREEVTGANVLVAIFAERESHAAYFLQEQEMTRYDAVNYISHGIGKRPGVSEARPPRGAEDEAESSKPTARGGEEEGGPKKQQDALKAYCVNLNEKAKGGKIDPLIGRHAEVSRTIQILCRRSKNNPLYVGDPGVGKTAIAEGLAKRIVEGKVPEALADATIFSLDMGTLLAGTRYRGDFEERLKQVVKELEEYPGAVLFIDEIHTVIGAGATSGGAMDASNLLKPALSSGAIRCIGSTTYKEYRQFFEKDRALVRRFQKIDVSEPSIEDAIEIMKGLKPYFEEYHHLRYSNDAIKSAVELSARYISDRKLPDKAIDVIDETGAAQMLLPPSKRRKLITEKEIEATVATMARIPPKTVSKDDEAVLANLEKELRSVVYGQDIAIEALSTSIKLARAGLREPNKPIGAYVFSGPTGVGKTEVAKQLASSLGVELLRFDMSEYMERHTVSRLLGAPPGYVGFDQGGLLTDGVDQHPHCVVLLDEIEKAHPDIYNILLQVMDHGTLTDHNGKKIDFRNVILIMTTNAGASEMAKAAIGFGSSKRTGEDEEALTRLFTPEFRNRLDAIIPFAALPTAVIHKVVQKFIMQLEAQLSERNVTFDLHEDAIAWLAEKGYDEKMGARPLARVIQDTIKKPLANEILFGKLKKGGVVNVTVGPKEDGKPGIVLEAISETAPIKPKPEAEVVHPEGDDGDDGELKTKAARKTRAKAVPQAEPEVRDAPKKGSAVPKVPRKK
Homolog/Ortholog Species: Phage-like protein
Homolog/Ortholog Protein: ELH61_09875, putative virion structural protein, phage(gi100292), PHAGE_Agroba_Atu_ph07_NC_042013
Homolog/Ortholog E-Value: 0.0
ORF Start: 1913869
ORF Stop: 1915239
Strand: Forward
Protein Sequence: MLKKPAPTQTALEMVTLDSLVPKDHVLRKIDAVIDFSFIHGRVAGLYCADNGRPPLDPTLMFKALFIGYLFGIRSERQLVREIEVNVAYRWFLQMKLTDGVFDASTLSQNRRRRFNDTSVAQDIFDHIVEQAIRHGLVDGTVLYTDSTHLKANANKGKYDLAMIEKSRSDYWADLDRAIEAERALHGQKPLKEKEREPEVKETKVSRTDPDSGYMVRDGKPKGFFYLDHRTVDGKLAIITDTHVTPSNVHDSIVYLDRLDRQRERFGFEVGAVGLDAGYATSGIAKGLEDRTILGVTGYRNPTPPRAGMMRKSKFGYEPETDGYRCPEGQLLAYATTDRNGYRHYRSDPAICRDCPLLASCTNNATATRTITRHVWADARQRTDANRLTPWGKAIYKRRKETVERSFADAKQLHGHRYARFRSLTRVSCQCLLAAAAQNIKKIAMALTTASKPAMA
Homolog/Ortholog Species: Transposase
Homolog/Ortholog Protein: ELH61_09880, putative transposase, phage(gi119443708), PHAGE_Staphy_Pvl108_NC_008689
Homolog/Ortholog E-Value: 1.02e-09
ORF Start: 1915522
ORF Stop: 1915878
Strand: Backward
Protein Sequence: MSLVAAHTHAFFNEVTQSGFVWTIRDEAGFPTSTNQSNEAAMPFWSSEIRARRIVDQVPAYRGFIPHKLPVEVFLDRWLQGLEQDNVRVGINWSGVRATGFDIAPADVRRRLHSASTR
Homolog/Ortholog Species: Non phage-like protein
Homolog/Ortholog Protein: ELH61_09885, DUF2750 domain-containing protein
Homolog/Ortholog E-Value: N/A
ORF Start: 1916105
ORF Stop: 1916536
Strand: Backward
Protein Sequence: MTSPAAYDDSNIFAKILRGEIPSHRVYEDEHTVAFMDVMPQAPGHVLVVPKAASRNIFDADPATLTHAITVVQKVANAVKGVFDADGVFIAQFNEPAAGQTVFHLHFHVIPRHEGTALKPHSGKMEDGAVLAANAEKIRAALA
Homolog/Ortholog Species: Phage-like protein
Homolog/Ortholog Protein: ELH61_09890, gp4, phage(gi298103512), PHAGE_Strept_phiSASD1_NC_014229
Homolog/Ortholog E-Value: 1.01e-07
Terminase
Portal Protein
Coat Protein
Tail Shaft
Integrase
Phage-like Protein
Other
Transposase
Plate Protein
tRNA
Download data as .txt file: png_input file_download