Submission Results
Sequence Name: Not Available
GenBank Accession Number: SIMU01000003.1 open_in_new
GenInfo (GI) Number: 0000000000 open_in_new
Download Results: SIMU01000003.1.PHASTER.zip
gi|00000000|ref|SIMU01000003.1| Rhizobium leguminosarum strain SM95 plasmid pSM95_Rh02 623038, gc%: 60.95%
Download summary as .txt file: summary.txt file_download
Total: 1 prophage regions have been identified, of which 0 regions are intact, 1 regions are incomplete, and 0 regions are questionable.
Region | Region Length | Completeness | Score | # Total Proteins | Region Position | Most Common Phage | GC % | Details |
---|---|---|---|---|---|---|---|---|
1 | 8Kb | incomplete | 20 | 9 | 106598-114601 info_outline | PHAGE_Xantho_XcP1_NC_048147(1) | 58.97% | Show info_outline |
>1 106598-114601 TCAGTCGATCCTCGTGATGAAGGCGCGGTCTTGATCGACGACGAGCCCTACATTCGCGCC AGGCGAAAGCGGGTTATCCGCCATGAACCGCAAAATGACGCCGTTAACCTCGGCCTCGAC AAGGTAGCGATCTCCTTTGAACACCACATGACGGATCGTCGCGTTCAGGCCATCCGTGCC AATACGGACATGCTCCGGCCGAATGGCGAGCACTGCCTTATCCCCTTTCGCCAATCCCTC CACTGCCGGGCATTGCACGATACCGGTGGCCGTATCGATCGCAGCCGATCCGGAGCCCCC GCCGGCATCGACGATTTCGCCTGGAATGAGTGCCGCGGCGCCGATGAAGTCAGCAACGAA AGCGTTGGCAGGACGGGCATAGATTGTCAGCGGCGCAGCCCTTTGAACGATGCGCCCCTT GTTCATGACAACGACGATGTCGGAGAGCGCAAAGGCCTCCTCCTGATCGTGCGTGACGTA GACGAAGGCAATACCGAGACGCCGCTGCAAATCCTTCAGTTCGATTTGCAAATGCCCGCG CATCTTTTTGTCCAGTGCAGACAAGGGCTCGTCGAGAAGCAGCAGATGCGGCTCGGCGAT GATGGCGCGGGCCACCGCGACCCGCTGCTGCTGGCCGCCAGAGAGCTGGTGTGGATACCG ATCGCCGAACTCCGCCATATGCACGGCGTCCAGAGCCCTTTTGACGCGCGCGGCCACATC GACATTGCTTTGGCGCCGCAGTGTCAGCGAGAACGCGACATTTCCGAATACGGTGAGGTG CGGAAACAGCGCGTAGCTCTGGAATACGGTGTTGACCGGACGTCGCTCCGGCGGAACGCC GGAAAGCGACCGTCCGTCAAGCACGATCGATCCCGCGTCCGGCGTCTCGAAGCCGGCGAT CATGCGCAAGATCGTCGTTTTCCCGCATCCGGAAGGGCCGAGAAAGGTAGTGAAAGCCCT TTCCGGAATATCGAGATCGACGCTATCGACCGCGCGGTCCCTGCCGTAATTCTTGGTCAG ACCGCGCGCCGAAAGCAAAAGCCCGGATCTTGACGGATCGGCCGCCATGTCAGGCGTCGT CGTCATGATCAGCTTGCCTTGACTTCATTCCAGATCTTCAGCCACTCGGAATAATTCTCC GGGGCGACCGGCCACATGAACGACTTCATCACATCCATGTCGTCGACGAAGATCGCTTCC TGCTGCTGCTTCGACAATTCGTCGCGGATGATCGAGGATGTGGTCGCATAGTTGCCGATC TTAGCAATCTGGGTCGCGAATTCCGCCCCAAGCAGATAATCGATGAACTTGTAGCCGATC TCGCTCTTTTCCGCAGACAGCGTAGCCGGCATGGCAAAGCAGTCGCACCAGCCCATAATG CCGGCTTTCGGCTTGGCCATCCCCATCGCCAGCTTGTCTTTCAACGCGTCATAGGGAACC CGCCACGAAAACGCGCAGGAAACCTCGCCGGTGGCAAACAGATTGGTGAGATCGCCGATC GTCTGCCAATAGGTGCGCAGCAGCGGCTTCTGCGCAACCAAAAGCTTCTTGGCTTCTGCC AGCTCCTTGGTCTCCATGAGAAAGACCTTGTCGCGCGGCACGCCGGCGACCAGACCGGCG ATGCCAATGGATTCCAGCGCATAGTCGCGCATGGCGAGCTGGCCCTTGTATTTCTCGTTG AACAACGTCGTGTAGTCAGGCTCGCTGTCGAACTTGTCCTTTCGGTATACGATTGGATTG AGGCCCCAGAGATAGGGAACCGCGAACTGCCTGCCCGAGGCATCGAGAACCTTGGGCGTG GTCTTGAAGACGTCGTACATCTTGGCGGCGTTGGGGATCTTCGCCATGTCGATTTCTTCA AGCAGGCCTGCCTTGATATAGCGCCAGCTTCCATTCAGCGACGGATTGATCATGTCCCAG TCGGAAGCCGTGCCGGTTTTGAGCGCCGCGAACTGCGCGTCCTCGCTGGAAAGATAGGAT AGCTTTACCTTGACGCCGGTCTGCGCCTCGAAGGCCGCAATGTATTCAGGGTGCCCGTTG GAATCCCAGGTCGCCCAAACCATTTCGGAAACGGCAGCTTTGGCACTGCTCAGTGAACCG AATATCCCCGCCGCCGCGCCGGCGGCGATCCCTGCAAGCACTTCACGTCGGCTAACTCCT GAACCTGCCATCATGCCCTCATCTGGTTGCGAGTGAACTGCGACCAATGTCGTTTGTTTG GACAGGAAAGCCTATAACCGAACCCAAATAGACTATATTGCACTGCGCAAAAGCCCGGCG ACCTCCGTCGCGATCCCGCGAGATTGGTTTAAACGTAGACCGGCGCTCCGGATCGCAGGA ATGTGATCGGTACCGACTCCGTCGGATCGAAGACGCGGACGCCGGCCGCAACCGACTTGA TCATCTGCCGGAGCCAGACGTGATCGGGAGACTGGTGCTTGCACTCATGCCAGAGCATGT AGAACTGCATCTGTCCGAGTTCGCGCGGTGCCTCGAGAACGACGAGCGGAAAGGCCTGAG CGATCTGCTCGGCAAAAATTCGGCCCGTGGTGAACACGAGGTCCGATTGTGTCAGCACGT AGGGCGTGATCGCATATTCTGGCACGGATACAGCGATGTTGCGCTTGAGCCCGAGTTCGA TCAGCCGCCCGTCGATCGGGCTGAGATGGGCCCGCTTGTCGGAAGTCGGAGAAAGATGCT TTTCCTTCAGGAACTCGCCCATCGAGATCGGCTCACGTTGACTTGCCAGTCTATGGTTCG GGCGGACGACGCAGACAATGTCGGTGGTTAACAGGGGAGACATCCGCAGATACTCCGGCG GATGCGGCCAATTGCCGATGACCGCATCGATGGACCCGTCCGCAAGCTGGCTGATAAGAT CATCATAGCTCGGCATGTGGGACGCATCGACGCCGACATTGGGTGCCTTTTGTGCAATTT TTCCGACCAGCTGAGGCAGAAACACGGCTCCGAGGCAATTGTCTGCGATGATGCGGAAGT TTCGCGTCGCATTGTCCGGGTCGAAGGTAGCGTGAGGCGACAGATGCGCGTCGATCTGGC CGAGAATGTCGCGCAACGTCTCCTTCAGGGCGAGACCTCGCTCCGTTGGTACCAGCGCCC CTCCGGACCTGACCAGAAGCGGATCGTCCAGCATTTCCCGCAGCCGTTTCAGCGTCAGGC TGACCGTCGGCTGGGCCTGCCCCAGAATGTCGGCGGTCTTTGAAACGCTGCATTCCGTCA GCAAGGTCAGGAGCGTGCGCATCAGACGCACATCCATCATGCCATTCGCCTGTCGCATGA TGATCAGCCTTCCCTTGTCGAACGGTGCAACAAGCTTGAGAAGAGGCATTTTTCATGCCA GCCGGGGAGCTCGCTGGTCATGCCTGCAATTACTGCGATCAAGAGCATTGAAAGTTGTCG GGCAGAGGATCGCCCGATTTGGAGGTATTCCCGCACGGGGAATTTGCGCCCACTACCGTG TCCGGGGCTACTCCAATATATCCGGTAACGCGTGTTGTTCGAGGCGCGCCGCAACATTCA CGCCGTCGCCGAACACATCGTCCTCGGTGATCATGACCTCGGCGTAGTTTATACCAATTC GAATTTTAAGAGGATCGGTGCCGCTGGCGGCTACAGCCTCGCTGGCCAGCAGATGCTGTA CCTGCAACGCCGTCGAGATGGCGACGGCCGGGCTTTCGAATAGCGAAAGTAGGCCGTCAC CCAGAACTTTGATTGTCCGACCGCCACTTGTAGCCGCAAAGTTGAAGTGTCCTGTTTCTG CAAAGTTGGAATGTCACTCTCCCCCCATCTGCCAAGCCGGACCCATATGTACAATTGGCA TCGTCCGCACGGCAGCCTGAAATCAAAAACACCCATCAGTCGCCTGGGCCTCAGCCGAGA CAACCTCTTGAGCATCCACAGCTAGAGCGTGTCAGAGCGTGTCGATCAGCCCTCCGTCCA CACGCATCGAAGCGCCGGTCGTTGCGGATGCCAAAGGTGAGGCCAGATAGGTGACGAGAT TGGCGACTTCCTCGACGCTCGCAGCTCGTTGGATGATCGAACCGCTGCGGTGCTTCTTGA CGAAGTCCGCTGCGACCTCCTCGATCGGTTTCCCCGTCTTTGCACGCTCCTCGGCAAGCA TCGCCTCGACGCCTTCGGACAGCGTGGGGCCCGGAAGGACGGAGTTCACGGTGACGCCCG TGCCAGCCATGCGCTTGGCAAGGCCGCGGGCAACCGCGATGTCTGCGGTCTTGCTGACGC CGTAGTGGATCATCTCGACCGGAATGTTGAAGCCGGATTCCGACGCGATGAAGATAACGC GGCCCCAGTCGAGTTTCTGCATCCCGGGAAGATATGCGCGCGACAGCCTGACGGCGGACA TGACGTTCACCTGCCAGTGGCGGTCCCACACTTCGTCGTCCGCCTCGAAGAAGTCGAGGG GCTGAAAGATGCCCGCGTTGTTGATGAGGATGTCTACATGAGGAACTTTGGCGACGAGAG CGTCACATCCTTCAGCGGTTGCGAGATCGGCTGCCACGGCGGCTACGCTTCCCTTGGCAC CTTCGCCTTTCAGGCGATCGGCCGCCTTGGCGGTCTTTTCTTCGGACCGGCCATTGACCA CCACGTTCGCGCCCGCCCTCGCGAGCTGGCGGACGATCGCGTAGCCAATGCCTTCGGTGG ATCCGGTAACTAGAGCGGTTTTGCCTGTGAGGTCGATCTGCATCTGCTGCTCCTGTTGAG GCTCGATCACGGAACCTATGGATTGGCGACCGGTCTCGCAACAGGCAAATATCCGAGGCG GTTTGCATAGTCACACCCCGGCGGGTGCCCGCGGCCCTTGAATCGGCCTGTAAGCACCAA TACGCCGACCAAGTGGGGATCTAAGCAAGGATTTTGTTTGTGACTTGGCGGACAGAACAG CAGGGCGCAAGCCGGATTGTTCTGCCTTCTGCTCTGGCCCCGGATAGCTATAATCCCCGG GCGGACGGGTCGGTTTCAAATGGACTACACGTTACGCCGCCAATTGCCAGGCGTTGCGCC AACGACACTTGCGAAAACCCTTGTGAAATGACTTTGATCGGCGAAGCCGCAGGCGATCGC CACTTCGGCGAGCGGCGCATTCGAGGTGCGCAGCAGGGTGCGGGCTCGATCTATTCGTTG GCTGAGAAGCCATTGATAAGGCGTCATGCCCGTCGTCTCGCGAAAGGCGCGAATGAAATA GCCGCGAGACAGATTGCAGGCCTGCGCCACTTGCTCGATCGAGATGTCGCCGTCAAGGTT TTCGAGAAGCAGACTTTTCGCCAGATGCTCGTGCGAGCGCGACAGGCTTCGTGATCTGTT GGGGGTTGCAACGGACCGGCCGCCATAGCGTTGAACCAGGTAAGTCCCGATGGCAGTGGT CATCTGGTCGATGAACAGCGCGCTTGCCTCCTCCGGTTTTTCGAGCGCCGGAATCAGCGC CCGAGCGAGGTTGGCAAGCACGATGTCTTTCGAGGCGGTTTCGGCTGCAAGCGAGGTGAT GCCGGAAAGTTCCGCACCATCGGCAATCCTGGTGAGAGCCGCAGGTGAGATTTCGAACAG GAGAAAATCGAAAGAACCGCTGAGATCTGCTTTGTAGGCCTCGGACAGATCGCGAATGTA GATCGAGTTCTCCGCAAAGTCGTGGGTCGTCACATGATGTTGGTGATGGATGCGCCGCGT GTGACCGCCGATCGAAGAGACACCGACAACGAACCCTCGGTCGCTAGCCGATGTCATGAC CTGATCGAGCCGCGCGACGCTTGTTCCCTTGCGAAAGACCGCAATATCGGAACCTGCCAG TGCCTGGTTGATCGACAACTGTTTTAGAGGACATCCGAAGGTGTCGGTTGTCGTTGCGCC GAGTGATTTCACGTGACTGTCGGACATAGGAACCAAGACGCCCAGCTTTCGTTTTTGGAA GACCAATCTCGCGGCTCAGCCCTTCAGCGCAGCCTTCTGGAAATTCGCCGAATCTCGTCG ACGAGGATGCTTTTGTGCAAGATAGATGCTTTGAAGATGTATAGGAACTCTCAAATCGTG GCATCGCAATGGTCGGGCTATACCGTTCCCAGCCCGGAGGATCGAGACACATTTGAGTGA AGCGTTGGCGGTCAGCGACAGGATTGCAACATGTTTTGGCATCGGTGCAGCGAGGGCCTT GGTCATCAAGCCTCTTCGCGAGGCGAAGCTCTCTGTCGTTCATCTTGACCACGTTTATGG AGATGCGGAGCATCCGGTCTTCCTCCCGGCCGATGATGCCTTCCTGCTCATGCTCTATCT CGTCGATGTCGATCATCGCGACATCCGTCCGGACCAGACCGTCGCACCCTTGAAGACCTA TCCCAAAGGTTCTGTCTGTCTCATCAGTCTGAGGCACGGAGCCGCAATCTCGATTCGCGG CCATTTCGAAGCACTTGCCTTTCATATCCCGAACTCGCACTTTGCCGAACTCGCCGAGGA GGCAGGCGAACCGCGCGTCGATGACCTTGCGACCTGCCGAGGAATTGACGATCAGGTGAT CCGCAATATTGGTGGAGCACTGATGCCGATGTTCGACATGCCGGACGAAGTCAGGGATCA GCTGCTTCCCCATATAGGGCTGGCGTTAAACGCCCATCTCGCGCATCGTTACGGACGCTC ACCCGCACAACGGCTGTCGGCCAGCGGTCGGCTGTCTCCCATGCAGGAAAAGCGCATCAA GACGTACATGGCCGCAAACCTGTCGGCCAACATGACCGTCGATCAGATTGCCGAGGCCAC CGGTTTTTCAGTGGATGAGCTGCGTTCAGGCTTCTTGAATACAACCGGGCAGTCTGTCGC CGAATGGATGTCGGCGTACCGGATGACCAGGGCCCAAGCACAGCTCAGCAGAACCGGTGA TCCTATCGCGCAGGTGGCTGCGACGTGCGGTTTTGCTGATGAGGACACATTCATCGACGC CTTCTCGAAGACTGTGGGAGTGGCTCCGACGGAGTGGCGCTCACGCAATCGGCACTAGGA GAGTTGAAGCTGGAGCGGCGCTGCACCGCAATCTTTCCCCTGATCTCGGTTTCAATGAAA AGCGCCGATAACGTCTTGGTGAAAACAGCTTCAGACGCGTTGTGCGCGGTCCGGATGCGG CACTAAAGCCGCACCCGGACGGACCAATCAGGCTTCGACGAACGCCAGGAGATCGGCGTT CAGGACATCGGCATGGGTGGTCAGCATGCCGTGCGGATAGCCCTTGTAGACTTTCAAAGT GCCATTCTGCACCAGCGTTGCCGACAACCGGCCGGCATTGTCGATCGGCACGACCTGATC GTCGTCGCCGTGCATGACGAGGGTCGGAACGGTGATCGCCTTCAAGTCTTGCGTCTGGTC GGTTTCGGAGAAGGCCTTGATCCCATCGTAATGGGCCTTGGCGCTGCCCATCATGCCCTG CCGCCACCAATTCTGGATGACGCCCGGGTAGACCTTCGCGTCGGACCGGTTGAAGCCATA GAACGGACCGGTGGGGAAGTCGATGAACAGCTGCGCGCGGTTATCGGCGACGCCCTTGCG GATGCCGTCGAAGACTTCCATCGGCAAGCCGCCCGGATTTGTCTCGGTCTTCAGCATCAG CGGCGGAACGGCGGAAACCAGAACGGCCTTGGCGACGCGCCCGGTCGGCTGGCCGTATTT TGCAACGTAGCGGGCGACTTCTCCCCCGCCGGTCGAATGGCCGATATGGACGGCATTCTT CAGATCGAGCGCTTCGACGACAGCGAAGGCGTCGGCGGCATAGTGGTCCATGTCGTGTCC GTCGGAAACCTGTGCCGAGCGGCCGTGACCACGACGGTCGTGGGCGACGACGCGATAACC CTTCGACAGGAAGAACAGCATCTGTGCGTCCCAGTCGTCCGACGAGAGCGGCCAGCCATG GTGGAAGACGATCGGCTGGGCATCCTTGCGCCCCCAATCCTTATAAAAGATCTCGACGCC GTCCTTGGTGGTGATGTAGCCCAT
Region | 1 |
Region Length | 8Kb |
Completeness(score) | incomplete(20) |
Specific Keyword | transposase |
Region Position | 106598-114601 |
# tRNA | 0 |
# Total Proteins | 9 |
# Phage Hit Proteins | 6 |
# Hypothetical Proteins | 0 |
Phage + Hypothetical Protein % | 66.6% |
# Bacterial Proteins | 3 |
Attachment Site | no |
# Phage Species | 5 |
Most Common Phage Name(hit genes count) | PHAGE_Xantho_XcP1_NC_048147(1) PHAGE_Bacill_SPbeta_NC_001884(1) PHAGE_Mycoba_Heldan_NC_042328(1) PHAGE_Mycoba_Bactobuster_NC_031279(1) PHAGE_Mycoba_Larenn_NC_028877(1) PHAGE_Bordet_vB_BbrM_PHB04_NC_047861(1) PHAGE_Bacill_G_NC_023719(1) PHAGE_Mycoba_Goose_NC_042340(1) PHAGE_Plankt_PaV_LD_NC_016564(1) |
First Most Common Phage # | 2 |
First Most Common Phage % | 11.11% |
GC % | 58.97% |
Region: | The number assigned to the region. |
Region Length: | The length of the sequence of that region (in bp). |
Completeness: | A prediction of whether the region contains a intact or incomplete prophage based on the above criteria. |
Specific Keyword: | The specific phage-related keyword(s) found in protein name(s) in the region. |
Region Position: | The start and end positions of the region on the bacterial chromosome. |
# tRNA: | The number of tRNA genes present in the region. |
# Total Proteins: | The number of ORFs present in the region. |
# Phage Hit Proteins: | The number of proteins in the region with matches in the phage protein database. |
# Hypothetical Proteins: | The number of hypothetical proteins in the region without a match in the database. |
Phage + Hypothetical Protein %: | The combined percentage of phage proteins and hypothetical proteins in the region. |
# Bacterial Proteins: | The number of proteins in the region with matches in the nrfilt database. |
Attachment Site: | The putative phage attachment site. |
# Phage Species: | The number of different phages that have similar proteins to those in the region. |
Most Common Phage: | The phage(s) with the highest number of proteins most similar to those in the region. |
First Most Common Phage #: | The highest number of proteins in a phage most similar to those in the region. |
First Most Common Phage %: | The percentage of proteins in # Phage Hit Proteins that are most similar to the Most Common Phage proteins. |
GC %: | The percentage of GC nucleotides of the region. |
Questionable (score 70-90)
Incomplete (score < 70)
Region: | The number assigned to the region. |
Region Length: | The length of the sequence of that region (in bp). |
Completeness: | A prediction of whether the region contains a intact or incomplete prophage based on the above criteria. |
Score: | The score of the region based on the above criteria. |
# Total Proteins: | The number of ORFs present in the region. |
Region Position: | The start and end positions of the region on the bacterial chromosome. |
Most Common Phage: | The phage(s) with the highest number of proteins most similar to those in the region. |
GC %: | The percentage of GC nucleotides of the region. |
Criteria for scoring prophage regions (as intact, questionable, or incomplete):
Method 1:
- If the number of certain phage organism in this table is more than or equal to 100% of the total number of CDS of the region, the region is marked with total score 150. If less than 100%, method 2 and 3 will be used.
- If the number of certain phage organism in this table is more than 50% of the total number of CDS of the region, that phage organism is considered as the major potential phage for that region; the percentage of the total number of that phage organism in this table in the total number of proteins of the region is calculated and then multipled by 100; the percentage of the length of that phage organism in this table in the length of the region is calculated and then multipled by 50 (phage head's encapsulation capability is considered).
- If any of the specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin') are present, the score will be increased by 10 for each keyword found.
- If the size of the region is greater than 30 Kb, the score will be increased by 10.
- If there are at least 40 proteins in the region, the score will be increased by 10.
- If all of the phage-related proteins and hypothetical proteins constitute more than 70% of the total number of proteins in the region, the score will be increased by 10.
If the region's total score is less than 70, it is marked as incomplete; if between 70 to 90, it is marked as questionable; if greater than 90, it is marked as intact.
gi|00000000|ref|SIMU01000003.1| Rhizobium leguminosarum strain SM95 plasmid pSM95_Rh02 623038, gc%: 60.95%
Download details as .txt file: detail.txt file_download
Hits against Bacterial Database or GenBank File
Region 1, total 9 CDS
# | CDS Position | BLAST Hit | E-Value | Sequence |
---|---|---|---|---|
1 | complement(106598..107683) | PHAGE_Bacill_G_NC_023719: gp245; ELH43_31745; phage(gi593777701) | 9.57e-32 | Showinfo_outline |
2 | complement(107686..108738) | extracellular solute-binding protein; ELH43_31750 | 0.0 | Showinfo_outline |
3 | complement(108906..109946) | LysR family transcriptional regulator; ELH43_31755 | 0.0 | Showinfo_outline |
4 | complement(110097..110345) | adenylate/guanylate cyclase domain-containing protein; ELH43_31760 | 0.0 | Showinfo_outline |
5 | 110421..110522 | PROPHAGE_Brucel_1330: ISBm3, transposase, programmed frameshift; ELH43_31765; phage(gi52430457) | 1.96e-13 | Showinfo_outline |
6 | complement(110529..111320) | PHAGE_Xantho_XcP1_NC_048147: primase; ELH43_31770; phage(gi100047) | 7.36e-06 | Showinfo_outline |
7 | complement(111602..112435) | PHAGE_Bordet_vB_BbrM_PHB04_NC_047861: exonuclease; ELH43_31775; phage(gi100048) | 1.67e-18 | Showinfo_outline |
8 | 112722..113615 | PHAGE_Bordet_vB_BbrM_PHB04_NC_047861: exonuclease; ELH43_31780; phage(gi100048) | 1.51e-09 | Showinfo_outline |
9 | complement(113765..114601) | PHAGE_Mycoba_Goose_NC_042340: hypothetical protein; ELH43_31785; phage(gi100061) | 1.09e-12 | Showinfo_outline |
>complement(106598..107683)
RhizobiumleguminosarumMTTTPDMAADPSRSGLLLSARGLTKNYGRDRAVDSVDLDIPERAFTTFLGPSGCGKTTILRMIAGFETPDAGSIVLDGRSLSGVPPERRPVNTVFQSYALFPHLTVFGNVAFSLTLRRQSNVDVAARVKRALDAVHMAEFGDRYPHQLSGGQQQRVAVARAIIAEPHLLLLDEPLSALDKKMRGHLQIELKDLQRRLGIAFVYVTHDQEEAFALSDIVVVMNKGRIVQRAAPLTIYARPANAFVADFIGAAALIPGEIVDAGGGSGSAAIDTATGIVQCPAVEGLAKGDKAVLAIRPEHVRIGTDGLNATIRHVVFKGDRYLVEAEVNGVILRFMADNPLSPGANVGLVVDQDRAFITRID
>complement(107686..108738)
RhizobiumleguminosarumMLAGIAAGAAAGIFGSLSSAKAAVSEMVWATWDSNGHPEYIAAFEAQTGVKVKLSYLSSEDAQFAALKTGTASDWDMINPSLNGSWRYIKAGLLEEIDMAKIPNAAKMYDVFKTTPKVLDASGRQFAVPYLWGLNPIVYRKDKFDSEPDYTTLFNEKYKGQLAMRDYALESIGIAGLVAGVPRDKVFLMETKELAEAKKLLVAQKPLLRTYWQTIGDLTNLFATGEVSCAFSWRVPYDALKDKLAMGMAKPKAGIMGWCDCFAMPATLSAEKSEIGYKFIDYLLGAEFATQIAKIGNYATTSSIIRDELSKQQQEAIFVDDMDVMKSFMWPVAPENYSEWLKIWNEVKAS
>complement(108906..109946)
RhizobiumleguminosarumMPLLKLVAPFDKGRLIIMRQANGMMDVRLMRTLLTLLTECSVSKTADILGQAQPTVSLTLKRLREMLDDPLLVRSGGALVPTERGLALKETLRDILGQIDAHLSPHATFDPDNATRNFRIIADNCLGAVFLPQLVGKIAQKAPNVGVDASHMPSYDDLISQLADGSIDAVIGNWPHPPEYLRMSPLLTTDIVCVVRPNHRLASQREPISMGEFLKEKHLSPTSDKRAHLSPIDGRLIELGLKRNIAVSVPEYAITPYVLTQSDLVFTTGRIFAEQIAQAFPLVVLEAPRELGQMQFYMLWHECKHQSPDHVWLRQMIKSVAAGVRVFDPTESVPITFLRSGAPVYV
>complement(110097..110345)
RhizobiumleguminosarumSGGRTIKVLGDGLLSLFESPAVAISTALQVQHLLASEAVAASGTDPLKIRIGINYAEVMITEDDVFGDGVNVAARLEQHALPD
>110421..110522
RhizobiumleguminosarumHMYNWHRPHGSLKSKTPISRLGLSRDNLLSIHS
>complement(110529..111320)
RhizobiumleguminosarumMQIDLTGKTALVTGSTEGIGYAIVRQLARAGANVVVNGRSEEKTAKAADRLKGEGAKGSVAAVAADLATAEGCDALVAKVPHVDILINNAGIFQPLDFFEADDEVWDRHWQVNVMSAVRLSRAYLPGMQKLDWGRVIFIASESGFNIPVEMIHYGVSKTADIAVARGLAKRMAGTGVTVNSVLPGPTLSEGVEAMLAEERAKTGKPIEEVAADFVKKHRSGSIIQRAASVEEVANLVTYLASPLASATTGASMRVDGGLIDTL
>complement(111602..112435)
RhizobiumleguminosarumMSINQALAGSDIAVFRKGTSVARLDQVMTSASDRGFVVGVSSIGGHTRRIHHQHHVTTHDFAENSIYIRDLSEAYKADLSGSFDFLLFEISPAALTRIADGAELSGITSLAAETASKDIVLANLARALIPALEKPEEASALFIDQMTTAIGTYLVQRYGGRSVATPNRSRSLSRSHEHLAKSLLLENLDGDISIEQVAQACNLSRGYFIRAFRETTGMTPYQWLLSQRIDRARTLLRTSNAPLAEVAIACGFADQSHFTRVFASVVGATPGNWRRNV
>112722..113615
RhizobiumleguminosarumMAVSDRIATCFGIGAARALVIKPLREAKLSVVHLDHVYGDAEHPVFLPADDAFLLMLYLVDVDHRDIRPDQTVAPLKTYPKGSVCLISLRHGAAISIRGHFEALAFHIPNSHFAELAEEAGEPRVDDLATCRGIDDQVIRNIGGALMPMFDMPDEVRDQLLPHIGLALNAHLAHRYGRSPAQRLSASGRLSPMQEKRIKTYMAANLSANMTVDQIAEATGFSVDELRSGFLNTTGQSVAEWMSAYRMTRAQAQLSRTGDPIAQVAATCGFADEDTFIDAFSKTVGVAPTEWRSRNRH
>complement(113765..114601)
RhizobiumleguminosarumMGYITTKDGVEIFYKDWGRKDAQPIVFHHGWPLSSDDWDAQMLFFLSKGYRVVAHDRRGHGRSAQVSDGHDMDHYAADAFAVVEALDLKNAVHIGHSTGGGEVARYVAKYGQPTGRVAKAVLVSAVPPLMLKTETNPGGLPMEVFDGIRKGVADNRAQLFIDFPTGPFYGFNRSDAKVYPGVIQNWWRQGMMGSAKAHYDGIKAFSETDQTQDLKAITVPTLVMHGDDDQVVPIDNAGRLSATLVQNGTLKVYKGYPHGMLTTHADVLNADLLAFVEA
Questionable (score 70-90)
Incomplete (score < 70)
Viewer Options
Click on a region in the genome above to show details here.
ORF Start: 106598
ORF Stop: 107683
Strand: Backward
Protein Sequence: MTTTPDMAADPSRSGLLLSARGLTKNYGRDRAVDSVDLDIPERAFTTFLGPSGCGKTTILRMIAGFETPDAGSIVLDGRSLSGVPPERRPVNTVFQSYALFPHLTVFGNVAFSLTLRRQSNVDVAARVKRALDAVHMAEFGDRYPHQLSGGQQQRVAVARAIIAEPHLLLLDEPLSALDKKMRGHLQIELKDLQRRLGIAFVYVTHDQEEAFALSDIVVVMNKGRIVQRAAPLTIYARPANAFVADFIGAAALIPGEIVDAGGGSGSAAIDTATGIVQCPAVEGLAKGDKAVLAIRPEHVRIGTDGLNATIRHVVFKGDRYLVEAEVNGVILRFMADNPLSPGANVGLVVDQDRAFITRID
Homolog/Ortholog Species: Phage-like protein
Homolog/Ortholog Protein: ELH43_31745, gp245, phage(gi593777701), PHAGE_Bacill_G_NC_023719
Homolog/Ortholog E-Value: 9.57e-32
ORF Start: 107686
ORF Stop: 108738
Strand: Backward
Protein Sequence: MLAGIAAGAAAGIFGSLSSAKAAVSEMVWATWDSNGHPEYIAAFEAQTGVKVKLSYLSSEDAQFAALKTGTASDWDMINPSLNGSWRYIKAGLLEEIDMAKIPNAAKMYDVFKTTPKVLDASGRQFAVPYLWGLNPIVYRKDKFDSEPDYTTLFNEKYKGQLAMRDYALESIGIAGLVAGVPRDKVFLMETKELAEAKKLLVAQKPLLRTYWQTIGDLTNLFATGEVSCAFSWRVPYDALKDKLAMGMAKPKAGIMGWCDCFAMPATLSAEKSEIGYKFIDYLLGAEFATQIAKIGNYATTSSIIRDELSKQQQEAIFVDDMDVMKSFMWPVAPENYSEWLKIWNEVKAS
Homolog/Ortholog Species: Non phage-like protein
Homolog/Ortholog Protein: ELH43_31750, extracellular solute-binding protein
Homolog/Ortholog E-Value: N/A
ORF Start: 108906
ORF Stop: 109946
Strand: Backward
Protein Sequence: MPLLKLVAPFDKGRLIIMRQANGMMDVRLMRTLLTLLTECSVSKTADILGQAQPTVSLTLKRLREMLDDPLLVRSGGALVPTERGLALKETLRDILGQIDAHLSPHATFDPDNATRNFRIIADNCLGAVFLPQLVGKIAQKAPNVGVDASHMPSYDDLISQLADGSIDAVIGNWPHPPEYLRMSPLLTTDIVCVVRPNHRLASQREPISMGEFLKEKHLSPTSDKRAHLSPIDGRLIELGLKRNIAVSVPEYAITPYVLTQSDLVFTTGRIFAEQIAQAFPLVVLEAPRELGQMQFYMLWHECKHQSPDHVWLRQMIKSVAAGVRVFDPTESVPITFLRSGAPVYV
Homolog/Ortholog Species: Non phage-like protein
Homolog/Ortholog Protein: ELH43_31755, LysR family transcriptional regulator
Homolog/Ortholog E-Value: N/A
ORF Start: 110097
ORF Stop: 110345
Strand: Backward
Protein Sequence: SGGRTIKVLGDGLLSLFESPAVAISTALQVQHLLASEAVAASGTDPLKIRIGINYAEVMITEDDVFGDGVNVAARLEQHALPD
Homolog/Ortholog Species: Non phage-like protein
Homolog/Ortholog Protein: ELH43_31760, adenylate/guanylate cyclase domain-containing protein
Homolog/Ortholog E-Value: N/A
ORF Start: 110421
ORF Stop: 110522
Strand: Forward
Protein Sequence: HMYNWHRPHGSLKSKTPISRLGLSRDNLLSIHS
Homolog/Ortholog Species: Transposase
Homolog/Ortholog Protein: ELH43_31765, ISBm3, transposase, programmed frameshift, phage(gi52430457), PROPHAGE_Brucel_1330
Homolog/Ortholog E-Value: 1.96e-13
ORF Start: 110529
ORF Stop: 111320
Strand: Backward
Protein Sequence: MQIDLTGKTALVTGSTEGIGYAIVRQLARAGANVVVNGRSEEKTAKAADRLKGEGAKGSVAAVAADLATAEGCDALVAKVPHVDILINNAGIFQPLDFFEADDEVWDRHWQVNVMSAVRLSRAYLPGMQKLDWGRVIFIASESGFNIPVEMIHYGVSKTADIAVARGLAKRMAGTGVTVNSVLPGPTLSEGVEAMLAEERAKTGKPIEEVAADFVKKHRSGSIIQRAASVEEVANLVTYLASPLASATTGASMRVDGGLIDTL
Homolog/Ortholog Species: Phage-like protein
Homolog/Ortholog Protein: ELH43_31770, primase, phage(gi100047), PHAGE_Xantho_XcP1_NC_048147
Homolog/Ortholog E-Value: 7.36e-06
ORF Start: 111602
ORF Stop: 112435
Strand: Backward
Protein Sequence: MSINQALAGSDIAVFRKGTSVARLDQVMTSASDRGFVVGVSSIGGHTRRIHHQHHVTTHDFAENSIYIRDLSEAYKADLSGSFDFLLFEISPAALTRIADGAELSGITSLAAETASKDIVLANLARALIPALEKPEEASALFIDQMTTAIGTYLVQRYGGRSVATPNRSRSLSRSHEHLAKSLLLENLDGDISIEQVAQACNLSRGYFIRAFRETTGMTPYQWLLSQRIDRARTLLRTSNAPLAEVAIACGFADQSHFTRVFASVVGATPGNWRRNV
Homolog/Ortholog Species: Phage-like protein
Homolog/Ortholog Protein: ELH43_31775, exonuclease, phage(gi100048), PHAGE_Bordet_vB_BbrM_PHB04_NC_047861
Homolog/Ortholog E-Value: 1.67e-18
ORF Start: 112722
ORF Stop: 113615
Strand: Forward
Protein Sequence: MAVSDRIATCFGIGAARALVIKPLREAKLSVVHLDHVYGDAEHPVFLPADDAFLLMLYLVDVDHRDIRPDQTVAPLKTYPKGSVCLISLRHGAAISIRGHFEALAFHIPNSHFAELAEEAGEPRVDDLATCRGIDDQVIRNIGGALMPMFDMPDEVRDQLLPHIGLALNAHLAHRYGRSPAQRLSASGRLSPMQEKRIKTYMAANLSANMTVDQIAEATGFSVDELRSGFLNTTGQSVAEWMSAYRMTRAQAQLSRTGDPIAQVAATCGFADEDTFIDAFSKTVGVAPTEWRSRNRH
Homolog/Ortholog Species: Phage-like protein
Homolog/Ortholog Protein: ELH43_31780, exonuclease, phage(gi100048), PHAGE_Bordet_vB_BbrM_PHB04_NC_047861
Homolog/Ortholog E-Value: 1.51e-09
ORF Start: 113765
ORF Stop: 114601
Strand: Backward
Protein Sequence: MGYITTKDGVEIFYKDWGRKDAQPIVFHHGWPLSSDDWDAQMLFFLSKGYRVVAHDRRGHGRSAQVSDGHDMDHYAADAFAVVEALDLKNAVHIGHSTGGGEVARYVAKYGQPTGRVAKAVLVSAVPPLMLKTETNPGGLPMEVFDGIRKGVADNRAQLFIDFPTGPFYGFNRSDAKVYPGVIQNWWRQGMMGSAKAHYDGIKAFSETDQTQDLKAITVPTLVMHGDDDQVVPIDNAGRLSATLVQNGTLKVYKGYPHGMLTTHADVLNADLLAFVEA
Homolog/Ortholog Species: Hypothetical protein
Homolog/Ortholog Protein: ELH43_31785, hypothetical protein, phage(gi100061), PHAGE_Mycoba_Goose_NC_042340
Homolog/Ortholog E-Value: 1.09e-12
Terminase
Portal Protein
Coat Protein
Tail Shaft
Integrase
Phage-like Protein
Other
Transposase
Plate Protein
tRNA
Download data as .txt file: png_input file_download