Skip to main content

BADASS: BActeriocin-Diversity ASsessment Software



Bacteriocins are defined as thermolabile peptides produced by bacteria with biological activity against taxonomically related species. These antimicrobial peptides have a wide application including disease treatment, food conservation, and probiotics. However, even with a large industrial and biotechnological application potential, these peptides are still poorly studied and explored. BADASS is software with a user-friendly graphical interface applied to the search and analysis of bacteriocin diversity in whole-metagenome shotgun sequencing data.


The search for bacteriocin sequences is performed with tools such as BLAST or DIAMOND using the BAGEL4 database as a reference. The putative bacteriocin sequences identified are used to determine the abundance and richness of the three classes of bacteriocins. Abundance is calculated by comparing the reads identified as bacteriocins to the reads identified as 16S rRNA gene using SILVA database as a reference. BADASS has a complete pipeline that starts with the quality assessment of the raw data. At the end of the analysis, BADASS generates several plots of richness and abundance automatically as well as tabular files containing information about the main bacteriocins detected. The user is able to change the main parameters of the analysis in the graphical interface. To demonstrate how the software works, we used four datasets from WMS studies using default parameters. Lantibiotics were the most abundant bacteriocins in the four datasets. This class of bacteriocin is commonly produced by Streptomyces sp.


With a user-friendly graphical interface and a complete pipeline, BADASS proved to be a powerful tool for prospecting bacteriocin sequences in Whole-Metagenome Shotgun Sequencing (WMS) data. This tool is publicly available at


Characterization of bioactive molecules produced by free-living microorganisms has been very important in recent years because of their biotechnological applications. It is well known that the overwhelming majority of free-living microorganisms are not capable of being grown in laboratory conditions [1], which is a bottleneck to the identification and isolation of bioactive compounds. Thus, an alternative to search for new compounds is the well-established method of WMS, where the nucleic acids of the microbial community are extracted and sequenced directly from environmental samples [2]. Thus, genes involved in the synthesis of peptides or non-peptides bioactive compounds can be assessed. The main bottleneck lies in the development of user-friendly tools that allow the user to analyze a large amount of data in a simple and interactive way.

Several genes and molecules were prospected by WMS such as amylolytic or cellulolytic enzymes [3], antimicrobial compounds [4], antibiotic resistance genes (ARGs) [5,6,7] and bacteriocins [8]. Bacteriocins are small cationic thermostable bacterial peptides with a narrow spectrum of activity [9,10,11]. Unlike antibiotics, bacteriocins are produced by ribosomal activity and therefore are protease-sensitive. They are divided into three classes according to their synthesis mechanism.

Class I consists of peptides that after their translation, undergo structural changes. This class is also called lantibiotics. They have a molecular weight below 5 kDa and a size smaller than 28 amino acids [12, 13]. Class II is characterized by peptides that do not undergo post-translational modifications. They are larger than class I bacteriocins and have a molecular weight below 10 kDa [14]. Class III is composed of peptides with a molecular weight higher than 30 kDa. Bacteriocins of this class have a mechanism of action different from the other two classes, eliminating bacterial cells through cell wall hydrolysis [12, 15, 16].

A variety of software has been developed to search for ARGs or secondary metabolites such as nonribosomal peptide synthase (NRPS) or polyketide synthase (PKS) in WMS data [6, 17]. However, none of this software is focused on the prospecting of bacteriocins. Anti-SMASH [18] is an excellent tool to analyze genomes while RiPPER [19] works better for pan-genome data. BAGEL web tool (BActeriocin GEnome mining tooL) [17] is one of the first tools developed for the identification of peptides and bacteriocins in genome data. However, the tool has a maximum size for the input file, making difficult the analysis of WMS data.

In this article, we present BADASS software (BActeriocin-Diversity ASSessment Software), an automated pipeline with an intuitive graphical interface that allows users to analyze the diversity of bacteriocins using WMS raw data. Diversity measurement is based on the abundance and richness of the three bacteriocin classes currently described. The software is available at



The pipeline of BADASS (Fig. 1) starts with the automatic loading of bacteriocin sequences to the database. It is worth noting that this process needs to be executed only on the first use of the software. The input file consists of a WMS sequencing sample in FASTQ format. After saving the project the following steps are performed.

  1. (a)

    Quality assessment The user can choose to evaluate the raw data with a boxplot chart that correlates the Phred score of a base (y axis) with base position (x axis). This is an optional step that helps users to decide about the quality filter values that will be used in the next step. A boxplot with the result of the FastQC analysis is produced at this stage and displayed to the user.

  2. (b)

    Trimming and quality filter Raw data is trimmed to remove bases at the end of the reads with a Phred score below the cut-off value provided by the user. Subsequently, sequences are filtered according to parameters such as alignment score and e-value. The Fastx Toolkit software is used in this step.

  3. (c)

    Parser fastq2fasta The trimmed and filtered file is converted into FASTA format.

  4. (d)

    Mapping against the bacteriocin database In order to identify bacteriocins, we adapted a search method used in several works [20, 21] which consists of: Firstly, a database is built using non-redundant BAGEL4 sequences [22]. Subsequently, the BLAST+ tool [22, 23] is used to compare the translated reads against the BAGEL4 database with blastx. The best hit for each read identified as bacteriocin is retrieved. The user can define an e-value cut-off for the homology analysis.

  5. (e)

    Mapping against the 16S rRNA SILVA database The same file of the previous step is used to align the reads against the SILVA database using DIAMOND [24]. Two files in.csv format are generated. The first contains the list of subject nucleotide sequences with their respective identity values. The other file contains the best hit for each query sequence based on an identity cut-off value provided by the user. Cut-off values are adjustable in the graphical interface of BADASS.

Fig. 1
figure 1

Diagram of the software pipeline

The number of reads identified as bacteriocins and 16S rRNA are used to calculate the richness and abundance of bacteriocins in the WMS dataset as mentioned later.

Programming language and database

BADASS was developed in JAVA ( and used the Maven tool ( to build and manage the project. Maven was used due to the automated management and generation of the JAR package containing the software dependencies. Swing library was used to produce the graphical interface. The database management system used to control the steps and manage the project was SQLite v.3 (

Data source and software validation

The software validation was performed using four whole-metagenome shotgun sequencing datasets. Samples were obtained in the Tucuruí Hydroelectric Power Plant water reservoir submitted in EBI database under the accession numbers ERS1560860, ERS1560861, and ERS1562591 [8] and a sample obtained from Unai's Hot Spring from the ENA (European Nucleotide Archive) database with the accession number PRJEB8864. The following parameters were used: Quality threshold: 20, minimum length: 100, minimum quality score to keep: 16, minimum percent: 80%, e-value: 10, threads: 6, identity: 50.

Quality assessment and BLAST

BADASS uses the statistical platform R v.4 ( for quality assessment of the raw data, through the FastQC package ( Reads are trimmed and quality filtered using the FASTX-Toolkit ( The user is able to adjust the parameters of quality filtering in the graphical interface of BADASS. BLAST v.2.0.9-3 and Diamond v. software are used to map the reads against the BAGEL4 [22, 25] and SILVA [26] databases, respectively. BAGEL4 was updated and now has about 500 Ribosomally synthesized and post-translationally modified peptides (RiPPs) (class 1), 230 non-modified bacteriocins (class 2), and 90 bacteriocins with more than 30 kDa (class 3). RippMIner [27] and the data repository MIBiG [28] were used to build the BAGEL4 database.

Abundance analysis

Diversity of bacteriocins was analyzed in terms of abundance and richness. In order to calculate the abundance of bacteriocins we adapted the formula proposed formula in studies involving the search of abundance of resistance genes in WMS data. [20]. Where: (1) n is the amount of bacteriocins that were found in sample; (2).Nbacteriocin sequences is the number of reads that mapped to a specific bacteriocin; (3) Tread is the average size of reads; (4) Tbacteriocin is the average size of the bacteriocin; (5) N16S rRNA sequences is the number of reads that mapped to 16S rRNA sequences; and (6) T16S rRNA sequence is the average size of the 16S rRNA sequence.

$$Abundance = \mathop \sum \limits_{1}^{n} \frac{{N_{bacteriocinsequences} \times \frac{{T_{read} }}{{T_{bacteriocin} }}}}{{N_{16SrRNAsequences} \times \frac{{T_{read} }}{{T_{16SrRNA} }}}}.$$


Analyses were performed in a Desktop equipment Intel® Core™ i7-10510U CPU @ 1.80 GHz with 8 processing cores, 16 GB of RAM memory, and tests were run on Ubuntu 21.10, 64-bit, Windows 11 and macOS Ventura 13.0 operating systems.

Results and discussion

BADASS was developed using the BLAST and DIAMOND alignment tools to identify bacteriocin and 16S rRNA sequences in WMS raw data. The choice of tool can be defined in the BADASS GUI. Other studies have used similar methodologies for prospecting relevant genes such as ARGs [7, 20, 29, 30]. For example, ARGs-OAP is an online pipeline for antibiotic resistance genes detection in metagenomic data through similarity sequence analysis [5]. In addition to the homology search, BADASS calculates the abundance of each bacteriocin class by taking into account the size of the genes and the size of the reads produced by the sequencing library [22]. The number of reads identified as bacteriocins are compared to the number of reads identified as the gene of the 16S rRNA, which is present in a few copies per cell. This approach makes the size of the reads as well as the size of the genes not interfere with the analysis of abundance. Thus, this pipeline is a powerful tool for rapid and comprehensive evaluation of bacteriocin diversity using WMS raw data.

The main results of BADASS are the description of richness and the values of the abundance of bacteriocins using WMS raw data as input file in a simple and intuitive way. The software provides a set of adjustable parameters in the graphical interface. Users can also choose to process the samples using default parameters. In more detail, the results obtained by the software include: (1) quality assessment box plots of the raw data directly in the graphical interface or even in the results folder; (2) spreadsheets in.xlsx or.csv format containing information about the identified bacteriocins (richness) including their frequency (ratio between the number of reads identified as bacteriocin and the total number of reads in a sample) and abundance (calculated using the formula previously mentioned), organized by class; (3) bar plots of abundance based on the.csv files; (4) a.csv file containing the list of 16S rRNA sequences identified in the dataset including the value of percentage identity; (5) Trimmed.fastq and QV.fastq files containing the trimmed reads and the reads filtered by minimum size, respectively.

It is also possible to detect bacteriocin in genome sequences using other software. Table 1 presents several computational tools and databases developed to help in the identification of these antimicrobial peptides. The main features of each software or database are compared in the table. It is worth noting that BADASS is the only who has a graphical interface, supports WMS data, and performs diversity analysis (Table 1). BAGEL4 stands out for having one of the most complete databases containing a large number of annotated and experimentally verified bacteriocin sequences. In addition, the database is divided into three classes according to the genetic information and mechanism of action. Because of these features, BAGEL4 was used as a reference bacteriocin database in BADASS. BACTIBASE [30] is a database containing detailed information about the physicochemical properties of bacteriocins. This information allows a fast and accurate prediction of the structure–function relationship and possible target organisms of the antimicrobial peptides. Other relevant software includes BOA (Bacteriocin Operon Associator) [31] which uses Hidden Markov Models to predict bacteriocin clusters, Neubi [32] which identifies bacteriocins using a word embedding approach, and Anti-SMASH (Antibiotics and Secondary Metabolite Analysis Shell) which was launched in 2011 and is used not only for bacteriocin prediction but for a number of other secondary metabolites [18, 33].

Table 1 Comparison of the main features of software used for bacteriocin gene mining

The identification of bacteriocins, however, is still quite challenging due to the limited number of known and experimentally analyzed sequences. Choosing the most appropriate and up-to-date tool is essential for the search and identification of bacteriocin genes. BADASS is a user-friendly software, with a robust pipeline that starts with the quality assessment of the raw data and ends with the analysis of the richness and abundance of bacteriocins.

A pilot analysis was performed using four datasets with default parameters. The results of the dataset ERR1816708 are presented in Table 2. First column of the table shows the BAGEL4 accession number and name of the bacteriocins. Other columns correspond to frequency, abundance and class, respectively. Thus, users are able to identify the diversity of bacteriocins in the dataset. Additionally, complementary analyses such as taxonomic affiliation of the microbial community are important to determine the ecological context of the described bacteriocins [8].

Table 2 CSV file generated by the BADASS presenting the names and other information of the detected bacteriocins

Two bar plots were generated by the software containing an overview of the bacteriocin diversity. The first plot (Fig. 2) is designed based on a.csv file similar to Table 2. The plot presents the top ten most abundant bacteriocins in the dataset. Information about the bacterial species that commonly produce the peptides are presented in the legend. The second plot (Fig. 3) presents the abundance of bacteriocins by class. The best parameters for each study should be carefully chosen by the user according to it dataset characteristics. A variation in the results is expected since the parameters adjust the analysis performed by the software. The choice of parameters by the user will result in changes in the result, being able to restrict to stricter or looser parameters [34].

Fig. 2
figure 2

Output file of BADASS. Bar plots for the four datasets analyzed presenting the top ten most abundant bacteriocins

Fig. 3
figure 3

Output file of BADASS. Bar plots presenting the abundance of each class of bacteriocin in the four datasets analyzed

The time required for similarity search and post-alignment analysis has become a bottleneck as sequence costs decrease and the size of the datasets increases [35]. We also highlight that all the analysis, starting from the filtering of the raw data can be done in the BADASS pipeline. The software allows users to modify most of the parameters such as e-value, identity cut-off, and others.


In the environment, a countless number of microbial species coexist and, in order to succeed in colonize their ecological niches, many have developed mechanisms to eliminate other species through the production of antimicrobial molecules. In this chemical warfare, bacteriocins are narrow-spectrum antimicrobial peptides synthesized by ribosomal activity that are widely distributed in bacterial species. Thus, the development of computational tools to identify, classify and quantify bacteriocins in WMS datasets is of great importance for microbial ecology and biotechnology.

BADASS provides the user with a robust and automated computational tool with a simple and intuitive graphical interface, where the parameters can be adjusted by the user, allowing greater independence in the analysis of different samples. The integration of the software with the R statistical platform allows the generation of plots that helps in data interpretation. For those looking to prospect antimicrobial peptides in WMS raw data, BADASS is a powerful solution.

Availability and requirements

Project name: BADASS

Project home page:

Operating system(s): platform independent

Programming language: Java

Other requirements: e.g. Java 19.0.1 or higher

License: GNU GPL

Any restrictions to use by non-academics: license needed.

Availability of data and materials

The software is available at and samples were obtained in the Tucuruí Hydroelectric Power Plant water reservoir submitted in EBI database under the accession numbers ERS1560860, ERS1560861, and ERS1562591 and a sample obtained from Unai's Hot Spring from the ENA (European Nucleotide Archive) database with the accession number PRJEB8864.



BActeriocin-Diversity ASsessment Software


Whole-Metagenome Shotgun Sequencing


Antibiotic resistance genes


Nonribosomal peptide synthase


Polyketide synthase


BActeriocin GEnome mining tool


European Nucleotide Archive


Ribosomally synthesized and post-translationally modified peptides


  1. Stefani FOP, Bell TH, Marchand C, De La Providencia IE, El Yassimi A, St-Arnaud M, et al. Culture-dependent and -independent methods capture different microbial community fractions in hydrocarbon-contaminated soils. PLoS ONE. 2015;10:1–16.

    Article  Google Scholar 

  2. Handelsman J, Rondon MR, Brady SF, Clardy J, Goodman RM. Molecular biological access to the chemistry of unknown soil microbes: a new frontier for natural products. Chem Biol. 1998;5:R245–9.

    Article  CAS  Google Scholar 

  3. Vester JK, Glaring MA, Stougaard P. An exceptionally cold-adapted alpha-amylase from a metagenomic library of a cold and alkaline environment. Appl Microbiol Biotechnol. 2015;99:717–27.

    Article  CAS  Google Scholar 

  4. Arivaradarajan P, Paramasamy G, Nair SP, Allan E, Mullany P. Identification of an antibacterial protein by functional screening of a human oral metagenomic library. FEMS Microbiol Lett. 2015;362:1–6.

    Article  Google Scholar 

  5. Yang Y, Jiang X, Chai B, Ma L, Li B, Zhang A, et al. ARGs-OAP: Online analysis pipeline for antibiotic resistance genes detection from metagenomic data using an integrated structured ARG-database. Bioinformatics. 2016;32:2346–51.

    Article  CAS  Google Scholar 

  6. Yang S-C, Lin C-H, Sung CT, Fang J-Y. Antibacterial activities of bacteriocins: application in foods and pharmaceuticals. Front Microbiol. 2014;5:241.

    Google Scholar 

  7. Li H, Zheng X, Tan L, Shao Z, Cao H, Xu Y. The vertical migration of antibiotic-resistant genes and pathogens in soil and vegetables after the application of different fertilizers. Environ Res. 2021;2022(203):111884.

    Google Scholar 

  8. Costa SS, Lago LAB, Silva A, Das Graças DA, Lameira J, Baraúna RA. Diversity of bacteriocins in the microbiome of the Tucuruí Hydroelectric Power Plant water reservoir and three-dimensional structure prediction of a zoocin. Genet Mol Biol. 2022;45:1–9.

    Article  Google Scholar 

  9. Cotter PD, Ross RP, Hill C. Bacteriocins-a viable alternative to antibiotics? Nat Rev Microbiol. 2013;11:95–105.

    Article  CAS  Google Scholar 

  10. Yang Y, Jiang XT, Zhang T. Evaluation of a hybrid approach using UBLAST and BLASTX for metagenomic sequences annotation of specific functional genes. PLoS ONE. 2014;9:e110947.

    Article  Google Scholar 

  11. Montville TJ, Winkowski K, Ludescher RD. Models and mechanisms for bacteriocin action and application. Int Dairy J. 1995;5:797–814.

    Article  CAS  Google Scholar 

  12. Riley MA, Wertz JE. Bacteriocins: evolution, ecology, and application. Annu Rev Microbiol. 2002;56:117–37.

    Article  CAS  Google Scholar 

  13. Dobson A, Cotter PD, Ross RP, Hill C. Bacteriocin production: a probiotic trait? Appl Environ Microbiol. 2012;78:1–6.

    Article  CAS  Google Scholar 

  14. Cotter PD, Ross RP, Hill C. Bacteriocins - a viable alternative to antibiotics? Nat Rev Microbiol. 2013;11:95–105.

    Article  CAS  Google Scholar 

  15. Cotter PD, Hill C, Ross RP. Bacteriocins: developing innate immunity for food. Nat Rev Microbiol. 2005;3:777–88.

    Article  CAS  Google Scholar 

  16. Huang F, Teng K, Liu Y, Cao Y, Wang T, Ma C, et al. Bacteriocins: potential for human health. Oxid Med Cell Longev. 2021;2021:1–17.

    CAS  Google Scholar 

  17. de Jong A, van Hijum SAFT, Bijlsma JJE, Kok J, Kuipers OP. BAGEL: a web-based bacteriocin genome mining tool. Nucleic Acids Res. 2006;34(Web Server Issue):W273–9.

    Article  Google Scholar 

  18. Medema MH, Blin K, Cimermancic P, De Jager V, Zakrzewski P, Fischbach MA, et al. AntiSMASH: rapid identification, annotation and analysis of secondary metabolite biosynthesis gene clusters in bacterial and fungal genome sequences. Nucleic Acids Res. 2011;39(SUPPL. 2):339–46.

    Article  Google Scholar 

  19. Santos-Aberturas J, Chandra G, Frattaruolo L, Lacret R, Pham TH, Vior NM, et al. Uncovering the unexplored diversity of thioamidated ribosomal peptides in Actinobacteria using the RiPPER genome mining tool. Nucleic Acids Res. 2019;47:4624–37.

    Article  CAS  Google Scholar 

  20. Li B, Yang Y, Ma L, Ju F, Guo F, Tiedje JM, et al. Metagenomic and network analysis reveal wide distribution and co-occurrence of environmental antibiotic resistance genes. ISME J. 2015;9:2490–502.

    Article  CAS  Google Scholar 

  21. Port JA, Cullen AC, Wallace JC, Smith MN, Faustman EM. Metagenomic frameworks for monitoring antibiotic resistance in aquatic environments. Environ Health Perspect. 2014;122(3):222–8.

    Article  CAS  Google Scholar 

  22. van Heel AJ, de Jong A, Song C, Viel JH, Kok J, Kuipers OP. BAGEL4: a user-friendly web server to thoroughly mine RiPPs and bacteriocins. Nucleic Acids Res. 2018;46:W278–88.

    Article  Google Scholar 

  23. Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ. Basic local alignment search tool. J Mol Biol. 1990;215:403–10.

    Article  CAS  Google Scholar 

  24. Edgar RC. Search and clustering orders of magnitude faster than BLAST. Bioinformatics. 2010;26:2460–1.

    Article  CAS  Google Scholar 

  25. Buchfink B, Xie C, Huson DH. Fast and sensitive protein alignment using DIAMOND. Nat Methods. 2014;12:59–60.

    Article  Google Scholar 

  26. Quast C, Pruesse E, Yilmaz P, Gerken J, Schweer T, Yarza P, et al. The SILVA ribosomal RNA gene database project: improved data processing and web-based tools. Nucleic Acids Res. 2013;41(Database Issue):D590–6.

    CAS  Google Scholar 

  27. Agrawal P, Khater S, Gupta M, Sain N, Mohanty D. RiPPMiner: A bioinformatics resource for deciphering chemical structures of RiPPs based on prediction of cleavage and cross-links. Nucleic Acids Res. 2017;45:W80–8.

    Article  CAS  Google Scholar 

  28. Kautsar SA, Blin K, Shaw S, Navarro-Muñoz JC, Terlouw BR, Van Der Hooft JJJ, et al. MIBiG 2.0: a repository for biosynthetic gene clusters of known function. Nucleic Acids Res. 2020;48:D454–8.

    Google Scholar 

  29. Manoharan RK, Srinivasan S, Shanmugam G, Ahn YH. Shotgun metagenomic analysis reveals the prevalence of antibiotic resistance genes and mobile genetic elements in full scale hospital wastewater treatment plants. J Environ Manag. 2020;2021(296):113270.

    Google Scholar 

  30. Hammami R, Zouhir A, Le Lay C, Ben Hamida J, Fliss I. BACTIBASE second release: a database and tool platform for bacteriocin characterization. BMC Microbiol. 2010;10:22.

    Article  Google Scholar 

  31. Morton JT, Freed SD, Lee SW, Friedberg I. A large scale prediction of bacteriocin gene blocks suggests a wide functional spectrum for bacteriocins. BMC Bioinform. 2015;16:1–9.

    Article  Google Scholar 

  32. Hamid MN, Friedberg I. Identifying antimicrobial peptides using word embedding with deep recurrent neural networks. Bioinformatics. 2019;35:2009–16.

    Article  CAS  Google Scholar 

  33. Blin K, Shaw S, Kloosterman AM, Charlop-Powers Z, Van Wezel GP, Medema MH, et al. AntiSMASH 6.0: improving cluster detection and comparison capabilities. Nucleic Acids Res. 2021;49:W29–35.

    Article  CAS  Google Scholar 

  34. Dash S, Rahman S, Hines H, Feng W. Incremental BLAST: incremental addition of new sequence databases through e-value correction. bioRxiv. 2018:476218.

  35. Mardis ER. A decade’s perspective on DNA sequencing technology. Nature. 2011;470:198–203.

    Article  CAS  Google Scholar 

Download references


The authors would like to thank to Universidade Federal do Pará (UFPA) for the structure to the development for this research.


The authors would like to thank to Coordenação de Aperfeiçoamento de Pessoal de Nível Superior (CAPES) for the scholarship, Centrais Elétricas do Norte do Brasil (Eletronorte) for the research funding and assistance during environmental sampling, and Pró-Reitoria de Pesquisa e Pós-Graduação (PROPESP) from Universidade Federal do Pará (UFPA) for the assistance in the payment of publication fees through the PAPQ program. The funding body did not interfere with the results achieved by the research.

Author information

Authors and Affiliations



SSC, RAB and AS participated in the design and implementation of the pipeline. GM and AOV participated in the coding of the software. All authors contributed to the adaptation of the pipeline. All authors wrote the manuscript and all authors read and approved the final version of the manuscript.

Corresponding author

Correspondence to Sávio S. Costa.

Ethics declarations

Ethics approval and consent to participate

Not applicable.

Consent for publication

Not applicable.

Competing interests

The authors declare no competing interests.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit The Creative Commons Public Domain Dedication waiver ( applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and Permissions

About this article

Verify currency and authenticity via CrossMark

Cite this article

Costa, S.S., da Silva Moia, G., Silva, A. et al. BADASS: BActeriocin-Diversity ASsessment Software. BMC Bioinformatics 24, 24 (2023).

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI:


  • Antimicrobial peptides
  • Bacteriocin
  • Metagenome mining
  • Software development