Figure 3From: N-gram analysis of 970 microbial organisms reveals presence of biological language modelsUnigram distribution in the proteomes of different genera. Unigram distribution of species from the genera (A) Brucella, (B) Burkholderia, (C) Bacillus, (D) Xanthomonas, (E) Pseudonomas and (F) Escherichia are shown. Within a specific genus, and to some extent within the same class, most species show a similar unigram distribution.Back to article page