Figure 8From: N-gram analysis of 970 microbial organisms reveals presence of biological language modelsUsage of very distinct n-gram language models by some organisms. Top 40 most frequently used 4-grams in (A) Homo sapiens (shown as bold, cyan line), (B) Shigella dysenteria (shown as bold, magenta line). The corresponding frequencies of 4-grams in other microbes are shown in thin red for animal pathogens and blue for plant pathogens.Back to article page