 Methodology article
 Open Access
 Published:
Exploiting geometric similarity for statistical quantification of fluorescence spatial patterns in bacterial colonies
BMC Bioinformatics volume 21, Article number: 224 (2020)
Abstract
Background
Currently the combination of molecular tools, imaging techniques and analysis software offer the possibility of studying gene activity through the use of fluorescent reporters and infer its distribution within complex biological threedimensional structures. For example, the use of Confocal Scanning Laser Microscopy (CSLM) is a regularlyused approach to visually inspect the spatial distribution of a fluorescent signal. Although a plethora of generalist imaging software is available to analyze experimental pictures, the development of tailormade software for every specific problem is still the most straightforward approach to perform the best possible image analysis. In this manuscript, we focused on developing a simple methodology to satisfy one particular need: automated processing and analysis of CSLM image stacks to generate 3D fluorescence profiles showing the average distribution detected in bacterial colonies grown in different experimental conditions for comparison purposes.
Results
The presented method processes batches of CSLM stacks containing threedimensional images of an arbitrary number of colonies. Quasicircular colonies are identified, filtered and projected onto a normalized orthogonal coordinate system, where a numerical interpolation is performed to obtain fluorescence values within a spatially fixed grid. A statistically representative threedimensional fluorescent pattern is then generated from this data, allowing for standardized fluorescence analysis regardless of variability in colony size. The proposed methodology was evaluated by analyzing fluorescence from GFP expression subject to regulation by a stressinducible promoter.
Conclusions
This method provides a statistically reliable spatial distribution profile of fluorescence detected in analyzed samples, helping the researcher to establish general correlations between gene expression and spatial allocation under differential experimental regimes. The described methodology was coded into a MATLAB script and shared under an open source license to make it accessible to the whole community.
Background
The combination of new optical visualization techniques that use fluorophores to study gene expression with efficient algorithms to analyze data has been pushing synthetic biology to new levels [1]. Recent software and hardware developments are increasing the analysis capabilities of researchers, providing them with enhanced accuracy and specificity when studying gene expression within complex populations [2, 3], differentiation of subpopulations in microbial colonies [4, 5] or spatially location and inspection of areas of interest at individual cell level [6], among other uses. Despite the growing tendency in biology to rely upon imaging analysis software, there are still various fields in which use of such software is not wide spread [1]. Often, this resistance is due to researchers not finding a software package that effectively responds to their needs: many software tools were initially developed to deal with specific problems in a certain field and thus are tightly fitted to that field of study [7]. These software programs are then further expanded in a generalist fashion to adapt to a broader user community, not taking into account the specific needs of every potential user [8, 9]. This situation suggests that, although very powerful software does currently exist, tailored software is still an essential component for meeting more specific needs of many researchers.
An example of the need for more tailored software is found in the study of microbial colonies by microbiologists and biophysicists, where the spatial allocation of fluorescent regions (associated with extra or intracellular probes) is essential for analyses of processes such as morphogenesis [10, 11], cellular differentiation [4] or the physicalchemical conditions affecting the development of multicellular communities [12, 13]. These studies are strongly limited by the intrinsic morphological variability associated with the cellular growth process, which requires manual analysis of data. Case studies where physical arrangement of bodies exhibits randomness or fractal patterning (i.e. neuron development [14, 15] and fungal fruiting bodies [16]) involves an additional level of difficulty due to the vast structural heterogeneity displayed (thus hindering the systematic collection of measurements for statistical purposes, as well as the ability to generalize results). Nevertheless, in cases where morphogenetic processes lead to a set of geometrically similar structures which can be systematically transformed onto a common reference frame, structural tendencies of pattern formation can be studied at population level.
In this work a specific methodology is presented to systematize the gathering and analysis of bacterial colonies exhibiting circular symmetry, despite variations in size and depth of samples.
Results
Rationale
The proposed methodology is based on exploiting the geometric similarity that bacterial colonies (3–6 days of growth) normally exhibit, uniformizing their shapes by applying similarity transformations, a subset of a broader group of operations termed affine transformations [17]. Recall that a similarity transformation is any mapping function that preserves not only collinearity, parallelism, convexity, and ratios of distances among parallel lines (common characteristics to all affine transformations), but also angles and proportionality (specific of that subgroup). As a result, transformed objects are similar to the original body (they resemble the same shape, angles and proportion, according to a ratio of magnification [17]). Specifically, uniform scaling, rotation and translation are the applied operations in this methodology.
The circular symmetry and narrowly bounded variability in axial direction allow for the establishment of a computational workflow that applies a systematic set of filters and transformations, which are depicted in Fig. 1. This allows a mapping from a physical coordinate system to a normalized dimensionless reference frame where spatial positioning among replicas is comparable for statistical purposes. The specifics of this workflow are described next.
Labeling and filtering
Images are first loaded into memory (Fig. 1a). The analysis pipeline then starts with a selection and filtering stage of all colonies present in the images. To clearly delimit the boundaries of individual colonies, two independent fluorescent reporters (GFP and mCherry) were used to, respectively, monitor the activity of the promoter and spatially locate colony boundary. The filtering process was based on discriminating circular objects from topview images containing an arbitrary number of colonies. The first step to automate colony detection consisted on applying the XY sum projection (Proy) in both fluorescent channels along axial direction (zstack) to obtain two single planar images (one for each channel), that further served as a stencil for object detection (Fig. 1b). Thus:
where S^{z} is a threedimensional matrix of size P_{x} x P_{y} x Z_{stack} containing the whole Zstack image, Z_{stack} is the number of Zplanes in the stack, P_{x} and P_{y} are the pixel resolution of the gathered image and Proy is the sum projection matrix (size P_{x} x P_{y}) in the Z direction.
A binarization stage was applied to discard the background of the image (using as a threshold value the average intensity of the image multiplied by a factor of 1.1), generating a boolean mask wherein pixel values were either 1 or 0 (Fig. 1c). We define this threshold as Th in order to create a filtering operator F(X) that drops values of X smaller than Th:
where Proy_{ij} are the elements of the Proy matrix, Th is the threshold value to filter and F(X) is the filtering criteria applied to every element X_{ij} of matrix \( \mathbf{X}\boldsymbol{\in }{\mathbf{\mathcal{M}}}_{{\boldsymbol{P}}_{\boldsymbol{x}}\times {\boldsymbol{P}}_{\boldsymbol{y}}}\left(\mathbb{R}\right) \). The binary operator Bin(F) is next applied to the filtered projection matrix ProyF to create Boolean mask B:
Binary connected components within B matrices were detected and labeled using the algorithm described by Haralik and Shapiro [18] (Fig. 1d). Each object was treated as an array of pixels \( {\mathbf{b}}^{\boldsymbol{n}}\boldsymbol{\in}{\mathbf{\mathcal{M}}}_{\mathbf{1}\times {\mathrm{N}}_{\mathrm{pixel}}^{\mathrm{n}}}\left(\mathbb{N}\right) \) with value 1 whose respective row and column indices are given in two vectors: \( {\mathbf{bx}}^{\boldsymbol{n}}\boldsymbol{\in}{\mathbf{\mathcal{M}}}_{\mathbf{1}\times {\mathrm{N}}_{\mathrm{pixel}}^{\mathrm{n}}}\left(\mathbb{N}\right) \) and \( {\mathbf{by}}^{\boldsymbol{n}}\boldsymbol{\in}{\mathbf{\mathcal{M}}}_{\mathbf{1}\times {\mathrm{N}}_{\mathrm{pixel}}^{\mathrm{n}}}\left(\mathbb{N}\right) \) respectively. XY object area (A) and mass center for every detected object were calculated by computing their zeroth and first moments as follows:
where n represents each individual object detected in the image, \( {\mathrm{N}}_{\mathrm{pixel}}^{\mathrm{n}} \) is the number of pixels detected in object n and CM_{n} is the point position in the XY plane (in pixel units) of the mass center of object n.
Detected elements with a XY area smaller than an empirically chosen value (an equivalent circular area of 20 pixels in the present case) were discarded. Major and minor axis lengths in the XY plane for each object n were derived from maximum and minimum detected X and Y values found within b_{x}^{n} and b_{y}^{n}. At this point those elements having a majorminor axis length ratio larger than 15% were also excluded to avoid noncircular geometries (e.g. merged colonies, Fig. 1e).
Performing a pointbypoint transposition of the fluorescence distribution into a threedimensional body is quite complex because it is impossible to observe all points simultaneously. Instead it is more advisable to use fixed planes to study the distribution of the signal inside the whole body. Although this is tricky for bodies with arbitrary geometry, for cases with circular symmetry a very convenient choice is to inspect an axial projection (XY) and radial crosssection (XZ): the former shows the trend of the studied signal for increasing radial distances, while the later depicts the behavior along the Z axis in a diametric plane. Thus, XY and XZ planes were gathered from every colony (Fig. 1f).
Aligning and mapping
Prior to mapping the data, every image was geometrically aligned. XY projections did not need any adjustment because all colonies exhibit circular symmetry. However, XZ images needed to be horizontally aligned due to the presence of imperfections on the agar surface, variability among samples and the natural curvature of agar when it is close to the edge of the culture plate. XZ profiles were realigned along the X axis by obtaining the orientation of the major axis with respect to the X axis (given by the angle θ between both axis, see [19] for more details) and then rotating the point coordinates of all pixels using a rotation matrix (Fig. 1g). So where P^{m}_{xz} and P^{m}_{xz}’ are, respectively, the unaligned and aligned point coordinates of the chosen radial crosssection m (with length 2 x N^{m}_{pixel}), they relate by means of the following rotation matrix around the Y axis:
As mentioned previously, to respond to colony size variability, a mapping process was applied to the XY and XZ planes of every colony. Data was transformed from a physical dimensionbased reference frame to a normalized XYZ domain bounded within X ∈ [−1, 1], Y ∈ [−1, 1], Z ∈ [0, 1], using as normalizing dimensions the radius (for XY projection) and maximum height (for XZ plane) of each colony (Fig. 1h). This is equivalent to rescaling the spatial dimensions of all colonies to a similar size. We first compute the radius R, the height H and the minimum height h (defined as being where the base of the colony lays) as follows:
where we make an abuse of notation to indicate that maximum and minimum operator are applied only on x, y or z components of points P^{m}_{XY}^{’}_{n} and P^{m}_{XZ}^{’}_{n}. n_{1} and n_{2} are the number of points forming the XY and XZ planes, respectively. CM^{m}_{XY} and CM^{m}_{XZ} are the mass centers of the XY and XZ planes (calculated as previously described). These three variables are then used to normalize all points:
here ()_{x} and ()_{z} denote the x and z components of the considered points, and the hat operator is used to design normalized versions of rotated points.
To spatially correlate position and signal, cell locations need to be fixed in a reference grid. An experimental solution to this issue would be extremely complex due to natural replica variability. Nevertheless, it is still possible to numerically overcome this problem by using data interpolation to estimate the values of signal intensity for every colony within a fixed grid of coordinates, using experimental data from arbitrary positions throughout the colony (Fig. 1i). The interpolant grid should have a density point smaller than the original image resolution to minimize lack of data when estimating values. In this work two grids of smaller resolution ([256 × 256] points for XY projections and [256 × 10] points for XZ planes) were used to cover the normalized domain. A barycentricbased coordinate cubic interpolation algorithm supported by a Delaunay triangulation of the pixel coordinates was chosen to estimate values [20]. Interpolated profiles were finally stored in a sequential manner to create a stack of profiles (Fig. 1j) from which statistical measurements were performed.
Intensity normalization
Stored XY and XZ profiles were used to estimate the central tendency of the intensity distribution for every set of experimental conditions and in every fluorescent channel (mean, median, see Fig. 1k). Depending on a researcher’s needs, intensity values either can be handled as raw data or can be first normalized with respect to maximum and minimum reference values (Fig. 1l). Raw values can be used to establish foldchange comparison of measurements among samples using a semibounded scale ([0, ∞)), provided that all samples are gathered with the same microscope settings. Normalized data can be used to locate heat areas of intensity signal in the colony and local variations within colonies when microscope settings cannot be standardized among samples. The most common option to normalize data is to work with positive / negative control samples to perform the transformation:
where I* and I are, respectively, normalized and nonnormalized intensity matrices, and M, C^{+} and C^{−} subscripts denote the sources of the samples (regular sample, positive control and negative control respectively).
Unfortunately, there are circumstances in which any one of these controls may not be suitable to use due to modification microscopy settings to avoid image acquisition quality problems (i.e. signal saturation due to the existence of large differences in intensity values between samples and the positive control). An alternative choice for these cases is to scale values by using the initial maximum and minimum intensity values found in every profile, as follows:
where I^{*}_{XY} and I^{*}_{XZ} are the XY and XZnormalized intensity profiles for each experimental condition and fluorescent channel, I_{XY} and I_{XZ} are the absolute intensity profiles, and max (I_{XY}), max(I_{XZ}), min(I_{XY}), min(I_{XZ}) are the maximum and minimum values of intensity detected in I_{XY} and I_{XZ} respectively for the chosen experimental conditions.
Experimental validation
In order to validate the proposed methodology, we performed a growth experiment on agar plates using a P. putida KT2440mCherry strain carrying a plasmid that produces GFP as regulated by a promoter whose expression varies with spatial position within the colony. In this case we chose a promoter that has been reported to respond to environmental humidity [in preparation], thus colonies exhibit a spatial fluorescence pattern according to water access within the colony. The experimental procedure followed is detailed in the methods section and depicted in Figs. 2a (experiment) and 2B (image analysis). To summarize this procedure in brief: individual bacterial colonies were streaked onto 60mm culture dishes and incubated at 30 °C for 5 days, letting the agar dry. Colonies were imaged using CSLM technique to generate Zstack images that were sequentially analyzed to gather the fluorescence profile of every individual colony. These profiles were transformed to produce statistically treatable data.
As a benchmark test, three sets of colonies were cultured: one carrying the desiccationinducible promoter (the sample to analyze, named M), a chemicallyinducible promoter (serving as a positive control, named C+) and a nonexpressing plasmid (serving as a negative control, named C). The results of the analysis are shown in Fig. 3 (XY projection) and Additional Files 1, 2 and 3 (XZ slices). Raw fluorescence analysis (Fig. 3 upper row) showed a spatially dependent behavior of sample M when compared with controls, as it exhibited a ringshaped distribution. Values of C+ and C are, respectively, above and below the range of intensities exhibited by sample M. The differences in magnitude of the observed intensity made a direct comparison of the signal spatial distributions impossible, so intensity normalization was applied to correct this effect. The resulting heatmaps (Fig. 3 lower row) exhibited differences in fluorescent pattern that suggest a proper functioning of the desiccationresponsive promoter. Sample C exhibited a noisy distribution not associated to the measured biological reporter, but rather to unspecific phenomena (i.e. selffluorescence). Fluorescence in sample C+ displayed a classical 2D Gaussian distribution, resembling the overall biomass distribution of the colony and implying an accumulation of constitutively expressed fluorescent protein in the center of the colony. Since colonies are expected to gradually dry out during the course of the experiment, sample M should show a spatially dependent fluorescence profile mirroring the water distribution within the colony. The presence of a ringshaped pattern confirms that the reporter distribution is not spatially correlated with biomass distribution nor associated to an unspecific phenomenon, but rather follows a welldefined arrangement. The coefficient of variations derived from the statistical analysis (see Fig. 4) showed a moderate error bar size in all samples except in positive control C+, where errors are large due to a low number of analyzed colonies.
Although the generated evidence does not confirm a direct relationship between the humidity distribution and the observed ringshaped fluorescence pattern, the nonarbitrary order of the fluorescence profile does confirm an interaction between the tested reporter and a spatiallydependent variable. This conclusion is enough to validate the proposed methodology as a reliable method to measure spatial distribution of fluorescent signal in colonies.
Discussion
The presented methodology was developed to automate analysis of CSLM image stacks of circular bacterial colonies, the most frequently observed pattern when cells grow on solid media (1.5% agar, w/v) in regular laboratory conditions. Nevertheless, its potential use can be extended to any microorganism that develop colonies of circular morphology. The method is based not on the type of microorganism, but rather on taking advantage of the geometric similarity of analyzed colonies. This allows for routine application of a set of mathematical transformations to gather normalized data regardless of colony size. The application of this method to nonsymmetric samples could also be possible, but with numeric adaptations to deal with its asymmetric shape provided that those samples are all geometrically similar. The required algorithmic adjustment to process highly irregular bodies would involve the processing of the data using more general analysis techniques: modal matching [21,22,23], moment based methods [24, 25], geometric hashing [26, 27] or pose clustering [28] could help to deal with complex geometries. Random geometries on the contrary (i.e. diffusion driven spreading [29, 30]) are not treatable using this approach, because mapping function cannot be computed to project data into the normalized grid.
There is additionally a major limitation in this approach when applying the numerical procedure to generate the mapped intensity profile. Ideally, all images should be taken at the largest possible resolution to provide the smallest possible pixel area: this allows the generation of interpolated intensity profiles with larger resolution and more accuracy, even in smaller interpolating grids [31]. An insufficient biological sampling or the use of lowresolution images may lead to poor accuracy during interpolation when computing values. Furthermore, as the method is based on performing interpolation to estimate values in fixed spatial positions, this methodology will provide worse estimations close to the boundaries, where colonies exhibit a larger degree of variability (see Fig. 4). If the number of processed samples is low (n < 10 in our methodology using a conservative criterion, as is the case in Fig. 4 sample C+), the standard deviation values may drastically increase. This effect can be partially diminished by improving the statistical sampling (i.e. increasing the number of samples to process) or by enhancing the quality of data (i.e. increasing the bit depth of the gathered image), as the most straightforward alternatives to overcome the issue among others related with sampling process [32]. A computational approach based in replacing the interpolant algorithm (i.e. use of Radial Basis Functions [33]) can increase the precision of predictions in the grid of evaluation if required. Thus special attention must be paid when designing the experiments and computing each case to minimize these issues.
Conclusion
The statistical study of fluorescent reporter distribution within bacterial colonies is cumbersome because of colony size variability (in both area and thickness) among samples, and the need to acquire data from a sufficient number of replicas to ensure reliable statistics. In this work a specific methodology was implemented in a MATLAB script in order to automate the selection and extraction of useful data from CSLM stack images of circular bacterial colonies. The process is computationally performed to allow an image analysis of all colonies independent of size variability. The methodology was experimentally validated by comparing the distribution of fluorescence exhibited by P. putida colonies carrying a plasmid regulated by a humiditysensitive promoter. As the proposed numerical procedure exploits the geometric similarity of measured bodies and uses an interpolating approach to generate statistically comparable data, there are limitations when working with a low number of samples, poor quality images or nonsymmetric bodies. Despite these limitations, the proposed approach offers a powerful and simple framework to study signal distribution of CSLM images in an automated and statistically reliable fashion.
Methods
Strains, plasmids media and growth conditions
The bacterial strain used here is a derivative of P. putida KT2440 with an mCherry fluorescent cassette integrated in its genome that constitutively expresses the red fluorescent protein. This fluorescent signal was used to locate the colony boundaries to perform the image processing [34]. This strain was transformed with three versions of the plasmid pGLR2, which contains a promoterless dual GFPluxCDABE reporter system [35]. The three plasmids used here are: i) pGLR2 (a plasmid without promoter that does not fluoresce) used as a negative control; ii) pGLR2Ptrc (with an IPTGinducible promoter controlling fluorescence expression) used as a positive control; and iii) pGLR2P_{4707} (with a humiditysensitive promoter controlling the fluorescence [in preparation]) as the experimental sample. All the strains were grown overnight on regular M9 minimal medium Petri dishes [36] solidified with 1.5% (w/v) agar and amended with 0.2% (w/v) glucose, 1 mM IPTG. Samples were supplemented when required with 50 μg/ml Kanamycin and 15 μg/ml Gentamycin. Individual colonies were restreaked to microscopecompatible 35 × 14 mm culture dishes (Ibidi) containing 2 ml of M9agar with glucose as the carbon source (prepared as mentioned before) and incubated at 30 °C during 5 days (120 h). The first 72 h, all dishes were covered with individual lids. For the remaining 48 h, the dishes were incubated without their own lids but within a 92 × 16 mm Petri dish to promote a higher drying of the growing media. Relative humidity was not controlled during the experiment.
Imaging acquisition
Colony images were gathered using a Confocal Multispectral Leica SP5 system with a HCX PL APO CS 10 × 0.40 DRY UV objective using 488 and 561 nm laser lines to detect GFP and mCherry fluorescent signals, respectively. Images were captured at 8bit resolution (1024 × 1024) with no amplification factor and a frequency rate of 400 Hz. Distance between XY pixels and gathered Z planes included in stack images were 1.5137 and 6 μm, respectively. The numerical method was implemented in a script written in MATLAB (The Mathworks) containing the imaging toolbox and the MATLAB compatible bioformats package (https://docs.openmicroscopy.org/bioformats/5.9.2/users/matlab/index.html#) on a regular PC. Confocal images shown in the manuscript were treated to enhance brightness and contrast using ImageJ software.
Statistics
Experimental data gathered from different conditions was retrieved from two biological replicas, with at least 3 images for every condition. The number of processed colonies varied depending on the filtering criteria and the quality of the gathered image. For the parameters used in this manuscript, the final number of analyzed colonies was 21 for the humiditysensitive strain, 5 colonies for the positive control and 3 colonies for the negative control.
Availability of data and materials
A copy of the software together with CSLM images analyzed within this manuscript can be found in: https://fairdomhub.org/data_files/2946?version=1. The software can be found also in http://github.com/drespeso/GEOSIMCO. Operative system: Tested under Windows. Programming Language: Matlab. Other requirements: Image Processing Toolbox for Matlab, MATLAB compatible bioformats package. License: MIT.
Abbreviations
 CSLM:

Confocal Scanning Laser Microscopy
 IPTG:

Isopropyl βD1thiogalactopyranoside
References
 1.
Eliceiri KW, Berthold MR, Goldberg IG, Ibáñez L, Manjunath BS, Martone ME, Murphy RF, Peng H, Plant AL, Roysam B, et al. Biological imaging software tools. Nat Methods. 2012;9:697.
 2.
Cao H, Kuipers OP. Influence of global gene regulatory networks on single cell heterogeneity of green fluorescent protein production in Bacillus subtilis. Microb Cell Factories. 2018;17(1):134.
 3.
Chalfie M, Tu Y, Euskirchen G, Ward WW, Prasher DC. Green fluorescent protein as a marker for gene expression. Science. 1994;263(5148):802.
 4.
Chai L, Vlamakis H, Kolter R. Extracellular signal regulation of cell differentiation in biofilms. MRS Bull. 2011;36(5):374–9.
 5.
Tecon R, Or D. Cooperation in carbon source degradation shapes spatial selforganization of microbial consortia on hydrated surfaces. Sci Rep. 2017;7:43726.
 6.
Knott GW, Holtmaat A, Trachtenberg JT, Svoboda K, Welker E. A protocol for preparing GFPlabeled neurons previously imaged in vivo and in slice preparations for light and electron microscopic analysis. Nat Protoc. 2009;4:1145.
 7.
ReyVillamizar N, Somasundar V, Megjhani M, Xu Y, Lu Y, Padmanabhan R, Trett K, Shain W, Roysam B. Largescale automated image analysis for computational profiling of brain tissue surrounding implanted neuroprosthetic devices using Python. Front Neuroinformatics. 2014:8(39).
 8.
Carpenter AE, Jones TR, Lamprecht MR, Clarke C, Kang IH, Friman O, Guertin DA, Chang JH, Lindquist RA, Moffat J, et al. CellProfiler: image analysis software for identifying and quantifying cell phenotypes. Genome Biol. 2006;7(10):R100.
 9.
de Chaumont F, Dallongeville S, Chenouard N, Hervé N, Pop S, Provoost T, MeasYedid V, Pankajakshan P, Lecomte T, Le Montagner Y, et al. Icy: an open bioimage informatics platform for extended reproducible research. Nat Methods. 2012;9:690.
 10.
Asally M, Kittisopikul M, Rue P, Du Y, Hu Z, Cagatay T, Robinson AB, Lu H, GarciaOjalvo J, Sueel GM. Localized cell death focuses mechanical forces during 3D patterning in a biofilm. Proc Natl Acad Sci U S A. 2012;109(46):18891–6.
 11.
Rusconi R, Lecuyer S, Guglielmini L, Stone HA. Laminar flow around corners triggers the formation of biofilm streamers. J R Soc Interface. 2010;7(50):1293–9.
 12.
Wilking JN, Zaburdaev V, De Volder M, Losick R, Brenner MP, Weitz DA. Liquid transport facilitated by channels in Bacillus subtilis biofilms. Proc Natl Acad Sci U S A. 2013;110(3):848–52.
 13.
Cho H, Jonsson H, Campbell K, Melke P, Williams JW, Jedynak B, Stevens AM, Groisman A, Levchenko A. Selforganization in highdensity bacterial colonies: efficient crowd control. PLoS Biol. 2007;5(11):2614–23.
 14.
Shi P, Huang Y, Hong J. Automated threedimensional reconstruction and morphological analysis of dendritic spines based on semisupervised learning. Biomed Optics Express. 2014;5(5):1541–53.
 15.
Janoos F, Mosaliganti K, Xu X, Machiraju R, Huang K, Wong STC. Robust 3D reconstruction and identification of dendritic spines from optical microscopy imaging. Med Image Anal. 2009;13(1):167–79.
 16.
Lux R, Li Y, Lu QA. Detailed threedimensional analysis of structural features of Myxococcus xanthus fruiting bodies using confocal laser scanning microscopy, vol. 1; 2004.
 17.
Coxeter HSM, Greitzer SL. Geometry revisited: mathematical Association of America; 1967.
 18.
Haralick RM, Shapiro LG. Computer and robot vision: AddisonWesley publishing company; 1993.
 19.
Jain R, Kasturi R, Schunck BG. Machine vision: McGrawhill; 1995.
 20.
Yang TY. Finite element structural analysis: prentice hall PTR; 1986.
 21.
Sadek RA. SVD based image processing applications: state of the art, contributions and research challenges. Int J Adv Comput Technol. 2012;3(7):26–34.
 22.
van Otterloo PJ. A contouroriented approach to shape analysis: prentice hall; 1991.
 23.
Günsel B, Murat Tekalp A. Shape similarity matching for querybyexample. Pattern Recogn. 1998;31(7):931–44.
 24.
Khotanzad A, Hong YH. Invariant image recognition by Zernike moments. IEEE Trans Pattern Anal Mach Intell. 1990;12(5):489–97.
 25.
Dudani SA, Breeding KJ, McGhee RB. Aircraft identification by moment invariants. IEEE Trans Comput. 1977;C26(1):39–46.
 26.
Wolfson HJ, Rigoutsos I. Geometric hashing: an overview. IEEE Comput Sci Eng. 1997;4(4):10–21.
 27.
Schwartz JT, Sharir M. Identification of partially obscured objects in two and three dimensions by matching Noisy characteristic curves. Int J Robotics Res. 1987;6(2):29–44.
 28.
Stockman G. Object recognition and localization via pose clustering. Computer Vision Graphics Imag Processing. 1987;40(3):361–87.
 29.
Morris JD, Hewitt JL, Wolfe LG, Kamatkar NG, Chapman SM, Diener JM, Courtney AJ, Leevy WM, Shrout JD. Imaging and analysis of Pseudomonas aeruginosa swarming and rhamnolipid production. Appl Environ Microbiol. 2011;77(23):8310–7.
 30.
BenJacob E, Cohen I, Gutnick DL. Cooperative organization of bacterial colonies: from genotype to morphotype. Annu Rev Microbiol. 1998;52:779–806.
 31.
Englund E, Weber D, Leviant N. The effects of sampling design parameters on block selection. Math Geol. 1992;24(3):329–43.
 32.
Li J, Heap AD. A review of comparative studies of spatial interpolation methods in environmental sciences: performance and impact factors. Ecological Informatics. 2011;6(3):228–41.
 33.
Myers DE. Spatial interpolation: an overview. Geoderma. 1994;62(1):17–28.
 34.
MartínezGarcía E, Jatsenko T, Kivisaar M, de Lorenzo V. Freeing Pseudomonas putida KT2440 of its proviral load strengthens endurance to environmental stresses. Environ Microbiol. 2015;17(1):76–90.
 35.
Benedetti IM, de Lorenzo V, SilvaRocha R. Quantitative, nondisruptive monitoring of transcription in single cells with a broadhost range GFPluxCDABE dual reporter system. PLoS One. 2012;7(12):e52000.
 36.
Maniatis T, Fritsch EF, Sambrook J. Molecular cloning: a laboratory manual: cold Spring Harbor laboratory; 1982.
Acknowledgements
The authors thank Silvia Gutierrez Erlandsson, group leader of advanced light microscopy unit at CNB for fruitful discussions. Authors declare that they have no conflict of interest.
Funding
This work was funded by the SETH Project of the Spanish Ministry of Science RTI 2018–095584BC42, MADONNA (H2020FETOPENRIA20171766975), BioRoboost (H2020NMBPBIOCSA2018), and SYNBIO4FLAV (H2020NMBP/0500) Contracts of the European Union and the S2017/BMD3691 InGEMICSCM funded by the Comunidad de Madrid (European Structural and Investment Funds). The Funding agencies did not have any role in the design, collection, analysis or interpretation of the data or writing of the manuscript.
Author information
Affiliations
Contributions
DRE designed and coded the numerical methodology. EA performed the experiments to validate the algorithm. DRE, EA, EMG and VDL analyzed the data and wrote the manuscript.
Corresponding author
Ethics declarations
Ethics approval and consent to participate
Not applicable.
Consent for publication
Not applicable.
Competing interests
The authors declare that they have no competing interests.
Additional information
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary information
Additional file 1.
Raw fluorescence profiles for a XZ section (Y=0 plane) of monitored promoter (M), 4 positive control (C+) and negative control (C).
Additional file 2.
Normalized fluorescence profiles for a XZ section (Y=0 plane) of monitored promoter (M), positive control (C+) and negative control (C)
Additional file 3.
Variation coefficient for a XZ section (Y=0 plane) of monitored promoter (M), positive control (C+) and negative control (C).
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
About this article
Cite this article
Espeso, D.R., Algar, E., MartínezGarcía, E. et al. Exploiting geometric similarity for statistical quantification of fluorescence spatial patterns in bacterial colonies. BMC Bioinformatics 21, 224 (2020). https://0doiorg.brum.beds.ac.uk/10.1186/s1285902034901
Received:
Accepted:
Published:
DOI: https://0doiorg.brum.beds.ac.uk/10.1186/s1285902034901
Keywords
 CSLM
 Software
 Bacteria
 Geometric similarity
 Colonies
 Statistical analysis
 Spatial distribution
 Pattern
 GFP