Skip to main content
Figure 1 | BMC Bioinformatics

Figure 1

From: Facilitating the development of controlled vocabularies for metabolomics technologies with text mining

Figure 1

The flow of data in a TM approach to CV expansion. The information retrieval module is used to gather a corpus of documents relevant for a given CV from the literature databases. Automatic term recognition is applied against the corpus to extract terms as domain-specific lexical units. Some of the extracted terms not directly related to the CV are filtered out by using the knowledge about typically co-occurring types of terms.

Back to article page