Figure 1.

The flow of data in a TM approach to CV expansion. The information retrieval module is used to gather a corpus of documents relevant for a given CV from the literature databases. Automatic term recognition is applied against the corpus to extract terms as domain-specific lexical units. Some of the extracted terms not directly related to the CV are filtered out by using the knowledge about typically co-occurring types of terms.

Spasić et al. BMC Bioinformatics 2008 9(Suppl 5):S5   doi:10.1186/1471-2105-9-S5-S5