Healthc Inform Res >Volume 19(4); 2013 > Article |
Review Article
Healthcare Informatics Research 2013;19(4):235-242.
Published online:December 31, 2013
1Institute for Medical Informatics, Statistics and Documentation, Medical University of Graz, Graz, Austria.
2European Centre for Disease Prevention and Control, Stockholm, Sweden.
3Department of Medical Informatics, Academic Medical Center, Amsterdam, The Netherlands.
4Department of Biomedical Engineering, Linköping University, Linköping, Sweden.
5National Library of Medicine, National Institutes of Health, Bethesda, MD, USA.
© 2013 The Korean Society of Medical Informatics
Keywords:Data Mining,Terminology,Semantics,Vocabulary,Publishing
Time line analysis of the occurrence of the phrase "medical concept representation" using theScopus term analyser [4], extraction of the contextual environment usingUltimate Research Assistant [5] and visualization of the results using a tag cloud [6];
Using the toolPublish orPerish [7] to identify the authors of the most influential papers, using seven sources, viz.Web of Science,Scopus,Embase,PubMed,Google Scholar,Cochrane Library,British Library on-line catalogue. The question was to have an idea of the persistence of the influential authors from the first period to the second one. The Boolean search expression "concept representation" AND ("medical" OR "medicine") AND ("knowledge" OR "information") was submitted to all of them, with variations according to their proprietary syntax. For identifying the top ten papers, the results of the seven lists were consolidated into a common table. For this, available citation ranks were taken, otherwise the source's own ranking mechanism was used. In the following, the top ten papers were the source for extracting the top thirty authors, which were ranked in a second step. For this, the following heuristics was used: Thenth author in the list was assigned a score of 11 -n, the eleventh and following authors was given a zero value. The scoring was weighted, favouring multiple appearances of authors in different sources: a final score was calculated as a net score (0.8 + 0.2 × occurrence).
In the post-2000 analysis, due to the significant drop of the usage of the exact phrase "medical concept representation" the resulting paper population would have been too small for applying the same procedure as described for the first period. Therefore, instead of summing up the citation data only for papers matching the query, here the citation data for all papers per author were used. This same method, however, could not be used for same analysis backwards to the previous period, due to limitations of the tool used [7].
The hypothesis of a paradigm shift was studied, comparing relevant papers published during the years from 1988 to 1999 with those appearing between 2000 and 2012, focusing the same subject area. The reason for starting with 1988 was the availability of bibliographic databases, being almost accordant with the period of our interest, viz. the activities of the IMIA WG onMedical Knowledge Representation. Author lists were compared and all the titles of the two full paper sets were text mined usingTextalyser [8].
The second, more recent set was cross-checked against a third set from the same period, obtained by an online survey targeted to the specifically interested audience. For this survey (open from August to October 2012) the primary source was theLinkedIn group of the MCR WG, having at that time over fifty members of widely various backgrounds. Secondary sources were additional LinkedIn Groups in broader domain. Participants were asked to quote and to share the papers they found to be most influential in their work or research. We usedDatagle [9] and a Google document to collect survey data.
Propagation of the paradigm of ontological realism, the proponents of which have been arguing against the usage of this word in the context of ontologies, contending that the representation of concepts as "entities of thought" is inappropriate for the representation of a scientific domain and obfuscates the difference between the entities and names given to them [32];
The preference of "class" over "concept" in the SemanticWeb and description logics community, especially regarding the influential OWL family of representation languages [33];
The obvious polysemy of the word itself [34].
The capture of medical information and knowledge leverages (standards) ontologies;
Open reference resources for content are developed collaboratively, shared, and reused;
Web enabled standards help achieve transparent results;
"Big data" opens new ways for knowledge acquisition;
However, a large part of clinical information continues being recorded as free text, which keeps the need of processing medical language on the research agenda.
Bold face highlights the terms that also occur in the top-ten list from the 1988-1999 period (Table 2).
MCR WG: Working Group on Medical Concept Representation, OBO: Open Biological and Biomedical Ontologies, SNOMED CT: Systematized Nomenclature of Medicine Clinical Terms, OBI: Ontology for Biomedical Investigations.
aFrequency ranking is based on incidence in survey lists. The first ranked paper was mentioned the most times in various lists. Papers with rank '2' shared the second highest number of occurrence and so on. Papers with same rank are in order of their citation frequency, shown above initalics.
Recent Movement on Education and Training in Health Informatics2014 April;20(2)
Contents and Methodologies of Medical Informatics Education2006 December;12(4)
![]() | ![]() |