Journal of Power Sources, Vol.110, No.1, 163-176, 2002
Electrochemical power text mining using bibliometrics and database tomography
Database tomography (DT) is a textual database analysis system consisting of two major components: (1) algorithms for extracting multi-word phrase frequencies and phrase proximities (physical closeness of the multi-word technical phrases) from any type of large textual database, to augment (2) interpretative capabilities of the expert human analyst. DT was used to derive technical intelligence from an electrochemical power database derived from the science citation index (SCI). Phrase frequency analysis by the technical domain experts provided the pervasive technical themes of the electrochemical power database, and the phrase proximity analysis provided the relationships among the pervasive technical themes. Bibliometric analysis of the electrochemical power literature supplemented the DT results with author/journal/institution publication and citation data.
Keywords:electrochemical power;database tomography;bibliometric analysis;text mining;information retrieval;technical intelligence