Authors:
Sebastian Unger
1
;
Christa K. Raak
2
and
Thomas Ostermann
1
Affiliations:
1
Department for Psychology and Psychotherapy, Faculty of Health, Witten/Herdecke University, Witten, Germany
;
2
Center for Integrative Medicine, Faculty of Health, Witten/Herdecke University, Witten, Germany
Keyword(s):
Evaluation, Search Engine, Information Storage and Retrieval, Sentence-Pair Regression, Semantic Similarity, Online Publications.
Abstract:
Although there is a steady increase of scientific publications in integrative medicine, it is still difficult to get a valid overview of published evidence. The open accessible bibliographical database CAMbase 3.0 (available at https://cambase.de) hosted by Witten/Herdecke University is one of such established databases in this field. In 2020, CAMbase 2.0 was migrated to a newer 64-bit operating systems, resulting in a variety of issues. A promising solution of keeping and accessing the data of CAMbase 2.0 was to replace the business logic with the open-source platform Solr, which uses a score ranking algorithm instead of a semantic-syntactic interpretation of search queries as in CAMbase 2.0. As a result, the before-after analysis with T-tests showed mainly no significant differences in the equality of the queried titles after applying SBERT, not even in the number of search hits (t = 1.43, df = 35, p = 0.17), but in query times (t = 4.2, df = 35, p < 0.01). While search hits remain
ed stable as the speed increases, the approach with Solr is more efficient, making this technical report a possible blueprint for similar bibliography-based databases projects.
(More)