Robust clustering of languages across Wikipedia growth
Kristina Ban, Matjaž Perc, Zoran Levnajić, 2017, original scientific article

Abstract: Wikipedia is the largest existing knowledge repository that is growing on a genuine crowdsourcing support. While the English Wikipedia is the most extensive and the most researched one with over 5 million articles, comparatively little is known about the behaviour and growth of the remaining 283 smaller Wikipedias, the smallest of which, Afar, has only one article. Here, we use a subset of these data, consisting of 14 962 different articles, each of which exists in 26 different languages, from Arabic to Ukrainian. We study the growth of Wikipedias in these languages over a time span of 15 years. We show that, while an average article follows a random path from one language to another, there exist six well-defined clusters of Wikipedias that share common growth patterns. The make-up of these clusters is remarkably robust against the method used for their determination, as we verify via four different clustering methods. Interestingly, the identified Wikipedia clusters have little correlation with language families and groups. Rather, the growth of Wikipedia across different languages is governed by different factors, ranging from similarities in culture to information literacy.
Keywords: Wikipedia, language, growth dynamics, data analysis, clustering
Published in DKUM: 13.11.2017; Views: 961; Downloads: 358
.pdf Full text (1004,06 KB)
This document has many files! More...

Community structure and the evolution of interdisciplinarity in Slovenia's scientific collaboration network
Borut Lužar, Zoran Levnajić, Janez Povh, Matjaž Perc, 2014, original scientific article

Abstract: Interaction among the scientific disciplines is of vital importance in modern science. Focusing on the case of Slovenia, we study the dynamics of interdisciplinary sciences from 1960 to 2010. Our approach relies on quantifying the interdisciplinarity of research communities detected in the coauthorship network of Slovenian scientists over time. Examining the evolution of the community structure, we find that the frequency of interdisciplinary research is only proportional with the overall growth of the network. Although marginal improvements in favor of interdisciplinarity are inferable during the 70s and 80s, the overall trends during the past 20 years are constant and indicative of stalemate. We conclude that the flow of knowledge between different fields of research in Slovenia is in need of further stimulation.
Keywords: community structure, interdisciplinarity, scientific collaboration, research funding, Slovenia
Published in DKUM: 19.06.2017; Views: 773; Downloads: 340
.pdf Full text (652,72 KB)
This document has many files! More...

