Abstract
PurposeDevelop a comprehensive framework for assessing the knowledge organization systems (KOSs), including the taxonomy of Wikipedia and the ontologies of Wikidata, with a specific focus on enhancing management and retrieval with a gender nonbinary perspective.Design/methodology/approachThis study employs heuristic and inspection methods to assess Wikipedia’s KOS, ensuring compliance with international standards. It evaluates the efficiency of retrieving non-masculine gender-related articles using the Catalan Wikipedian category scheme, identifying limitations. Additionally, a novel assessment of Wikidata ontologies examines their structure and coverage of gender-related properties, comparing them to Wikipedia’s taxonomy for advantages and enhancements.FindingsThis study evaluates Wikipedia’s taxonomy and Wikidata’s ontologies, establishing evaluation criteria for gender-based categorization and exploring their structural effectiveness. The evaluation process suggests that Wikidata ontologies may offer a viable solution to address Wikipedia’s categorization challenges.Originality/valueThe assessment of Wikipedia categories (taxonomy) based on KOS standards leads to the conclusion that there is ample room for improvement, not only in matters concerning gender identity but also in the overall KOS to enhance search and retrieval for users. These findings bear relevance for the design of tools to support information retrieval on knowledge-rich websites, as they assist users in exploring topics and concepts.
Reference69 articles.
1. Abián, D., Meroño-Peñuela, A. and Simperl, E. (2022), “An analysis of content gaps versus user needs in the Wikidata knowledge graph”, in Sattler, U., Hogan, A., Keet, M., Presutti, V., Almeida, J.P.A., Takeda, H., Monnin, P., Pirrò, G. and d'Amato, C. (Eds), Lecture Notes in Computer Science, Springer Science and Business Media Deutschland GmbH; Scopus, Vol. 13489 LNCS, pp. 354-374, doi: 10.1007/978-3-031-19433-7_21.
2. Testing the validity of Wikipedia categories for subject matter labelling of open-domain corpus data;Journal of Information Science,2022
3. Albuquerque, F.A.A.C. (2017), “Arcabouço de arquitetura da informação para ciclo de vida de projeto de vocabulário controlado: uma aplicação em Engenharia de Software [Fernando Antônio de Araújo Chacon de]”, available at: https://repositorio.unb.br/handle/10482/31288
4. Assessing the practice of biomedical ontology evaluation: gaps and opportunities;Journal of Biomedical Informatics,2018