Taxonomies and Ontologies in Wikipedia and Wikidata: An In-Depth Examination of Knowledge Organization Systems

Main Article Content

Miquel Centelles
Núria Ferran Ferrer

This article examines Wikipedia’s knowledge organization system (KOS) and the broader KOS of Wikidata. We study the structure, functions, and relationship of Wikipedia’s KOS to concepts like taxonomies and folksonomies, highlighting its unique characteristics compared to social media. A significant aspect of our examination is the gender-related content classification in the Catalan edition of Wikipedia (Viquipèdia), which notably excludes female categories and non-binary gender classifications. We explore the potential implications of these restrictions on gender bias within the platform. Furthermore, we broaden our investigative methodology to assess the KOS of Wikidata. Wikidata is a dataset built on ontological principles, designed to enhance and enrich Wikipedia’s digital, collaborative encyclopedia. The findings shed light on the presence or absence of gender bias and contribute to the ongoing discourse on promoting inclusivity and diversity in online knowledge sharing.

Keywords
KOS, Knowledge Organization System, Ontology, Taxonomy, Wikidata, Wikipedia

Article Details

How to Cite
Centelles, Miquel; Ferran Ferrer, Núria. “Taxonomies and Ontologies in Wikipedia and Wikidata: An In-Depth Examination of Knowledge Organization Systems”. Hipertext.net, 2024, no. 28, pp. 33-48, doi:10.31009/hipertext.net.2024.i28.04.
Author Biographies

Miquel Centelles, Universitat de Barcelona

He is a professor at the Faculty of Information and Audiovisual Media at the University of Barcelona (FIMA).
He holds a degree in Library Science and Documentation and a bachelor’s degree in Philology. His teaching and research focus on the representation and organization of information, as well as the application of semantic technologies in information and knowledge management. He coordinated the Master’s in Digital Content Management from 2005 to 2008, and since 2020, he has been the coordinator of the Master’s in Digital Humanities, involving five faculties at the UB. In research, he has collaborated on the Archiver project for the digital preservation of research data (Archiver TENDER – European Union), and the I+D+I project, Women and Wikipedia (PID2020-116936RA-I00).

Núria Ferran Ferrer, Universitat de Barcelona

An associate professor at the Faculty of Information and Audiovisual Media at the University of Barcelona (UB) since 2021, previously at the UOC from 2005. She coordinates the Doctoral Program in Information and Communication at UB. She holds a European doctorate from UB (2010), with degrees in Journalism (UAB, 1998), Documentation (UOC, 2003), and a Master’s in Information Society (IN3-UOC, 2005). She has been an associate professor at several universities, including UAB, UB, and UPF. Currently, she serves as the delegate of the rector for the Directorate of the Equality Unit. In research, she is the principal investigator of the I+D+I project, Women and Wikipedia (PID2020-116936RA-I00), where she supervises two theses. She has also collaborated on open science and citizen science projects and conducted research stays at the University of Sheffield (United Kingdom, 2009) and theUniversity of Tallin (Estonia, 2015).

References

AENOR. (2014). UNE-ISO 25964-1: Información y documentación: Tesauros e interoperabilidad con otros vocabularios. Parte 1: Tesauros para la recuperación de la información. AENOR. https://www.une.org/encuentra-tu-norma/busca-tu-norma/norma?c=N0053960

Dawe, L. & Robinson, A. (2017). Wikipedia editing and information literacy: A case study. Information and Learning Science, 118(1/2), 5–16.https://doi.org/10.1108/ILS-09-2016-0067

Ferran-Ferrer, N., Castellanos-Pineda, P., Minguillón, J. & Meneses,J. (2021). The gender gap on the spanish wikipedia: Listening to the voices of women editors. Profesional de La Informacion, 30(5), e300516. Scopus. https://doi.org/10.3145/epi.2021.sep.16

I.S.S.T, Fraunhofer. (2009). Guidelines and good practices for taxonomies (1.3). Semantic Interoperability Centre Europe. https://joinup.ec.europa.eu/sites/default/files/document/2011-12/guidelines-and-good-practices-for-taxonomies-v1.3a.pdf

Kaffee, L-A., Piscopo, A., Vougiouklis, P., Simperl, E., Carr, L. & Pintscher,L. (2017). A Glimpse into Babel: An Analysis of Multilinguality inWikidata. Proceedings of the 13th International Symposium on Open Collaboration, 1–5. https://doi.org/10.1145/3125433.3125465

Maciá, Y. (2022). Mujeres de categoría: Utilización de los principios y estándares de los datos enlazados (linked open data) paravisualizar las biografías de mujeres en la Viquipèdia [Master’s Degree, Universitat de Barcelona]. https://diposit.ub.edu/dspace/handle/2445/189877

Piscopo, A., Phethean, C. & Simperl, E. (2017). What Makes aGood Collaborative Knowledge Graph: Group Compositionand Quality in Wikidata. In Giovanni Luca Ciampaglia, Afra Mashhadi, and Taha Yasseri (Eds.), Social Informatics (pp. 305–322). Springer International Publishing. https://doi.org/10.1007/978-3-319-67217-5_19

Quintarelli, E. (2005). Power to the people. ISKO Italy-UniMIB Meeting,Milan, June 24, 2005.

Singer, P, Lemmerich, F, West, R, Zia, L, Wulczyn, E, Strohmaier, M. & Leskovec, J. (2017). Why We Read Wikipedia. Proceedings of the 26th International Conference on World Wide Web, 1591–1600. https://doi. org/10.1145/3038912.3052716

Soler-Adillon, J, Pavlovic, D. & Freixa, P. (2018). Wikipedia in higher education: Changes in perceived value through content contribution. Comunicar, 26(54), 39–48. https://doi.org/10.3916/C54-2018-04

Vrandečić, D. & Krötzsch, M. (2014). Wikidata: A free collaborative knowledgebase. Communications of the ACM, 57(10), 78–85. https://doi.org/10.1145/2629489

Wikimedia. (2016). Help:Label/ca. https://www.wikidata.org/wiki/Help:Label/ca

Wikimedia. (2021). Categoria:Infermers. In Viquipèdia, l’enciclopèdia lliure. https://ca.wikipedia.org/w/index.php title=Categoria:Infermers&oldid=26858616

Wikimedia. (2022a). Wikidata:WikiProject Biography. https://www.wikidata.org/wiki/Wikidata:WikiProject_Biography

Wikimedia. (2022b). Wikidata:WikiProject Ontology/Classes. https://www.wikidata.org/wiki/Wikidata:WikiProject_Ontology/Classes

Wikimedia. (2022c). Wikipedia:Propuesta de política de categorización. In Wikipedia, la enciclopedia libre. https://es.wikipedia.org/w/index.php?title=Wikipedia:Propuesta_de_pol%C3%ADtica_de_categorizaci%C3%B3n&oldid=142571261

Wikimedia. (2023a). Help:Basic membership properties. https://www.wikidata.org/wiki/Help:Basic_membership_properties

Wikimedia. (2023b). Help:Property constraints portal. https://www.wikidata.org/wiki/Help:Property_constraints_portal

Wikimedia. (2023c). Help:Property constraints portal/list of constraints. https://www.wikidata.org/wiki/Help:Property_constraints_portal/list_of_constraints

Wikimedia. (2023d). Wikidata:Creators de proprietats. https://www.wikidata.org/wiki/Wikidata:Property_creators/oc

Wikimedia. (2023e). Wikipedia:Categorización. In Wikipedia, la enciclopedia libre. https://es.wikipedia.org/w/index.php?title=Wikipedia:-Categorizaci%C3%B3n&oldid=152512537

Wikimedia. (2023f). Viquipèdia:Categorització. In Viquipèdia, l’enciclopèdia lliure. https://ca.wikipedia.org/w/index.php?title=Viquip%-C3%A8dia:Categoritzaci%C3%B3&oldid=32543623

Yedid, N. (2013). Introducción a las Folksonomías: Definición, Características y Diferencias con los Modelos Tradicionales de Indización. Información, cultura y sociedad, 29, Article 29. http://revistascientificas.filo.uba.