Abstract
The underrepresentation of women in STEM fields needs to be highlighted through data to assist decision-makers and public policy creators in addressing the issue effectively. However, the lack of structured, organized data published openly in this domain is still a reality. To address this problem, a Latin American research network called ELLAS was created. The project's goal is to develop a platform with Semantic Web-based technologies to structure and concentrate data from Brazil, Peru, and Bolivia, initially. This paper presents the processes defined for the collection and curation of both unstructured and structured data, sourced from scientific articles, social networks, and existing open data. We explore the architecture design in a way that facilitates understanding of the details of the processes and the actors involved for each data source. We present the preliminary results from the application of these processes, and the strategies for future work, which include the data extraction and curation, and the ontology and knowledge graph development We also present some of the undergoing work, such as the survey development and application as well as showing what still hasn't been done, such as the platform development.
Publisher
Sociedade Brasileira de Computacao - SB
Reference31 articles.
1. Berardi, R. C. G., Amador, B. O., Hoger, M. D. V., Turato, P. A., da Silva Santos, L. M., and Bim, S. A. (2022). The demand for stereotype-free computing courses for elementary school teachers. Journal on Interactive Systems, 13(1):410–418. DOI: https://doi.org/10.5753/jis.2022.2854.
2. Berardi, R. C. G., Auceli, P. H. S., Maciel, C., Davila, G., Guzman, I. R., and Mendes, L. (2023). Ellas: Uma plataforma de dados abertos com foco em lideranças femininas em stem no contexto da américa latina. In Anais do XVII Women in Information Technology, pages 124–135. SBC. DOI: https://doi.org/10.5753/wit.2023.230764.
3. Berners-Lee, T. (2006). Linked data. world wide web consortium (w3c),. Available at: [link]. Accessed on 29 May 2024.
4. Bertucini, O. T., Berardi, R. C., Belizario, M. G., and Kozievitch, N. (2023). Garantindo a qualidade de dados na fusão de dados conectados: Um caso de uso de shacl em dados abertos de mobilidade e educação de curitiba. In Anais da XVIII Escola Regional de Banco de Dados, pages 31–40. SBC. DOI: https://doi.org/10.5753/erbd.2023.229429.
5. Branisa, B., Cabero, P., and Guzman, I. (2021). The main factors explaining it career choices of female students in bolivia. AMCIS 2021 Proceedings.