Manifestation and sociodemographic microdata of Brazil’s Unified Health System Ombudsman
BMC Research Notes volume 17, Article number: 18 (2024)
This article presents the process of extraction and treatment of two datasets from the General Ombudsman of the Brazilian Unified Health System (OUVSUS). The resulting datasets allow the analysis of manifestation characteristics and sociodemographic profile of the citizens that performed these manifestations.
The first dataset depicts the characteristics of the manifestations registered by the General Ombudsman. Each row represents an individual manifestation and contains information such as the registration date, classification, input channel, and subject, among others. The second dataset is constituted of sociodemographic information for each citizen that performed a manifestation, and characteristics such as sexual orientation, race, age, and geographic location of the citizen are presented, among others.
This article presents the process of creation, treatment, and availability of two datasets from the General Ombudsman of the Brazilian Unified Health System (OUVSUS).
The General Ombudsman works permanently, through direct communication with citizens, to enable social participation, the dissemination of health information, and formal mediation between the needs of users and the managers of the Brazilian Unified Health System (SUS) .
Different channels of entry are available for citizens to express themselves. The manifestations are digitally registered, containing information about the characteristics of the user’s profile and the manifestations presented.
The first dataset presents the different characteristics of the manifestations carried out by the population, while the second presents the sociodemographic profile of the users that carried out these manifestations.
The data were obtained through a partnership between the General Ombudsman of the Brazilian Unified Health System (Ministry of Health of Brazil), the Regional Office of Aggeu Magalhães Institute, and the Platform of Data Science applied to Health (both from the Oswaldo Cruz Foundation).
Ombudsman’s offices are institutional bodies that aim to make public services responsive to citizens’ demands. Through the ombudsman, the citizen ceases to act as a mere public service user to exercise the role of controller and evaluator of public policies .
These data are relevant to public administration and academic research, providing a temporal portrait of the Brazilian population’s aspirations concerning public health.
This data paper presents two datasets: the first depicts the characteristics of the manifestations registered by the General Ombudsman, where each row represents an individual manifestation and contains information such as registration date, classification, input channel, and subject, among others; the second dataset is constituted of sociodemographic information for each citizen that performed a manifestation, and characteristics such as the sexual orientation, race, age, and geographic location of the citizen, among others, are presented.
These datasets are generated by acquiring the original microdata and applying specific data construction steps. A detailed description of the methodology can be found in the file etl_methodology. The datasets represent information collected by the Genaral Ombudsman from 2010–01-01 to the day that the microdata is extracted from the source database, which is performed almost on a daily basis. These datasets and related files are shown in Table 1.
One of the main contributions of this data paper is facilitating access to OUVSUS microdata. Due to security reasons, accessing the DATASUS databases that store this microdata is impossible. Therefore, it was necessary to develop an indirect approach to acquire the data, where an Ombudsman technical staff, with access privileges to the databases, is responsible for manually extracting the desired information. This process is performed on an (almost) daily basis and results in two files: OUVIDORIA_MANIFESTACOES_<extraction_date>.csv and OUVIDORIA_PERFIL_<extraction_date>.csv, which can be obtained in the links manifestation_updated_dataset and citizen_profile_updated_dataset, respectively.
After the data acquisition, the original microdata is cleansed, transformed, and enriched by applying different data construction operations. The definition of the operations is based on detailed studies of the datasets and feedback received from the Ombudsman’s technical staff and specialists in the field.
A general overview of the data construction operations performed for each dataset is presented below:
Manifestations replacement of values in date-typed columns with NULL values when the format is not correct or when the values represent dates outside the coverage period; generation of new columns derived from date columns; fixing incorrect values based on feedback obtained from specialists.
Citizen Sociodemographic Profile renaming of columns due to wrong or archaic titles; fixing values through the application of mapping operations; replacement of invalid age values with NULL; generation of new columns derived from date columns; enrichment with municipality info, such as municipality name, area, coordinates, geographic regions, among others. The information used for the municipality info can be found in the file municipalities.
A detailed description of the data construction process may be found in the notebooks data_construction_citizen_profile and data_construction_manifestations, which exemplify the application to microdata spanning the period from 2010–01-01 to 2022–07-19. The resulting datasets obtained after the data construction process can be found in the files CITIZEN_PROFILE_20100101_20220719_T.zip and MANIFESTATIONS-20,100,101_20220719_T.zip. For reproducibility purposes, the original datasets are found in files CITIZEN_PROFILE_20100101_20220719 and MANIFESTATIONS_20100104_20220719.
A complete description of the variables for each dataset generated after the data construction process can be found in the files dict_citizen_profile.csv and dict_manifestations.csv.
The most recent versions of original and derived datasets for manifestations and sociodemographic profiles can be found in the links manifestation_updated_dataset and citizen_profile_updated_dataset, respectively.
The updated datasets in the links manifestation_updated_dataset and citizen_profile_updated_dataset might contain data extracted from the databases on a previous date to the download process. This occurs due to the indirect extraction approach, where an Ombudsman’s technical staff might not be able to generate the original datasets for a given date.
There are two reasons for the high number of NULL values in the datasets regarding the characteristics of manifestations. First, some information is acquired only after a registered manifestation has concluded some stages. For example, the variable “DATA DO FECHAMENTO” (conclusion date) is filled only when the corresponding manifestation has gone through all phases necessary to close a register. The second reason for the high number of NULL values is the result of users not providing optional information. This is the case for the variable “BAIRRO DO CIDADAO” (the user’s district that performed the manifestation).
As the filling of the citizen sociodemographic profile data is the user’s own responsibility, some informations may not have been filled.
Availability of data and materials
The data described in this Data note (CITIZEN_PROFILE_20100101_20220719, CITIZEN_PROFILE_20100101_20220719_T, citizen_profile_updated_dataset, MANIFESTATIONS-20,100,101_20220719, MANIFESTATIONS-20,100,101_20220719_T and manifestations_updated_dataset) as well as its auxiliary files can be freely and openly accessed on Synapse under the same persistent identifier: https://doi.org/10.7303/syn31945851.
Brazilian Unified Health System
Department of informatics of the Unified Health System of Brazil
General Ombdusman of the Brazilian Unified Health System
Fernandez MV, Junior GDG, de Sá DA, de Medeiros KR, Caliari RV. As ouvidorias públicas na democracia brasileira: o caso das ouvidorias do sus. In: UFPE (eds) Ouvidoria do SUS: a Voz do Cidadão e Resultados de Pesquisas. SÁ, D. A.; GURGEL JUNIOR, G. D.; FERNANDEZ, M. V.; MOREIRA, R. S., Recife; 2019.
General Ombudsman of the Brazilian Unified Health System (SUS). 2022. https://www.synapse.org/#!Synapse:syn31945851/wiki/617929 Accessed 24 Jan 2023.
We would like to thank the Brazilian Health Ministry, the Aggeu Magalhães Institute - Fiocruz PE and the Health Information and Communication Institute - ICICT - Fiocruz.
Brazilian Ministry of Health
Ethics approval and consent to participate
Following Brazilian federal regulations on ethics approval, the datasets presented in this paper are dispensed of ethics approval.
The authors declare that they have no competing interests.
Consent for publication
The Brazilian Health Ministry, through its General Ombudsman, consents to the publication of the data presented in this paper. Declaration of Consent n. 25000.202960/2019-51.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
About this article
Cite this article
de Freitas Saldanha, R., Kreischer, V., Gritz, R. et al. Manifestation and sociodemographic microdata of Brazil’s Unified Health System Ombudsman. BMC Res Notes 17, 18 (2024). https://doi.org/10.1186/s13104-023-06639-x