Reproducible pipeline with PYTHON/PYSUS for Departamento de Informática do Sistema Único de Saúde (DATASUS) extraction and analysis: Time series and sex profile of UTI in the SIH-RD (MG and AC states), 2024
DOI:
https://doi.org/10.33448/rsd-v15i3.50704Keywords:
Hospital Information Systems, Urinary Tract Infections, Hospitalization, Unified Health System, Reproducibility of Tests.Abstract
Reproducible access to DATASUS microdata is critical for health research and management, yet it often faces operational barriers, particularly when dealing with large datasets. This study aimed to demonstrate the feasibility of a reproducible Python/PySUS pipeline to extract and analyze SIH/SUS data (RD files), using hospitalizations for urinary tract infection (UTI) as a use case. UTI was defined as ICD-10 N39.0 recorded as the primary diagnosis (DIAG_PRINC = N390) in Minas Gerais (MG) and Acre (AC), Brazil, in 2024. The pipeline was executed in Google Colab, performing month-by-month downloads (by competence), DataFrame conversion, and Parquet caching to improve robustness and enable reruns. Descriptive analyses included monthly time series and sex distribution. A total of 29,720 UTI hospitalizations were identified in 2024 (MG: 28,860; AC: 860), with a peak in January (MG: 2,690; AC: 95). Female hospitalizations predominated in both states (AC: 75.9%; MG: 65.2%). The findings indicate that PySUS enables automated and reproducible access to DATASUS, efficiently producing tables and figures and supporting secondary-data studies across settings with different data volumes.
References
AlertaDengue. (n.d.). PySUS [Software]. GitHub. https://github.com/AlertaDengue/PySUS, Recuperado em 27/02/2026.
Benchimol, E. I., Smeeth, L., Guttmann, A., Harron, K., Moher, D., Petersen, I., Sørensen, H. T., von Elm, E., Langan, S. M., & RECORD Working Committee. (2015). The REporting of studies Conducted using Observational Routinely-collected health Data (RECORD) statement. PLOS Medicine, 12(10), e1001885. https://doi.org/10.1371/journal.pmed.1001885
Bisong, E. (2019). Google Colaboratory. In Building Machine Learning and Deep Learning Models on Google Cloud Platform (pp. 59–64). Apress.
Bittencourt, S. A., Camacho, L. A. B., & Leal, M. C. (2006). O Sistema de Informação Hospitalar e sua aplicação na saúde coletiva. Cadernos de Saúde Pública, 22(1), 19–30.
Czajkowski, K., Broś-Konopielko, M., & Teliga-Czajkowska, J. (2021). Urinary tract infection in women. Menopause Review, 20(1), 40–50.
Flores-Mireles, A. L., Walker, J. N., Caparon, M., & Hultgren, S. J. (2015). Urinary tract infections: Epidemiology, mechanisms of infection and treatment options. Nature Reviews Microbiology, 13(5), 269–284.
Foxman, B. (2014). Urinary tract infection syndromes. Infectious Disease Clinics of North America, 28(1), 1–13.
He, Q., Yang, L., & Zeng, G. (2025). Global burden of urinary tract infections: A systematic analysis. Journal of Global Health, 15, 04012.
Lessa, F., Mendes, A. C. G., Farias, S. F., Sá, D. A., Duarte, P. O., & Melo Filho, D. A. (2000). Novas metodologias para vigilância epidemiológica: utilização do sistema de informações hospitalares do SUS. Informe Epidemiológico do SUS, 9(1), 3–8.
Machado, J. P., Martins, M., & Leite, I. C. (2016). Qualidade das bases de dados hospitalares no Brasil: alguns elementos. Revista Brasileira de Epidemiologia, 19(3), 567–581.
Malta, D. C., et al. (2018). Sistemas de informação em saúde no Brasil: avanços e desafios. Ciência & Saúde Coletiva, 23(6), 1905–1916.
Mendes, A. C. G., Sá, D. A., Miranda, G. M. D., Lyra, T. M., & Tavares, R. A. W. (2000). Avaliação do Sistema de Informações Hospitalares (SIH/SUS) como fonte complementar na vigilância e monitoramento de doenças. Epidemiologia e Serviços de Saúde.
Ministério da Saúde. (n.d.). Morbidade Hospitalar do SUS (SIH/SUS). DATASUS. https://datasus.saude.gov.br/acesso-a-informacao/morbidade-hospitalar-do-sus-sih-sus/ Recuperado em 27/02/2026
Munafò, M. R., Nosek, B. A., Bishop, D. V. M., Button, K. S., Chambers, C. D., Percie du Sert, N., et al. (2017). A manifesto for reproducible science. Nature Human Behaviour, 1(1), 0021.
Peng, R. D. (2011). Reproducible research in computational science. Science, 334(6060), 1226–1227.
Pereira, A. S. et al. (2018). Metodologia da pesquisa científica. [Free ebook]. Santa Maria. Editora da UFSM.
Risemberg, R. I. C., Wakin, M., & Shitsuka, R. (2026). A importância da metodologia científica no desenvolvimento de artigos científicos. E-Acadêmica, 7(1), e0171675.
Shitsuka, R. et al. (2014). Matemática fundamental para tecnologia. (2ed). Editora Érica.
Simmering, J. E., Tang, F., Cavanaugh, J. E., Polgreen, L. A., & Polgreen, P. M. (2017). The increase in summer time urinary tract infections: A systematic review and meta-analysis. Epidemiology and Infection, 145(13), 2781–2795.
Vandenbroucke, J. P., von Elm, E., Altman, D. G., Gøtzsche, P. C., Mulrow, C. D., Pocock, S. J., & Egger, M. (2007). Strengthening the Reporting of Observational Studies in Epidemiology (STROBE): Explanation and elaboration. PLoS Medicine, 4(10), e297.
von Elm, E., Altman, D. G., Egger, M., Pocock, S. J., Gøtzsche, P. C., & Vandenbroucke, J. P. (2007). The Strengthening the Reporting of Observational Studies in Epidemiology (STROBE) statement: Guidelines for reporting observational studies. The Lancet, 370(9596), 1453–1457.
Wilkinson, M. D., Dumontier, M., Aalbersberg, I. J., et al. (2016). The FAIR Guiding Principles for scientific data management and stewardship. Scientific Data, 3, 160018.
Yang, L., et al. (2022). Global epidemiology of urinary tract infections: A systematic review. The Lancet Regional Health.
Zeng, G., et al. (2022). Global, regional, and national burden of urinary tract infections. BMC Medicine, 20, 123.
Downloads
Published
Issue
Section
License
Copyright (c) 2026 Marcos Rodrigo de Oliveira, Camilo Amaro de Carvalho, Juliana Cantele Xavier

This work is licensed under a Creative Commons Attribution 4.0 International License.
Authors who publish with this journal agree to the following terms:
1) Authors retain copyright and grant the journal right of first publication with the work simultaneously licensed under a Creative Commons Attribution License that allows others to share the work with an acknowledgement of the work's authorship and initial publication in this journal.
2) Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgement of its initial publication in this journal.
3) Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work.
