Reproducible pipeline with PYTHON/PYSUS for Departamento de Informática do Sistema Único de Saúde (DATASUS) extraction and analysis: Time series and sex profile of UTI in the SIH-RD (MG and AC states), 2024

Authors

DOI:

https://doi.org/10.33448/rsd-v15i3.50704

Keywords:

Hospital Information Systems, Urinary Tract Infections, Hospitalization, Unified Health System, Reproducibility of Tests.

Abstract

Reproducible access to DATASUS microdata is critical for health research and management, yet it often faces operational barriers, particularly when dealing with large datasets. This study aimed to demonstrate the feasibility of a reproducible Python/PySUS pipeline to extract and analyze SIH/SUS data (RD files), using hospitalizations for urinary tract infection (UTI) as a use case. UTI was defined as ICD-10 N39.0 recorded as the primary diagnosis (DIAG_PRINC = N390) in Minas Gerais (MG) and Acre (AC), Brazil, in 2024. The pipeline was executed in Google Colab, performing month-by-month downloads (by competence), DataFrame conversion, and Parquet caching to improve robustness and enable reruns. Descriptive analyses included monthly time series and sex distribution. A total of 29,720 UTI hospitalizations were identified in 2024 (MG: 28,860; AC: 860), with a peak in January (MG: 2,690; AC: 95). Female hospitalizations predominated in both states (AC: 75.9%; MG: 65.2%). The findings indicate that PySUS enables automated and reproducible access to DATASUS, efficiently producing tables and figures and supporting secondary-data studies across settings with different data volumes.

References

AlertaDengue. (n.d.). PySUS [Software]. GitHub. https://github.com/AlertaDengue/PySUS, Recuperado em 27/02/2026.

Benchimol, E. I., Smeeth, L., Guttmann, A., Harron, K., Moher, D., Petersen, I., Sørensen, H. T., von Elm, E., Langan, S. M., & RECORD Working Committee. (2015). The REporting of studies Conducted using Observational Routinely-collected health Data (RECORD) statement. PLOS Medicine, 12(10), e1001885. https://doi.org/10.1371/journal.pmed.1001885

Bisong, E. (2019). Google Colaboratory. In Building Machine Learning and Deep Learning Models on Google Cloud Platform (pp. 59–64). Apress.

Bittencourt, S. A., Camacho, L. A. B., & Leal, M. C. (2006). O Sistema de Informação Hospitalar e sua aplicação na saúde coletiva. Cadernos de Saúde Pública, 22(1), 19–30.

Czajkowski, K., Broś-Konopielko, M., & Teliga-Czajkowska, J. (2021). Urinary tract infection in women. Menopause Review, 20(1), 40–50.

Flores-Mireles, A. L., Walker, J. N., Caparon, M., & Hultgren, S. J. (2015). Urinary tract infections: Epidemiology, mechanisms of infection and treatment options. Nature Reviews Microbiology, 13(5), 269–284.

Foxman, B. (2014). Urinary tract infection syndromes. Infectious Disease Clinics of North America, 28(1), 1–13.

He, Q., Yang, L., & Zeng, G. (2025). Global burden of urinary tract infections: A systematic analysis. Journal of Global Health, 15, 04012.

Lessa, F., Mendes, A. C. G., Farias, S. F., Sá, D. A., Duarte, P. O., & Melo Filho, D. A. (2000). Novas metodologias para vigilância epidemiológica: utilização do sistema de informações hospitalares do SUS. Informe Epidemiológico do SUS, 9(1), 3–8.

Machado, J. P., Martins, M., & Leite, I. C. (2016). Qualidade das bases de dados hospitalares no Brasil: alguns elementos. Revista Brasileira de Epidemiologia, 19(3), 567–581.

Malta, D. C., et al. (2018). Sistemas de informação em saúde no Brasil: avanços e desafios. Ciência & Saúde Coletiva, 23(6), 1905–1916.

Mendes, A. C. G., Sá, D. A., Miranda, G. M. D., Lyra, T. M., & Tavares, R. A. W. (2000). Avaliação do Sistema de Informações Hospitalares (SIH/SUS) como fonte complementar na vigilância e monitoramento de doenças. Epidemiologia e Serviços de Saúde.

Ministério da Saúde. (n.d.). Morbidade Hospitalar do SUS (SIH/SUS). DATASUS. https://datasus.saude.gov.br/acesso-a-informacao/morbidade-hospitalar-do-sus-sih-sus/ Recuperado em 27/02/2026

Munafò, M. R., Nosek, B. A., Bishop, D. V. M., Button, K. S., Chambers, C. D., Percie du Sert, N., et al. (2017). A manifesto for reproducible science. Nature Human Behaviour, 1(1), 0021.

Peng, R. D. (2011). Reproducible research in computational science. Science, 334(6060), 1226–1227.

Pereira, A. S. et al. (2018). Metodologia da pesquisa científica. [Free ebook]. Santa Maria. Editora da UFSM.

Risemberg, R. I. C., Wakin, M., & Shitsuka, R. (2026). A importância da metodologia científica no desenvolvimento de artigos científicos. E-Acadêmica, 7(1), e0171675.

Shitsuka, R. et al. (2014). Matemática fundamental para tecnologia. (2ed). Editora Érica.

Simmering, J. E., Tang, F., Cavanaugh, J. E., Polgreen, L. A., & Polgreen, P. M. (2017). The increase in summer time urinary tract infections: A systematic review and meta-analysis. Epidemiology and Infection, 145(13), 2781–2795.

Vandenbroucke, J. P., von Elm, E., Altman, D. G., Gøtzsche, P. C., Mulrow, C. D., Pocock, S. J., & Egger, M. (2007). Strengthening the Reporting of Observational Studies in Epidemiology (STROBE): Explanation and elaboration. PLoS Medicine, 4(10), e297.

von Elm, E., Altman, D. G., Egger, M., Pocock, S. J., Gøtzsche, P. C., & Vandenbroucke, J. P. (2007). The Strengthening the Reporting of Observational Studies in Epidemiology (STROBE) statement: Guidelines for reporting observational studies. The Lancet, 370(9596), 1453–1457.

Wilkinson, M. D., Dumontier, M., Aalbersberg, I. J., et al. (2016). The FAIR Guiding Principles for scientific data management and stewardship. Scientific Data, 3, 160018.

Yang, L., et al. (2022). Global epidemiology of urinary tract infections: A systematic review. The Lancet Regional Health.

Zeng, G., et al. (2022). Global, regional, and national burden of urinary tract infections. BMC Medicine, 20, 123.

Published

2026-03-06

Issue

Section

Health Sciences

How to Cite

Reproducible pipeline with PYTHON/PYSUS for Departamento de Informática do Sistema Único de Saúde (DATASUS) extraction and analysis: Time series and sex profile of UTI in the SIH-RD (MG and AC states), 2024. Research, Society and Development, [S. l.], v. 15, n. 3, p. e1315350704, 2026. DOI: 10.33448/rsd-v15i3.50704. Disponível em: https://rsdjournal.org/rsd/article/view/50704. Acesso em: 24 mar. 2026.