English
 
Privacy Policy Disclaimer
  Advanced SearchBrowse

Item

ITEM ACTIONSEXPORT

Released

Journal Article

Predicting temperatures in Brazilian states capitals via Machine Learning

Authors

da Silva,  Sidney T.
External Organizations;

Gabrick,  Enrique C.
External Organizations;

de Moraes,  Ana Luiza R.
External Organizations;

Viana,  Ricardo L.
External Organizations;

Batista,  Antonio M.
External Organizations;

Caldas,  Iberê L.
External Organizations;

/persons/resource/Juergen.Kurths

Kurths,  Jürgen
Potsdam Institute for Climate Impact Research;

External Resource
Fulltext (restricted access)
There are currently no full texts shared for your IP range.
Fulltext (public)
There are no public fulltexts stored in PuRe
Supplementary Material (public)
There is no public supplementary material available
Citation

da Silva, S. T., Gabrick, E. C., de Moraes, A. L. R., Viana, R. L., Batista, A. M., Caldas, I. L., Kurths, J. (2025 online): Predicting temperatures in Brazilian states capitals via Machine Learning. - European Physical Journal - Special Topics.
https://doi.org/10.1140/epjs/s11734-025-01710-z


Cite as: https://publications.pik-potsdam.de/pubman/item/item_32766
Abstract
Climate change refers to substantial long-term variations in weather patterns. In this work, we employ a Machine Learning (ML) technique, the Random Forest (RF) algorithm, to forecast the monthly average temperature for Brazilian’s states capitals (27 cities) and the whole country, from January 1961 to December 2022. To forecast the temperature at k-month, we consider as features in RF: (i) global emissions of carbon dioxide (CO2), methane (CH4), and nitrous oxide (NO) at k-month; (ii) temperatures from the previous three months, i.e., (k - 1), (k-2) and (k-3) -month; (iii) combination of i and ii. By investigating breakpoints in the times series, we discover that 24 cities and the gases present breakpoints in the 80’s and 90’s. After the breakpoints, we find an increase in the temperature and the gas emission. Thereafter, we separate the cities according to their geographical position and employ the RF algorithm to forecast the temperature from 2010–08 until 2022–12. Based on i, ii, and iii, we find that the three inputs result in a very precise forecast, with a normalized root mean squared error (NMRSE) less than 0.083 for the considered cases. From our simulations, the better forecasted region is Northeast through iii (NMRSE = 0.012). Furthermore, we also investigate the forecasting of anomalous temperature data by removing the annual component of each time series. In this case, the best forecasting is obtained with strategy i, with the best region being Northeast (NRMSE = 0.090).