Use of the generalized linear model for forecasting the number of visits to museums in Mexico: comparison between ordinary linear and Poisson regression
PDF (Español (España))


generalized linear model
ordinary lineal regression
Poisson regression
goodness of fit
cultural offer
cultural consumption
variables museos
modelo lineal generalizado
regresión lineal ordinaria
regresión de Poisson
bondad de ajuste
oferta cultural
consumo cultural

How to Cite

Guzmán Chávez, A. D., Guerrero González, F., & Vargas Rodríguez, E. . (2022). Use of the generalized linear model for forecasting the number of visits to museums in Mexico: comparison between ordinary linear and Poisson regression. Nova Scientia, 14(28).


It is of great interest for establishments that offer a product or service, for example museums, to know the number of possible visits as a function of other variables that customers will make in a specific period. This to evaluate the degree of demand that exists and at the same time, evaluate if the dissemination strategies are working or otherwise, make more appropriate decisions to improve the quality of attention and satisfaction with visitors. To implement the generalized linear model to estimate the number of annual visits to museums in Mexican territory as a function of predictive variables, a database of 110 museums taken from the Instituto Nacional de Estadística y Geografía (INEGI) for the years 2017 and 2018 was used. The models used were regression ordinary linear (RLO) and Poisson regression (RP) and these were applied to all the principal components of eleven predictor variables (web page, reception and service capacity, main theme, permanent collection, ownership, entry, adult fee, discounts, type of visits, open days of the year and temporary exhibitions) to counteract multicollinearity. Besides, the coefficient of determination ( ) was measured between the observed and estimated data to determine the method with the best fit. The functions were used to estimate the number of visits for the years 2019 and 2020. With the model that best fit, an analysis of estimation errors was performed. Comparing the observed data for the years 2017 and 2018 with the results estimated with the characteristic functions of each model, the determination coefficients were  0.61 for the RLO and 0.86 for the PR, respectively. For the years 2019 and 2020, the coefficients of determination obtained were 0.68 and 0.31 for the RLO, and 0.87 and 0.84 for the RP, respectively. The maximum error of estimation registered was between 10 001 and 20 000 annual visits. It is shown that the most suitable model to forecast future annual visits to any museum in Mexico is the Poisson regression. It is believed that the high adjustment to the data observed with the PR method is because these did not have an excess of zeros and fulfilled the assumption of equidispersion. Finally, with the proposed characteristic function, the number of annual visits can be estimated with a maximum error of around 10 %, which is low compared to the maximum number of annual visits that a museum receives.
PDF (Español (España))


Azhari, H. R., Widyaningsih, Y., y Lestari, D. (2018). Predicting Final Result of Football Match Using Poisson Regression Model. Journal of Physics: Conference Series (pp. 1-5). Depok: IOPscience.

Gabda, D., Jubok, Z. H., Budin, K., y Hassan, S. (2008, June). Multiple Linear Regression in Forecasting The Number of Asthmatics. WSEAS Transactions on Information Science & Applications, 5(6), 972-977.

Gardner, W., Mulvey, E. P., y Shaw, E. C. (1995). Regression Analyses of Counts and Rates: Poisson, Oversipersed Poisson, and Negative Binomial Modelos. Psychological Bulletin, 118(3), 392-404.

Glenn, D. (2002). Multivariate Regression Trees: A new Technique for Modeling Species-Environment Relationships. Ecology, 83(4), 1105-1117.[1105:MRTANT]2.0.CO;2

Graefe, A., Armstrong, J. S., Jones, R. J., y Cuzán, A. G. (2014, March). Combining Forecasts: An Application to Elections. International Journal of Forecasting, 30(1), 43-54.

Graff, C. A., Coffield, S. R., Chen, Y., Foufoula-Georgiou, E., Randerson, J. T., y Smyth, P. (2020, July). Forecasting Daily Wildfire Activity Using Poisson Regression. IEEE Transactions on Geoscience and Remote Sensing, 58(7), 4837-4851.

Ismail, Z., Yahya, A., y Shabri, A. (2009). Forecasting Gold Prices Using Multiple Linear Regression Method. American Journal of Applied Sciences, 6(8), 1509-1514.

Montgomery, D. C., Peck, E. A., y Vining, G. G. (2021). Introduction to Linear Regression Analysis. New Jersey: John Wiley & Sons.

Uyanik, G. K., y Güler, N. (2013, December 10). A Study on Multiple Linear Regression Analysis. Precedia - Social and Behavioral Sciences, 106, 234-240.

Vicuña, M. I., C. V., y Quiroga, B. F. (2021, April 23). Forecasting the 2020 COVID-19 Epidemic: A Multivariate Quasi-Poisson Regression to Model the Evolution of New Cases in Chile. Frontiers in Public Health, 9, 416-422.

Creative Commons License

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.

Copyright (c) 2022 Nova Scientia