Use of the generalized linear model for forecasting the number of visits to museums in Mexico: comparison between ordinary linear and Poisson regression
PDF (Español (España))

Keywords

museums
forecasting
generalized linear model
ordinary lineal regression
Poisson regression
Newthon-Raphson
goodness of fit
culture
cultural offer
diffusion
cultural consumption
visitors
variables museos
pronóstico
modelo lineal generalizado
regresión lineal ordinaria
regresión de Poisson
Newthon-Raphson
bondad de ajuste
cultura
oferta cultural
difusión
consumo cultural
visitantes
variables

How to Cite

Guzmán Chávez, A. D., Guerrero González, F., & Vargas Rodríguez, E. . (2022). Use of the generalized linear model for forecasting the number of visits to museums in Mexico: comparison between ordinary linear and Poisson regression. Nova Scientia, 14(28). https://doi.org/10.21640/ns.v14i28.2999

Abstract

It is of great interest for establishments that offer a product or service, for example museums, to know the number of possible visits as a function of other variables that customers will make in a specific period. This to evaluate the degree of demand that exists and at the same time, evaluate if the dissemination strategies are working or otherwise, make more appropriate decisions to improve the quality of attention and satisfaction with visitors. To implement the generalized linear model to estimate the number of annual visits to museums in Mexican territory as a function of predictive variables, a database of 110 museums taken from the Instituto Nacional de Estadística y Geografía (INEGI) for the years 2017 and 2018 was used. The models used were regression ordinary linear (RLO) and Poisson regression (RP) and these were applied to all the principal components of eleven predictor variables (web page, reception and service capacity, main theme, permanent collection, ownership, entry, adult fee, discounts, type of visits, open days of the year and temporary exhibitions) to counteract multicollinearity. Besides, the coefficient of determination ( ) was measured between the observed and estimated data to determine the method with the best fit. The functions were used to estimate the number of visits for the years 2019 and 2020. With the model that best fit, an analysis of estimation errors was performed. Comparing the observed data for the years 2017 and 2018 with the results estimated with the characteristic functions of each model, the determination coefficients were  0.61 for the RLO and 0.86 for the PR, respectively. For the years 2019 and 2020, the coefficients of determination obtained were 0.68 and 0.31 for the RLO, and 0.87 and 0.84 for the RP, respectively. The maximum error of estimation registered was between 10 001 and 20 000 annual visits. It is shown that the most suitable model to forecast future annual visits to any museum in Mexico is the Poisson regression. It is believed that the high adjustment to the data observed with the PR method is because these did not have an excess of zeros and fulfilled the assumption of equidispersion. Finally, with the proposed characteristic function, the number of annual visits can be estimated with a maximum error of around 10 %, which is low compared to the maximum number of annual visits that a museum receives.

https://doi.org/10.21640/ns.v14i28.2999
PDF (Español (España))

References

Azhari, H. R., Widyaningsih, Y., y Lestari, D. (2018). Predicting Final Result of Football Match Using Poisson Regression Model. Journal of Physics: Conference Series (pp. 1-5). Depok: IOPscience. https://doi.org/10.1088/1742-6596/1108/1/012066

Gabda, D., Jubok, Z. H., Budin, K., y Hassan, S. (2008, June). Multiple Linear Regression in Forecasting The Number of Asthmatics. WSEAS Transactions on Information Science & Applications, 5(6), 972-977.

Gardner, W., Mulvey, E. P., y Shaw, E. C. (1995). Regression Analyses of Counts and Rates: Poisson, Oversipersed Poisson, and Negative Binomial Modelos. Psychological Bulletin, 118(3), 392-404. https://doi.org/10.1037/0033-2909.118.3.392

Glenn, D. (2002). Multivariate Regression Trees: A new Technique for Modeling Species-Environment Relationships. Ecology, 83(4), 1105-1117. https://doi.org/10.1890/0012-9658(2002)083[1105:MRTANT]2.0.CO;2

Graefe, A., Armstrong, J. S., Jones, R. J., y Cuzán, A. G. (2014, March). Combining Forecasts: An Application to Elections. International Journal of Forecasting, 30(1), 43-54. https://doi.org/10.1016/j.ijforecast.2013.02.005

Graff, C. A., Coffield, S. R., Chen, Y., Foufoula-Georgiou, E., Randerson, J. T., y Smyth, P. (2020, July). Forecasting Daily Wildfire Activity Using Poisson Regression. IEEE Transactions on Geoscience and Remote Sensing, 58(7), 4837-4851. https://doi.org/10.1109/TGRS.2020.2968029

Ismail, Z., Yahya, A., y Shabri, A. (2009). Forecasting Gold Prices Using Multiple Linear Regression Method. American Journal of Applied Sciences, 6(8), 1509-1514.

Montgomery, D. C., Peck, E. A., y Vining, G. G. (2021). Introduction to Linear Regression Analysis. New Jersey: John Wiley & Sons.

Uyanik, G. K., y Güler, N. (2013, December 10). A Study on Multiple Linear Regression Analysis. Precedia - Social and Behavioral Sciences, 106, 234-240. https://doi.org/10.1016/j.sbspro.2013.12.027

Vicuña, M. I., C. V., y Quiroga, B. F. (2021, April 23). Forecasting the 2020 COVID-19 Epidemic: A Multivariate Quasi-Poisson Regression to Model the Evolution of New Cases in Chile. Frontiers in Public Health, 9, 416-422. https://doi.org/10.3389/fpubh.2021.610479

Creative Commons License

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.

Copyright (c) 2022 Nova Scientia