Predicting Project Delays Using New Trended Regression Tree Method

Document Type : Research Paper

Author

Jundi-Shapur University of Technology, Dezful, Iran

10.22070/jqepo.2023.16409.1237

Abstract

gas distribution projects in Iran between 2015 and 2020. A series of predictive models have been reviewed and evaluated for delay risk prediction such as k-Nearest Neighbor (k-NN) Regression, Regression Trees (RT), Support Vector Machine Regression (SVMR), and Artificial Neural Network (ANN). Computational results based on cross-validation revealed that when delays follow a rational pattern it could be predicted by our developed Trended Regression Tree (TRT) method and k-NN regression method. These novel methods are effective and provide practitioners with significantly more reliable predictions and applied insight into the delay causes. The notion of Trended Regression Trees is developed for the first time. Project delays are modeled based on project specifications and therefore there is no need to make any extra data gathering to predict project delays. Based on the research findings, we recommended that the management team focus their quest on the most effective factors to reduce project delays.

Keywords


Adam, A., Josephson, P., & Lindahl, G. (2017). Aggregation of factors causing cost overruns and time delays in large public construction projects: Trends and implications. Engineering, Construction and Architectural Management, 24, 393-406.
Al‐Kharashi, A., & Skitmore, M. (2009). Causes of delays in Saudi Arabian public sector construction projects. Construction Management and Economics, 27, 3-23.
Alshboul, O., Alzubaidi, M.A., Mamlook, M.E., Almasabha, G., Almuflih, A.S., & Shehadeh, A. (2022a). Forecasting Liquidated Damages via Machine Learning-Based Modified Regression Models for Highway Construction Projects. Sustainability, 14(10), 5835.
Alshboul, O., Shehadeh, A., Al Mamlook, R.E., Almasabha, G., Almuflih, A.S., & Alghamdi, S.Y. (2022). Prediction Liquidated Damages via Ensemble Machine Learning Model: Towards Sustainable Highway Construction Projects. Sustainability, 14(15), 9303.
Alshboul, O., Shehadeh, A., Almasabha, G., & Almuflih, A.S. (2022b). Extreme Gradient Boosting-Based Machine Learning Approach for Green Building Cost Prediction. Sustainability, 14(11), 6651.
Borovsky, A., Thal, D., & Leonard, L.B. (2021). Moving towards accurate and early prediction of language delay with network science and machine learning approaches. Scientific Reports (11), 8136.
Breiman, L. (2001). Statistical Modeling: The Two Cultures (with comments and a rejoinder by the author). Statistical Science, 16(3), 199-231.
Breiman, L., Friedman, J.H., Olshen, R.A., & Stone, C.J. (1983). Classification and Regression Trees, Taylor & Francis.
Chan, D.W., & Kumaraswamy, M.M. (2002). Compressing construction durations: lessons learned from Hong Kong building projects. International Journal of Project Management, 20, 23-35.
 
Banerjee Chattapadhyay, D., Putta, J., & Rao P, R. M. (2021). Risk identification, assessments, and prediction for mega construction projects: A risk prediction paradigm based on cross analytical-machine learning model. Buildings11(4), 172.
 
Cozad, A., Sahinidis, N.V., & Miller, D.C. (2014). Learning surrogate models for simulation‐based optimization. Aiche Journal, 60(6), 2211-2227.
Cui, Y., Liu, H., Wang, Q., Zheng, Z., Wang, H., Yue, Z., ... & Yao, M. (2022). Investigation on the ignition delay prediction model of multi-component surrogates based on back propagation (BP) neural network. Combustion and Flame237, 111852.
Davoudabadi, R., Mousavi, S.M., Šaparauskas, J., & Gitinavard, H. (2019). Solving construction project selection problem by a new uncertain weighting and ranking based on compromise solution with linear assignment approach. Journal of Civil Engineering and Management, 25(3), 241-251.
Derakhshanfar, H., Ochoa, J. J., Kirytopoulos, K., Mayer, W., & Langston, C. (2020). A cartography of delay risks in the Australian construction industry: impact, correlations and timing. Engineering, Construction and Architectural Management, 28(7), 1952–1978.
Diebold, F. X., & Mariano, R. S. (2002). Comparing predictive accuracy. Journal of Business & economic statistics20(1), 134-144.
Doraisamy, S. V., Akasah, Z. A., & Yunus, R. (2015). An overview on the issue of delay in the construction industry. In InCIEC 2014: Proceedings of the International Civil and Infrastructure Engineering Conference 2014 (pp. 313-319). Springer Singapore.
Durdyev, S., & Hosseini, M.R. (2020). Causes of delays on construction projects: a comprehensive list. International Journal of Managing Projects in Business, 13(1), 20-46.
 
Egwim, C.N., Alaka, H., Toriola-Coker, L.O., Balogun, H., & Sunmola, F. (2021). Applied artificial intelligence for predicting construction projects delay. Machine Learning with Applications, Volume 6, 100166.
Abd El-Razek, M. E., Bassioni, H. A., & Mobarak, A. M. (2008). Causes of delay in building construction projects in Egypt. Journal of construction engineering and management134(11), 831-841.
Fallahnejad, M. (2013). Delay causes in Iran gas pipeline projects. International Journal of Project Management, 31(1), 136-146.
Friedman, J.H. (1991). Multivariate Adaptive Regression Splines. Annals of Statistics, 19(1), 1-67.
Ghazal, M.M., & Hammad, A. (2022). Application of knowledge discovery in database (KDD) techniques in cost overrun of construction projects. International Journal of Construction Management, 22(9), 1632-1646.
Gitinavard, H. (2019). Strategic evaluation of sustainable projects based on hybrid group decision analysis with incomplete information. Journal of Quality Engineering and Production Optimization, 4(2), 17-30.
Gitinavard, H., & Mousavi, S. M. (2015). Evaluating construction projects by a new group decision-making model based on intuitionistic fuzzy logic concepts. International Journal of Engineering28(9), 1312-1319.
Gitinavard, H., Mousavi, S., Vahdani, B., & Siadat, A. (2020). Project safety evaluation by a new soft computing approach-based last aggregation hesitant fuzzy complex proportional assessment in construction industry. Scientia Iranica, 27(2), 983-1000.
Gunduz, M., Nielsen, Y., & Ozdemir, M. (2015). Fuzzy Assessment Model to Estimate the Probability of Delay in Turkish Construction Projects. Journal of Management in Engineering, 31(4), 04014055.
Gurgun, A.P., Koc, K., & Kunkcu, H. (2022). Exploring the adoption of technology against delays in construction projects, Engineering, Construction and Architectural Management, Vol. ahead-of-print No. ahead-of-print. https://doi.org/10.1108/ECAM-06-2022-0566
Hamzeh, A. M., Mousavi, S. M., & Gitinavard, H. (2020). Imprecise earned duration model for time evaluation of construction projects with risk considerations. Automation in Construction, 111, 102993.
Ilic, I., Görgülü, B., Cevik, M., & Baydogan, M.G. (2021). Explainable boosted linear regression for time series forecasting. Pattern Recognition, 120, 108144.
Islam, M.S., & Trigunarsyah, B. (2017). Construction Delays in Developing Countries: A Review. Journal of Construction Engineering and Project Management, 7(1), 1-12.
A Kassem, M., Khoiry, M. A., & Hamzah, N. (2021). Theoretical review on critical risk factors in oil and gas construction projects in Yemen. Engineering, Construction and Architectural Management28(4), 934-968.
Kleijnen, J.P. (2017). Regression and Kriging metamodels with their experimental designs in simulation: A review. European Journal of Operational Research, 256, 1-16.
Klumpenhouwer, W., & Shalaby, A. (2022). Using Delay Logs and Machine Learning to Support Passenger Railway Operations. Journal of the Transportation Research Board, 2676(9). https://doi.org/10.1177/03611981221085
Korhonen, K.T., & Kangas, A.S. (1997). Application of nearest-neighbour regression for generalizing sample tree information. Scandinavian Journal of Forest Research, 12, 97-101.
Li, M., Vanberkel, P., & Zhong, X. (2022). Predicting ambulance offload delay using a hybrid decision tree model. Socio-Economic Planning Sciences, 80, 101146.
Lin, C., & Fan, C. (2019). Evaluation of CART, CHAID, and QUEST algorithms: a case study of construction defects in Taiwan. Journal of Asian Architecture and Building Engineering, 18, 539 - 553.
Loh, W. (2011). Classification and regression trees. Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery, 1. 14–23.
Mahmoodzadeh, A., Nejati, H.R., & Mohammadi, M. (2022). Optimized machine learning modelling for predicting the construction cost and duration of tunneling projects. Automation in Construction, Volume 139, 104305.
Mehrabi Sharafabadi, H., & Movafaghpour, M.A. (2021). Investigating Causes of Delay in Natural Gas Distribution Pipeline Projects: a Correlation Analysis (Case Study: Khuzestan Province of Iran). Journal of Applied Research on Industrial Engineering, 9(1), 68-77.
Mittas, N., & Mitropoulos, A. (2022). A Data-Driven Framework for Probabilistic Estimates in Oil and Gas Project Cost Management: A Benchmark Experiment on Natural Gas Pipeline Projects. Computation, 10(5), 75. https://doi.org/10.3390/computation10050075
Mohammed, R.M., & Suliman, S.M. (2019). Delay in Pipeline Construction Projects in the Oil and Gas Industry: Part 1 (Risk Mapping of Delay Factors). International Journal of Construction Engineering and Management, 8(1), 24-35.
National Iranian Gas Company Website, https://nigc.ir/index.aspx?siteid= 1&&site ID=1&pageid=172
R Development Core Team (2008). R: A language and environment for statistical computing. R Foundation for Statistical Computing. Vienna, Austria.
RuleQuest (2016). Data mining with cubist. https://www.rulequest.com/cubist-info.html.
Sambasivan, M., Deepak, T.J., Salim, A., & Ponniah, V. (2017). Analysis of delays in Tanzanian construction industry: Transaction cost economics (TCE) and structural equation modeling (SEM) approach, Engineering, Construction and Architectural Management, 24 (2), 308-325.
Sanni-Anibire, M.O., Zin, R.M. & Olatunji, S.O. (2022). Machine learning model for delay risk assessment in tall building projects. International Journal of Construction Management, Volume 22(11).
Seber, G., & Lee, A. (2012). Linear regression analysis. Wiley Series in Probability and Statistics. Wiley.
Sen, A., & Srivastava, M. (2012). Regression analysis: Theory, methods, and applications. Springer New York.
Shoar, S., Chileshe, N., & Edwards, J.D. (2022). Machine learning-aided engineering services' cost overruns prediction in high-rise residential building projects: Application of random forest regression. Journal of Building Engineering, Volume 50, 104102.
Smola, A., & Schölkopf, B. (2004). A tutorial on support vector regression. Statistics and Computing, 14(3), 199-222.
Taleongpong, P., Hu, S., Jiang, Z., Wu, C., Popo-Ola, S., & Han, K. (2021). Machine learning techniques to predict reactionary delays and other associated key performance indicators on British railway network. Journal of Intelligent Transportation Systems, 26(3), 311-329. https://doi.org/10.1080/15472450.2020.1858822
Türkakin, O. H., Manisali, E., & Arditi, D. (2020). Delay analysis in construction projects with no updated work schedules. Engineering, Construction and Architectural Management, 27(10), 2893–2909.
Yang, J. B., & Wei, P. R. (2010). Causes of delay in the planning and design phases for construction projects. Journal of Architectural Engineering16(2), 80-83.
Yang, L., Liu, S., Tsoka, S., & Papageorgiou, L.G. (2017). A regression tree approach using mathematical programming. Expert Systems With Applications 78, 347–357.
Zhang, N., & Wei, G. (2013). Extension of VIKOR method for decision making problem based on hesitant fuzzy set, Applied Mathematical Modelling, 37(7), 4938-4947.
Zhang, Y., & Sahinidis, N.V. (2013). Uncertainty Quantification in CO2 Sequestration Using Surrogate Models from Polynomial Chaos Expansion. Industrial & Engineering Chemistry Research, 52(9), 3121-3132.