论文标题

使用多元线性回归进行生化氧的需求预测

Using Multivariate Linear Regression for Biochemical Oxygen Demand Prediction in Waste Water

论文作者

Mutai, Isaiah K., Van Laerhoven, Kristof, Karuri, Nancy W., Tewo, Robert K.

论文摘要

在废水中预测废水中的生化氧需求(BOD)时,存在多元线性回归(MLR)的机会,将各种水质参数作为输入变量。这项工作的目的是通过四个输入变量检查MLR在废水中预测BOD中的能力:溶解氧(DO),氮,粪便大肠菌群和总大肠菌群。这四个输入变量具有更高的相关强度,即在检查相关强度的七个参数中BOD。机器学习(ML)的数据集的80%和90%作为培训集完成,分别为20%和10%作为测试集。通过相关系数(R),均方根误差(RMSE)和BOD预测的百分比精度来评估MLR性能。 BOD预测的溶解氧,氮,粪便肠结肠和总大肠菌的输入变量的性能指数为:RMSE = 6.77mg/L,R = 0.60,精度为70.3%,训练数据集的80%和RMSE = 6.74mg/L,R = 0.60和精确度的训练数据集的训练数据集和87.5%的精度为87.5%。发现将训练集的百分比提高到80%的数据集以上,仅提高了模型的准确性,但对模型的预测能力没有重大影响。结果表明,使用适当选择的输入参数可以成功地用于废水中BOD的MLR模型。

There exist opportunities for Multivariate Linear Regression (MLR) in the prediction of Biochemical Oxygen Demand (BOD) in waste water, using the diverse water quality parameters as the input variables. The goal of this work is to examine the capability of MLR in prediction of BOD in waste water through four input variables: Dissolved Oxygen (DO), Nitrogen, Fecal Coliform and Total Coliform. The four input variables have higher correlation strength to BOD out of the seven parameters examined for the strength of correlation. Machine Learning (ML) was done with both 80% and 90% of the data as the training set and 20% and 10% as the test set respectively. MLR performance was evaluated through the coefficient of correlation (r), Root Mean Square Error (RMSE) and the percentage accuracy in prediction of BOD. The performance indices for the input variables of Dissolved Oxygen, Nitrogen, Fecal Coliform and Total Coliform in prediction of BOD are: RMSE=6.77mg/L, r=0.60 and accuracy 70.3% for training dataset of 80% and RMSE=6.74mg/L, r=0.60 and accuracy of 87.5% for training set of 90% of the dataset. It was found that increasing the percentage of the training set above 80% of the dataset improved the accuracy of the model only but did not have a significant impact on the prediction capacity of the model. The results showed that MLR model could be successfully employed in the estimation of BOD in waste water using appropriately selected input parameters.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源