Original Research

A new imputation method for population mean in the presence of missing data based on a transformed variable with applications to air pollution data in Chiang Mai, Thailand

Abstract

Introduction: Chiang Mai’s air pollution has risen to number one in the world for the highest level of fine particulate matter which further exacerbates the damage to human health. Fine particulate matter can enter the human body and blood circulation, destroying organ systems, increasing the risk for chronic disease and cancer, despite not having smoking habits or other morbidities. The Thai government must sort out this issue before it is too late as the whole nation’s health is at risk due to excessive dust levels higher than standard guidelines. Collection of pollution data can help us to come up with solutions and prevent it from turning into a hazardous situation. Unfortunately, pollution data are missing and need to be dealt with before analysis to obtain accurate results.
Materials and methods: A new method of imputation for estimating population mean based on a transformed variable has been suggested under simple random sampling without replacement and the uniform nonresponse mechanism. The bias and mean square error of the proposed estimator are investigated up to the first order of approximation. The performance of the proposed estimator is studied via applications to air pollution data in Chiang Mai, Thailand.
Results: The proposed estimator shows the best performance, giving the least bias and mean square error for all levels of sampling fractions. For the results from application the estimated value of sulfur dioxide from Particulate Matter 2.5 (PM2.5), the Percentage Relative Efficiency (PRE) is higher than all existing estimators by at least 16%. For the estimated PM2.5 from PM10 the PRE is higher than all existing estimators by at least 1600%, an extremely significant difference exhibiting similarity to real values.
Conclusion: The proposed imputation technique based on the transformed auxiliary variable can be helpful for imputing missing values and improving the efficiency of the estimators.

1. Singh S, Horn S. Compromised imputation in survey sampling. Metrika. 2000 Sep 7;51(3):267–76. Available from: https://doi.org/10.1007/s001840000054.
2. Chodjuntug K, Lawson N. Imputation for estimating the population mean in the presence of nonresponse, with application to fine particle density in Bangkok. Mathematical Population Studies. 2022 Oct 2;29(4):204-25. Available from: https://doi.org/10.1080 /08898480.2021.
1997466.
3. Chodjuntug K, Lawson N. A Chain regression exponential type imputation method for mean
estimation in the presence of missing data. Songklanakarin Journal of Science and Technology, 2022
Jul; 44(4): 1109-1118.
4. Ponkaew C, Lawson N. A New Estimator for Population Total in the Presence of Missing Data Under Unequal Probability Sampling Without Replacement: A Case Study on Fine Particulate Matter in the North of Thailand. Songklanakarin Journal of Science and Technology. 2023.
5. Srivenkataramana T. A dual to ratio estimator in sample surveys. Biometrika. 1980 April 1;
67(1): 199-204. Available from: https://doi.org/10.2307/2335334
6. Thongsak N, Lawson N. Bias and mean square error reduction by changing the shape
of the distribution of an auxiliary variable: application to air pollution data in Nan, Thailand. Mathematical Population Studies. 2023; 30(3): 180–194. Available from: https://doi.org/10.1080/08898480.2022.2145790
7. Thongsak N, Lawson N. Classes of combined population mean estimators utilizing transformed variables under double sampling: an application to air pollution in Chiang Rai, Thailand. Songklanakarin. Journal of Science and Technology. 2022 Sep;44(5): 1390-1398.
8. Lawson N. An improved family of estimators for estimating population mean using a transformed auxiliary variable under double sampling. Songklanakarin Journal of Science and Technology. 2023.
9.Thongsak N, Lawson N. Classes of Population Mean Estimators using Transformed Variables in Double Sampling. Gazi University Journal of Science. 2023; 36(4): 1834-1852.
10. Team RC. R: The R project for statistical computing. https. www. r-project. org. 2016.
11. Pollution Control Department: Air4Thai [Internet]. Air4thai.pcd.go.th. [cited 2023 Jul 19]. Available from: http://air4thai.pcd.go.th/webV2/history
Files
IssueVol 8 No 3 (2023): Summer 2023 QRcode
SectionOriginal Research
DOI https://doi.org/10.18502/japh.v8i3.13786
Keywords
Imputation method; Missing data; Transformed variable; Air pollution data; Mean square error

Rights and permissions
Creative Commons License This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.
How to Cite
1.
Thongsak N, Lawson N. A new imputation method for population mean in the presence of missing data based on a transformed variable with applications to air pollution data in Chiang Mai, Thailand. JAPH. 2023;8(3):285-298.