Data Article

Developing codes for validation of PM10, PM2.5, and O3 datasets using Rprogramming language

Abstract

Introduction: The wide range of studies on air pollution requires accurate and reliable datasets. However, due to many reasons, the measured concentrations may be incomplete or biased. The development of an easy-to-use and reproducible exposure assessment method is required for researchers. Therefore, in this article, we describe and present a series of codes written in R Programming Language for data handling, validating and averaging of PM10, PM2.5, and O3 datasets.
Findings: These codes can be used in any types of air pollution studies that seek for PM and ozone concentrations that are indicator of real concentrations. We used and combined criteria from several guidelines proposed by US EPA and APHEKOM project to obtain an acceptable methodology. Separate .csv files for PM10, PM2.5 and O3 should be prepared as input file. After the file was imported to the R Programming software, first, negative and zero values of concentrations within all the dataset will be removed. Then, only monitors will be selected that have at least 75% of hourly concentrations. Then, 24-h averages and daily maximum of 8-h moving averages will be calculated for PM and ozone, respectively. For output, the codes create two different sets of data. One contains the hourly concentrations of the interest pollutant (PM10, PM2.5, or O3) in valid stations and their average at city level. Another is the final 24-h averages of city for PM10 and PM2.5 or the final daily maximum 8-h averages of city for O3.
Conclusion: These validated codes use a reliable and valid methodology, and eliminate the possibility of wrong or mistaken data handling and averaging. The use of these codes are free and without any limitation, only after the citation to this article.

1. Forouzanfar MH, Afshin A, Alexander LT, Anderson HR, Bhutta ZA, Biryukov S, et al. Global, regional, and
national comparative risk assessment of 79 behavioural, environmental and occupational, and metabolic risks or clusters of risks, 1990–2015: a systematic analysis for the Global burden of disease study 2015. The Lancet. 2016;388(10053):1659-724.
2. WHO. WHO’s urban ambient air pollution database. 2016 update 2016.
3. Yarahmadi M, Hadei M, Nazari SSH, Conti GO, Alipour MR, Ferrante M, et al. Mortality assessment attributed to long-term exposure to fine particles in ambient air of the megacity of Tehran, Iran. Environmental science and pollution research. 2018;25(14):14254-62.
4. Hopke PK, Hashemi Nazari SS, Hadei M, Yarahmadi M, Kermani M, Yarahmadi E, et al. Spatial and temporal trends of short-term health impacts of PM2.5 in Iranian cities; a modelling approach (2013-2016). Aerosol and air quality research. 2018.
5. Hassanvand MS, Naddafi K, Malek M, Valojerdi AE, Mirzadeh M, Samavat T, et al. Effect of long-term exposure to ambient particulate matter on prevalence of type 2 diabetes and hypertension in Iranian adults: an ecologic study. Environmental science and pollution research.
2017:1-6.
6. Beelen R, Raaschou-Nielsen O, Stafoggia M, Andersen ZJ, Weinmayr G, Hoffmann B, et al. Effects of long-term
exposure to air pollution on natural-cause mortality: an analysis of 22 European cohorts within the multicentre ESCAPE project. The lancet. 2014;383(9919):785-95.
7. Hadei M, Hopke PK, Nazari SSH, Yarahmadi M, Shahsavani A, Alipour MR. Estimation of mortality and hospital admissions attributed to criteria air pollutants in Tehran metropolis, Iran (2013–2016). Aerosol and air quality research. 2017;17:2474-81.
8. Pascal M, Corso M, Ung A. Guidelines for assessing the health impacts of air pollution in European cities. Aphekom project, Work package 5, deliverable D5, April 2011. 2011.
9. EPA. Quality assurance handbook for air pollution measurement systems; Ambient air quality monitoring program
2013.
10. Liu M, Huang Y, Ma Z, Jin Z, Liu X, Wang H, et al. Spatial and temporal trends in the mortality burden of
air pollution in China: 2004–2012. Environment International. 2017;98:75-81.
11. USEPA. Chapter 11: Valid data and completeness requirements. In: Agency USEP, editor. USA2015.
12. Hadei M, Nazari SSH, Eslami A, Khosravi A, Yarahmadi M, Naghdali Z, et al. Distribution and number of
ischemic heart disease (IHD) and stroke deaths due to chronic exposure to PM2.5 in 10 cities of Iran (2013-2015); and AIRQ+ modelling. Journal of air pollution and health. 2018;2(3):129-36.
13. Hadei M, Nazari SSH, Yarahmadi E, Kermani M, Yarahmadi M, Naghdali Z, et al. Estimation of lung cancer mortality attributed to long-term exposure to PM2.5 in 15 cities during 2015-2016; an AIRQQ+ modelling. Journal of air pollution and health. 2017;2(1):19-26.
14. World Health Organization. Monitoring ambient air quality for health impact assessment. 1999.
15. Brauer M, Freedman G, Frostad J, van Donkelaar A, Martin RV, Dentener F, et al. Ambient air pollution exposure estimation for the global burden of disease 2013. Environmental Science & Technology. 2016;50(1):79-88.
Files
IssueVol 4 No 1 (2019): Winter 2019 QRcode
SectionData Article
DOI https://doi.org/10.18502/japh.v4i1.604
Keywords
Exposure assessment; Particulate matter; Air pollution; Epidemiology; Health impact assessment

Rights and permissions
Creative Commons License This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.
How to Cite
1.
Nabizadeh Nodehi R, Hadei M. Developing codes for validation of PM10, PM2.5, and O3 datasets using Rprogramming language. JAPH. 2019;4(1):73-80.