MURAL - Maynooth University Research Archive Library



    Statistical and Machine Learning Models for Multivariate Sensor Data with Application to Environmental Monitoring


    Shoari Nejad, Amin (2023) Statistical and Machine Learning Models for Multivariate Sensor Data with Application to Environmental Monitoring. PhD thesis, National University of Ireland Maynooth.

    [thumbnail of PhD_Thesis_Amin_Shoari_Nejad.pdf]
    Preview
    Text
    PhD_Thesis_Amin_Shoari_Nejad.pdf

    Download (3MB) | Preview

    Abstract

    The increased availability and use of sensor data in environmental monitoring has led to a vastly increased demand for tools that can process, analyze and report on important environmental events in real or near-real time. These data have special characteristics, being high dimensional, recorded in space and time, and with potential missing values / sparsity issues. This thesis concerns itself with the building of statistical and machine learning methods for data of this type. Statistical methods can play an important role in inferring valuable features of the latent data-generating processes of the sensor data, while properly taking into account the uncertainty. Machine learning methods can play a significant role by offering an automatic framework to unearth complex patterns underlying the data, being scalable and dealing with challenges such as sparsity in an innovative way. This dissertation contributes to our understanding of how to monitor the environment, focusing on Dublin Bay as a proof of concept. It examines sea-level rise, water turbidity, and how to manage large environmental datasets that change over time and space with missing values. Firstly, it updates us on Dublin’s sea level record to produce a processed data product from 1938 to 2016. It utilizes a new statistical approach to make better estimates of average sea levels and finds that sea levels have been rising more quickly in recent times. Then it investigates how human activities, such as dredging and dumping, affect water turbidity in Dublin Bay. It employs a new statistical model called VARICH to model the variability in water turbidity over two years and at different locations, finding that weather conditions like wind speed significantly impact most locations. Meanwhile, dredging operations show lower impacts, and dumping operations have a significant impact only at greater water depths. Lastly, the dissertation addresses the challenge of missing values in large-scale environmental datasets by proposing two innovative models for multivariate spatio-temporal forecasting that offer competitive performance without the need for imputation. These models are a transformer-based model, SERT, and a simpler, interpretable model, SST-ANN. Altogether, the chapters of this dissertation provide new insights into environmental trends and offer novel methods for analysing environmental data. This is all taking place against the backdrop of significant global issues such as climate change, loss of biodiversity, and increasing pollution. This thesis stands as an example of how data analysis is advancing in the field of environmental monitoring.
    Item Type: Thesis (PhD)
    Keywords: Statistical and Machine Learning Models; Multivariate Sensor Data; Application to Environmental Monitoring;
    Academic Unit: Faculty of Science and Engineering > Research Institutes > Hamilton Institute
    Item ID: 18315
    Depositing User: IR eTheses
    Date Deposited: 26 Mar 2024 12:37
    URI: https://mu.eprints-hosting.org/id/eprint/18315
    Use Licence: This item is available under a Creative Commons Attribution Non Commercial Share Alike Licence (CC BY-NC-SA). Details of this licence are available here

    Repository Staff Only (login required)

    Item control page
    Item control page

    Downloads

    Downloads per month over past year

    Origin of downloads