MURAL - Maynooth University Research Archive Library



    Generalizing Gain Penalization for Feature Selection in Tree-Based Models


    Wundervald, Bruna, Parnell, Andrew and Domijan, Katarina (2020) Generalizing Gain Penalization for Feature Selection in Tree-Based Models. IEEE Access, 8. pp. 190231-190239. ISSN 2169-3536

    [thumbnail of KD_generalizing.pdf]
    Preview
    Text
    KD_generalizing.pdf

    Download (4MB) | Preview

    Abstract

    We develop a new approach for feature selection via gain penalization in tree-based models. First, we show that previous methods do not perform sufficient regularization and often exhibit sub-optimal out-of-sample performance, especially when correlated features are present. Instead, we develop a new gain penalization idea that exhibits a general local-global regularization for tree-based models. The new method allows for full flexibility in the choice of feature-specific importance weights, while also applying a global penalization. We validate our method on both simulated and real data, exploring how the hyperparameters interact and we provide the implementation as an extension of the popular R package ranger.
    Item Type: Article
    Keywords: Dimensionality reduction; feature selection; gain penalization; tree-models;
    Academic Unit: Faculty of Science and Engineering > Research Institutes > Hamilton Institute
    Item ID: 15346
    Identification Number: 10.1109/ACCESS.2020.3032095
    Depositing User: Katarina Domijan
    Date Deposited: 25 Jan 2022 17:00
    Journal or Publication Title: IEEE Access
    Publisher: IEEE
    Refereed: Yes
    Related URLs:
    URI: https://mu.eprints-hosting.org/id/eprint/15346
    Use Licence: This item is available under a Creative Commons Attribution Non Commercial Share Alike Licence (CC BY-NC-SA). Details of this licence are available here

    Repository Staff Only (login required)

    Item control page
    Item control page

    Downloads

    Downloads per month over past year

    Origin of downloads