A data imputation method with support vector machines for activity-based transportation models

Banghua Yang, Davy Janssens, Da Ruan, Mario Cools, Tom Bellemans, Geert Wets

    Research outputpeer-review

    Abstract

    In this paper, a data imputation method with a Support Vector Machine (SVM) is proposed to solve the issue of missing data in activity-based diaries. Here two SVM models are established to predict the missing elements of 'number of cars' and 'driver license'. The inputs of the former SVM model include five variables (Household composition, household income, Age oldest household member, Children age class and Number of household members). The inputs of the latter SVM model include three variables (personal age, work status and gender). The SVM models to predict the 'number of cars' and 'driver license' can achieve accuracies of 69% and 83% respectively. The initial experimental results show that missing elements of observed activity diaries can be accurately inferred by relating different pieces of information. Therefore, the proposed SVM data imputation method serves as an effective data imputation method in the case of missing information.

    Original languageEnglish
    Title of host publicationFoundations of Intelligent Systems
    Subtitle of host publicationProceedings of the Sixth International Conference on Intelligent Systems and Knowledge Engineering, Shanghai, China, Dec 2011 (ISKE2011)
    EditorsYinglin Wang, Tianrui Li
    Pages249-257
    Number of pages9
    DOIs
    StatePublished - 2011

    Publication series

    NameAdvances in Intelligent and Soft Computing
    Volume122
    ISSN (Print)1867-5662

    ASJC Scopus subject areas

    • General Computer Science

    Cite this