Tracking multiple pedestrians from monocular videos in an interacting multiple model framework

Zhengqiang Jiang

    Research output: ThesisDoctoral Thesis

    591 Downloads (Pure)


    [Truncated abstract] Detecting and tracking pedestrians in videos have many important computer vision applications including visual surveillance, people recognition, smart environments and human-machine interaction. An automatic pedestrian tracking system comprises 2 main stages: (1) detecting pedestrians in the initial frame or every frame; (2) maintaining the identities of pedestrians in video sequences. Tracking pedestrians in videos is a challenging problem because of background clutters, lighting variation, and occlusion. A reliable pedestrian tracking system should be capable of maintaining the identities of pedestrians in the presence of partial or even, occasionally, full occlusion in video sequences. The appearance model of a pedestrian is widely used in pedestrian tracking. For the appearance model, I compute the colour histograms for the upper and lower bodies of each pedestrian detected by the Histogram of Oriented Gradient (HOG) human detector. Each of these colour histograms is a 3-dimensional entity in the L*a*b colour space and the concatenation of them yields a 4-dimensional tensor. This allows spatial information to be incorporated into the appearance model while the computation cost is kept to its minimum. To get a better estimate of the colour histogram, the kernel density estimation is used to smooth the histogram of the appearance of each pedestrian. The Hellinger distance is used for histogram matching. In this thesis, a multiple pedestrian tracking method for monocular videos captured by a fixed camera in an interacting Multiple Model (IMM) framework is presented. This tracking method involves multiple IMM trackers running in parallel which are tied together by a robust data association component.
    Original languageEnglish
    QualificationDoctor of Philosophy
    Publication statusUnpublished - 2013


    Dive into the research topics of 'Tracking multiple pedestrians from monocular videos in an interacting multiple model framework'. Together they form a unique fingerprint.

    Cite this