Affective and holistic approach at TRECVID 2010 task - Semantic Indexing (SIN)

Kok Meng Ong, Supheakmungkol Sarin, Wataru Kameyama

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

    1 Citation (Scopus)

    Abstract

    This paper reports our experiments for TRECVID 2010 task: Semantic Indexing. We present two approaches namely, Affective and Holistic. In the first approach, we have used combination of affective features from image, video and audio trained with neural network algorithm. Image features employed are color histogram and face detection from the keyframe. The number of face is also used in one of the runs. Video features include the motion activity and shot duration. Additionally, the audio power is included as feature. For the second approach, color, texture and scene features are extracted from the whole keyframe image as well as its background and saliency regions. Genetic algorithm is used to find the weight of each feature for effective combination. Then, KNN is used to propagate the annotation. We have submitted 4 runs where we distinguish the first two as affective category and the the last two as holistic ones. The summary is as follows: • kmlabGITS1-color histogram, motion, rhythm, sound and face number trained using neural network • kmlabGITS2-color histogram, motion, rhythm, sound and without face number trained using neural network • kmlabGITS3-combination of 5 image features (hsv bg, gabor, haar, gist and lab bg) using Genetic Algorithm and KNN • kmlabGITS4-combination of 5 image features (hsv, hsv bg, haar, haar roi and gist) using Genetic Algorithm and KNN.

    Original languageEnglish
    Title of host publication2010 TREC Video Retrieval Evaluation Notebook Papers
    PublisherNational Institute of Standards and Technology
    Publication statusPublished - 2010
    EventTREC Video Retrieval Evaluation, TRECVID 2010 - Gaithersburg, MD
    Duration: 2010 Nov 152010 Nov 17

    Other

    OtherTREC Video Retrieval Evaluation, TRECVID 2010
    CityGaithersburg, MD
    Period10/11/1510/11/17

    Fingerprint

    Semantics
    Color
    Genetic algorithms
    Neural networks
    Acoustic waves
    Face recognition
    Textures
    Experiments

    ASJC Scopus subject areas

    • Computer Graphics and Computer-Aided Design
    • Computer Vision and Pattern Recognition
    • Human-Computer Interaction
    • Software

    Cite this

    Ong, K. M., Sarin, S., & Kameyama, W. (2010). Affective and holistic approach at TRECVID 2010 task - Semantic Indexing (SIN). In 2010 TREC Video Retrieval Evaluation Notebook Papers National Institute of Standards and Technology.

    Affective and holistic approach at TRECVID 2010 task - Semantic Indexing (SIN). / Ong, Kok Meng; Sarin, Supheakmungkol; Kameyama, Wataru.

    2010 TREC Video Retrieval Evaluation Notebook Papers. National Institute of Standards and Technology, 2010.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

    Ong, KM, Sarin, S & Kameyama, W 2010, Affective and holistic approach at TRECVID 2010 task - Semantic Indexing (SIN). in 2010 TREC Video Retrieval Evaluation Notebook Papers. National Institute of Standards and Technology, TREC Video Retrieval Evaluation, TRECVID 2010, Gaithersburg, MD, 10/11/15.
    Ong KM, Sarin S, Kameyama W. Affective and holistic approach at TRECVID 2010 task - Semantic Indexing (SIN). In 2010 TREC Video Retrieval Evaluation Notebook Papers. National Institute of Standards and Technology. 2010
    Ong, Kok Meng ; Sarin, Supheakmungkol ; Kameyama, Wataru. / Affective and holistic approach at TRECVID 2010 task - Semantic Indexing (SIN). 2010 TREC Video Retrieval Evaluation Notebook Papers. National Institute of Standards and Technology, 2010.
    @inproceedings{6a5c0686976241279b08089e2c21ed5e,
    title = "Affective and holistic approach at TRECVID 2010 task - Semantic Indexing (SIN)",
    abstract = "This paper reports our experiments for TRECVID 2010 task: Semantic Indexing. We present two approaches namely, Affective and Holistic. In the first approach, we have used combination of affective features from image, video and audio trained with neural network algorithm. Image features employed are color histogram and face detection from the keyframe. The number of face is also used in one of the runs. Video features include the motion activity and shot duration. Additionally, the audio power is included as feature. For the second approach, color, texture and scene features are extracted from the whole keyframe image as well as its background and saliency regions. Genetic algorithm is used to find the weight of each feature for effective combination. Then, KNN is used to propagate the annotation. We have submitted 4 runs where we distinguish the first two as affective category and the the last two as holistic ones. The summary is as follows: • kmlabGITS1-color histogram, motion, rhythm, sound and face number trained using neural network • kmlabGITS2-color histogram, motion, rhythm, sound and without face number trained using neural network • kmlabGITS3-combination of 5 image features (hsv bg, gabor, haar, gist and lab bg) using Genetic Algorithm and KNN • kmlabGITS4-combination of 5 image features (hsv, hsv bg, haar, haar roi and gist) using Genetic Algorithm and KNN.",
    author = "Ong, {Kok Meng} and Supheakmungkol Sarin and Wataru Kameyama",
    year = "2010",
    language = "English",
    booktitle = "2010 TREC Video Retrieval Evaluation Notebook Papers",
    publisher = "National Institute of Standards and Technology",

    }

    TY - GEN

    T1 - Affective and holistic approach at TRECVID 2010 task - Semantic Indexing (SIN)

    AU - Ong, Kok Meng

    AU - Sarin, Supheakmungkol

    AU - Kameyama, Wataru

    PY - 2010

    Y1 - 2010

    N2 - This paper reports our experiments for TRECVID 2010 task: Semantic Indexing. We present two approaches namely, Affective and Holistic. In the first approach, we have used combination of affective features from image, video and audio trained with neural network algorithm. Image features employed are color histogram and face detection from the keyframe. The number of face is also used in one of the runs. Video features include the motion activity and shot duration. Additionally, the audio power is included as feature. For the second approach, color, texture and scene features are extracted from the whole keyframe image as well as its background and saliency regions. Genetic algorithm is used to find the weight of each feature for effective combination. Then, KNN is used to propagate the annotation. We have submitted 4 runs where we distinguish the first two as affective category and the the last two as holistic ones. The summary is as follows: • kmlabGITS1-color histogram, motion, rhythm, sound and face number trained using neural network • kmlabGITS2-color histogram, motion, rhythm, sound and without face number trained using neural network • kmlabGITS3-combination of 5 image features (hsv bg, gabor, haar, gist and lab bg) using Genetic Algorithm and KNN • kmlabGITS4-combination of 5 image features (hsv, hsv bg, haar, haar roi and gist) using Genetic Algorithm and KNN.

    AB - This paper reports our experiments for TRECVID 2010 task: Semantic Indexing. We present two approaches namely, Affective and Holistic. In the first approach, we have used combination of affective features from image, video and audio trained with neural network algorithm. Image features employed are color histogram and face detection from the keyframe. The number of face is also used in one of the runs. Video features include the motion activity and shot duration. Additionally, the audio power is included as feature. For the second approach, color, texture and scene features are extracted from the whole keyframe image as well as its background and saliency regions. Genetic algorithm is used to find the weight of each feature for effective combination. Then, KNN is used to propagate the annotation. We have submitted 4 runs where we distinguish the first two as affective category and the the last two as holistic ones. The summary is as follows: • kmlabGITS1-color histogram, motion, rhythm, sound and face number trained using neural network • kmlabGITS2-color histogram, motion, rhythm, sound and without face number trained using neural network • kmlabGITS3-combination of 5 image features (hsv bg, gabor, haar, gist and lab bg) using Genetic Algorithm and KNN • kmlabGITS4-combination of 5 image features (hsv, hsv bg, haar, haar roi and gist) using Genetic Algorithm and KNN.

    UR - http://www.scopus.com/inward/record.url?scp=84905223164&partnerID=8YFLogxK

    UR - http://www.scopus.com/inward/citedby.url?scp=84905223164&partnerID=8YFLogxK

    M3 - Conference contribution

    AN - SCOPUS:84905223164

    BT - 2010 TREC Video Retrieval Evaluation Notebook Papers

    PB - National Institute of Standards and Technology

    ER -