Publications

Publications by categories in reversed chronological order.

2022

  1. CVPR
    Everything at Once–Multi-modal Fusion Transformer for Video Retrieval
    Nina Shvetsova, Brian Chen, Andrew Rouditchenko, Samuel Thomas, Brian Kingsbury, Rogerio Feris, David Harwath, James Glass, and Hilde Kuehne
    In Computer Vision and Pattern Recognition (CVPR) 2022

2021

  1. arXiv
    PreViTS: Contrastive Pretraining with Video Tracking Supervision
    Brian Chen, Ramprasaath R Selvaraju, Shih-Fu Chang, Juan Carlos Niebles, and Nikhil Naik
    In arXiv preprint arXiv:2112.00804 2021
  2. EMNLP
    Joint Multimedia Event Extraction from Video and Article
    Brian Chen, Xudong Lin, Christopher Thomas, Manling Li, Shoya Yoshida, Lovish Chum, Heng Ji, and Shih-Fu Chang
    In Empirical Methods in Natural Language Processing findings (EMNLP) 2021
  3. ICCV
    Multimodal Clustering Networks for Self-supervised Learning from Unlabeled Videos
    Brian Chen, Andrew Rouditchenko, Kevin Duarte, Hilde Kuehne, Samuel Thomas, and others
    In International Conference on Computer Vision (ICCV) 2021
  4. Interspeech
    Avlnet: Learning audio-visual language representations from instructional videos
    Andrew Rouditchenko, Angie Boggust, David Harwath, Brian Chen, and others
    In Proceedings of the Interspeech 2021
  5. Interspeech
    Cascaded Multilingual Audio-Visual Learning from Videos
    Andrew Rouditchenko, Angie Boggust, David Harwath, Samuel Thomas, Hilde Kuehne, Brian Chen, and others
    In Proceedings of the Interspeech 2021
  6. NAACL
    RESIN: A Dockerized Schema-Guided Cross-document Cross-lingual Cross-media Information Extraction and Event Tracking System
    Haoyang Wen, Ying Lin, Tuan Lai, ..., Brian Chen, ..., and others
    In Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies: Demonstrations (NAACL) 2021

2020

  1. AAAI
    General Partial Label Learning via Dual Bipartite Graph Autoencoder
    Brian Chen, Bo Wu, Alireza Zareian, Hanwang Zhang, and Shih-Fu Chang
    In AAAI Conference on Artificial Intelligence (AAAI) 2020
  2. ACL
    GAIA: A fine-grained multimedia knowledge extraction system
    Manling Li, Alireza Zareian, Ying Lin, Xiaoman Pan, Spencer Whitehead, Brian Chen, Bo Wu, Heng Ji, Shih-Fu Chang, Clare Voss, and others
    In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics: System Demonstrations (ACL) 2020

2019

  1. CVPR
    Multi-level multimodal common semantic space for image-phrase grounding
    Hassan Akbari, Svebor Karaman, Surabhi Bhargava, Brian Chen, Carl Vondrick, and Shih-Fu Chang
    In Computer Vision and Pattern Recognition (CVPR) 2019

2018

  1. TAC
    GAIA-A Multi-media Multi-lingual Knowledge Extraction and Hypothesis Generation System.
    Tongtao Zhang, Ananya Subburathinam, ..., Brian Chen, ..., and others
    In TAC 2018