Technical Report

  • ZM-Net: Real-time Zero-shot Image Manipulation Network[PDF]
    Hao Wang, Xiaodan Liang, Hao Zhang, Dit-Yan Yeung, Eric P. Xing. 2017.

Journal Articles

  • Look into Person: Joint Human Parsing and Pose Estimation Network and a New Benchmark
    Xiaodan Liang, Ke Gong, Xiaohui Shen, Liang Lin. TPAMI, 2018.

  • Proposal-free Network for Instance-level Object Segmentation [PDF]
    Xiaodan Liang, Yunchao Wei, Xiaohui Shen, Jianchao Yang, Liang Lin, Shuicheng Yan. TPAMI, 2018.

  • Scale-aware Fast R-CNN for Pedestrian Detection [PDF]
    Jianan Li, Xiaodan Liang, ShengMei Shen, Tingfa Xu, Shuicheng Yan. TIP, 2017

  • Multi-stage Object Detection with Group Recursive Learning [PDF]
    Jianan Li, Xiaodan Liang, Jianshu Li, Tingfa Xu, Jiashi Feng, Shuicheng Yan. IEEE Transactions on Multimedia (T-MM), 2017

  • Attentive Contexts for Object Detection [PDF]
    Jianan Li, Yunchao Wei, Xiaodan Liang, Jian Dong, Tingfa Xu, Jiashi Feng, Shuicheng Yan. 2016.
    IEEE Transactions on Multimedia (T-MM), 2017

  • STC: A Simple to Complex Framework for Weakly-supervised Semantic Segmentation [[PDF]]
    Yunchao Wei, Xiaodan Liang, Yunpeng Chen, Xiaohui Shen, Ming-Ming Cheng, Yao Zhao, Shuicheng Yan. 2016
    IEEE Transactions on Pattern Analysis and Machine Intelligence (T-PAMI).

  • Learning to Segment Human by Watching YouTube [PDF]
    Xiaodan Liang, Yunchao Wei, YunPeng Chen, Jianchao Yang, Liang Lin, Shuicheng Yan
    IEEE Transactions on Pattern Analysis and Machine Intelligence (T-PAMI), DOI: 10.1109/TPAMI.2016.2598340, 2016.

  • Human Parsing with Contextualized Convolutional Neural Network [PDF][Page with Data]
    Xiaodan Liang, Chunyan Xu, Xiaohui Shen, Jianchao Yang, Jinhui Tang, Liang Lin, Shuicheng Yan
    IEEE Transactions on Pattern Analysis and Machine Intelligence (T-PAMI), DOI: 10.1109/TPAMI.2016.2537339, 2016.

  • Recognizing Focal Liver Lesions in CEUS with Dynamically Trained Latent Structured Models [PDF][Page with Data]
    Xiaodan Liang, Liang Lin, Qingxing Cao, Rui Huang, and Yongtian Wang
    IEEE Transactions on Medical Imaging (T-MI), 35(3): 713-727, 2016

  • Scale-aware Pixelwise Object Proposal Network
    Zequn Jie, Xiaodan Liang, Jiashi Feng, Wen Feng Lu, Eng Hock Francis Tay, Shuicheng Yan
    EEE Transactions on Image Processing (TIP), 2016

  • Clothes Co-Parsing via Joint Image Segmentation and Labeling with Application to Clothing Retrieval [PDF][Page with Data]
    Xiaodan Liang, Liang Lin, Wei Yang, Ping Luo, Junshi Huang, and Shuicheng Yan
    IEEE Transactions on Multimedia (T-MM), 18(6): 1175-1186, 2016

  • Learning to Segment with Image-level Annotations
    Yunchao Wei, Xiaodan Liang, Yunpeng Chen, Zequn Jie, Yanhui Xiao, Yao Zhao, Shuicheng Yan
    Pattern Recognition, 2016

  • Deep Human Parsing with Active Template Regression [PDF]
    Xiaodan Liang, Si Liu, Xiaohui Shen, Jianchao Yang, Luoqi Liu, Jian Dong, Liang Lin, Shuicheng Yan
    IEEE Transactions on Pattern Analysis and Machine Intelligence (PAMI), Volume 37, Issue 12, 2015

  • Transferred Human Parsing with Video Context [PDF]
    Si Liu, Xiaodan Liang , Luoqi Liu, Ke Lu, Liang Lin, Xiaochun Cao, and Shuicheng Yan (Corresponding Author)
    IEEE Transactions on Multimedia (T-MM), 17(8): 1347-1358, 2015

  • Multi-Loss Regularized Deep Neural Networ
    Chunyan Xu, Canyi Lu, Xiaodang Liang, Junbin Gao, Wei Zheng, Tianjiang Wang, Shuicheng Yan
    IEEE Transactions on Circuits and Systems for Video Technology (TCSVT), 2015

  • Complex Background Subtraction by Pursuing Dynamic Spatio-Temporal Models [PDF]
    Liang Lin, Yuanlu Xu, Xiaodan Liang, and Jianhuang Lai
    IEEE Transactions on Image Processing (T-IP), 23(7): 3191-3202, 2014

Conference Papers
2019

  • Knowledge-Driven Encode, Retrieve, Paraphrase for Medical Image Report Generation
    Christy Y. Li,
    Xiaodan Liang**, Zhiting Hu, Eric Xing. AAAI 2019

  • End-to-End Knowledge-Routed Relational Dialogue System for Automatic Diagnosis
    Lin Xu, Qixian Zhou, Ke Gong,
    Xiaodan Liang**, Jianheng Tang, Liang Lin. AAAI 2019

2018

  • Symbolic Graph Reasoning Meets Convolutions
    Xiaodan Liang, Zhiting Hu, Hao Zhang, Liang Lin, Eric P. Xing. NIPS 2018

  • Hybrid Retrieval-Generation Reinforced Agent for Medical Image Report Generation
    Christy Y. Li, Xiaodan Liang, Zhiting Hu, Eric Xing. NIPS 2018

  • Soft-Gated Warping-GAN for Pose-Guided Person Image Synthesis
    Haoye Dong, Xiaodan Liang, Ke Gong, Hanjiang Lai, Jia Zhu, Jian Yin. NIPS 2018

  • Deep Generative Models with Learnable Knowledge Constraints
    Zhiting Hu, Zichao Yang, Ruslan Salakhutdinov, Xiaodan Liang, Lianhui Qin, Haoye Dong, Eric Xing. NIPS 2018

  • Hybrid Knowledge Routed Modules for Large-scale Object Detection
    ChenHan Jiang, Hang Xu, Xiaodan Liang, Liang Lin. NIPS 2018

  • Reinforced Auto-Zoom Net: Towards Accurate and Fast Breast Cancer Segmentation in Whole-slide Images
    Nanqing Dong, Michael C. Kampffmeyer, Xiaodan Liang, Zeya Wang, Wei Dai, Eric P. Xing. DLMIA/MICCAI 2018. (Oral)

  • Generative Semantic Manipulation with Mask-Contrasting GAN [PDF]
    Xiaodan Liang, Hao Zhang, Eric P. Xing. ECCV 2018.

  • CIRL: Controllable Imitative Reinforcement Learning for Vision-based Self-driving
    Xiaodan Liang, Tairui Wang, Luona Yang, Eric P. Xing. ECCV 2018.

  • Instance-level Human Parsing via Part Grouping Network (Oral)
    Ke Gong, Xiaodan Liang, Yicheng Li, Yimin Chen, Liang Lin. ECCV 2018.

  • Adversarial Geometry-Aware Human Motion Prediction (Oral)
    Liangyan Gui, Xiaodan Liang, Yuxiong Wang, Jose M.F. Moura. ECCV 2018.

  • Real-to-Virtual Domain Unification for End-to-End Autonomous Driving [PDF]
    Luona Yang, Xiaodan Liang, Eric P. Xing. ECCV 2018.

  • Toward Characteristic-Preserving Image-based Virtual Try-On Network
    Bochao Wang, Huabin Zheng, Xiaodan Liang,Yimin Chen, Liang Lin. ECCV 2018.

  • RCAA: Relational Context-Aware Agents for Person Search
    Xiaojun Chang, Po-Yao Huang, Xiaodan Liang, Yi Yang, Alexander Hauptmann. ECCV 2018.

  • A Modulation Module for Multi-task Learning with Applications in Image Retrieval
    Xiangyun Zhao, Haoxiang Li, Xiaohui Shen, Xiaodan Liang, Ying Wu. ECCV 2018.

  • Adaptive Temporal Encoding Network for Video Instance-level Human Parsing
    Qixian Zhou,Xiaodan Liang, Ke Gong, Liang Lin. ACM MM 2018.

  • Teaching Robots to Predict Human Motion
    Liangyan Gui, Kevin Zhang, Yuxiong Wang, Xiaodan Liang, Jose M.F. Moura, Manuela Veloso. IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 2018. (Oral)

  • Unsupervised Domain Adaptation for Automatic Estimation of Cardiothoracic Ratio
    Nanqing Dong, Michael C. Kampffmeyer, Xiaodan Liang, Zeya Wang, Wei Dai, Eric P. Xing. MICCAI 2018

  • StepDeep: A Novel Spatial-temporal Mobility Event Prediction Framework based on Deep Neural Network
    Bilong Shen, Xiaodan Liang, Yufeng Ouyang, Miaofeng Liu, Weimin Zheng, Kathleen M. Carley. KDD 2018

  • Dynamic-structured Semantic Propagation Network[PDF]
    Xiaodan Liang, Hongfei Zhou, Eric P. Xing. CVPR 2018

  • Visual Question Reasoning on General Dependency Tree[PDF], [Pytorch code]
    Qingxing Cao, Xiaodan Liang, Bailing Li, Guanbin Li, Liang Lin. CVPR 2018. (Spotlight)

  • Reinforcement Cutting-Agent Learning for Video Object Segmentation
    Junwei Han, Le Yang, Dingwen Zhang, Xiaojun Chang, Xiaodan Liang. CVPR 2018. (Spotlight)

2017

  • Structured Generative Adversarial Networks
    Hao Zhang, Zhijie Deng, Xiaodan Liang, Jun Zhu, Eric P. Xing. NIPS 2017 (Nvidia Pioneer Research Award)

  • Dual Motion GAN for Future-Flow Embedded Video Prediction
    Xiaodan Liang, Lisa Lee, Wei Dai, Eric P. Xing. ICCV 2017.

  • Recurrent Topic-Transition GAN for Visual Paragraph Generation[PDF]
    Xiaodan Liang, Zhiting Hu, Hao Zhang, Chuang Gan, Eric P. Xing. ICCV 2017.

  • Temporal Dynamic Graph LSTM for Action-driven Video Object Detection
    Yuan Yuan, Xiaodan Liang, Xiaolong Wang, Dit-Yan Yeung, Abhinav gupta. ICCV 2017.

  • Nonparametric Variational Auto-encoders for Hierarchical Representation Learning[PDF]
    Prasoon Goyal, Zhiting Hu, Xiaodan Liang, Chenyu Wang, Eric P. Xing. ICCV 2017.

  • Deep Attribute-preserving Metric Learning for Natural Language Object Retrieval
    Jianan LI, Yunchao Wei, Xiaodan Liang, Fang Zhao, Jianshu Li, Tingfa Xu, Jiashi Feng. ACM MM 2017.

  • Poseidon: An Efficient Communication Architecture for Distributed Deep Learning on GPU Clusters.[PDF]
    Hao Zhang, Zeyu Zheng, Shizhen Xu, Wei Dai, Qirong Ho, Xiaodan Liang, Zhiting Hu, Jinliang Wei, Pengtao Xie, and Eric P. Xing.
    ATC (Oral), 2017.

  • Controllable Text Generation[PDF]
    Zhiting Hu, Zichao Yang, Xiaodan Liang, Ruslan Salakhutdinov, Eric P. Xing. ICML 2017. (Oral),

  • Deep learning based subdivision approach for large scale macromolecules structure recovery from electron cryo tomograms [PDF]
    Min Xu, Xiaoqi Chai, Hariank Muthakana, Xiaodan Liang, Ge Yang, Tzviya Zeev-Ben-Mordehai, Eric P. Xing. ISMB 2017, Bioinformatics doi:10.1093/bioinformatics/btx230 2017.

  • Deep Variation-structured Reinforcement Learning for Visual Relationship and Attribute Detection[PDF]
    Xiaodan Liang, Lisa Lee, Eric P. Xing
    IEEE Conference on Computer Vision and Pattern Recognition (CVPR), (Spotlight), 2017

  • Interpretable Structure-Evolving LSTM[PDF]
    Xiaodan Liang, Liang Lin, Xiaohui Shen, Jiashi Feng, Shuicheng Yan, Eric P. Xing
    IEEE Conference on Computer Vision and Pattern Recognition (CVPR), (Spotlight), 2017

  • Recurrent 3D Pose Sequence Machines[PDF]
    Mude Lin, Xiaodan Liang, Keze Wang, Liang Lin
    IEEE Conference on Computer Vision and Pattern Recognition (CVPR), (Oral), 2017

  • Object Region Mining with Adversarial Erasing: A Simple Classification to Semantic Segmentation Approach[PDF]
    Yunchao Wei, Jiashi Feng, Xiaodan Liang, Ming-Ming Cheng, Yao Zhao, Shuicheng Yan
    IEEE Conference on Computer Vision and Pattern Recognition (CVPR), (Oral), 2017

  • Look into Person: Self-supervised Structure-sensitive Learning and A New Benchmark for Human Parsing[PDF]
    Ke Gong, Xiaodan Liang, Xiaohui Shen, Liang Lin
    IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2017, equal contribution with first author

  • Perceptual Generative Adversarial Networks for Small Object Detection[PDF]
    Jianan Li, Xiaodan Liang, Yunchao Wei, Tingfa Xu, Jiashi Feng, Shuicheng Yan
    IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2017

  • Attention-Aware Face Hallucination via Deep Reinforcement Learning[PDF]
    Qingxing Cao, Liang Lin, Yukai Shi, Xiaodan Liang, Guanbin Lin
    IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2017

2016

  • Tree-structured Reinforcement Learning for Sequential Object Localization [PDF]
    Zequn Jie, Xiaodan Liang, Jiashi Feng, Xiaojie Jin, Wen Feng Lu, Shuicheng Yan
    Neural Information Processing Systems (NIPS), 2016

  • Semantic Object Parsing with Graph LSTM [PDF]
    Xiaodan Liang, Xiaohui Shen, Jiashi Feng, Liang Lin, Shuicheng Yan
    European Conference on Computer Vision (ECCV) (Spotlight), 2016

  • Peak-Piloted Deep Network for Facial Expression Recognition [PDF]
    Xiangyun Zhao, Xiaodan Liang, Luoqi Liu, Teng Li, Yugang Han, Nuno Vasconcelos, Shuicheng Yan
    European Conference on Computer Vision (ECCV), 2016

  • LSTM-CF: Unifying Context Modeling and Fusion with LSTMs for RGB-D Scene Labeling [PDF][Page with Code]
    Zhen Li, Yukang Gan, Xiaodan Liang, Yizhou Yu, Hui Cheng, and Liang Lin
    European Conference on Computer Vision (ECCV), 2016

  • Is Faster R-CNN Doing Well for Pedestrian Detection? [PDF][Page with Code]
    Liliang Zhang, Liang Lin, Xiaodan Liang, Kaiming He
    European Conference on Computer Vision (ECCV), 2016

  • Semantic Object Parsing with Local-Global Long Short-Term Memory [PDF]
    Xiaodan Liang, Xiaohui Shen, Donglai Xiang, Jiashi Feng, Liang Lin, Shuicheng Yan
    IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (Spotlight), 2016

  • Reversible Recursive Instance-level Object Segmentation [PDF]
    Xiaodan Liang, Yunchao Wei, Xiaohui Shen, Zequn Jie, Jiashi Feng, Liang Lin, Shuicheng Yan
    IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2016

  • Deep Structured Scene Parsing by Learning with Image Descriptions [PDF][Page with Data]
    Liang Lin, Guangrun Wang, Rui Zhang, Ruimao Zhang, Xiaodan Liang, and Wangmeng Zuo
    IEEE Conference on Computer Vision and Pattern Recognition (CVPR), (Oral), 2016

  • Geometric Scene Parsing with Hierarchical LSTM [PDF]
    Zhanglin Peng, Ruimao Zhang, Xiaodan Liang, Xiaobai Liu, and Liang Lin
    Proc. of International Joint Conference on Artificial Intelligence (IJCAI), 2016

  • Human Pose Estimation from Depth Images via Inference Embedded Multi-task Learning [PDF]
    Keze Wang, Shengfu Zhai, Hui Cheng, Xiaodan Liang, and Liang Lin
    Proc. of ACM International Conference on Multimedia (ACM MM) (Oral), 2016

2015

  • Human Parsing with Contextualized Convolutional Neural Network [PDF]
    Xiaodan Liang, Chunyan Xu, Xiaohui Shen, Jianchao Yang, Liu Si, Jinhui Tang, Liang Lin, Shuicheng Yan
    IEEE International Conference on Computer Vision (ICCV) (Oral), 2015
  • Towards Computational Baby Learning: A Weakly-supervised Approach for Object Detection [PDF]
    Xiaodan Liang, Si Liu, Yunchao Wei, Luoqi Liu, Liang Lin, and Shuicheng Yan
    IEEE International Conference on Computer Vision (ICCV), 2015
  • Matching-CNN Meets KNN: Quasi-Parametric Human Parsing [PDF]
    Si Liu, Xiaodan Liang, Luoqi Liu, Xiaohui Shen, Jianchao Yang, Changsheng Xu, Liang Lin, Xiaochun Cao, Shuicheng Yan
    IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2015

2014

  • Fashion Parsing with Video Context [PDF]
    Si Liu, Xiaodan Liang, Luoqi Liu, Xiaohui Shen, Jianchao Yang, Changsheng Xu, Liang Lin, Xiaochun Cao, Shuicheng Yan (Equal Contribution)
    Proc. of ACM International Conference on Multimedia (ACM MM), (Oral), 2015
  • Recognizing Focal Liver Lesions in Contrast-Enhanced Ultrasound with Discriminatively Trained Spatio-Temporal Model [PDF]
    Xiaodan Liang, Qingxing Cao, Rui Huang, and Liang Lin
    IEEE International Symposium on Biomedical Imaging (ISBI), 2015

2013

  • Learning Latent Spatio-Temporal Compositional Model for Human Action Recognition [PDF]
    Xiaodan Liang, Liang Lin, and Liangliang Cao
    Proc. ACM International Conference on Multimedia (ACM MM), (Oral), 2013