Selected publications Conference paper Journal article
2021
(2021). Open-book video captioning with retrieve-copy-generate network. IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

PDF

(2021). DPFPS: Dynamic and progressive filter pruning for compressing convolutional neural networks from scratch. AAAI Conference on Artificial Intelligence (AAAI).

Code

2020
(2020). Web objectionable video recognition based on deep multi instance learning with representative prototypes selection. IEEE Transactions on Circuits and Systems for Video Technology (TCSVT).

PDF

(2020). STA-CNN: Convolutional spatial-temporal attention learning for action recognition. IEEE Transactions on Image Processing (TIP).

PDF

(2020). Recursive least-squares estimator-aided online learning for visual tracking. IEEE Conference on Computer Vision and Pattern Recognition (CVPR Oral).

PDF

(2020). RDSNet: A new deep architecture for reciprocal object detection and instance segmentation. AAAI Conference on Artificial Intelligence (AAAI).

PDF Code

(2020). Ocean: Object-aware anchor-free tracking. European Conference on Computer Vision (ECCV).

PDF Code

(2020). Object relational graph with teacher-recommended learning for video captioning. IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

PDF

(2020). Multi-cue semi-supervised color constancy with limited training samples. IEEE Transactions on Image Processing (TIP).

PDF

(2020). Learning to Predict Salient Faces: A Novel Visual-Audio Saliency Model. European Conference on Computer Vision (ECCV).

PDF

(2020). Graph convolutional network with structure pooling and joint-wise channel attention for action recognition. Pattern Recognition (PR).

PDF

(2020). EDP: An efficient decomposition and pruning scheme for convolutional neural network compression. IEEE Transactions on Neural Networks and Learning Systems (TNNLS).

PDF

(2020). Dual L1-normalized context aware tensor power iteration and its applications to multi-object tracking and multi-graph matching. International Journal of Computer Vision (IJCV).

PDF

(2020). Distractor-aware discrimination learning for online multiple object tracking. Pattern Recognition (PR).

PDF

(2020). Anisotropic convolution for image classification. IEEE Transactions on Image Processing (TIP).

PDF

(2020). Anchor-free one-stage online multi-object tracking. Pattern Recognition and Computer Vision (PRCV best paper).

2019
(2019). Tangent Fisher vector on matrix manifolds for action recognition. IEEE Transactions on Image Processing (TIP).

PDF

(2019). Rank-1 tensor approximation for high-order association in multi-target tracking. International Journal of Computer Vision (IJCV).

PDF

(2019). Multimodal semantic attention network for video captioning. IEEE International Conference on Multimedia and Expo (ICME).

PDF

(2019). Knowledge distillation via instance relationship graph. IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

PDF

(2019). Fast online object tracking and segmentation: A unifying approach. IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

PDF Code

(2019). Deeper and wider siamese networks for real-time visual tracking. IEEE Conference on Computer Vision and Pattern Recognition (CVPR Oral).

PDF Code

(2019). Asymmetric 3d convolutional neural networks for action recognition. Pattern recognition (PR).

PDF

(2019). Anchor diffusion for unsupervised video object segmentation. IEEE International Conference on Computer Vision (ICCV).

PDF

2018
(2018). Visual tracking via spatially aligned correlation filters network. European Conference on Computer Vision (ECCV).

PDF

(2018). Tracking-by-fusion via Gaussian process regression extended to transfer learning. IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI).

PDF

(2018). Learning attentions: residual attentional siamese network for high performance online visual tracking. IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

PDF

(2018). Interaction-aware spatio-temporal pyramid attention networks for action classification. European Conference on Computer Vision (ECCV).

PDF

(2018). Hierarchical nonlinear orthogonal adaptive-subspace self-organizing map based feature extraction for human action recognition. AAAI Conference on Artificial Intelligence (AAAI).

PDF

(2018). Dual sticky hierarchical Dirichlet process hidden Markov model and its application to natural language description of motions. IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI).

PDF

(2018). Do not lose the details: reinforced representation learning for high performance visual tracking. International Joint Conference on Artificial Intelligence (IJCAI).

PDF

(2018). Distractor-aware siamese networks for visual object tracking. European Conference on Computer Vision (ECCV).

PDF Code

(2018). Deep cost-sensitive and order-preserving feature learning for cross-population age estimation. IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

PDF

(2018). Deep constrained siamese hash coding network and load-balanced locality-sensitive hashing for near duplicate image detection. IEEE Transactions on Image Processing (TIP).

PDF

(2018). Context-dependent random walk graph kernels and tree pattern graph matching kernels with applications to action recognition. IEEE Transactions on Image Processing (TIP).

PDF

(2018). Anomaly detection using local kernel density estimation and context-based regression. IEEE Transactions on Knowledge and Data Engineering (TKDE).

PDF

2017
(2017). Towards robust and accurate multi-view and partially-occluded face alignment. IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI).

PDF

(2017). Spatio-temporal self-organizing map deep network for dynamic object detection from videos. IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

PDF

(2017). Robust visual object tracking with top-down reasoning. ACM international conference on Multimedia (ACM MM).

PDF

(2017). Multi-view multi-instance learning based on joint sparse representation and multi-view dictionary learning. IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI).

PDF

(2017). Diagnosing deep learning models for high accuracy age estimation from a single image. Pattern Recognition (PR).

PDF

(2017). D2C: Deep cumulatively and comparatively learning for human age estimation. Pattern Recognition (PR).

PDF

2016
(2016). Tensor power iteration for multi-graph matching. IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

PDF

(2016). Semi-supervised tensor-based graph embedding learning and its application to visual discriminant tracking. IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI).

PDF

(2016). Salient object detection via structured matrix decomposition. IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI featured paper).

PDF

(2016). Online human action detection using joint classification-regression recurrent neural networks. European Conference on Computer Vision (ECCV).

PDF

(2016). Multimodal web aesthetics assessment based on structural SVM and multitask fusion learning. IEEE Transactions on Multimedia (TMM).

PDF

(2016). Multi-instance multi-label learning combining hierarchical context and its application to image annotation. IEEE Transactions on Multimedia (TMM).

PDF

(2016). Multi-cue illumination estimation via a tree-structured group joint sparse representation. International Journal of Computer Vision (IJCV).

PDF

(2016). Listwise learning to rank from crowds. ACM Transactions on Knowledge Discovery from Data (TKDD).

PDF

(2016). Listwise learning to rank by exploring structure of objects. IEEE Transactions on Knowledge and Data Engineering (TKDE).

PDF

(2016). Graph based skeleton motion representation and similarity measurement for action recognition. European Conference on Computer Vision (ECCV).

PDF

(2016). Fusing R features and local features with context-aware kernels for action recognition. International Journal of Computer Vision (IJCV).

PDF

(2016). FatRegion: A fast adaptive tree-structured region extraction approach. IEEE Transactions on Circuits and Systems for Video Technology (TCSVT).

PDF

2015
(2015). Predicting image memorability by multi-view adaptive regression. ACM international conference on Multimedia (ACM MM).

PDF

(2015). Optimizing locally linear classifiers with supervised anchor point learning. International Joint Conference on Artificial Intelligence (IJCAI).

PDF

(2015). Multi-perspective cost-sensitive context-aware multi-instance sparse coding and its application to sensitive video recognition. IEEE Transactions on Multimedia (TMM).

PDF

(2015). Multi-feature max-margin hierarchical Bayesian model for action recognition. IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

PDF

(2015). Local subspace collaborative tracking. IEEE International Conference on Computer Vision (ICCV).

PDF

(2015). Learning a superpixel-driven speed function for level set tracking. IEEE Transactions on Cybernetics.

PDF

(2015). Joint scale-spatial correlation tracking with adaptive rotation estimation. IEEE International Conference on Computer Vision Workshops (ICCVW).

PDF

(2015). A robust tracking system for low frame rate video. International Journal of Computer Vision (IJCV).

PDF

~2014
(2014). Transfer learning based visual tracking with gaussian processes regression. European Conference on Computer Vision (ECCV).

PDF

(2014). Single and multiple object tracking using a multi-feature joint sparse representation. IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI).

PDF

(2014). Learning human actions by combining global dynamics and local appearance. IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI featured paper).

PDF

(2014). Image classification using multiscale information fusion based on saliency driven nonlinear diffusion filtering. IEEE Transactions on Image Processing (TIP).

PDF

(2014). Bin ratio-based histogram distances and their application to image classification. IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI).

PDF

(2013). Discriminant tracking using tensor representation with semi-supervised improvement. IEEE International Conference on Computer Vision (ICCV).

PDF

(2013). An incremental DPMM-based method for trajectory clustering, modeling, and retrieval. IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI).

PDF

(2013). An improved hierarchical Dirichlet process-hidden Markov model and its application to trajectory modeling and retrieval. International Journal of Computer Vision (IJCV).

PDF

(2012). Single and multiple object tracking using log-Euclidean Riemannian subspace and block-division appearance model. IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI).

PDF

(2012). Active contour-based visual tracking by integrating colors, shapes, and motions. IEEE Transactions on Image Processing (TIP).

PDF

(2011). Incremental tensor subspace learning and its applications to foreground segmentation and tracking. International Journal of Computer Vision (IJCV).

PDF