Home /  About Me /  Team /  Collaborators /  Research


2024

AI4Medicine

  1. PMC-LLaMA: Towards Building Open-source Language Models for Medicine.
    Chaoyi Wu*, Weixiong Lin*, Xiaoman Zhang, Ya Zhang, Yanfeng Wang, Weidi Xie
    In: Journal of the American Medical Informatics Association, 2024. (JAMIA, Impact Factor: ~7.9)   (NEW)
    Arxiv | Model | Code
  2. Towards Building Multilingual Language Model for Medicine.
    Pengcheng Qiu*, Chaoyi Wu*, Xiaoman Zhang, Weixiong Lin, Haicheng Wang, Ya Zhang, Yanfeng Wang, Weidi Xie
    Technical Report, 2024.   (NEW)
    Arxiv | Code | Model | Dataset
  3. One Model to Rule them All: Towards Universal Segmentation for Medical Images with Text Prompts.
    Ziheng Zhao, Yao Zhang, Chaoyi Wu, Xiaoman Zhang, Ya Zhang, Yanfeng Wang, Weidi Xie
    Technical Report, 2024.   (NEW)
    Project Page | Paper
  4. Large-scale Long-tailed Disease Diagnosis on Radiology Images.
    Qiaoyu Zheng, Weike Zhao, Chaoyi Wu, Xiaoman Zhang, Ya Zhang, Yanfeng Wang, Weidi Xie
    Under Review, 2024.   (NEW)
    Project Page | Paper

Computer Vision

  1. OV-VIS: Open-Vocabulary Video Instance Segmentation.
    Haochen Wang, Shuai Wang, Cilin Yan, Xiaolong Jiang, Xu Tang, Yao Hu, Weidi Xie^†, Efstratios Gavves
    In: International Journal of Computer Vision, 2024. (IJCV, Impact Factor: ~19.5, Corr Author)   (NEW)
    Code | Journal Version
  2. Intelligent Grimm - Open-ended Visual Storytelling via Latent Diffusion Models.
    Chang Liu*, Haoning Wu*, Yujie Zhong, Xiaoyun Zhang, Yanfeng Wang, Weidi Xie
    In: Conference on Computer Vision and Pattern Recognition (CVPR) , 2024.   (New)
    Project Page | Arxiv
  3. AutoAD III: The Prequel -- Back to the Pixels.
    Tengda Han, Max Bain, Arsha Nagrani, Gül Varol, Weidi Xie, Andrew Zisserman
    In: Conference on Computer Vision and Pattern Recognition (CVPR) , 2024.   (New)
    Project Page | Paper
  4. InstaGen: Enhancing Object Detection by Training on Synthetic Dataset.
    Chengjian Feng, Yujie Zhong, Zequn Jie^†, Weidi Xie^†, Lin Ma
    In: Conference on Computer Vision and Pattern Recognition (CVPR) , 2024.   (New)
    Project Page | Paper
  5. Retrieval-Augmented Egocentric Video Captioning.
    Jilan Xu, Yifei Huang, Junlin Hou, Guo Chen, Yuejie Zhang, Rui Feng, Weidi Xie
    In: Conference on Computer Vision and Pattern Recognition (CVPR) , 2024.   (New)
    Project Page | Paper
  6. A Strong Baseline for Temporal Video-Text Alignment.
    Zeqian Li*, Qirui Chen*, Tengda Han, Ya Zhang, Yanfeng Wang, Weidi Xie
    Technical Report, 2024.   (NEW)
    Project Page | Paper
  7. Grounded Question-Answering in Long Egocentric Videos.
    Shangzhe Di, Weidi Xie
    In: Conference on Computer Vision and Pattern Recognition (CVPR) , 2024.   (New)
    Project Page | Paper
  8. Amodal Ground Truth and Completion in the Wild.
    Guanqi Zhan, Chuanxia Zheng, Weidi Xie, Andrew Zisserman
    In: Conference on Computer Vision and Pattern Recognition (CVPR) , 2024.   (New)
    Project Page | Paper
  9. Appearance-based Refinement for Object-Centric Motion Segmentation.
    Junyu Xie, Weidi Xie, Andrew Zisserman
    Technical Report, 2024.   (NEW)
    Paper

2023

AI4Medicine

  1. Can GPT-4V(ision) Serve Medical Applications ? Case Studies on GPT-4V for Multimodal Medical Diagnosis.
    Chaoyi Wu*, Jiayu Lei*, Qiaoyu Zheng*, Weike Zhao*, Weixiong Lin*, Xiaoman Zhang*, Xiao Zhou*, Ziheng Zhao*,
    Ya Zhang, Yanfeng Wang, Weidi Xie
    Technical Report, 2023.
    Project Page | Paper
  2. Towards Generalist Foundation Model for Radiology by Leveraging Web-scale 2D & 3D Medical Data.
    Chaoyi Wu*, Xiaoman Zhang*, Ya Zhang, Yanfeng Wang, Weidi Xie
    Under Review, 2023.
    Project Page | Code & Model | Paper
  3. Knowledge-enhanced Pre-training for Auto-diagnosis of Chest Radiology Images.
    Xiaoman Zhang, Chaoyi Wu, Ya Zhang, Yanfeng Wang, Weidi Xie
    In: Nature Communications, 2023. (Impact Factor: ~18)
    Project Page | Code & Model | Paper
  4. MedKLIP: Medical Knowledge Enhanced Language-Image Pre-Training.
    Chaoyi Wu, Xiaoman Zhang, Ya Zhang, Yanfeng Wang, Weidi Xie
    In: International Conference on Computer Vision (ICCV) , 2023.
    Project Page | Code & Model | Arxiv
  5. PMC-CLIP: Contrastive Language-Image Pre-training using Biomedical Documents.
    Weixiong Lin*, Ziheng Zhao*, Xiaoman Zhang, Chaoyi Wu, Ya Zhang, Yanfeng Wang, Weidi Xie
    In: International Conference on Medical Image Computing and Computer Assisted Intervention (MICCAI), 2023.
    Project Page | Code & Model | Arxiv
  6. PMC-VQA: Visual Instruction Tuning for Medical Visual Question Answering.
    Xiaoman Zhang, Chaoyi Wu, Ziheng Zhao, Weixiong Lin, Ya Zhang, Yanfeng Wang, Weidi Xie
    Under Review, 2023.
    Project Page | Arxiv
  7. Deep Facial Phenotyping with Mixup Augmentation.
    Jonathan Campbell, Mitchell Dawson, Andrew Zisserman, Weidi Xie, Christoffer Nellåker
    In: Annual Conference on Medical Image Understanding and Analysis.
    Paper
  8. K-Diag: Knowledge-enhanced Disease Diagnosis in Radiographic Imaging.
    Chaoyi Wu*, Xiaoman Zhang*, Yanfeng Wang, Ya Zhang, Weidi Xie
    In: Big Task Small Data, 1001-AI, MICCAI 2023 Workshop (Oral).
    Project Page | Arxiv
  9. Self-supervised Tumor Segmentation with Sim2Real Adaptation.
    Xiaoman Zhang, Weidi Xie, Chaoqin Huang, Ya Zhang, Xin Chen, Qi Tian, Yanfeng Wang
    In: IEEE Journal of Biomedical and Health Informatics, 2023. (Impact Factor: ~7)
    Project Page | Arxiv

Computer Vision

  1. Self-supervised Object-Centric Learning for Videos.
    Görkay Aydemir, Weidi Xie, Fatma Güney
    In: Conference on Neural Information Processing Systems (NeurIPS) , 2023.
    Project Page | Arxiv
  2. What Does Stable Diffusion Know about the 3D Scene?
    Guanqi Zhan, Chuanxia Zheng, Weidi Xie, Andrew Zisserman
    Technical Report, 2023.
    Project Page | Arxiv
  3. A Large-scale Dataset for Audio-Language Representation Learning.
    Luoyi Sun, Xuenan Xu, Mengyue Wu, Weidi Xie
    Technical Report, 2023.
    Project Page | Arxiv
  4. Zero-shot Composed Text-Image Retrieval.
    Yikun Liu, Jiangchao Yao, Yanfeng Wang, Ya Zhang, Weidi Xie
    In: British Machine Vision Conference (BMVC) , 2023.
    Project Page | Arxiv
  5. Boost Video Frame Interpolation via Simple Motion Adaptation.
    Haoning Wu, Xiaoyun Zhang, Weidi Xie, Ya Zhang, Yanfeng Wang
    In: British Machine Vision Conference (BMVC) , 2023. (Oral)
    Project Page | Arxiv
  6. Annotation-free Audio-Visual Segmentation.
    Jinxiang Liu, Yu Wang, Chen Ju, Ya Zhang, Weidi Xie
    In: IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), 2023.
    Project Page | Arxiv
  7. Open-vocabulary Object Segmentation with Diffusion Models.
    Ziyi Li*, Qinye Zhou*, Xiaoyun Zhang, Ya Zhang, Yanfeng Wang, Weidi Xie
    In: International Conference on Computer Vision (ICCV) , 2023.
    Project Page | Code & Model | Arxiv
  8. AutoAD II: The Sequel – Who, When, and What in Movie Audio Description.
    Tengda Han, Max Bain, Arsha Nagrani, Gül Varol, Weidi Xie, Andrew Zisserman 
    In: International Conference on Computer Vision (ICCV) , 2023.
    Project Page | Paper
  9. The Making and Breaking of Camouflage.
    Hala Lamdouar,  Weidi Xie, Andrew Zisserman
    In: International Conference on Computer Vision (ICCV) , 2023.
    Paper
  10. Towards Open-Vocabulary Video Instance Segmentation.
    Haochen Wang, Shuai Wang, Cilin Yan, Xiaolong Jiang, Xu Tang, Yao Hu, Weidi Xie*, Efstratios Gavves
    In: International Conference on Computer Vision (ICCV) , 2023.
    Project Page | Arxiv
  11. Joint-Relation Transformer for Multi-person Motion Prediction.
    Qingyao Xu, Weibo Mao, Jingze Gong, Chenxin Xu, Siheng Chen, Weidi Xie, Ya Zhang, Yanfeng Wang
    In: International Conference on Computer Vision (ICCV) , 2023.
    Arxiv
  12. Multi-Modal Classifiers for Open-Vocabulary Object Detection.
    Prannay Kaul, Weidi Xie, Andrew Zisserman
    In: International Conference on Machine Learning (ICML) , 2023.
    Project Page | Arxiv
  13. Diagnosing Human-object Interaction Detectors.
    Fanrui Zhu, Fangrui Zhu, Yiming Xie, Weidi Xie, Huaizu Jiang
    Technical Report, 2023.
    Code | Arxiv
  14. arXiVeri: Automatic Table Verification with GPT.
    Gyungin Shin, Weidi Xie, Samuel Albanie
    Technical Report, 2023.
    Project Page | Arxiv
  15. arXiVeri: Automatic Table Verification with GPT.
    Gyungin Shin, Weidi Xie, Samuel Albanie,
    In: NeurIPS AI4Science Workshop , 2023.
    Arxiv
  16. Namedmask: Distilling Segmenters from Complementary Foundation Models.
    Gyungin Shin, Weidi Xie, Samuel Albanie,
    In: CVPR Workshop , 2023.
    Project Page | Arxiv
  17. Zero-shot Unsupervised Transfer Instance Segmentation.
    Gyungin Shin, Samuel Albanie, Weidi Xie
    In: CVPR Workshop , 2023.   (Best Paper Award)
    Project Page | Arxiv
  18. AutoAD: Movie Description in Context.
    Tengda Han, Max Bain, Arsha Nagrani, Gül Varol, Weidi Xie, Andrew Zisserman
    In: Conference on Computer Vision and Pattern Recognition (CVPR) , 2023.   (Highlight)
    Project Page | Arxiv
  19. Collaboration Helps Camera Overtake LiDAR in 3D Detection.
    Yue Hu, Yifan Lu, Runsheng Xu, Weidi Xie, Siheng Chen, Yanfeng Wang
    In: Conference on Computer Vision and Pattern Recognition (CVPR) , 2023.
    Arxiv | Dataset | Code
  20. OvarNet: Towards Open-vocabulary Object Attribute Recognition.
    Keyan Chen*, Xiaolong Jiang*, Yao Hu, Xu Tang, Yan Gao, Jianqi Chen, Weidi Xie
    In: Conference on Computer Vision and Pattern Recognition (CVPR) , 2023.
    Project Page | Arxiv
  21. Learning Open-vocabulary Semantic Segmentation Models From Natural Language Supervision.
    Jilan Xu, Junlin Hou, Yuejie Zhang, Rui Feng, Yi Wang, Yu Qiao, Weidi Xie
    In: Conference on Computer Vision and Pattern Recognition (CVPR) , 2023.
    Project Page | Arxiv
  22. Multi-modal Prompting for Low-Shot Temporal Action Localization.
    Chen Ju, Zeqian Li, Peisen Zhao, Ya Zhang, Xiaopeng Zhang, Qi Tian, Yanfeng Wang, Weidi Xie
    Technical Report, 2023.
    Arxiv
  23. Aerial Monocular 3d Object Detection.
    Yue Hu, Shaoheng Fang, Weidi Xie, Siheng Chen
    In: IEEE Robotics and Automation Letters (RA-L), 2023. (Impact Factor: ~4)
    Project Page | Arxiv

2022

  1. Turbo Training with Token Dropout.
    Tengda Han, Weidi Xie, Andrew Zisserman
    In: British Machine Vision Conference (BMVC) , 2022.
    Project Page | Arxiv
  2. A Simple Plugin for Transforming Images to Arbitrary Scales.
    Qinye Zhou, Ziyi Li, Weidi Xie^†, Xiaoyun Zhang, Ya Zhang, Yanfeng Wang†
    In: British Machine Vision Conference (BMVC) , 2022.
    Project Page | Arxiv
  3. Sparse in Space and Time: Audio-visual Synchronisation with Trainable Selectors.
    Vladimir Iashin, Weidi Xie, Esa Rahtu, Andrew Zisserman
    In: British Machine Vision Conference (BMVC) , 2022.   (Spotlight)
    Project Page | Arxiv
  4. CounTR: Transformer-based Generalised Visual Counting.
    Chang Liu, Yujie Zhong, Andrew Zisserman, Weidi Xie
    In: British Machine Vision Conference (BMVC) , 2022.
    Project Page | Arxiv
  5. K-Space Transformer for Fast MRI Reconstruction.
    Ziheng Zhao, Tianjiao Zhang, Weidi Xie†, Yanfeng Wang†, Ya Zhang
    In: British Machine Vision Conference (BMVC) , 2022.
    Project Page | Arxiv
  6. Open-vocabulary Semantic Segmentation with Frozen Vision-Language Models.
    Chaofan Ma, Yuhuan Yang, Yanfeng Wang, Ya Zhang, Weidi Xie
    In: British Machine Vision Conference (BMVC) , 2022.   (Oral Presentation)
    Arxiv | Code
  7. A Tri-Layer Plugin to Improve Occluded Detection.
    Guanqi Zhan, Weidi Xie, Andrew Zisserman
    In: British Machine Vision Conference (BMVC) , 2022.   (Oral Presentation)
    Project Page | Arxiv
  8. Associating Objects and Their Effects in Video through Coordination Games.
    Erika Lu, Forrester Cole, Weidi Xie, Tali Dekel, William T. Freeman, Andrew Zisserman, Michael Rubinstein
    In: Conference on Neural Information Processing Systems (NeurIPS) , 2022.
    Project Page | Paper
  9. ReCo: Retrieve and Co-segment for Zero-shot Transfer.
    Gyungin Shin, Weidi Xie, Samuel Albanie
    In: Conference on Neural Information Processing Systems (NeurIPS) , 2022.
    Project Page | Arxiv
  10. Segmenting Moving Objects via an Object-Centric Layered Representation.
    Junyu Xie, Weidi Xie, Andrew Zisserman
    In: Conference on Neural Information Processing Systems (NeurIPS) , 2022.
    Project Page | Arxiv
  11. Prompting Visual-Language Models for Efficient Video Understanding.
    Chen Ju, Tengda Han, Kunhao Zheng, Ya Zhang, Weidi Xie
    In: European Conference on Computer Vision (ECCV) , 2022
    Project Page | Arxiv
  12. PromptDet: Expand Your Detector Vocabulary with Uncurated Images.
    Chengjian Feng, Yujie Zhong, Zequn Jie, Xiangxiang Chu, Haibing Ren, Xiaolin Wei, Weidi Xie†, Lin Ma
    In: European Conference on Computer Vision (ECCV) , 2022
    Project Page | Arxiv
  13. Exploiting Transformation Invariance and Equivariance for Self-supervised Sound Localisation.
    Jinxiang Liu, Chen Ju, Weidi Xie, Ya Zhang
    In: ACM Multimedia , 2022.
    Project Page | Arxiv
  14. Adaptive 3D Localization of 2D Freehand Ultrasound Brain Images.
    Pak-Hei Yeung, Moska Aliasi, Monique Haak, the INTERGROWTH-21, Weidi Xie, Ana I.L. Namburete
    In: International Conference on Medical Image Computing and Computer Assisted Intervention (MICCAI), 2022.
    Project Page | Arxiv
  15. Transforming the Interactive Segmentation for Medical Imaging.
    Wentao Liu, Chaofan Ma, Yuhuan Yang, Weidi Xie, Ya Zhang
    In: International Conference on Medical Image Computing and Computer Assisted Intervention (MICCAI), 2022.   (Early Accept)
    Project Page | Arxiv
  16. Temporal Alignment Networks for Long-term Video.
    Tengda Han, Weidi Xie, Andrew Zisserman
    In: Conference on Computer Vision and Pattern Recognition (CVPR) , 2022.   (Oral Presentation)
    Project Page | Arxiv
  17. Label, Verify, Correct: A Simple Few Shot Object Detection Method.
    Prannay Kaul, Weidi Xie, Andrew Zisserman
    In: Conference on Computer Vision and Pattern Recognition (CVPR) , 2022.
    Project Page | Arxiv
  18. It's About Time: Analog Clock Reading in the Wild.
    Charig Yang, Weidi Xie, Andrew Zisserman
    In: Conference on Computer Vision and Pattern Recognition (CVPR) , 2022.
    Project Page | Arxiv
  19. Unsupervised Salient Object Detection with Spectral Cluster Voting.
    Gyungin Shin, Samuel Albanie, Weidi Xie
    In: Conference on Computer Vision and Pattern Recognition, L3D-IVU Workshop , 2022.
    Code | Arxiv
  20. Quantum Self-supervised Learning.
    Ben Jaderberg, Lewis W. Anderson, Weidi Xie, Samuel Albanie, Martin Kiffner, Dieter Jaksch
    In: Quantum Science and Technology, 2022 (Impact Factor: ~5.2)
    Code | Arxiv
  21. Self-supervised Tumor Segmentation through Layer Decomposition.
    Xiaoman Zhang, Weidi Xie, Chaoqin Huang, Ya Zhang, Yanfeng Wang
    Project Page | Arxiv
  22. Subcortical Segmentation Of The Fetal Brain in 3D Ultrasound Using Deep Learning.
    Linde S.Hesse, Moska Aliasi, Felipe Moser, the INTERGROWTH-21st Consortium, Monique C. Haak, Weidi Xie, Mark Jenkinson, Ana I.L. Namburete
    In: NeuroImage, Volume 254, July, 2022. (Impact Factor: ~6.5)
    Link

2021

  1. ImplicitVol: Sensorless 3D Ultrasound Reconstruction with Deep Implicit Representation.
    Pak-Hei Yeung, Linde Hesse, Moska Aliasi, Monique Haak, the INTERGROWTH-21st Consortium, Weidi Xie*, Ana I.L. Namburete*
    Project Page | Arxiv
  2. Segmenting Invisible Moving Objects.
    Hala Lamdouar, Weidi Xie, Andrew Zisserman
    In: British Machine Vision Conference (BMVC), 2021.
    Project Page | Paper

  3. Audio-Visual Synchronisation In the Wild.
    Honglie Chen, Weidi Xie, Triantafyllos Afouras, Arsha Nagrani, Andrea Vedaldi, Andrew Zisserman
    In: British Machine Vision Conference (BMVC), 2021.
    Project Page | Paper

  4. All You Need Are a Few Pixels: Semantic Segmentation with PixelPick.
    Gyungin Shin, Weidi Xie, Samuel Albanie
    In: International Conference on Computer Vision (ICCV), ILDAV Workshop , 2021.   (Best Paper Award)
    Project Page | Arxiv

  5. NeRF--: Neural Radiance Fields Without Known Camera Parameters.
    Zirui Wang, Shangzhe Wu, Weidi Xie, Min Chen, Victor Adrian Prisacariu
    Project Page | Arxiv

  6. Self-supervised Video Object Segmentation by Motion Grouping.
    Charig Yang, Hala Lamdouar, Erika Lu, Andrew Zisserman, Weidi Xie
    In: International Conference on Computer Vision (ICCV), 2021.
    Project Page | Arxiv

  7. Sli2Vol: Annotate a 3D Volume from a Single Slice with Self-Supervised Learning.
    Pak Hei Yeung, Ana I.L. Namburete, Weidi Xie
    In: International Conference on Medical Image Computing and Computer Assisted Intervention (MICCAI), 2021.
    Project Page | Arxiv

  8. Self-supervised Video Object Segmentation by Motion Grouping (Short Version).
    Charig Yang, Hala Lamdouar, Erika Lu, Andrew Zisserman, Weidi Xie
    In: Conference on Computer Vision and Pattern Recognition (CVPR), RVSU Workshop , 2021.   (Best Paper Award)
    Project Page | Arxiv

  9. Localizing Visual Sounds the Hard Way.
    Honglie Chen, Weidi Xie, Triantafyllos Afouras, Arsha Nagrani, Andrea Vedaldi, Andrew Zisserman
    In: Conference on Computer Vision and Pattern Recognition (CVPR), 2021
    Project Page | Arxiv
  10. Learning to Map 2D Ultrasound Images into 3D Space with Minimal Human Annotation.
    Pak-Hei Yeung, Moska Aliasi, Aris T. Papageorghiou, Monique Haak, Weidi Xie, Ana I.L. Namburete.
    In: Medical Image Analysis, February 2021. (Impact Factor: ~14)
    Project Page | Paper

2020

  1. VoxSRC 2020: The Second VoxCeleb Speaker Recognition Challenge.
    Arsha Nagrani, Joon Son Chung, Jaesung Huh, Andrew Brown, Ernesto Coto, Weidi Xie, Mitchell McLaren, Douglas A Reynolds, Andrew Zisserman.
    Tech Report
  2. Self-supervised Co-training for Video Representation Learning.
    Tengda Han, Weidi Xie, Andrew Zisserman
    In: Conference on Neural Information Processing Systems (NeurIPS) , 2020.
    Arxiv | Project Page | Code & Model
  3. Betrayed by Motion: Camouflaged Object Discovery via Motion Segmentation.
    Hala Lamdouar, Charig Yang, Weidi Xie, Andrew Zisserman
    In: Asian Conference on Computer Vision (ACCV), 2020.
    Arxiv | PDF | Project Page
  4. Layered Neural Rendering for Retiming People in Video.
    Erika Lu, Forrester Cole, Tali Dekel, Weidi Xie, Andrew Zisserman, David Salesin, William T. Freeman, Michael Rubinstein
    In: ACM Transactions on Graphics (TOG). Proc. SIGGRAPH Asia , 2020
    Arxiv | Project Page
  5. Inducing Predictive Uncertainty Estimation for Face Recognition.
    Weidi Xie, Jeffrey Byrne, Andrew Zisserman
    In: British Machine Vision Conference (BMVC) , 2020
    Arxiv | PDF
  6. Smooth-AP: Smoothing the Path Towards Large-Scale Image Retrieval.
    Andrew Brown, Weidi Xie, Vicky Kalogeiton, Andrew Zisserman
    In: European Conference on Computer Vision (ECCV) , 2020
    Arxiv | Project Page | Code & Model
  7. Memory-augmented Dense Predictive Coding for Video Representation Learning.
    Tengda Han, Weidi Xie, Andrew Zisserman
    In: European Conference on Computer Vision (ECCV) , 2020   (Spotlight Presentation)
    Arxiv | Project Page | Code & Model
  8. MAST: A Memory-Augmented Self-Supervised Tracker.
    Zihang Lai, Erika Lu, Weidi Xie
    In: Conference on Computer Vision and Pattern Recognition (CVPR), 2020
    Arxiv | Project Page | Code & Model
  9. VGG-Sound: A Large-Scale Audio-Visual Dataset.
    Honglie Chen, Weidi Xie, Andrea Vedaldi, Andrew Zisserman
    In: International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020
    Arxiv | PDF | Project Page | Code & Model
  10. Low-Memory CNNs Enabling Real-Time Ultrasound Segmentation Towards Mobile Deployment.
    Sagar Vaze, Weidi Xie, Ana Namburete.
    In: IEEE Journal of Biomedical and Health Informatics, 2020. (Impact Factor: ~7)
    Project Page | Code
  11. VoxCeleb: Large-scale Speaker Verification in the Wild.
    Arsha Nagrani*, Joon Son Chung*, Weidi Xie*, Andrew Zisserman. (* indicates equal contribution)
    In: Computer Speech & Language, 2020. (Impact Factor: ~1.8)
    Paper

2019

  1. VoxSRC 2019: The first VoxCeleb Speaker Recognition Challenge.
    Joon Son Chung, Arsha Nagrani, Ernesto Coto, Weidi Xie, Mitchell McLaren, Douglas A Reynolds, Andrew Zisserman.
    Tech Report
  2. Video Representation Learning by Dense Predictive Coding.
    Tengda Han, Weidi Xie, Andrew Zisserman
    In: 1st International Workshop on Large-scale Holistic Video Understanding, ICCV, 2019.   (Oral Presentation)
    Arxiv | Project Page | Code
  3. Self-supervised Learning for Video Correspondence Flow.
    Zihang Lai, Weidi Xie
    In: British Machine Vision Conference (BMVC), 2019.   (Oral Presentation)
    Arxiv | Project Page
  4. AutoCorrect: Deep Inductive Alignment of Noisy Geometric Annotations.
    Honglie Chen, Weidi Xie, Andrea Vedaldi, Andrew Zisserman.
    In: British Machine Vision Conference (BMVC), 2019.   (Spotlight Presentation)
    Arxiv | PDF
  5. Geometry-Aware Corner Network for Video Object Detection from Static Cameras.
    Dan Xu, Weidi Xie, Andrew Zisserman.
    In: British Machine Vision Conference (BMVC), 2019.   (Oral Presentation)
    Arxiv | PDF
  6. Utterance-level Aggregation for Speaker Recognition in the Wild.
    Weidi Xie, Arsha Nagrani, Joon Son Chung, Andrew Zisserman.
    In: International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2019.   (Oral Presentation)
    Arxiv | Project Page | Code & Model

2018

  1. Comparator Networks.
    Weidi Xie, Li Shen, Andrew Zisserman
    In: European Conference on Computer Vision (ECCV), 2018.
    Arxiv | PDF
  2. Multicolumn Networks on Face Recognition.
    Weidi Xie, Andrew Zisserman
    In: British Machine Vision Conference (BMVC), 2018.
    Arxiv | PDF | Code & Model | Bibtex
  3. Class-Agnostic Counting.
    Erika Lu, Weidi Xie, Andrew Zisserman
    In: Asian Conference on Computer Vision (ACCV), 2018.
    Arxiv | Project Page | Bibtex
  4. VGGFace2: A Dataset for Recognising Faces Across Pose and Age.
    Qiong Cao, Li Shen, Weidi Xie, Omkar M. Parkhi and Andrew Zisserman
    In: IEEE International Conference on Automatic Face and Gesture Recognition (F&G), 2018.   (Oral Presentation)
    Arxiv | PDF | Project Page | Bibtex
  5. Omega-Net: Fully Automatic, Multi-View Cardiac MR Detection, Orientation, and Segmentation with Deep Neural Networks.
    Weidi Xie*, Davis M. Vigneault*, Carolyn Y. Ho, David A. Bluemke and J. Alison Noble (*joint first author)
    In: Medical Image Analysis, Volume 48, Pages 95, August 2018. (Impact Factor: ~14)
    Arxiv | Paper
  6. VP-Nets: Efficient Automatic Localization of Key Brain Structures in 3D Fetal Neurosonography.
    Ruobing Huang, Weidi Xie and J. Alison Noble
    In: Medical Image Analysis, Volume 47, Pages 127, July 2018. (Impact Factor: ~14)
    Paper
  7. Fully-Automated Alignment of 3D Fetal Brain Ultrasound to a Canonical Reference Space Using Multi-task Learning.
    Weidi Xie*, Ana I.L. Namburete*, Mohammad Yaqub, Andrew Zisserman and J. Alison Noble (*joint first author)
    In: Medical Image Analysis, Volume 46, Pages 1, May 2018. (Impact Factor: ~14)
    Paper

2017

  1. Feature Tracking Cardiac Magnetic Resonance via Deep Learning and Spline Optimization.
    Davis M. Vigneaulta, Weidi Xie, David A. Bluemke and J. Alison Noble
    In: Functional Imaging and Modelling of the Heart (FIMH), 2017.   (Best Poster Award)
    Arxiv | Paper
  2. Robust Regression of Brain Maturation from 3D Fetal Neurosonography using CRNs.
    Ana I.L. Namburete, Weidi Xie and J. Alison Noble
    In: MICCAI Workshop on Fetal and InFant Image analysis (FIFI), 2017.   (Best Paper Award)
    Paper

2016

  1. Microscopy Cell Counting and Detection with Fully Convolutional Regression Networks.
    Weidi Xie, J. Alison Noble and Andrew Zisserman
    In: MICCAI 1st Deep Learning Workshop, 2015.
    In: Computer Methods in Biomechanics and Biomedical Engineering: Imaging & Visualization, 2016.   (Biannual Best Journal Article)
    Paper | Code | Award

Based on a template by Jon Barron