A modular benchmark for rare disease facial image analysis.
This repository provides a modular framework for facial image analysis in the context of rare disease diagnosis, specifically tailored for use with the RDFace dataset. It supports data preprocessing, supervised classification, few-shot learning, synthetic image generation, and LLM-based report generation.
Data preprocessing
Utilities for preparing pediatric facial images and analyzing synthetic images for downstream benchmarking.
Classification
Supports conventional supervised learning and few-shot classification for rare disease facial phenotype recognition.
Synthetic generation
Includes synthetic image generation modules for identity- and phenotype-consistent data generation in low-data settings.
Hybrid release model for synthetic and real facial images.
RDFace includes both real pediatric facial images and synthetic data. Due to privacy considerations, the dataset is released under a hybrid access model.
RDFace-Syn
The synthetic dataset is freely available for research and benchmarking.
RDFace-Syn on Kaggle- Freely available for research use
- Designed for benchmarking under data scarcity
- Useful for synthetic-only and synthetic-assisted experiments
RDFace-Real
The real dataset contains identifiable facial images and is distributed under controlled access.
Request RDFace-Real- Approved users will receive a Research ID
- Access is granted only for approved research purposes
- Ethics approval, such as REB or IRB approval, is required
Watch the RDFace overview video.
This video introduces the motivation, dataset construction, benchmark design, and synthetic data generation components of RDFace.
Cite RDFace.
Please cite our paper if you use RDFace, its synthetic data, benchmark splits, or related code.
@inproceedings{rdface2026,
title = {RDFace: A Benchmark Dataset for Rare Disease Facial Image Analysis under Extreme Data Scarcity and Phenotype-Aware Synthetic Generation},
author = {Ganlin Feng and Yuxi Long and Hafsa Ali and Erin Lou and Fahad Butt and Qian Liu and Yang Wang and Pingzhao Hu},
booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition},
year = {2026}
}