Motivation: The performance of typical classifiers significantly drops due to unknown degradation. Hence, we employ DDPM to adjust degraded images towards the domain of clean images. We inherently assume that the adapted images domain is better than directly using unknown degraded images for classification. Indeed, previous studies like DDA have shown that DDPM helps improve the performance of classifying unknown degraded images. The DDA method applies an ensemble of classifiers trained on clean images to the input of degraded and adapted images to resolve imperfect adapted images' limitations. Contrastingly, we train two separate classifiers on adapted and degraded images that substantially improve classification performance for both adapted and degraded images. In particular, a classifier trained on adapted images with a limited set of known degradations anticipates imperfections in the image, thereby contributing to the robustness of our proposed method. Similarly, a classifier trained on degraded images of a few dissimilar known degradations helps our proposed method handle the degraded images directly.
Notations: We categorize three types of training images, i.e., clean, degraded, and adapted, denoted as , , and , respectively. Clean images are natural images without degradation; degraded images undergo synthesis using a specific degradation model, and the adapted images are sampled by applying DDPM on degraded images. Furthermore, there are two types of classifier in our study, i.e., simple classifier and distilled classifier, denoted as and . and are trained using image and label pairs , where represents clean, degraded, and adapted images respectively. We represent classifiers trained with clean, adapted, and degraded images as , , and , respectively. Likewise, distilled classifiers trained using , , and images are represented as , , and respectively. Besides, there are two other symbols utilized in our study, i.e., describes the DDPM process for adaptation such as the one described in DDA and denotes the ensemble, which comprises a set of distinct classifiers defined as .
Proposed Method: We propose DiffAUD, i.e., diffusion-based adaptation for unknown degraded images as described in Figure 1, where the top block shows the overall process for the classification of degraded images, which constitutes applying a diffusion model and an ensemble of distilled classifiers and to get the final classification prediction. Furthermore, to apply ensemble, we take the sum of logits from the two classifiers before the softmax function and apply to predict the input image class.
Our proposed method is split into three steps as follows:
- Apply DDPM on the degraded images to yield adapted images .
- Feed adapted images to a distilled classifier trained on adapted images from known degradations, i.e., and in parallel, we input degraded images directly to a distilled classifier trained on known degradation images, i.e., .
- Apply ensemble on the outputs of two distilled classifiers to output .