DASH: Visual Analytics for Debiasing Image Classification via User-Driven Synthetic Data Augmentation

April 23, 2022· Bum Chul Kwon , Jungsoo Lee , Chaeyeon Chung , Nyoungwoo Lee , Ho-Jin Choi , Jaegul Choo

PDF Video

Abstract

Image classification models often learn to predict a class based on irrelevant co-occurrences between input features and an output class in training data. We call the unwanted correlations “data biases,” and the visual features causing data biases “bias factors.” It is challenging to identify and mitigate biases automatically without human intervention. Therefore, we conducted a design study to find a human-in-the-loop solution. First, we identified user tasks that capture the bias mitigation process for image classification models with three experts. Then, to support the tasks, we developed a visual analytics system called DASH that allows users to visually identify bias factors, to iteratively generate synthetic images using a state-of-the-art image-to-image translation model, and to supervise the model training process for improving the classification accuracy. Our quantitative evaluation and qualitative study with ten participants demonstrate the usefulness of DASH and provide lessons for future work.

Type

Conference paper

Publication

Eurographics Conference on Visualization (EuroVis) Short Papers

Last updated on April 23, 2022

← Human-Centered Explainability For Life Sciences, Healthcare, And Medical Informatics May 13, 2022

Progression Of Type 1 Diabetes From Latency To Symptomatic Disease Is Predicted By Distinct Autoimmune Trajectories March 21, 2022 →