TY - GEN
T1 - Training a single multi-class convolutional segmentation network using multiple datasets with heterogeneous labels
T2 - 39th IEEE International Geoscience and Remote Sensing Symposium, IGARSS 2019
AU - Kong, Fanjie
AU - Chen, Cheng
AU - Huang, Bohao
AU - Collins, Leslie M.
AU - Bradbury, Kyle
AU - Malof, Jordan M.
N1 - Publisher Copyright:
© 2019 IEEE.
PY - 2019/7
Y1 - 2019/7
N2 - Segmentation convolutional neural networks (CNNs) are now popular for the semantic segmentation (i.e., dense pixel-wise labeling) of remote sensing imagery, such as color or hyperspectral satellite imagery. In recent years a large number of hand-labeled datasets of overhead imagery have emerged, leading to breakthrough performance for CNNs. However, these datasets are typically used in isolation of one another because they are either (i) annotated with heterogeneous object type labels, or (ii) they are collected over different geographic areas. This imposes a major bottleneck on the value of these datasets. In this work we present what we call a class-asymmetric loss function that makes it possible to train a single multi-class network using multiple datasets that are heterogeneously-labeled. We show, for example, that it is possible to train a segmentation algorithm for Buildings, roads, and background using two datasets: one annotated with buildings and one annotated with buildings. We propose a class asymmetric loss that under certain common conditions, allows for one to train models on datasets in which the target class is unlabeled.
AB - Segmentation convolutional neural networks (CNNs) are now popular for the semantic segmentation (i.e., dense pixel-wise labeling) of remote sensing imagery, such as color or hyperspectral satellite imagery. In recent years a large number of hand-labeled datasets of overhead imagery have emerged, leading to breakthrough performance for CNNs. However, these datasets are typically used in isolation of one another because they are either (i) annotated with heterogeneous object type labels, or (ii) they are collected over different geographic areas. This imposes a major bottleneck on the value of these datasets. In this work we present what we call a class-asymmetric loss function that makes it possible to train a single multi-class network using multiple datasets that are heterogeneously-labeled. We show, for example, that it is possible to train a segmentation algorithm for Buildings, roads, and background using two datasets: one annotated with buildings and one annotated with buildings. We propose a class asymmetric loss that under certain common conditions, allows for one to train models on datasets in which the target class is unlabeled.
KW - aerial imagery
KW - building detection
KW - convolutional neural networks
KW - deep learning
KW - semantic segmentation
UR - http://www.scopus.com/inward/record.url?scp=85077704204&partnerID=8YFLogxK
U2 - 10.1109/IGARSS.2019.8898617
DO - 10.1109/IGARSS.2019.8898617
M3 - Conference contribution
AN - SCOPUS:85077704204
T3 - International Geoscience and Remote Sensing Symposium (IGARSS)
SP - 3903
EP - 3906
BT - 2019 IEEE International Geoscience and Remote Sensing Symposium, IGARSS 2019 - Proceedings
PB - Institute of Electrical and Electronics Engineers Inc.
Y2 - 28 July 2019 through 2 August 2019
ER -