TY - GEN
T1 - If training data appears to be mislabeled, should we relabel it? Improving supervised learning algorithms for threat detection in ground penetrating radar data
AU - Reichman, Daniël
AU - Collins, Leslie M.
AU - Malof, Jordan M.
N1 - Publisher Copyright:
© 2018 SPIE.
PY - 2018
Y1 - 2018
N2 - This work focuses on the development of automatic buried threat detection (BTD) algorithms using ground penetrating radar (GPR) data. Buried threats tend to exhibit unique characteristics in GPR imagery, such as high energy hyperbolic shapes, which can be leveraged for detection. Many recent BTD algorithms are supervised, and therefore they require training with exemplars of GPR data collected over non-threat locations and threat locations, respectively. Frequently, data from non-threat GPR examples will exhibit high energy hyperbolic patterns, similar to those observed from a buried threat. Is it still useful therefore, to include such examples during algorithm training, and encourage an algorithm to label such data as a non-threat? Similarly, some true buried threat examples exhibit very little distinctive threat-like patterns. We investigate whether it is beneficial to treat such GPR data examples as mislabeled, and either (i) relabel them, or (ii) remove them from training. We study this problem using two algorithms to automatically identify mislabeled examples, if they are present, and examine the impact of removing or relabeling them for training. We conduct these experiments on a large collection of GPR data with several state-of-the-art GPR-based BTD algorithms.
AB - This work focuses on the development of automatic buried threat detection (BTD) algorithms using ground penetrating radar (GPR) data. Buried threats tend to exhibit unique characteristics in GPR imagery, such as high energy hyperbolic shapes, which can be leveraged for detection. Many recent BTD algorithms are supervised, and therefore they require training with exemplars of GPR data collected over non-threat locations and threat locations, respectively. Frequently, data from non-threat GPR examples will exhibit high energy hyperbolic patterns, similar to those observed from a buried threat. Is it still useful therefore, to include such examples during algorithm training, and encourage an algorithm to label such data as a non-threat? Similarly, some true buried threat examples exhibit very little distinctive threat-like patterns. We investigate whether it is beneficial to treat such GPR data examples as mislabeled, and either (i) relabel them, or (ii) remove them from training. We study this problem using two algorithms to automatically identify mislabeled examples, if they are present, and examine the impact of removing or relabeling them for training. We conduct these experiments on a large collection of GPR data with several state-of-the-art GPR-based BTD algorithms.
KW - buried threat detection
KW - ground penetrating radar
KW - mislabeled data
KW - supervised learning
UR - http://www.scopus.com/inward/record.url?scp=85048031514&partnerID=8YFLogxK
U2 - 10.1117/12.2305881
DO - 10.1117/12.2305881
M3 - Conference contribution
AN - SCOPUS:85048031514
T3 - Proceedings of SPIE - The International Society for Optical Engineering
BT - Detection and Sensing of Mines, Explosive Objects, and Obscured Targets XXIII
A2 - Bishop, Steven S.
A2 - Isaacs, Jason C.
PB - SPIE
T2 - Detection and Sensing of Mines, Explosive Objects, and Obscured Targets XXIII 2018
Y2 - 16 April 2018 through 18 April 2018
ER -