Learning I/O Variables from Scientific Software’s User Manuals

  • Zedong Peng
  • , Xuanyi Lin
  • , Sreelekhaa Nagamalli Santhoshkumar
  • , Nan Niu
  • , Upulee Kanewala

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

3 Scopus citations

Abstract

Scientific software often involves many input and output variables. Identifying these variables is important for such software engineering tasks as metamorphic testing. To reduce the manual work, we report in this paper our investigation of machine learning algorithms in classifying variables from software’s user manuals. We identify thirteen natural-language features, and use them to develop a multi-layer solution where the first layer distinguishes variables from non-variables and the second layer classifies the variables into input and output types. Our experimental results on three scientific software systems show that random forest and feedforward neural network can be used to best implement the first layer and second layer respectively.

Original languageEnglish
Title of host publicationComputational Science - ICCS 2022, 22nd International Conference, Proceedings
EditorsDerek Groen, Clélia de Mulatier, Valeria V. Krzhizhanovskaya, Peter M.A. Sloot, Maciej Paszynski, Jack J. Dongarra
PublisherSpringer Science and Business Media Deutschland GmbH
Pages503-516
Number of pages14
ISBN (Print)9783031087592
DOIs
StatePublished - 2022
Event22nd Annual International Conference on Computational Science, ICCS 2022 - London, United Kingdom
Duration: Jun 21 2022Jun 23 2022

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume13353 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference22nd Annual International Conference on Computational Science, ICCS 2022
Country/TerritoryUnited Kingdom
CityLondon
Period06/21/2206/23/22

Keywords

  • Classification
  • Machine learning
  • Scientific software
  • Software documentation
  • User manual

Fingerprint

Dive into the research topics of 'Learning I/O Variables from Scientific Software’s User Manuals'. Together they form a unique fingerprint.

Cite this