Application of back-translation: A transfer learning approach to identify ambiguous software requirements

Ishan Mani Subedi; Maninder Singh; Vijayalakshmi Ramasamy; Gursimran Singh Walia

doi:10.1145/3409334.3452068

Application of back-translation: A transfer learning approach to identify ambiguous software requirements

Ishan Mani Subedi, Maninder Singh, Vijayalakshmi Ramasamy, Gursimran Singh Walia

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

5 Scopus citations

Abstract

Ambiguous requirements are problematic in requirement engineering as various stakeholders can debate on the interpretation of the requirements leading to a variety of issues in the development stages. Since requirement specifications are usually written in natural language, analyzing ambiguous requirements is currently a manual process as it has not been fully automated to meet the industry standards. In this paper, we used transfer learning by using ULMFiT where we pre-trained our model to a general-domain corpus and then fine-tuned it to classify ambiguous vs unambiguous requirements (target task). We then compared its accuracy with machine learning classifiers like SVM, Linear Regression, and Multinomial Naive Bayes. We also used back translation (BT) as a text augmentation technique to see if it improved the classification accuracy. Our results showed that ULMFiT achieved higher accuracy than SVM (Support Vector Machines), Logistic Regression and Multinomial Naive Bayes for our initial data set. Further by augmenting requirements using BT, ULMFiT got a higher accuracy than SVM, Logistic Regression, and Multinomial Naive Bayes classifier, improving the initial performance by 5.371%. Our proposed research provides some promising insights on how transfer learning and text augmentation can be applied to small data sets in requirements engineering.

Original language	English (US)
Title of host publication	Proceedings of the 2021 ACMSE Conference - ACMSE 2021
Subtitle of host publication	The Annual ACM Southeast Conference
Publisher	Association for Computing Machinery, Inc
Pages	130-137
Number of pages	8
ISBN (Electronic)	9781450380683
DOIs	https://doi.org/10.1145/3409334.3452068
State	Published - Apr 15 2021
Externally published	Yes
Event	2021 ACM Southeast Conference, ACMSE 2021 - Virtual, Online, United States Duration: Apr 15 2021 → Apr 17 2021

Publication series

Name	Proceedings of the 2021 ACMSE Conference - ACMSE 2021: The Annual ACM Southeast Conference

Conference

Conference	2021 ACM Southeast Conference, ACMSE 2021
Country/Territory	United States
City	Virtual, Online
Period	4/15/21 → 4/17/21

Keywords

Machine learning
Neural networks
Requirement engineering and quality
Transfer learning

ASJC Scopus subject areas

Computational Theory and Mathematics
Computer Science Applications
Hardware and Architecture
Software

Access to Document

10.1145/3409334.3452068

Cite this

Subedi, I. M., Singh, M., Ramasamy, V., & Walia, G. S. (2021). Application of back-translation: A transfer learning approach to identify ambiguous software requirements. In Proceedings of the 2021 ACMSE Conference - ACMSE 2021: The Annual ACM Southeast Conference (pp. 130-137). (Proceedings of the 2021 ACMSE Conference - ACMSE 2021: The Annual ACM Southeast Conference). Association for Computing Machinery, Inc. https://doi.org/10.1145/3409334.3452068

Application of back-translation: A transfer learning approach to identify ambiguous software requirements. / Subedi, Ishan Mani; Singh, Maninder; Ramasamy, Vijayalakshmi et al.
Proceedings of the 2021 ACMSE Conference - ACMSE 2021: The Annual ACM Southeast Conference. Association for Computing Machinery, Inc, 2021. p. 130-137 (Proceedings of the 2021 ACMSE Conference - ACMSE 2021: The Annual ACM Southeast Conference).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

Subedi, IM, Singh, M, Ramasamy, V & Walia, GS 2021, Application of back-translation: A transfer learning approach to identify ambiguous software requirements. in Proceedings of the 2021 ACMSE Conference - ACMSE 2021: The Annual ACM Southeast Conference. Proceedings of the 2021 ACMSE Conference - ACMSE 2021: The Annual ACM Southeast Conference, Association for Computing Machinery, Inc, pp. 130-137, 2021 ACM Southeast Conference, ACMSE 2021, Virtual, Online, United States, 4/15/21. https://doi.org/10.1145/3409334.3452068

Subedi IM, Singh M, Ramasamy V, Walia GS. Application of back-translation: A transfer learning approach to identify ambiguous software requirements. In Proceedings of the 2021 ACMSE Conference - ACMSE 2021: The Annual ACM Southeast Conference. Association for Computing Machinery, Inc. 2021. p. 130-137. (Proceedings of the 2021 ACMSE Conference - ACMSE 2021: The Annual ACM Southeast Conference). doi: 10.1145/3409334.3452068

Subedi, Ishan Mani ; Singh, Maninder ; Ramasamy, Vijayalakshmi et al. / Application of back-translation : A transfer learning approach to identify ambiguous software requirements. Proceedings of the 2021 ACMSE Conference - ACMSE 2021: The Annual ACM Southeast Conference. Association for Computing Machinery, Inc, 2021. pp. 130-137 (Proceedings of the 2021 ACMSE Conference - ACMSE 2021: The Annual ACM Southeast Conference).

@inproceedings{c0d25574ed7b4ac3b1b3aa3557eded20,

title = "Application of back-translation: A transfer learning approach to identify ambiguous software requirements",

abstract = "Ambiguous requirements are problematic in requirement engineering as various stakeholders can debate on the interpretation of the requirements leading to a variety of issues in the development stages. Since requirement specifications are usually written in natural language, analyzing ambiguous requirements is currently a manual process as it has not been fully automated to meet the industry standards. In this paper, we used transfer learning by using ULMFiT where we pre-trained our model to a general-domain corpus and then fine-tuned it to classify ambiguous vs unambiguous requirements (target task). We then compared its accuracy with machine learning classifiers like SVM, Linear Regression, and Multinomial Naive Bayes. We also used back translation (BT) as a text augmentation technique to see if it improved the classification accuracy. Our results showed that ULMFiT achieved higher accuracy than SVM (Support Vector Machines), Logistic Regression and Multinomial Naive Bayes for our initial data set. Further by augmenting requirements using BT, ULMFiT got a higher accuracy than SVM, Logistic Regression, and Multinomial Naive Bayes classifier, improving the initial performance by 5.371%. Our proposed research provides some promising insights on how transfer learning and text augmentation can be applied to small data sets in requirements engineering.",

keywords = "Machine learning, Neural networks, Requirement engineering and quality, Transfer learning",

author = "Subedi, {Ishan Mani} and Maninder Singh and Vijayalakshmi Ramasamy and Walia, {Gursimran Singh}",

note = "Publisher Copyright: {\textcopyright} 2021 ACM.; 2021 ACM Southeast Conference, ACMSE 2021 ; Conference date: 15-04-2021 Through 17-04-2021",

year = "2021",

month = apr,

day = "15",

doi = "10.1145/3409334.3452068",

language = "English (US)",

series = "Proceedings of the 2021 ACMSE Conference - ACMSE 2021: The Annual ACM Southeast Conference",

publisher = "Association for Computing Machinery, Inc",

pages = "130--137",

booktitle = "Proceedings of the 2021 ACMSE Conference - ACMSE 2021",

}

TY - GEN

T1 - Application of back-translation

T2 - 2021 ACM Southeast Conference, ACMSE 2021

AU - Subedi, Ishan Mani

AU - Singh, Maninder

AU - Ramasamy, Vijayalakshmi

AU - Walia, Gursimran Singh

PY - 2021/4/15

Y1 - 2021/4/15

N2 - Ambiguous requirements are problematic in requirement engineering as various stakeholders can debate on the interpretation of the requirements leading to a variety of issues in the development stages. Since requirement specifications are usually written in natural language, analyzing ambiguous requirements is currently a manual process as it has not been fully automated to meet the industry standards. In this paper, we used transfer learning by using ULMFiT where we pre-trained our model to a general-domain corpus and then fine-tuned it to classify ambiguous vs unambiguous requirements (target task). We then compared its accuracy with machine learning classifiers like SVM, Linear Regression, and Multinomial Naive Bayes. We also used back translation (BT) as a text augmentation technique to see if it improved the classification accuracy. Our results showed that ULMFiT achieved higher accuracy than SVM (Support Vector Machines), Logistic Regression and Multinomial Naive Bayes for our initial data set. Further by augmenting requirements using BT, ULMFiT got a higher accuracy than SVM, Logistic Regression, and Multinomial Naive Bayes classifier, improving the initial performance by 5.371%. Our proposed research provides some promising insights on how transfer learning and text augmentation can be applied to small data sets in requirements engineering.

AB - Ambiguous requirements are problematic in requirement engineering as various stakeholders can debate on the interpretation of the requirements leading to a variety of issues in the development stages. Since requirement specifications are usually written in natural language, analyzing ambiguous requirements is currently a manual process as it has not been fully automated to meet the industry standards. In this paper, we used transfer learning by using ULMFiT where we pre-trained our model to a general-domain corpus and then fine-tuned it to classify ambiguous vs unambiguous requirements (target task). We then compared its accuracy with machine learning classifiers like SVM, Linear Regression, and Multinomial Naive Bayes. We also used back translation (BT) as a text augmentation technique to see if it improved the classification accuracy. Our results showed that ULMFiT achieved higher accuracy than SVM (Support Vector Machines), Logistic Regression and Multinomial Naive Bayes for our initial data set. Further by augmenting requirements using BT, ULMFiT got a higher accuracy than SVM, Logistic Regression, and Multinomial Naive Bayes classifier, improving the initial performance by 5.371%. Our proposed research provides some promising insights on how transfer learning and text augmentation can be applied to small data sets in requirements engineering.

KW - Machine learning

KW - Neural networks

KW - Requirement engineering and quality

KW - Transfer learning

UR - http://www.scopus.com/inward/record.url?scp=85106428321&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85106428321&partnerID=8YFLogxK

U2 - 10.1145/3409334.3452068

DO - 10.1145/3409334.3452068

M3 - Conference contribution

AN - SCOPUS:85106428321

T3 - Proceedings of the 2021 ACMSE Conference - ACMSE 2021: The Annual ACM Southeast Conference

SP - 130

EP - 137

BT - Proceedings of the 2021 ACMSE Conference - ACMSE 2021

PB - Association for Computing Machinery, Inc

Y2 - 15 April 2021 through 17 April 2021

ER -

Application of back-translation: A transfer learning approach to identify ambiguous software requirements

Abstract

Publication series

Conference

Keywords

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this