CPL - Chalmers Publication Library
| Utbildning | Forskning | Styrkeområden | Om Chalmers | In English In English Ej inloggad.

Using Machine Learning to Design a Flexible LOC Counter

M. Ochodek ; Miroslaw Staron ; D. Bargowski ; Wilhelm Meding (Institutionen för data- och informationsteknik (Chalmers)) ; R. Hebig
2017 Ieee International Workshop on Machine Learning Techniques for Software Quality Evaluation (Maltesque) p. 14-20. (2017)
[Konferensbidrag, refereegranskat]

The results of counting the size of programs in terms of Lines-of-Code (LOC) depends on the rules used for counting (i.e. definition of which lines should be counted). In the majority of the measurement tools, the rules are statically coded in the tool and the users of the measurement tools do not know which lines were counted and which were not. The goal of our research is to investigate how to use machine learning to teach a measurement tool which lines should be counted and which should not. Our interest is to identify which parameters of the learning algorithm can be used to classify lines to be counted. Our research is based on the design science research methodology where we construct a measurement tool based on machine learning and evaluate it based on open source programs. As a training set, we use industry professionals to classify which lines should be counted. The results show that classifying the lines as to be counted or not has an average accuracy varying between 0.90 and 0.99 measured as Matthew's Correlation Coefficient and between 95% and nearly 100% measured as the percentage of correctly classified lines. Based on the results we conclude that using machine learning algorithms as the core of modern measurement instruments has a large potential and should be explored further.

Nyckelord: software size estimation



Denna post skapades 2017-07-13.
CPL Pubid: 250689

 

Institutioner (Chalmers)

Institutionen för data- och informationsteknik (GU) (GU)
Institutionen för data- och informationsteknik (Chalmers)

Ämnesområden

Språkteknologi (språkvetenskaplig databehandling)

Chalmers infrastruktur