CPL - Chalmers Publication Library
| Utbildning | Forskning | Styrkeområden | Om Chalmers | In English In English Ej inloggad.

Automatic classification of UML Class diagrams from images

Truong Ho-Quang (Institutionen för data- och informationsteknik, Software Engineering (Chalmers)) ; Michel Chaudron ; Ingimar Samúelsson ; Jóel Hjaltason ; B. Karasneh ; H. Osman
Proceedings of the 21st Asia-Pacific Software Engineering Conference, APSEC 2014 (1530-1362). Vol. 1 (2014), p. 399-406.
[Konferensbidrag, refereegranskat]

- Graphical modelling of various aspects of software and systems is a common part of software development. UML is the de-facto standard for various types of software models. To be able to research UML, academia needs to have a corpus of UML models. For building such a database, an automated system that has the ability to classify UML class diagram images would be very beneficial, since a large portion of UML class diagrams (UML CDs) is available as images on the Internet. In this study, we propose 23 image-features and investigate the use of these features for the purpose of classifying UML CD images. We analyse the performance of the features and assess their contribution based on their Information Gain Attribute Evaluation scores. We study specificity and sensitivity scores of six classification algorithms on a set of 1300 images. We found that 19 out of 23 introduced features can be considered as influential predictors for classifying UML CD images. Through the six algorithms, the prediction rate achieves nearly 96% correctness for UML-CD and 91% of correctness for non-UML CD.

Nyckelord: Classification, Feature extraction, Machine learning, Software Engineering, UML, UML class diagram

Denna post skapades 2016-01-15. Senast ändrad 2017-06-28.
CPL Pubid: 230706


Läs direkt!

Länk till annan sajt (kan kräva inloggning)

Institutioner (Chalmers)

Institutionen för data- och informationsteknik, Software Engineering (Chalmers)
Institutionen för data- och informationsteknik (GU) (GU)


Systemvetenskap, informationssystem och informatik

Chalmers infrastruktur