Perbaikan Metode Rekomendasi Diskusi Pemrograman dengan Normalisasi Identifier Menggunakan Lingua::IdSplitter

Nanang Fakhrur Rozi, Daniel Oranova Siahaan, Fajar Baskoro

Abstract


Situs tanya-jawab Stack Overflow telah sering digunakan sebagai acuan oleh programmer. Informasi atau solusi dalam proses pengembangan perangkat lunak dapat dicari dengan bantuan mesin pencari pada situs. Namun, perbedaan dalam gaya penulisan, terutama pada penulisan identifier program, sering menyebabkan rekomendasi (pencarian) menjadi tidak sesuai dengan kebutuhan programmer. Beberapa programmer menulis identifier dalam bentuk singkatan sementara yang lain tidak sehingga menurunkan kinerja rekomendasi. Penelitian ini mengadopsi Lingua::IdSplitter untuk menormalkan identifier pada data diskusi Stack Overflow. Proses normalisasi dilakukan dengan memisahkan identifier yang terdiri atas komposisi term serta memperluas singkatan yang ada pada identifier ke bentuk penuh. Hasil penelitian menunjukkan bahwa normalisasi identifier menggunakan Lingua::IdSplitter hanya mampu meningkatkan performa sistem rekomendasi ketika identifier dengan unsur singkatan mendominasi pada data diskusi.

References


Cegielski, C. G. and Hall, D. J., 2006. What Makes A Good Programmer? Communications of the ACM, 49(10), pp.73-75.

LaToza, T. D., Venolia, G., and DeLine, R., 2006. Maintaining Mental Models: A Study of Developer Work Habits. Proceedings of the 28th international conference on Software engineering, ACM, pp.492-501.

Anderson, A., Huttenlocher, D., Kleinberg, J., and Leskovec, J., 2012. Discovering Value from Community Activity on Focused Question Answering Sites: A Case Study of Stack Overflow. Proceedings of the 18th ACM SIGKDD international conference on Knowledge discovery and data mining, ACM, pp.850-858.

Manning, C. D., Raghavan, P., and Schütze, H., 2008. Introduction to Information Retrieval. Cambridge: Cambridge University Press.

Cordeiro, J., Antunes, B., and Gomes, P., 2012. Context-Based Recommendation to Support Problem Solving in Software Development. Recommendation Systems for Software Engineering (RSSE), 2012 Third International Workshop, IEEE, pp.85-89.

Ponzanelli, L., Bacchelli, A., and Lanza, M., 2013. Leveraging Crowd Knowledge for Software Comprehension and Development. Software Maintenance and Reengineering (CSMR), 2013 17th European Conference, IEEE, pp.57-66.

Krauthammer, M. and Nenadic, G., 2004. Term identification in the biomedical literature. Journal of biomedical informatics, 37(6), pp.512-526.

Arwan, A., Rochimah, S., and Akbar, R. J., 2015. Source Code Retrieval on StackOverflow Using LDA. Information and Communication Technology (ICoICT), 2015 3rd International Conference, IEEE, pp.295-299.

Carvalho, N. R., Almeida, J. J., Henriques, P. R., and Varanda, M. J., 2015. From Source Code Identifiers to Natural Language Terms. Journal of Systems and Software, 100, pp.117-128.

Lawrie, D., Morrell, C., Feild, H., and Binkley, D., 2007. Effective identifier names for comprehension and memory. Innovations in Systems and Software Engineering, 3(4), pp.303-318.

Hu, D. J., 2009. Latent Dirichlet Allocation for Text, Images, and Music. University of California, San Diego.

Adomavicius, G. and Tuzhilin, A., 2005. Toward the Next Generation of Recommender Systems: A Survey of the State-of-the-Art and Possible Extensions. Knowledge and Data Engineering, IEEE Transaction, 17(6), pp.734-749.

Robillard, M. P., Walker, R. J., and Zimmermann, T., 2010. Recommendation Systems for Software Engineering. Software, IEEE, 27(4), pp.80-86.

Gosling, J., Joy, B., Steele, G., Bracha, G., and Buckley, A., 2015. Chapter 3. Lexical Structure, (online), (https://docs.oracle.com/javase/specs/jls/se8/html/jls-3.html, diakses 20 Juli 2016).

Vohra, D., Baesens, B., Backiel, A. and Vanden Broucke, S., 2015. Beginning Java Programming: The Object-oriented Approach. John Wiley & Sons.

Sun Microsystems, 1997. Java Code Conventions. Sun Microsystems, Inc.

Mahmoud, A. and Niu, N., 2011. Source Code Indexing for Automated Tracing. Proceedings of the 6th International Workshop on Traceability in Emerging Forms of Software Engineering, ACM, pp.3-9.

Jivani, A. G., 2011. A Comparative Study of Stemming Algorithms. Int. J. Comp. Tech. Appl, 2(6), pp.1930-1938.




DOI: https://doi.org/10.31284/j.iptek.2016.v20i2.42

Refbacks

  • There are currently no refbacks.