Join Raymond J. Mooney, professor in the Department of Computer Science at the University of Texas at Austin, for an upcoming talk in the College of IST Distinguished Lecture Series. This lecture is free and open to the Penn State community. A brief virtual reception will take place at 3:45 p.m. with the lecture to begin at 4:00 p.m.
"Deep Learning for Automating Software Documentation Maintenance"
Applying deep learning to large open-source software repositories offers the potential to develop many useful tools for aiding software development, including automated program synthesis and documentation generation. Specifically, we have developed methods that learn to automatically update existing natural language comments based on changes to the body of code they accompany. Developers frequently forget to update comments when they change code, which is Join detrimental to the software development cycle, causing confusion and bugs. First, we use methods for "just in time" comment/code inconsistency detection which learn to recognize when changes to code render it incompatible with its existing documentation. We then learn a model that appropriately updates a comment when it is judged to be inconsistent. Our approach learns to correlate changes across two distinct language representations, generating a sequence of edits that are applied to an existing comment to reflect source code modifications. We train and evaluate our model using a large dataset collected from commit histories of open-source Java software projects, with each example consisting of an update to a method and any concurrent edit to its corresponding comment. We compare our approach against multiple baselines using both automatic metrics and human evaluation. Results reflect the challenge of this task and that our model outperforms many baselines with respect to detecting inconsistent comments and appropriately updating them.
About the Speaker
Raymond J. Mooney is a Professor in the Department of Computer Science at the University of Texas at Austin. He received his Ph.D. in 1988 from the University of Illinois at Urbana/Champaign. He is an author of over 180 published research papers, primarily in the areas of machine learning and natural language processing. He was the President of the International Machine Learning Society from 2008-2011, program co-chair for AAAI 2006, general chair for HLT-EMNLP 2005, and co-chair for ICML 1990. He is a Fellow of AAAI, ACM, and ACL and the recipient of the Classic Paper award from AAAI-19 and best paper awards from AAAI-96, KDD-04, ICML-05 and ACL-07.
About the Distinguished Lecture Series
The College of Information Sciences and Technology’s Distinguished Lecture Series connects researchers, experts, and thought leaders with the college community to share perspectives and insights on a variety of topics. Our goal is to enrich the experience of IST students, faculty, and staff by inspiring thought-provoking conversations and exposing them to a diverse array of people, backgrounds, and ideas in the information sciences and related domains.
All lectures are free and open to the Penn State community unless otherwise noted.