Software Development Made Easier: U of T Researcher Improves Source Code Summarization
Eldan Cohen, an assistant professor at the University of Toronto, is spearheading a research team dedicated to enhancing the laborious process of summarizing source code. By developing human-centered machine learning algorithms, Cohen aims to automatically condense snippets of code into clear and concise language, providing a more efficient and effective approach to software development.
Code summarization plays a crucial role in understanding and managing software projects. It offers developers insights into the purpose and functionality of code, facilitating maintenance and collaboration in large-scale projects. Leveraging natural language processing techniques and machine learning, Cohen and his team are working towards automating this process and revolutionizing the way developers work with code.
Existing AI models have made strides in generating natural language summaries of code but often fall short in accuracy and relevance. Cohen acknowledges the limitations and proposes a human-in-the-loop technique to refine automated code summarization. By incorporating developers’ knowledge, preferences, and insights, this approach allows software engineers to actively participate in the generation of code summaries. It creates a feedback loop between developers and machine learning algorithms, enabling the system to learn from mistakes and produce higher-quality summaries.
To address the shortcomings of existing approaches, Cohen is also developing specialized machine learning algorithms. By introducing interactive methods that present developers with a selection of diverse and high-quality code summaries to choose from, the risk of generating incorrect summaries is significantly reduced. This approach ensures that developers have a range of suitable options available, tailored to their specific needs.
The ultimate goal of Cohen’s research is to enhance the effectiveness of automatic source code summarization. By implementing human-in-the-loop techniques and leveraging developer insights, Cohen aims to improve the quality of code summaries generated by state-of-the-art deep learning models.
Apart from its potential impact on software engineering, Cohen’s work is expected to have significant scholarly implications. It has the potential to spur further research and commercial activity, particularly in the domain of human-in-the-loop automation in software development.
Supported by the Connaught New Researcher Awards, Cohen and his team have actively involved students in all stages of the project, providing them with valuable research experience. The funds from the award will primarily support the students’ involvement and contribute to the advancement of the research.
Cohen’s research is one of four projects from U of T Engineering to receive support from the Connaught New Researcher Awards. This recognition highlights the university’s commitment to fostering innovative research and facilitating the growth of early-career faculty members.
With the development of human-centered machine learning algorithms and the integration of developer insights, Eldan Cohen and his team are poised to reshape the landscape of source code summarization. Their work promises to streamline software development processes, ultimately making them more accessible, efficient, and cost-effective.