Please use this identifier to cite or link to this item:
https://ruomoplus.lib.uom.gr/handle/8000/2040| Title: | A Comparison of the Effectiveness of ChatGPT and Co-Pilot for Generating Quality Python Code Solutions | Authors: | Nikolaidis, Nikolaos Flamos, Karolos Gulati, Khanak Feitosa, Daniel Ampatzoglou, Apostolos Chatzigeorgiou, Alexander |
Author Department Affiliations: | Department of Applied Informatics Department of Applied Informatics Department of Applied Informatics |
Author School Affiliations: | School of Information Sciences School of Information Sciences |
Subjects: | FRASCATI__Natural sciences__Computer and information sciences FRASCATI__Engineering and technology__Electrical engineering, Electronic engineering, Information engineering |
Keywords: | code quality GitHub Copilot OpenAI ChatGPT |
Issue Date: | 15-Aug-2024 | Publisher: | IEEE | Volume Title: | 2024 IEEE International Conference on Software Analysis, Evolution and Reengineering - Companion (SANER-C) | Start page: | 93 | End page: | 101 | Conference: | 2024 IEEE International Conference on Software Analysis, Evolution and Reengineering - Companion (SANER-C) | Abstract: | Artificial intelligence (AI) has become increasingly popular in software development to automate tasks and improve efficiency. AI has the potential to help while developing or maintaining software, in the sense that it can produce solutions out of a textual requirement specification, and understand code to provide suggestion on how a new requirement could be implemented. In this paper, we focus on the first scenario. Two AI-powered tools that have the potential to revolutionize the way software is developed are OpenAI’s ChatGPT and GitHub’s Copilot. In this paper, we used LeetCode, a popular platform for technical interview preparation and personal upskilling (self-learning), to evaluate the effectiveness of ChatGPT and Copilot on a set of coding problems, along with ChatGPT’s ability to correct itself when provided with feedback. The analysis of the effectiveness can lead to various conclusions, such as on if these solutions are ready to take over coding roles, and to what extent several parameters (difficulty and quality requirements) influence this result. Solutions have been generated for 60 problems using ChatGPT and Copilot, for the Python programming language. We investigated the performance of the models, the recurrent kinds of errors, and the resulting code quality. The evaluation revealed that ChatGPT and Copilot can be effective tools for generating code solutions for easy problems while both models are prone to syntax and semantic errors. Small improvements are observed for ode quality metrics across iterations, although the improvement pattern is not consistently monotonic, questioning ChatGPT’s awareness of the quality of its own solutions. Nevertheless, the improvement that was found along iterations, highlights the potential of AI and humans, acting as partners, in providing the optimal combination. The two models demonstrate a limited capacity for understanding context. Although AI-powered coding tools driven by large language models have the potential to assist developers in their coding tasks, they should be used with caution and in conjunction with human coding expertise. Developer intervention is necessary not only to debug errors but also to ensure high-quality and optimized code. |
URI: | https://ruomoplus.lib.uom.gr/handle/8000/2040 | ISBN: | [9798350351576] | DOI: | 10.1109/SANER-C62648.2024.00018 | Rights: | CC0 1.0 Παγκόσμια Attribution-NonCommercial-NoDerivatives 4.0 Διεθνές |
Corresponding Item Departments: | Department of Applied Informatics Department of Applied Informatics Department of Applied Informatics |
| Appears in Collections: | Conference proceedings |
Files in This Item:
| File | Description | Size | Format | |
|---|---|---|---|---|
| nikolaidis2024saner.pdf | 480,77 kB | Adobe PDF | View/Open |
SCOPUSTM
Citations
7
checked on Apr 16, 2026
Page view(s)
125
checked on Apr 18, 2026
Download(s)
57
checked on Apr 18, 2026
Google ScholarTM
Check
Altmetric
Altmetric
This item is licensed under a Creative Commons License