Search for Articles

Article

6,461 Views

17 Pages

Comparative Analysis of AI Models for Python Code Generation: A HumanEval Benchmark Study

Ali Bayram,
Gonca Gokce Menekse Dalveren and
Mohammad Derawi

Appl. Sci.2025, 15(18), 9907;https://doi.org/10.3390/app15189907

-

10 September 2025

This study conducts a comprehensive comparative analysis of six contemporary artificial intelligence models for Python code generation using the HumanEval benchmark. The evaluated models include GPT-3.5 Turbo, GPT-4 Omni, Claude 3.5 Sonnet, Claude 3....

225 Results Found

Comparative Analysis of AI Models for Python Code Generation: A HumanEval Benchmark Study

Program Code Generation with Generative AIs

Generative AI for Code Translation: A Systematic Mapping Study

A Threshold Selection Method in Code Plagiarism Checking Function for Code Writing Problem in Java Programming Learning Assistant System Considering AI-Generated Codes

Natural Language Generation and Understanding of Big Code for AI-Assisted Programming: A Review

Beyond Snippet Assistance: A Workflow-Centric Framework for End-to-End AI-Driven Code Generation

Evaluating Generative AI for HTML Development

Enhancing Security in Industrial Application Development: Case Study on Self-Generating Artificial Intelligence Tools

From Pilots to Practices: A Scoping Review of GenAI-Enabled Personalization in Computer Science Education

SCEditor-Web: Bridging Model-Driven Engineering and Generative AI for Smart Contract Development

Refactoring Loops in the Era of LLMs: A Comprehensive Study

EX-CODE: A Robust and Explainable Model to Detect AI-Generated Code

JorGPT: Instructor-Aided Grading of Programming Assignments with Large Language Models (LLMs)

Studying the Quality of Source Code Generated by Different AI Generative Engines: An Empirical Evaluation

AI-Driven Code Documentation: Comparative Evaluation of LLMs for Commit Message Generation

Exploring the Boundaries Between LLM Code Clone Detection and Code Similarity Assessment on Human and AI-Generated Code

Large Language Models for C Test Case Generation: A Comparative Analysis

Analysis of Concrete Air Voids: Comparing OpenAI-Generated Python Code with MATLAB Scripts and Enhancing 2D Image Processing Using 3D CT Scan Data

Comparative Analysis of Chatbots Using Large Language Models for Web Development Tasks

ChatGeoAI: Enabling Geospatial Analysis for Public through Natural Language, with Large Language Models

Challenges in Algorithmic Implementation: The FLoCIC Algorithm as a Case Study in Technology-Enhanced Computer Science Education

How Low-Code Tools Contribute to Diversity, Equity, and Inclusion (DEI) in the Workplace: A Case Study of a Large Japanese Corporation

Automated Test Generation Using Large Language Models

Contribution of Artificial Intelligence (AI) to Code-Based 3D Modeling Tasks

Assessing the Impact of Prior Coding and Artificial Intelligence Learning on Non-Computing Majors’ Perception of AI in a University Context

Impact of Adapting the Abbreviated Injury Scale (AIS)-2005 from AIS-1998 on Injury Severity Scores and Clinical Outcome

Evaluating Large Language Models in Code Generation: INFINITE Methodology for Defining the Inference Index

CNC Milling Optimization via Intelligent Algorithms: An AI-Based Methodology

Supporting Serious Game Development with Generative Artificial Intelligence: Mapping Solutions to Lifecycle Stages

ChatGPT Code Detection: Techniques for Uncovering the Source of Code

AI-Driven Innovations in Software Engineering: A Review of Current Practices and Future Directions

Use of Generative AI by Higher Education Students

Enhancing Program Synthesis with Large Language Models Using Many-Objective Grammar-Guided Genetic Programming

The Neurophysiological Paradox of AI-Induced Frustration: A Multimodal Study of Heart Rate Variability, Affective Responses, and Creative Output

The Human-Centred Design of a Universal Module for Artificial Intelligence Literacy in Tertiary Education Institutions

A Grammar of Speculation: Learning Speculative Design with Generative AI in Biodesign Education

ChatGPT: Challenges and Benefits in Software Programming for Higher Education

Code Word Cloud in Franz Kafka’s “Beim Bau der Chinesischen Mauer” [“The Great Wall of China”]

Exploring Factors Influencing Pre-Service Teachers’ Intention to Use GenAI for Instructional Design: A Grounded Theory Study

A Data-Centric AI Paradigm for Socio-Industrial and Global Challenges

Mapping Tomorrow’s Teaching and Learning Spaces: A Systematic Review on GenAI in Higher Education

AI-Enabled Sacramento Public Health (SACPH) App: A Reproducible AI-Based Method for Population-to-Practice Reasoning in Foundational Sciences in Pharmacy Education

Pushing the Limits of Large Language Models in Quantum Operations

Texture-Image-Oriented Coverless Data Hiding Based on Two-Dimensional Fractional Brownian Motion

Prevalence of POC5 Coding Variants in French-Canadian and British AIS Cohort

Creating Choropleth Maps by Artificial Intelligence—Case Study on ChatGPT-4

Artificial Intelligence in Digital Marketing: Towards an Analytical Framework for Revealing and Mitigating Bias

Writing Is Coding for Sustainable Futures: Reimagining Poetic Expression Through Human–AI Dialogues in Environmental Storytelling and Digital Cultural Heritage

Applications of Neural Network-Based AI in Cryptography

The Critical Impact and Socio-Ethical Implications of AI on Content Generation Practices in Media Organizations