Evaluating Generative AI for HTML Development
Abstract
1. Introduction
2. Related Work
3. Conceptual Framework
4. Methodology
- HTML Generation
- 2.
- Evaluation ProcessThe generated HTML code was assessed using a comprehensive rubric covering six core dimensions: validation, semantic accuracy, accessibility, efficiency, readability, and SEO optimization. This process consists of sub-processes to evaluate as follows:
- (1)
- HTML Validation
The generated code was tested using the W3C Markup Validation Service to identify errors and warnings, ensuring compliance with established HTML standards.- (2)
- Semantic Accuracy Check
We employed Google Lighthouse (via Chrome DevTools) to assess the semantic structure and proper usage of HTML5 elements.- (3)
- Accessibility Compliance Check
Accessibility was evaluated according to WCAG 2.1 guidelines, using automated tools.- (4)
- Efficiency Evaluation
Efficiency was analyzed across three sub-metrics:- (i)
- Lines of Code (LoC): Measuring code length to assess verbosity.
Note: LoC counts were taken after ignoring blank lines and pure closing tags such as </div>. HTML was evaluated in the same format as generated by each model, without additional minification, ensuring that all counts were based on consistent output conditions.- (ii)
- Redundancy Check: Identifying repeated or unnecessary markup.
- (iii)
- DOM Depth: Evaluating the nesting structure to detect overly complex hierarchies.
- (5)
- Readability Check
Code readability was examined through the following:- (i)
- Indentation Quality: Consistency and clarity of visual structure.
- (ii)
- Commenting: Measured by the number of code comments present in the HTML output, if any.
- (6)
- SEO Evaluation
- A.
- Proposed Redundancy Metrics:
- 1.
- Style Redundancy Score: Count duplicate CSS properties (e.g., repeated padding, margin, color).
CSS Redundancy (%) = (Number of duplicate properties)/(Total unique properties) × 100CSS Score = 100 − CSS Redundancy (%)- 2.
- HTML Structure Redundancy: Count unnecessary wrapper elements.
HTML Redundancy (%) = (Redundant wrappers)/(Total structural tags) × 100HTML Score = 100 − HTML Redundancy (%)Redundancy Score: (CSS Score + HTML Score)/2 - B.
- Proposed length of code after ignoring blank lines and pure closing tags like </div>:LoC (%) = (LoC − Lowest LoC)/(Highest LoC − Lowest LoC) × 100LoC Score = 100 − LoC (%)
- C.
- Proposed DOM Depth score:DOM Depth (%) = ((Highest DOM Depth − Lowest DOM Depth)/(DOM Depth − Lowest DOM Depth)) ×100DOM Depth Score = 100 − DOM Depth (%)
- D.
- Proposed Indentation Quality score:Indentation Error (%) = ((Highest Errors − Lowest Errors)/(Indentation Errors − Lowest Errors)) ×100Indentation Quality Score = 100 − Indentation Error (%)
- E.
- Proposed Commenting score:
5. Results
- Generate a valid, well-structured HTML webpage for a [specific use case]. The HTML must follow best practices and include proper semantic elements. The page should be structured as follows: [The structure from Table 2].
6. Discussion
7. Limitations
8. Conclusions & Future Work
Author Contributions
Funding
Data Availability Statement
Acknowledgments
Conflicts of Interest
References
- Sauvola, J.; Tarkoma, S.; Klemettinen, M.; Riekki, J.; Doermann, D. Future of software development with generative AI. Autom. Softw. Eng. 2024, 31, 26. [Google Scholar] [CrossRef]
- Abu Doush, I.; Kassem, R. Can generative AI create accessible web code? A benchmark analysis of AI-generated HTML against accessibility standards. Univers. Access Inf. Soc. 2025, 1–24. [Google Scholar] [CrossRef]
- AlDahoul, N.; Hong, J.; Varvello, M.; Zaki, Y. Towards a World Wide Web powered by generative AI. Sci. Rep. 2025, 15, 7251. [Google Scholar] [CrossRef] [PubMed]
- Palmer, Z.B.; Oswal, S.K. Constructing websites with generative AI tools: The accessibility of their workflows and products for users with disabilities. J. Bus. Tech. Commun. 2025, 39, 93–114. [Google Scholar] [CrossRef]
- Feuerriegel, S.; Hartmann, J.; Janiesch, C.; Zschech, P. Generative ai. Bus. Inf. Syst. Eng. 2024, 66, 111–126. [Google Scholar] [CrossRef]
- Fui-Hoon Nah, F.; Zheng, R.; Cai, J.; Siau, K.; Chen, L. Generative AI and ChatGPT: Applications, challenges, and AI-human collaboration. J. Inf. Technol. Case Appl. Res. 2023, 25, 277–304. [Google Scholar] [CrossRef]
- Lim, C.V.; Zhu, Y.-P.; Omar, M.; Park, H.-W. Decoding the relationship of artificial intelligence, advertising, and generative models. Digital 2024, 4, 244–270. [Google Scholar] [CrossRef]
- Binhammad, M.H.Y.; Othman, A.; Abuljadayel, L.; Al Mheiri, H.; Alkaabi, M.; Almarri, M. Investigating how generative AI can create personalized learning materials tailored to individual student needs. Creat. Educ. 2024, 15, 1499–1523. [Google Scholar] [CrossRef]
- Al-Ahmad, A.; Kahtan, H.; Tahat, L.; Tahat, T. Enhancing Software Engineering with AI: Key Insights from ChatGPT. In Proceedings of the 2024 International Conference on Decision Aid Sciences and Applications (DASA), Manama, Bahrain, 11–12 December 2024; pp. 1–5. [Google Scholar]
- Goldstein, J.A.; Sastry, G.; Musser, M.; DiResta, R.; Gentzel, M.; Sedova, K. Generative language models and automated influence operations: Emerging threats and potential mitigations. arXiv 2023, arXiv:2301.04246. [Google Scholar] [CrossRef]
- Yadav, S.S.; Kumar, Y. Generative AI in Streamlining Software Development Life Cycles Within Programming Environments. Int. J. Sci. Innov. Eng. 2024, 1, 29–36. [Google Scholar]
- EliteBrains. AI-Generated Code Statistics 2025. Available online: https://www.elitebrains.com/blog/aI-generated-code-statistics-2025 (accessed on 25 May 2025).
- Novet, J. Satya Nadella Says as Much as 30% of Microsoft Code Is Written by AI. Available online: https://www.cnbc.com/2025/04/29/satya-nadella-says-as-much-as-30percent-of-microsoft-code-is-written-by-ai.html (accessed on 30 April 2025).
- Overflow, S. Stack Overflow Annual Developer Survey 2024: AI Results. Available online: https://survey.stackoverflow.co/2024/ai (accessed on 12 January 2025).
- AIPRM. AI Statistics: Latest Trends, Usage, and Adoption Rates. Available online: https://www.aiprm.com/ai-statistics/ (accessed on 23 May 2025).
- Kerimbayev, N.; Menlibay, Z.; Garvanova, M.; Djaparova, S.; Jotsov, V. A Comparative Analysis of Generative AI Models for Improving Learning Process in Higher Education. In Proceedings of the 2024 International Conference Automatics and Informatics (ICAI), Varna, Bulgaria, 10–12 October 2024; pp. 271–276. [Google Scholar]
- Cho, K.; Park, Y.; Kim, J.; Kim, B.; Jeong, D. Conversational AI forensics: A case study on ChatGPT, Gemini, Copilot, and Claude. Forensic Sci. Int. Digit. Investig. 2025, 52, 301855. [Google Scholar] [CrossRef]
- Reyero Lobo, P.; Daga, E.; Alani, H.; Fernandez, M. Semantic Web technologies and bias in artificial intelligence: A systematic literature review. Semant. Web 2023, 14, 745–770. [Google Scholar] [CrossRef]
- Lizcano, D.; Martínez-Ortíz, A.L.; López, G.; Grignard, A. End-user modeling of quality for web components. J. Softw. Evol. Process 2023, 35, e2256. [Google Scholar] [CrossRef]
- Li, T.; Huang, R.; Cui, C.; Towey, D.; Ma, L.; Li, Y.-F.; Xia, W. A Survey on Web Application Testing: A Decade of Evolution. arXiv 2024, arXiv:2412.10476. [Google Scholar] [CrossRef]
- Mahon, J.; Mac Namee, B.; Becker, B.A. Guidelines for the Evolving Role of Generative AI in Introductory Programming Based on Emerging Practice. In Proceedings of the 2024 on Innovation and Technology in Computer Science Education V. 1, Milan, Italy, 8–10 July 2024; pp. 10–16. [Google Scholar]
- Devadas, R.M.; Hiremani, V.; Gujjar, P.; Sapna, R. Unveiling the Potential of Generative Approaches in AI-Infused Web Development for Design, Testing, and Maintenance. In Generative AI for Web Engineering Models; IGI Global: Hershey, PA, USA, 2025; pp. 107–128. [Google Scholar]
- Augustine, N.; Md, A.B.; Sultan, M.H.O.; Sharif, K.Y.; Sokolova, Y.; Katunina, O.; Pysarenko, N.; Klimova, I.; Kovalchuk, O.; Parikesit, B.A.B. Enhancing SQL injection (SQLi) mitigation by removing malicious SQL parameter values using long short-term memory (LSTM) neural networks. J. Theor. Appl. Inf. Technol. 2025, 103, 3070–3083. [Google Scholar]
- Islam, F. AI Driven Development of Modern Web Application. Bachelor’s Thesis, Tampere University of Applied Sciences, Tampere, Finland, 2024. [Google Scholar]
- APVisor. Choosing the Right AI Tool: A Comparison of DeepSeek, Claude, ChatGPT, Gemini, Copilot, and Perplexity. Available online: https://apvisor.com/?p=117 (accessed on 30 April 2025).
- Auger, T.; Saroyan, E. Generative AI for Web Development: Building Web Applications Powered by OpenAI APIs and Next.js; Springer Nature: Berlin/Heidelberg, Germany, 2024. [Google Scholar]
- Nettur, S.B.; Karpurapu, S.; Nettur, U.; Gajja, L.S. Cypress copilot: Development of an AI assistant for boosting productivity and transforming web application testing. IEEE Access 2024, 13, 3215–3229. [Google Scholar] [CrossRef]
- Tsuchiya, M. Performance impact caused by hidden bias of training data for recognizing textual entailment. arXiv 2018, arXiv:1804.08117. [Google Scholar] [CrossRef]
- ISO (2011) ISO/IEC 25010; Systems and Software Engineering—Systems and Software Quality—Requirements and Evaluation (SQuaRE)—System and Software Quality Models. International Organization for Standardization: Geneva, Switzerland, 2011. Available online: https://www.iso.org/standard/35733.html (accessed on 3 April 2025).
- Vinčević, V.; Zajmović, M. Quality models and web application validation methods. Ann. Fac. Eng. Hunedoara-Int. J. Eng. 2024, 22, 13–21. [Google Scholar]
- Preston, M.; Rezwana, J. The Accessibility Landscape of Co-Creative AI Systems: Analysis, Insights and Recommendations. In Proceedings of the ACM IUI Workshops 2025, Cagliari, Italy, 24–27 March 2025. [Google Scholar]
- Web Content Accessibility Guidelines (WCAG) 2.1, World Wide Web Consortium Recommendation. Available online: https://www.w3.org/TR/WCAG21/ (accessed on 8 September 2025).
- Gupta, V.; Singh, H. Web Content Accessibility Evaluation of Universities’ Websites-A Case Study for Universities of Punjab State in India. In Proceedings of the 2021 8th International Conference on Computing for Sustainable Global Development (INDIACom), New Delhi, India, 17–19 March 2021; pp. 546–550. [Google Scholar]
- Król, K. Between Truth and Hallucinations: Evaluation of the Performance of Large Language Model-Based AI Plugins in Website Quality Analysis. Appl. Sci. 2025, 15, 2292. [Google Scholar] [CrossRef]
- Macakoğlu, Ş.S.; Peker, S.; Medeni, İ.T. Accessibility, usability, and security evaluation of universities’ prospective student web pages: A comparative study of Europe, North America, and Oceania. Univers. Access Inf. Soc. 2023, 22, 671–683. [Google Scholar] [CrossRef]
- Margea, R. Content Management Systems for Institutional Academic Websites. The Feaa Timişoara Experience. Ann. ‘Constantin Brancusi’ Univ. Targu-Jiu 2017, 2, 41–51. Available online: https://www.utgjiu.ro/revista/ec/pdf/2017-02/05_Margea.pdf (accessed on 25 April 2025).
- Kim, M.C. Learning Early-Stage Web Development at Scale: Exploring Methods to Assess Learning Through Analysis of HTML and CSS. Ph.D. Thesis, Drexel University, Philadelphia, PA, USA, 2021. [Google Scholar]
- Nuñez-Varela, A.S.; Pérez-Gonzalez, H.G.; Martínez-Perez, F.E.; Soubervielle-Montalvo, C. Source code metrics: A systematic mapping study. J. Syst. Softw. 2017, 128, 164–197. [Google Scholar] [CrossRef]
- Hague, M.; Lin, A.W.; Ong, C.-H.L. Detecting redundant CSS rules in HTML5 applications: A tree rewriting approach. In Proceedings of the 2015 ACM SIGPLAN International Conference on Object-Oriented Programming, Systems, Languages, and Applications, Pittsburgh, PA, USA, 25–30 October 2015; pp. 1–19. [Google Scholar]
- Deng, X.; Shiralkar, P.; Lockard, C.; Huang, B.; Sun, H. Dom-lm: Learning generalizable representations for html documents. arXiv 2022, arXiv:2201.10608. [Google Scholar] [CrossRef]
- Moreno-Lumbreras, D. Enhancing HTML Structure Comprehension: Real-Time 3D/XR Visualization of the DOM. In Proceedings of the 2024 IEEE Working Conference on Software Visualization (VISSOFT), Flagstaff, AZ, USA, 6–7 October 2024; pp. 127–132. [Google Scholar]
- Buse, R.P.; Weimer, W.R. Learning a metric for code readability. IEEE Trans. Softw. Eng. 2009, 36, 546–558. [Google Scholar] [CrossRef]
- Elnaggar, M. Measuring the Readability of Web Documents. Bachelor’s Thesis, Duisburg-Essen University, Duisburg, Germany, 7 September 2023. [Google Scholar]
- Musciano, C.; Kennedy, B. HTML & XHTML: The Definitive Guide; O’Reilly Media, Inc.: Santa Rosa, CA, USA, 2002. [Google Scholar]
- Hanenberg, S.; Morzeck, J.; Werger, O.; Gries, S.; Gruhn, V. Indentation and Reading Time: A Controlled Experiment on the Differences Between Generated Indented and Non-indented JSON Objects. In Proceedings of the International Conference on Software Technologies, Rome, Italy, 10–12 July 2023; pp. 50–75. [Google Scholar]
- Tavosi, M.; Naghshineh, N. Google SEO score and accessibility rank on the American University Libraries’ websites: One comparative analysis. Inf. Discov. Deliv. 2023, 51, 241–251. [Google Scholar] [CrossRef]
- Csontos, B.; Heckl, I. Five years of changes in the accessibility, usability, and security of Hungarian government websites. Univers. Access Inf. Soc. 2025, 24, 2757–2781. [Google Scholar] [CrossRef]
- Brown, D.M. Communicating Design: Developing Web Site Documentation for Design and Planning; New Riders: Los Angeles, CA, USA, 2010. [Google Scholar]
- Zuga-Divre, L. Creating a Modern and Responsive Customer Tracking Dashboard. 2016. Available online: https://www.theseus.fi/handle/10024/119779 (accessed on 25 April 2025).
- Hristov, I. Web Design for Small Businesses: Enhancing User Experience. 2024. Available online: https://www.theseus.fi/handle/10024/869156 (accessed on 20 April 2025).
Metric | Tool(s) | Details/How It’s Measured |
---|---|---|
HTML Validation | W3C Markup Validation Service (https://validator.w3.org accessed on 15 April 2025) [33] | Number of errors and warnings. Types of common issues (missing tags, incorrect nesting, obsolete elements). |
Semantic Accuracy | Lighthouse (Chrome DevTools) [34,35] | Checks semantic errors and warnings. Best Practices score |
Accessibility Compliance | Lighthouse (Chrome DevTools) [34,35] | Accessibility score. |
Efficiency | Lines of Code (LoC) [36,37,38] | Count the total number of lines. Count the total number of tags used. |
Redundancy Check [39] | Count the number of repeated code blocks (e.g., repeated styles or elements that could be simplified). | |
DOM Depth [40,41] | Measure the maximum depth of the DOM tree (easy via browser DevTools). Shallow DOM = more efficient. | |
Readability | Indentation Quality [42] | Check if proper consistent indentation is used (spaces/tabs per level). |
Commenting [43,44,45] | Count the number of code comments (if any). Comments in generated HTML are rare, but if they exist = plus. | |
SEO Evaluation | Lighthouse (Chrome DevTools) [46,47] | Presence of meta tags. Correct heading structure (h1, h2, etc.). Link text meaningfulness. |
Case Study | The Structure |
---|---|
A simple contact form | Fields for name, email, message, and a submit button Link to home page Logo Map for the address General details about the website |
A navigation menu | Logo Horizontal navigation bar with links (e.g., Home, About, Services, Contact) Dropdown menu under one link (e.g., Services) Responsive design |
A blog post layout | Blog title Author name and post date Featured image Blog content with headings, paragraphs, and images Comment section at the bottom |
A product listing page | Grid of products Each product with image, name, price, and “Add to Cart” button Filter/sort options (e.g., by category or price) Pagination controls at the bottom |
A dashboard layout | Sidebar navigation (e.g., Dashboard, Profile, Settings) Top navigation bar with user profile picture and logout button Main area with cards showing KPIs (e.g., Sales, Users, Revenue) Embedded charts or graphs Notifications panel |
Case | Category | ChatGPT | DeepSeek | Gemini | Copilot | Claude | |
---|---|---|---|---|---|---|---|
1 | HTML Validation | 1—Warning: Section lacks heading. | No errors or warnings. | 1—Error: Bad value 100% for attribute width on element iframe: Expected a digit but saw % instead. | No errors or warnings. | No errors or warnings. | |
Semantic | Checks semantic errors and warnings. | 1 <frame> or <iframe> elements do not have a title. | 1 <frame> or <iframe> elements do not have a title. | 1 <frame> or <iframe> elements do not have a title. | 1 <frame> or <iframe> elements do not have a title. | No errors or warnings. | |
Best Practices score | 74 | 74 | 74 | 74 | 96 | ||
Accessibility | 92 | 96 | 92 | 92 | 96 | ||
Efficiency | Code Length | 100 | 73.42 | 65.82 | 83.54 | 0 | |
Redundancy Check | 97.22 | 87.12 | 84 | 94.38 | 75.67 | ||
DOM Depth | 100 | 50 | 50 | 100 | 0 | ||
Readability | Indentation Quality | 40 | 80 | 0 | 60 | 100 | |
Commenting | 0 | 0 | 0 | 0 | 0 | ||
SEO Optimization | 91 | 100 | 91 | 91 | 91 | ||
2 | HTML Validation | 1—Warning: Trailing slash on void elements has no effect and interacts badly with unquoted attribute values. | No errors or warnings. | No errors or warnings. | No errors or warnings. | No errors or warnings. | |
Semantic | Checks semantic errors and warnings. | No errors or warnings. | No errors or warnings. | No errors or warnings. | No errors or warnings. | No errors or warnings. | |
Best Practices score | 96 | 78 | 78 | 96 | 96 | ||
Accessibility | 100 | 100 | 100 | 100 | 100 | ||
Efficiency | Code Length | 85.2 | 0 | 63 | 100 | 74.1 | |
Redundancy Check | 88.3 | 92.9 | 86.3 | 92.2 | 82.6 | ||
DOM Depth | 50 | 0 | 50 | 100 | 50 | ||
Readability | Indentation Quality | 40 | 100 | 0 | 60 | 20 | |
Commenting | 0 | 100 | 0 | 0 | 0 | ||
SEO Optimization | 90 | 90 | 91 | 90 | 90 | ||
3 | HTML Validation | 1—Warning: Consider using the h1 element as a top-level heading only (all h1 elements are treated as top-level headings by many screen readers and other tools). 2—Warning: Trailing slash on void elements has no effect and interacts badly with unquoted attribute values. | 1—Error: End of file seen when looking for tag name. Ignoring tag. 2—Error: End of file seen, and there were open elements. 3—Error: Unclosed element form. 4—Error: Unclosed element section. 5—Error: Unclosed element main. | No errors or warnings. | No errors or warnings. | 1—Warning: Consider using the h1 element as a top-level heading only (all h1 elements are treated as top-level headings by many screen readers and other tools). | |
Semantic | Checks semantic errors and warnings. | No errors or warnings. | Background and foreground colors do not have a sufficient contrast ratio. | Background and foreground colors do not have a sufficient contrast ratio. | Background and foreground colors do not have a sufficient contrast ratio. | Background and foreground colors do not have a sufficient contrast ratio. | |
Best Practices score | 96 | 96 | 78 | 96 | 96 | ||
Accessibility | 100 | 96 | 95 | 96 | 96 | ||
Efficiency | Code Length | 77.0 | 23.0 | 100 | 87.8 | 0 | |
Redundancy Check | 90.3 | 84.1 | 95.5 | 93.2 | 83.4 | ||
DOM Depth | 66.7 | 33.3 | 100 | 100 | 0 | ||
Readability | Indentation Quality | 20 | 100 | 0 | 40 | 60 | |
Commenting | 0 | 0 | 100 | 0 | 0 | ||
SEO Optimization | 91 | 100 | 91 | 91 | 91 | ||
4 | HTML Validation | 1—Warning: Trailing slash on void elements has no effect and interacts badly with unquoted attribute values. | No errors or warnings. | 1—Warning: Section lacks heading. Consider using h2–h6 elements to add identifying headings to all sections, or else use a div element instead for any cases where no heading is needed. | 1—Warning: Section lacks heading. Consider using h2–h6 elements to add identifying headings to all sections, or else use a div element instead for any cases where no heading is needed. | 13 | |
Semantic | Checks semantic errors and warnings. | Background and foreground colors do not have a sufficient contrast ratio. | Background and foreground colors do not have a sufficient contrast ratio. | Background and foreground colors do not have a sufficient contrast ratio. | Background and foreground colors do not have a sufficient contrast ratio. | Background and foreground colors do not have a sufficient contrast ratio. | |
Best Practices score | 96 | 96 | 96 | 96 | 96 | ||
Accessibility | 96 | 96 | 94 | 94 | 96 | ||
Efficiency | Code Length | 0 | 100 | 90.6 | 64.9 | 0 | |
Redundancy Check | 83.7 | 95.6 | 93.5 | 90.1 | 86.4 | ||
DOM Depth | 0 | 100 | 100 | 50 | 0 | ||
Readability | Indentation Quality | 80 | 40 | 60 | 20 | 100 | |
Commenting | 60 | 0 | 0 | 100 | 100 | ||
SEO Optimization | 91 | 91 | 91 | 91 | 91 | ||
5 | HTML Validation | 1—Warning: Trailing slash on void elements has no effect and interacts badly with unquoted attribute values. | No errors or warnings. | 1—Warning: Trailing slash on void elements has no effect and interacts badly with unquoted attribute values. | No errors or warnings. | 8 | |
Semantic | Checks semantic errors and warnings. | Background and foreground colors do not have a sufficient contrast ratio. | Background and foreground colors do not have a sufficient contrast ratio. | Buttons do not have an accessible name. | Heading elements are not in a sequentially descending order | Background and foreground colors do not have a sufficient contrast ratio. | |
Best Practices score | 96 | 74 | 96 | 96 | 96 | ||
Accessibility | 96 | 92 | 88 | 98 | 95 | ||
Efficiency | Code Length | 84.9 | 0 | 44.6 | 100 | 0 | |
Redundancy Check | 90.4 | 90.9 | 83.2 | 100 | 94.8 | ||
DOM Depth | 75 | 25 | 50 | 100 | 0 | ||
Readability | Indentation Quality | 80 | 100 | 0 | 60 | 90 | |
Commenting | 0 | 0 | 0 | 0 | 0 | ||
SEO Optimization | 91 | 91 | 91 | 91 | 91 |
Case | ChatGPT | DeepSeek | Gemini | Copilot | Claude |
---|---|---|---|---|---|
1 | 1 warning | 0 | 1 error | 0 | 0 |
2 | 1 warning | 0 | 0 | 0 | 0 |
3 | 1 warning | 1 error | 2 errors | 0 | 0 |
4 | 1 warning | 0 | 1 warning | 1 warning | 0 |
5 | 1 warning | 0 | 1 warning | 0 | 0 |
Case | ChatGPT | DeepSeek | Gemini | Copilot | Claude |
---|---|---|---|---|---|
1 | 92 | 96 | 92 | 92 | 96 |
2 | 100 | 100 | 100 | 100 | 100 |
3 | 100 | 96 | 95 | 96 | 96 |
4 | 96 | 96 | 94 | 94 | 96 |
5 | 96 | 92 | 88 | 98 | 95 |
Case | ChatGPT | DeepSeek | Gemini | Copilot | Claude |
---|---|---|---|---|---|
1 | 100 | 50 | 50 | 100 | 0 |
2 | 50 | 0 | 50 | 100 | 50 |
3 | 66.7 | 33.3 | 100 | 100 | 0 |
4 | 0 | 100 | 100 | 50 | 0 |
5 | 75 | 25 | 50 | 100 | 0 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Alahmad, A.S.; Kahtan, H. Evaluating Generative AI for HTML Development. Technologies 2025, 13, 445. https://doi.org/10.3390/technologies13100445
Alahmad AS, Kahtan H. Evaluating Generative AI for HTML Development. Technologies. 2025; 13(10):445. https://doi.org/10.3390/technologies13100445
Chicago/Turabian StyleAlahmad, Ahmad Salah, and Hasan Kahtan. 2025. "Evaluating Generative AI for HTML Development" Technologies 13, no. 10: 445. https://doi.org/10.3390/technologies13100445
APA StyleAlahmad, A. S., & Kahtan, H. (2025). Evaluating Generative AI for HTML Development. Technologies, 13(10), 445. https://doi.org/10.3390/technologies13100445