Next Article in Journal
Mathematical Modeling and Simulation of a Gas Emission Source Using the Network Simulation Method
Next Article in Special Issue
Comparing Deep-Learning Architectures and Traditional Machine-Learning Approaches for Satire Identification in Spanish Tweets
Previous Article in Journal
A Robust and Reversible Watermarking Algorithm for a Relational Database Based on Continuous Columns in Histogram
Previous Article in Special Issue
Author Identification Using Chaos Game Representation and Deep Learning
Article

AlgoLabel: A Large Dataset for Multi-Label Classification of Algorithmic Challenges

Department of Computer Science and Engineering, Faculty for Automatic Control and Computers, University Politehnica of Bucharest Splaiul Independentei 313, Sector 6, 060042 Bucharest, Romania
*
Author to whom correspondence should be addressed.
Mathematics 2020, 8(11), 1995; https://doi.org/10.3390/math8111995
Received: 5 October 2020 / Revised: 29 October 2020 / Accepted: 3 November 2020 / Published: 9 November 2020
(This article belongs to the Special Issue Recent Advances in Deep Learning)
While semantic parsing has been an important problem in natural language processing for decades, recent years have seen a wide interest in automatic generation of code from text. We propose an alternative problem to code generation: labelling the algorithmic solution for programming challenges. While this may seem an easier task, we highlight that current deep learning techniques are still far from offering a reliable solution. The contributions of the paper are twofold. First, we propose a large multi-modal dataset of text and code pairs consisting of algorithmic challenges and their solutions, called AlgoLabel. Second, we show that vanilla deep learning solutions need to be greatly improved to solve this task and we propose a dual text-code neural model for detecting the algorithmic solution type for a programming challenge. While the proposed text-code model increases the performance of using the text or code alone, the improvement is rather small highlighting that we require better methods to combine text and code features. View Full-Text
Keywords: text classification; code labeling; multi-modal dataset; multi-label classification; deep learning text classification; code labeling; multi-modal dataset; multi-label classification; deep learning
Show Figures

Figure 1

MDPI and ACS Style

Iacob, R.C.A.; Monea, V.C.; Rădulescu, D.; Ceapă, A.-F.; Rebedea, T.; Trăușan-Matu, Ș. AlgoLabel: A Large Dataset for Multi-Label Classification of Algorithmic Challenges. Mathematics 2020, 8, 1995. https://doi.org/10.3390/math8111995

AMA Style

Iacob RCA, Monea VC, Rădulescu D, Ceapă A-F, Rebedea T, Trăușan-Matu Ș. AlgoLabel: A Large Dataset for Multi-Label Classification of Algorithmic Challenges. Mathematics. 2020; 8(11):1995. https://doi.org/10.3390/math8111995

Chicago/Turabian Style

Iacob, Radu C.A., Vlad C. Monea, Dan Rădulescu, Andrei-Florin Ceapă, Traian Rebedea, and Ștefan Trăușan-Matu. 2020. "AlgoLabel: A Large Dataset for Multi-Label Classification of Algorithmic Challenges" Mathematics 8, no. 11: 1995. https://doi.org/10.3390/math8111995

Find Other Styles
Note that from the first issue of 2016, MDPI journals use article numbers instead of page numbers. See further details here.

Article Access Map by Country/Region

1
Back to TopTop