Next Article in Journal
Numeracy for Adults with Learning Disabilities: A Focus on Concepts of Time
Previous Article in Journal
Counselling Services and Mental Health for International Chinese College Students in Post-Pandemic Thailand
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

The Influence of Problem Construction on Undergraduates’ Success with Stoichiometry Problems

Department of Chemistry, University of California, One Shields Avenue, Davis, CA 95616, USA
*
Author to whom correspondence should be addressed.
Educ. Sci. 2022, 12(12), 867; https://doi.org/10.3390/educsci12120867
Submission received: 11 October 2022 / Revised: 19 November 2022 / Accepted: 23 November 2022 / Published: 26 November 2022
(This article belongs to the Section STEM Education)

Abstract

:
Although there are numerous studies that aim to reveal the source of student failure in problem solving in STEM fields, there is a lack of attention on testing different methods to identify what works best in improving students’ problem-solving performance. In this study, the authors examined the influence of the type of problem construction intervention and compared it to the effect of traditional practice on 38 general chemistry students’ comprehension of problem-solving process as well as overall success with given stoichiometric problems. To determine students’ success with each subtopic involved in stoichiometric problems and to better understand the source of difficulty at a finer level rather than focusing on the end product as practiced in most studies, students’ solutions were examined using the COSINE (Coding System for Investigating Sub-problems and Network) method. The findings revealed that students who practiced the problem-construction method outperformed their counterparts in the control group who followed a traditional approach during their study session. An in-depth analysis also showed that the experimental group improved their success with seven out of nine subtopics while three topics observed an increase in the control group. The practical implication of the problem-construction method was discussed for a wider adoption by textbook publishers and educators across different disciplines.

1. Introduction

As the information age advances, problem solving is becoming an increasingly vital skill set required in every stage of our academic and professional lives [1]. Businesses are becoming less interested in employees specializing in one specific area but instead prefer hiring individuals who have interdisciplinary experience and excellent problem-solving abilities [2]. Introductory science courses, including general chemistry courses, could contribute to developing this important 21st century skill if students are taught using research-based effective methods. In these courses, a lack of an in-depth understanding of the presented topics encourages students to memorize the facts and use the same strategies for a wide range of problems [3]. Mikula and Heckler [4] pointed out this common issue observed in many STEM classes and mentioned that “many students have difficulties with basic, [but very] essential skills in STEM courses”. Stoichiometry is one of the first topics in chemistry curriculum that challenges students’ problem-solving ability and understanding of chemical concepts [5]. Some of the challenges students face can be explained by exploring students’ chemistry self-efficacy and motivation levels [3,6,7]. Although it is important to target and examine as many of these variables as possible to better identify the source and the nature of the challenges, the goal of this study was to investigate students’ conceptual understanding of chemistry topics, as measured by their ability to connect knowledge pieces and changes in students’ problem-solving performances associated with the presented methods. Students’ epistemological beliefs and their perception of learning and conceptual change impact the way that they learn and approach their education [1,3]. Students have different conceptions about the structure of knowledge. Many view it as one large block and some perceive it as a highly complex system built with many richly connected small pieces [3,5]. As students’ beliefs and perceptions on how learning occurs and knowledge is constructed are critical in developing problem-solving skill, this study focused on making the knowledge assembly process more explicit by encouraging the students to investigate the connections between different subtopics of stoichiometry. As emphasized in the constructivist theory, improving students’ ability of connecting all the studied concepts, not only within the same chapter but across the whole curriculum and even over different courses helps them develop a better conceptual understanding of chemistry topics, and, in turn, excel at solving problems [5]. Differences in students’ success with solving chemistry problems are related to the idea of concept building and how students navigate conceptual tasks [5,6]. The Complete Success Rate (CSR) formula generated by Gulacar et al. [8] and tested in previous studies [9,10,11,12] to reveal the connection between students’ conceptual understanding and problem-solving performance was utilized in this study as well. The authors shared the details regarding the determination of the CSR scores and their analysis in the methods section.
One of the biggest conceptual barriers in solving stoichiometry problems among all subtopics presented in stoichiometry, the mole concept, has been identified as a major source of struggle yielding unsuccessful attempts in solving related questions [13]. By spending their attention and time trying to memorize how to find moles from given examples and worksheets, students are not able to build their problem-solving skills because they are not spending time understanding the complex interactions amongst chemistry concepts and how they are connected in a solution. Rather, they tend to focus on superficial elements that distract them from the actual task when dealing with problems [14]. This aspect clearly differentiates novices from experts, who have a more thorough understanding of the subject and categorize questions based on underlying principles without being barred by the surface elements of the questions [15,16,17]. Mikula and Heckler [4] believe that this difference is one of the fundamental causes behind novice’s poor performance in STEM fields as the mastery of this elementary skill is vital for meaningful learning. When students identify the type of problem, they become more successful with the use of proper strategies and are able to put the given information together to generate correct solutions instead of being distracted by surface elements presented in the problem [14]. Gulacar et al. [5] and Day et al. [18] found that the main difference between low- and high-achieving students in stoichiometry problems was their ability to interpret the given information in the question body and connect the necessary steps to solve the problem. In Day et al.’s study [18], eye-tracking technology was used to discover which part of the stoichiometry problems students were struggling most to solve. What they discovered was that low-achieving students could accurately identify the useful information within a problem but could not figure out how to combine the given information to compute an answer [18]. After analyzing the data collected through think-aloud protocols, Gulacar et al. [5] also discovered that students struggle to link the necessary subproblems together. For example, in several instances, students remembered how to convert the grams of reactant into moles and find the mass of desired product; however, they forgot that they needed to test for the limiting reagent between these two steps [5]. These studies indicate that students need more help with developing a better conceptual understanding and bridging the pieces of information given in the problem. In other words, they need to see how the subproblems come together to generate a successful solution.
Special attention and time need to be allotted to improve students’ problem-solving strategies, which is not readily happening in today’s classrooms. Instead, educators rely heavily on standardized tests, which mainly measure student memorization of content knowledge [19]. Therefore, students are not focusing on developing a strong conceptual understanding but, instead, are prioritizing receiving a good grade on their tests; in essence, high scores are not always associated with a meaningful understanding [19,20]. Standardized tests are ineffective at revealing students’ true understanding of concepts and pinpointing weaknesses in their conceptual knowledge or problem-solving abilities [20]. Many students do not realize that problem solving is a great learning tool because it is mostly used as part of summative assessments to measure their level of understanding [21]. Therefore, they think their sole goal is to get an answer without worrying about the process used to get the answer or reflecting on how the solutions are obtained at a deeper level [22]. They do not realize that problem solving is what we do when we do not know what to do [23]. While solving problems, consciously or unconsciously, the problem solver starts to make connections between different concepts involved in the questions, which is seen as synonymous with the process of learning defined by many learning theorists as the reorganization of information we have in our minds [24,25]. As students solve problems, they become more aware of the discrepancies that exist between their own understanding of topics and scientifically accepted theories and refine their knowledge [26]. At the same time, problem solving encourages students to retrieve needed information from their long-term memory and process it every time, which promotes a better understanding of topics and helps them learn new information by molding it into a frame that makes sense to them. However, research shows that many students struggle to retrieve needed pieces of information during problem solving [27]. There is a need for more effective strategies assisting students to create rich, well-connected, and organized knowledge structures, so they can become more successful at applying their knowledge to various problem situations.
In a study conducted by Angawi [28] to improve student’s understanding of proton NMR, a method called skillful thinking approach was used to teach the students the thought-process behind each step which was taken to solve the problem instead of just providing a set of cues and signals to use in interpreting NMR spectroscopy data. Here, the researchers aimed to identify if students have the conceptual framework to identify the steps needed to solve problems. Another study also examined conceptual frameworks of students by analyzing the questions students asked out loud during classroom activities with a goal of understanding student’s conceptual comprehension and to provide insight to teachers about specific topics students needed help [29]. Chen et al. [27] specifically investigated the difficulties students experience in drawing connections between problem-solving and conceptual thinking. They provided students with solutions which incorporated mistakes to help them identify errors in the solutions and improve their problem-solving ability. Researchers in these two studies pointed out the importance of providing students with the conceptual tool set so that they can identify sub questions that they should be considering and figure out why they are following certain steps in solving a problem [27,28,30].
In addition, there are studies looking into the effects of students grading their peer’s work on their ability to identify the required knowledge and strategies in solving problems. For example, by evaluating their peers’ work, Scott [31] determined that the students were able to improve their overall problem-solving skills as measured with pre- and post-tests which included questions similar to the ones found in the peer-assessment activities. This study also concluded that the students avoided common sources of error by engaging in these activities [31]. Learning about the benefits of evaluating other students’ work encouraged several researchers [32,33] to study the effects of students’ generating their own questions on their problem-solving performance. They particularly aimed to determine how these interventions help them think through the conceptual framework and retrieve the concepts they learned in the classroom to generate a successful solution. Botelho et al. [34], in this specific study conducted in Hong Kong, asked freshman to create multiple-choice questions with the intention of helping them see the connections among different concepts in classes and develop a stronger conceptual understanding. However, the study did not result in strong conclusions on the effects of these practices due to some observed limitations such as the students’ unfamiliarity with the nature of multiple-choice questions and in developing them, thus they failed to see the purpose behind their own and peer generated questions [33]. In addition, this study only analyzed post surveys completed by students to determine how the students perceived this activity, heavily emphasizing the student’s opinions, thus, providing limited data on the effect of creating multiple-choice questions on their problem-solving achievement.
Overall, these studies [27,28,31,32,33] have evaluated the efficacy of various methods in enhancing students’ conceptual understanding of chemistry topics along with disclosing the benefits of evaluating peer’s work and multiple-choice question creation to some extent. Considering the advantages and the limitations of the methodologies presented in these studies, a new approach, problem construction, also called reverse engineering, was proposed and tested in this study. In this approach, students were provided with solutions to chemistry questions and asked to create a question that corresponds with that solution.
Angawi [28] and Chen et al. [27] aimed to help students develop conceptual understanding which is seen as synonymous to being able to connect different knowledge components as practiced in problem solving [34,35,36]. It was predicted that creating open-ended questions as practiced in problem construction would be a more effective way to help students see the connections between different chemistry concepts assessed with a wide range of subproblems in stoichiometry. On the other hand, Bothelo et al. [33] investigated the effects of creation of multiple-choice questions and Scott [31] examined the influence of peer evaluation on students’ problem-solving performance. Considering the complexities with generating multiple-choice questions, our intervention required that students generate open-ended questions using the given solution as a guide. In our study, it was hypothesized that problem construction would provide students a chance to evaluate previously developed solutions, mimicking peer-evaluation, and go beyond multiple-choice question creation and encouraging them to be more creative with constructing open-ended questions, which in turn is expected to be more effective in helping them build stronger connections between chemistry concepts and applying them across different stoichiometry problems. It was also noted that the impact of open-ended question creation has not received sufficient attention from the scientific community yet. Therefore, the goal of our investigation was to compare the effects of a common practice method that students are most familiar and use to get an answer to given problems to those of a naïve approach, problem construction, on their success with solving stoichiometry problems.
The aforementioned studies also lack an in-depth analysis on the effects of their methods on students’ overall problem-solving performance and almost totally ignore the interaction between students’ success with individual subproblems and getting a correct answer to the overall problem in their analysis. The analysis of the student-generated questions in our study was done at a finer scale; instead of looking at how problem construction improved student’s scores, we studied how it helped improve each sub step within the problems and their ability in connecting them.

2. Methods

2.1. Research Questions

Among the many effective methods used in problem solving studies, this study focuses on problem construction to answer the following questions:
  • How does the problem-construction intervention affect student’s performance with problem solving?
  • How do high- and low- achieving students benefit from the implementation of problem-construction intervention? Do their performances with stoichiometry problems change differently?

2.2. Participants

The study was conducted at a public research university that is on a quarter system and located in Northern California. After getting the IRB approval, students registered for the second course (CHE 2B) in the General Chemistry series were invited to complete an online survey to determine their grades in CHE 2A, the previous course. Out of 854 students, 99 indicated an interest to participate in the research project. Considering their performance in CHE 2A, 70 of them were invited to join the study and split into experimental and control groups with very similar grade distribution. Since the students who earn a grade lower than a C in CHE 2A cannot move to CHE 2B, in this study, we only had students who earned an A, B, or C. The team did everything with its power to achieve homogenous sampling. However, 32 students quit at different stages of the study due to various reasons, which changed homogeneity of the groups and limited the investigations of differences between control and experimental groups. Some of the common reasons for quitting included having discomfort with revealing their weaknesses during the think-aloud protocols, the time pressure in a fast-paced quarter system, and health issues. Therefore, their data were excluded from the analysis. While the study started with an equal number of participants in each group, these separations mostly affected the control group.

2.3. Design and Instrument

The study took place in the Winter Quarter about ten weeks after the topic of interest, stoichiometry, was covered. The data collection was fulfilled in three stages consisting of pre-test, intervention, and post-test as shown in Figure 1. All the participants in each group, the experimental and control groups, completed all three stages, but the control group did not have the same intervention.
The students in both groups solved the same pre- and post-test questions while thinking aloud during the weeks of 6–7 and weeks of 9–10, respectively. In these sessions, the participants were asked to make their thinking process audible as much as possible. When they forget to speak, they were reminded to reveal what is going on in their mind. The total time allotted for them to complete the questions was 30 min. During the interviews, if participants got stuck at the first step and could not determine a chemical equation, they were provided standardized hints on chemical formulas and symbols to help them move forward. At the end of the pre-test session, the participants were invited to come back after a week for the intervention without giving much detail about the nature of exercises they would complete.
During the intervention session, instead of presenting questions to work on, the students in the experimental group were given solutions accompanied by detailed steps and asked to construct and write questions that would generate those expected solutions. The goal of this reverse-engineering problem set was to challenge students to creatively devise questions that frame the numerical values, steps, and elemental substances included in the questions. Similar to the pre-test, certain hints were provided as necessary to allow the students to move forward with their work. At the end of the session, each participant wrote 11 questions within 75 min. On the other hand, the control group intervention mimicked what is commonly performed by students during their study time. They were provided with the same number of questions and asked to solve them. After completing each question, they were provided with the corresponding solutions and encouraged to compare their own solutions to those provided by the research team before moving to the next question.
Due to students’ limited availability, the post-test had to be scheduled one week after the interventions were implemented. The post-test included modified versions of the questions used in the pre-test. To minimize and eliminate the effect of memorization on their ability to solve problems, some of the chemicals and the numbers in the questions were changed. It was ensured that these modifications did not cause any additional difficulty to the questions. All the sessions were audio recorded and transcribed for coding and analysis. Think-aloud protocols led the team to capture participants’ thinking process, struggles, and unique challenges [27]. These transcripts were triangulated with participants’ solutions and interviewers’ observations.
The pre- and post-test included five stoichiometry questions with varying levels of difficulty. As a topic, stoichiometry was selected because its mastery is imperative in understanding the relationship between reactants and products in a reaction and learning how to manipulate the given information to determine a successful solution is a prerequisite to following topics covered in general and advanced chemistry courses [27]. Since stoichiometry is a multi-faceted topic that necessitates understanding of several sub-topics ranging from conversion of mass to limiting reagents, the questions were designed in a way that students can show their performance with each subtopic, and the research team can identify their challenges with them. Considering all these factors, understanding students’ performance and challenges with solving overall stoichiometric problems and the subproblems involved was deemed to be critical to help a large group of students struggling in the general chemistry series [12,34,35].

2.4. Data Analysis

Upon the completion of the interviews, students’ solutions were scanned digitally and uploaded to Gradescope, an online grading site, while all the audio recordings were transcribed verbatim.5 A sample coding screen from Gradescope was shared in Figure 2. After organizing all the data, the team applied the COSINE (Coding System for Investigating Sub-Problems and Network) [8,10,11] to systematically analyze students’ success with each subproblem and pinpoint the exact steps that students got lost and frustrated.
In this study, a subproblem was defined as one-step calculation that usually involves one stoichiometric subtopic (e.g., mole concept, limiting reagent, or percent yield). There was a total number of 29 subtopics from all of the questions. Since each student did the same question twice, both in pre- and post-tests, 58 data points were collected for each participant regardless of their groups.
The COSINE comprises 8 codes: S (successful), NR (Not Required), DD (Did not Know to Do), DSE (Did Something Else), CD (Could Not Do), UG (Unsuccessful-Guessed), URH (Unsuccessful-Received Hint), UDI (Unsuccessful-Did Incorrectly) [37]. Broken up into three different sections, the 8 codes of COSINE are categorized as successful, neutral, and unsuccessful codes. Within the successful codes category, only one code exists: S (Successful). Of the remaining seven codes, NR (Not Required), DD (Did Not Know to Do), and DSE (Did Something Else) fall into the neutral codes. On the other hand, the unsuccessful codes include CD (Could Not Do), UG (Unsuccessful-Guessed), URH (Unsuccessful-Received Hint), and UDI (Unsuccessful-Did Incorrectly). Organizing each code into its respective category allows us to identify the nature and the sources of students’ challenges.
Examples of how each code was applied are included below in Table 1 and Table 2. The team developed a detailed key for the questions used in the think-aloud interviews and used it as a rubric to assign the codes. The answer key showed the most common way to approach and solve each sub-problem and helped the coders compare students’ solutions to the expected solutions and assign the codes. While coding the subproblems, the coders checked students’ solutions and transcripts at the same time to determine the best code for the work completed. Each subproblem was coded independently without interference from the preceding and subsequent subproblems. In several cases, regardless of the correctness of the final answer, the intermediate steps were coded as successful when found appropriate. The coders not only evaluated the overall answer students reported but also scrutinized their performance with the individual sub-problems. Before the three coders started coding assigned students’ solutions, the inter-rater reliability was determined by utilizing Krippendorf’s alpha, [37] which was determined to be 0.73.
Upon finishing the coding both the pre- and post-tests, the codes were used in two formulas, Attempt Success Rate (ASR) and the Complete Success Rate (CSR), to enhance the analysis of hundreds of assigned codes and make the changes in students’ performances clearer.
The Attempt Success Rate (ASR) takes students’ scores into consideration and removes the codes assigned when students completely forgot or did not know to do a specific step to continue to the correct solution (DD) or did an irrelevant calculation (DSE) which was not appropriate to get the right answer. In other words, the ASR formula does not factor in the neutral codes (DD and DSE). On the other hand, the Complete Success Rate (CSR) takes all possible mistakes into consideration and factors in all the codes of the COSINE. It should be noted that code NR was not included in either formula because this code was only applied when the students used a new subproblem within a correct method different from the one used in the rubric. To keep the coding consistent across all participants, their presence was not involved in the calculations, but novelty in their methods was documented for further analysis.
ASR = S ( S + CD + UG + UDI + URH )
CSR = S ( S + DD + DSE + CD + UG + UDI + URH )
The values obtained from the ASR formula usually appear to be higher than the values obtained from the CSR formula [36]. It is hypothesized that after a successful intervention the gap between ASR and CSR will be narrowed down and even closed, which means students become more successful with connecting sub-problems without doing any unnecessary calculations indicated by low number of the code DSE and with generating a problem schema that has all the sub-problems in the best network possible. The difference between the ASR and CSR formula is crucial in determining if there is a correlation and increase in success rate from the intervention. Additionally, the difference provides substantial data in revealing whether students are connecting sub-problems in stoichiometry and understanding the relationship. If the difference between the ASR and CSR is close to 0 for problems, it is interpreted that the students are becoming more successful in figuring out what subproblems required for the given problem and putting them in a correct schema to get an accurate answer [36]. In other words, students are becoming more successful with doing individual sub-problems and connecting them. If the difference between the ASR and CSR is far from 0 for a specific problem, the student has gone off track and either done random calculations (DSE) or could not figure out what exactly needs to be done (DD), which could be an important sign for poor conceptual understanding. To help the reader find the meaning of the acronyms used in this paper easily, Table 3 was prepared.

3. Results and Discussions

The analysis of our findings focused on examining the impact of the problem-construction method on students’ problem-solving performance and to determine if any student group (e.g., low- and high-achievers) benefited from this method more than any other group did. In doing so, a special emphasis was placed on the variations of students’ success with individual subtopics that make up a subproblem and their achievement with the overall solutions. This approach was taken because a student may arrive at an incorrect answer for as many reasons as the number of sub steps involved in the questions or even more. Doing an in-depth analysis should reveal more about the students’ learning deficiencies and challenges with individual subtopics. Additionally, examining the success of different student groups would enable us to determine more efficient methods to close the achievement gap between the students in STEM fields.

3.1. Examining the Effect of Problem Construction on Student’s Problem-Solving Performance

The first part of our analysis utilized Attempt Success Rate (ASR) and Complete Success Rate (CSR) scores obtained by using the data generated with the application of COSINE codes to each subproblem in students’ solutions. Investigating the trends in changes between the pre- to the post-test scores enabled us to determine how students improved their understanding of individual stoichiometry topics in detail and how their success changed with the integration, synthesis, and use of all these topics to generate a correct solution to the given problems.

3.1.1. Evaluating the Changes in the Complete Success Rate (CSR) Scores

To determine the overall effectiveness of the intervention on students’ problem-solving achievement with stoichiometry problems, the CSR scores were first determined for the students in both experimental and control groups considering their performance with all 64 subproblems involved in the pre- and post-tests. The total data points for control group and experimental group were 448 and 1984, respectively. These numbers should be kept in mind while reading the analysis. Even though the sample size was small and not suitable for running widely acceptable statistical tests, it is believed that the differences between the groups presented in the results and discussion section are meaningful and give some interesting facts about the nature of interventions and show where they were most helpful and what the limitations were associated with them. Table 4 summarizes the changes in overall CSR scores, Δ CSR, for both groups. The Δ CSR value was calculated for each of the students for all five problems and averaged to give the Δ value of 0.04 for the experimental group and –0.07 for the control group.
The experimental group overall improved on the post-test, signified by the positive Δ CSR. On the other hand, the control group’s Δ value was determined to be negative indicating that the participants did worse on the post-test than the pre-test. When the changes in individual students’ CSR scores were examined, it was determined that, in experimental group, 32% of students did worse on the post-test, 55% improved their scores, and the rest did not show either advancement or decline in their scores. However, in the control group, 71% of students did poorly after the traditional practice while half of the remaining students showed a better performance and half of them protected their original score. Although the sample size is relatively small and the difference between delta values is not huge, these variances could be important evidence revealing the effectiveness of the intervention on students’ overall success with getting the correct solutions for stoichiometric problems.

3.1.2. Interpreting the Variations in the Attempt Success Rate (ASR) Scores

To further understand where the greatest improvements occurred to a student’s problem solving ability, a closer subproblem analysis was performed using ASR scores for each stoichiometric subtopic. Figure 3 shows the pre- and post- ASR values for the experimental group, categorized by each subtopic involved in the stoichiometry problems. It was determined that ASR values went up from pre- to post-test for 7 out of 9 subtopics. Thus, the difference between pre-test and post-test indicates that the intervention was effective in helping students perform better in the majority of the subproblems tested. Considering the documented students’ challenges with many of these subtopics including BEQ (Balancing Chemical Equations) [38], LR (Limiting Reagent) [39,40], MC (Mole Concept) [41], and SR (Stoichiometric Ratio) [42,43], these results could be considered as promising. Problem-construction has potential in helping students see how different components related to these topics are put together to create successful solutions.
Although the experimental group overall showed great progress with regard to their problem solving ability, this was not the case for the control group. While the experimental group was able to improve their performance with 7 subtopics, this number stayed at 3 for the control group. Figure 4 compares delta ASR values of the control and experimental groups. It should be noted that there are three types of trends in this figure for different groups of subtopics. The first trend indicates that the experimental group improved their performance with three subtopics, WEQ (Writing Equations), LR (Limiting Reactant), and SR (Stoichiometric Ratio), while the control group’s success with these subtopics worsened after the traditional study session.
It can be argued that the problem-construction method helps students more with these subtopics as each of them demands an analysis of multiple components, evaluating their interactions, and carefully connecting them to each other. Additionally, it was noticed that these topics require different types of knowledge and abilities. For example, WEQ is more about declarative knowledge and students’ ability of memorization [11]. So, it may not necessitate higher-order thinking skills as much as other subtopics. The subproblems about WEQ can be categorized under Remembering or Understanding level at maximum according to Bloom’s Taxonomy [44]. On the other hand, LR would require application level of thinking and proportional reasoning ability so that determined moles of substances are assessed and the LR is identified before moving to the next step in a stoichiometry problem [40,45]. In a similar way, the success with the SR would be related to students’ proportional reasoning ability to some extent. While some students are accustomed to using the dimensional analysis without thinking much about the conceptual background of the method, some use mole ratio deliberately to solve the problems [46,47].
The second trend involves three subtopics, PY (Percent Yield), BEQ (Balancing Equations), and MC (Mole Concept). The ASR scores went up for these three subtopics for both experimental and control groups. Out of these three topics, the control group improved their performance in MC and PY more than the experimental group did. Even though the differences for MC and BEQ are minimal, 0.01 and 0.04, respectively, the PY showed the greatest difference between two groups. It should be noted that, in this study, the PY subtopic is defined as being able to identify actual yield from the given question and plug it together with theoretical yield determined from another sub step into the PY formula. It could be argued that the problem construction method mainly aiming to show how different subproblems or subtopics are connected to each other in a solution is not very effective in helping students do algorithmic steps. It is also possible that this naïve approach caused retroactive interference [48]. Being exposed to a new way of thinking about stoichiometric problem solving could possibly resulted in relatively bigger changes in their knowledge system, which happens in conceptual change [49,50,51]. This gaining of these new memories may have in turn resulted in the inability of the experimental students to recall previously learned information, such as the PY subtopic (REF). On a deeper level, an explanation for this phenomenon could be due to negative priming. The experimental group was subjected to a more recent exposure of the PY subtopic in the intervention than the control group, for whom the most recent exposure for the PY subtopic was on the pre-test. The more recent exposure of the PY subtopic could have negatively influenced the future response to the same stimulus, resulting in a greater error rate for the experimental group compared to the control group. On the other hand, for the experimental group, the second greatest change was observed in the ASR score of the MF (Molecular Formula) with a delta value of 0.13 while the control group’s pre- and post-test scores stayed the same. In our coding, the MF subtopic was related to being able to recognize the relationship between the EF (Empirical Formula) and the MF and carry out the calculations to determine the new chemical formula with the correct molar mass. Although it can be argued that this operation also requires significant algorithmic problem-solving skills, it was somehow more challenging for the control group. The difference between the delta values of PY and MF needs to be investigated further to find out other possible variables affecting students’ scores here.
Finally, the last category highlights subtopics that both the control and experimental groups showed poorer performance in the post-test. The EF (Empirical Formula) and CM (Conservation of Mass) are two subtopics that the experimental group did worse after the intervention. The same trend was observed for the control group. It is speculated that students struggled with Conservation of Mass because it is not a conceptual topic, but instead requires algorithms and the use of calculations to solve for the correct answer. So, either traditional practice or problem-construction is not effective in improving students’ algorithmic skills. A result of this weak math foundational understanding is that students are not able to apply advanced problem-solving methods and instead rely on novice methods [52]. Additionally, it should be noted that this topic is mainly associated with figuring out the mass differences among the steps in a solution. So, another reason for this decrease may be due to the lack of attention to details. The steps that involve subtopic CM could be perceived to be trivial steps compared to other major steps in getting the correct answer for the given problems. While students are focusing on getting other relatively more complex calculations done, their working memory could be overloaded, [53,54] resulting in failure at acknowledging the need for doing simple math involving the subtopic CM. Otherwise, it is difficult to claim that students are not able to do basic addition or subtraction. Furthermore, students typically do not have a strong conceptual understanding of conservation of mass because they are accustomed to simply following an algorithm or equation to solve a stoichiometric problem without understanding the conceptual reasoning behind it [38]. In this study, the subtopic EF is defined as dividing the mole numbers of the elements by the smallest value and determining the simplest ratio among them. So, finding an empirical formula successfully strongly correlates with one’s proportional reasoning skills [46]. Starting from elementary schools, educators aim to promote the development of these skills as they affect their performance not only in math but also science [55]. Despite all these efforts, this study shows that students still have not developed their proportional reasoning skills to tackle the problems presented in chemistry classes. Heavy dependence on factor-label methods or dimensional analysis in traditional education could also contribute to the lack of desired improvements in students’ skills [46,55].

3.1.3. Bringing Hidden Differences to Daylight: Inspecting the Changes in the Codes

Following the analysis of students’ performance with the subtopics by utilizing the changes in their ASR scores, a deeper analysis was executed with the help of the codes assigned to each step in the experimental group’s solutions. Figure 5 highlights the specific changes seen in three categories of the COSINE codes from the pre- to the post-test for all the subtopics investigated in this study.
With this analysis, it was intended to find out how exactly problem-construction practice helps students and whether unsuccessful or neutral codes decrease more if the students improve their scores after the intervention. As described in the methodology section, there are three categories of codes, successful (S), unsuccessful (CD, UDI, UG, and URH), and neutral codes (DSE and DD). A close examination of the changes in unsuccessful codes indicates the presence of three categories. In the first category, there are 6 stoichiometric topics, which saw a decrease in the number of unsuccessful codes. Within the second group, there are only two stoichiometry topics, Stoichiometric Ratio (SR) and Conservation of Mass (CM), with the same number of unsuccessful codes before and after the study. So, the overall increase in ASR score of SR is explained with the transformation of neutral codes into successful codes. On the other side, the overall negative delta CSR value of CM was the result of the conversion of successful codes into neutral codes. It seems that the participants either forgot to carry out this step and received DD code or did inappropriate calculations and received DSE code, which support our previous hypothesis related to working memory capacity being an important factor in determination of students’ performance with this subtopic. It is obvious that neutral codes play an important role in determining students’ overall success with problem solving, and they should be considered in the studies examining students’ challenges with problem solving and in everyday grading practices. These findings should encourage educators to modify current assessment methods and go beyond the use of simple points that do not clearly reveal the nature of mistakes made in solving problems. Finally, the third category includes only one subtopic, EF, that observed an increase in unsuccessful codes after the intervention. In addition to this increase, it was also noted that the number of neutral codes went up. This finding points out the urgent need to address the weakness in students’ proportional reasoning abilities and employ effective techniques to improve it [46]. This observation agrees with the trends depicted in Figure 3 and Figure 4 where the overall ASR score decreased after the intervention.

3.2. Examining the Influence of the Intervention on High and Low Achieving Students

3.2.1. An In-Depth Analysis of Changes in CSR and ASR Scores

In the previous section, a thorough analysis of changes in CSR and ASR scores and variations in the code distribution was performed to determine the impact of problem-construction practice on all experimental group’s success with individual stoichiometry topics and overall problems. Here, the goal was to find out if the problem-construction intervention helped any specific student group more than others. First, students were grouped based on success levels in their general chemistry course. Table 5 displays the descriptive data for each student group. Although these delta values are small, for comparison purposes, they still provide a good method for judging the effectiveness of the intervention for students who performed differently in the prior chemistry course.
A and B students have an average mean of 0.01 for the delta CSR indicating overall small improvement in their problem-solving performance after the intervention. Interestingly, the greatest delta value, 0.09, was observed for the C students, which could be interpreted as the C students received the greatest benefit of participating in the problem-construction intervention compared to the students of stronger academic backgrounds. A possible explanation is that the lower performing students are the ones who may have been struggling with chemistry topics more as evidenced by their low grades, and the intervention helped them develop their problem-solving performance the most and possibly showed them what was lacking in their approaches to the problems [56]. Likewise, in the study conducted by Hardy et al. [32] using peer generated multiple-choice questions to improve understanding, lower intermediate achieving students improved more than the higher achieving students by engaging in the PeerWise activities.

3.2.2. Looking into the Distribution of Unsuccessful, Neutral, and Successful Codes

To better understand how exactly the intervention helped each group of students, particularly C students, Figure 6 was prepared with all the code breakdowns for each subtopic. Like the previous code analysis, the primary focus was placed on the changes in the unsuccessful and neutral codes. The changes in the successful code were not directly investigated but interpreted indirectly in relation to the changes in unsuccessful and neutral codes. The ideal scenario would compromise situations where both neutral and unsuccessful codes decrease or disappear after the intervention, resulting in a higher number of successful codes and, in turn, greater CSR and ASR scores. Each grade level’s performance was investigated in depth and compared to that of other grade levels. In the analysis of code breakdown of the students with a grade A, the first thing was noticed that out 9 topics, in only one, PY, a great transformation was observed, all neutral codes (8%) transformed to successful ones. Another interesting finding regarding the changes in A students’ codes is that 5 topics saw a decrease in the number of neutral codes while only two topics had a lower number of unsuccessful codes in the post-test. It seems that the intervention is more effective in addressing issues captured by DD and DSE codes than those marked by UDI, CD, UG, or URH codes.
Additionally, it should be noted there were a few occasions like in subtopics EF and LR, the number of neutral codes were transformed into unsuccessful ones after the intervention. It is hard to argue whether this is a better or a worse thing, but this point deserves further investigation. Finally, it is worth mentioning that the insignificant progress depicted with a low mean delta value, 0.01, is not surprising after seeing these students’ high percentage of successful codes even before the intervention. It was obvious that there was little room, or no room like in the subtopic MF for the improvement.
Unlike A students, B students did not have a subtopic that had 100% successful codes before or after the intervention. However, this group of students showed much better improvement in their problem solving with the subtopic PY. The percentage of unsuccessful codes went from 33% to 0% and the number of neutral codes decreased by 50%, but it was not enough to give them perfect ASR score. The same decrease in the number of unsuccessful codes was observed for the subtopic MF, which was the second greatest decrease in the number of unsuccessful codes for the entire student population after the C students’ success with the subtopic LR, in which unsuccessful codes went from 70% to 30%. When comparing the changes in this group to those in the group of A students, a similar trend was observed. Out of 9 topics, three topics saw a decrease in the number of neutral codes versus four topics that observed a decrease in the unsuccessful codes. Again, the results indicate that the intervention is better at addressing two issues, forgetting to do the necessary steps and doing irrelevant calculations. Further investigation of the codes for B students indicated that both the number of unsuccessful and neutral codes of the subtopic SR went up after the intervention, which was not the case for any subtopic in the group of A students. On the other hand, the number of both codes decreased at the same time for one subtopic, MC, which was not the case for A students, either. Overall, B students had relatively less successful codes in total compared to those of A students, but higher than those documented for C Students. This finding supports the idea that there is a correlation between the students’ overall standing in the course and performance with problems presented in the study. It could be also used to endorse the use of COSINE codes in similar studies aiming to understand students’ challenges with problem solving.
The final portion of analysis involved the students who had relatively low achievement in the previous general chemistry course. It is correct that the number of successful codes overall for C students was fewer than those received by A and B students, but a closer analysis shows that they were the ones who benefited from the intervention most. The number of unsuccessful codes dropped for seven subtopics out of nine. For two subtopics, BEQ and MC, both the number of unsuccessful and neutral codes went down, which did not happen before in any other grade group. It should be noted that the number of neutral codes decreased only for three topics for C students. This number is the lowest one observed among the three groups. It can be argued that the intervention helps students with relatively low achievement in the course fix their shortcomings related to unsuccessful codes but assist those who do better in the course with their challenges documented by the neutral codes. This is another aspect of the study that needs further examination. It is interesting to see for the first time that the unsuccessful codes assigned to a subtopic, CM, completely become neutral after the intervention. It was observed that 10% of unsuccessful codes were transformed into neutral codes. Opposite transformations were documented earlier, but as said before, there is no clear evidence for either transformation being more favorable or showing a better progress in reaching 100% successful codes. This topic will be under the radar of the researchers as well.

4. Limitation

The first and the most critical limitation was the number of students in the control group. The team had planned carefully to have reasonably high and equal numbers of students in both control and experimental groups, so certain statistical tests can be run with sufficient power. However, the study did not go as planned, and several students had to quit. In future iterations, the team is planning to invite significantly more students to start the project. Another limitation can be mentioned about the number of interventions. If the real impact of the interventions needs to be assessed, students should be exposed to those methods for an extended period rather than one time. Possibly, a better trend would be observed if longer interventions, more than 75 min, are implemented involving more than one chapter. Although there is not any evidence for its impact, it is suggested that the post-test given right after the interventions are implemented to better determine their impact on students’ problem-solving abilities. It is almost impossible to control what students do between the intervention and the post-test and rule out the effects of different practices students do on their own. Among other limitations, it should be noted that Krippendorf’s alpha score is slightly lowered than the recommended value, 0.80, to minimize the negative influence of inter-rater coding process on the data analysis. The result could be higher if we had only two coders instead of three. Finally, it is recommended that visual data is collected during the think-aloud protocols so it will be easier to determine what the participants refer to when they use words like “this” or “that” in their explanations or thinking process.

5. Conclusions

Conducting studies to understand the nature of problem-solving processes and find where students struggle most has been an important goal in chemistry education since the field was established. Many researchers have worked in this area and helped the community advance their comprehension of challenges and their roots associated with problem solving involving a wide range of science topics. Even though many issues were well studied and described in previous studies, there is a shortage in the number of studies aiming to mitigate those common problems. In this study, the effects of a promising and relatively naïve method, problem construction, were investigated and compared to those of a traditional practice method on improving college students’ problem-solving skills. The differences in the CSR values pointed out that the problem-construction intervention was more effective in improving students’ success with solving overall stoichiometry problems. Higher CSR scores obtained in the post-test indicate that not only did students recognize the proper steps to solving the problem, but also their relation to other sub steps. Additionally, increased ASR values revealed that the same intervention is efficient in helping students do better with subproblems involving different stoichiometric subtopics such as Percent Yield and Limiting Reagent as well as decreasing the number of unsuccessful and neutral codes for many subtopics. This change would reduce the gap between the ASR and CSR scores, indicating that students become proficient at doing individual subproblems and overall problems at the same time. This finding is encouraging as the previous studies [5,57] found out that students are, in general, successful with doing subproblems, but fail to figure out the correct schema that maps out every subproblem needed for getting the right answer to the given problems. A zero difference between ASR and CSR scores could also be interpreted as the students are getting a more organized knowledge system, which should make the knowledge retrieval process more efficient and increase the chance of getting the correct answers [10,11,12]. These positive changes are better understood when the nature of reverse problem solving is considered. During that process, students challenge themselves by taking a step further in the problem by using the given information and manipulating it in such a way that they can formulate a specific question with all the required stoichiometric subtopics, which are linked to each other considering the conceptual framework of the question. Using the method of reverse problem solving, instructors can help students connect concepts more successfully and see the underlying principles and schemas that certain types of problems carry. Sufficient problem-construction practice should also teach students how to break down the information given in the question, relate them to appropriate topics learned, and then complete the puzzle by putting all the pieces back together. Examining the differences in student groups based on overall performance in the previous chemistry course also discloses that problem-construction method helps all, but low-performing students benefit most. If educators want to close the achievement gap in stoichiometric problem solving and in other areas, they should design homework assignments utilizing problem-construction method and especially use it as a form of intervention to those who show relatively poorer performance in the class. Problem construction method can also be turned into a group activity as well. Students in the group can interpret the pieces of the given solutions and step by step create the questions collectively that will result in the same solution.
As highlighted in the limitation and participants sections, the number of students in the experimental group changed abruptly after the study began. It is planned to use the problem construction method with a larger and a more diverse group of students. In this study, with limited power, the effects of this method on students of different achievement groups were investigated. In the new studies, more detailed information on students’ demographics (e.g., socioeconomic and ethnic backgrounds) will be collected to bring depth to the analysis and find out if the method benefits a specific group more than others. Additionally, future studies will focus not only on stoichiometry but also on other general chemistry and advanced topics such as acid-base chemistry [58,59], chemical kinetics [60,61], NMR spectroscopy [28], which are known to be challenging to many students.

Author Contributions

O.G. came up with the research plan, O.G. and B.J.V. recruited participants, and H.K.M. and S.S.M. interviewed with participants and coded their data. B.J.V. organized the data for analysis. All the authors contributed to the analysis of the data under the guidance of O.G. Paper was written by all the authors, O.G., H.K.M., S.S.M. and B.J.V. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

The study was conducted according to the guidelines of the Declaration of Helsinki, and approved by the Institutional Review Board (or Ethics Committee) of University of California, Davis (IRB ID is 922670-3 and date of approval is 5 November 2017).

Informed Consent Statement

Informed consent was obtained from all subjects involved in the study.

Data Availability Statement

Not applicable.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Ogilvie, C. Changes in students’ problem-solving strategies in a course that includes context-rich, multifaceted problems. Phys. Rev. Spec. Top. -Phys. Educ. Res. 2009, 5, 020102. [Google Scholar] [CrossRef] [Green Version]
  2. Ederer, P.; Nedelkoska, L.; Patt, A.; Castellazzi, S. What do employers pay for employees’ complex problem solving skills? Int. J. Lifelong Educ. 2015, 34, 430–447. [Google Scholar] [CrossRef]
  3. Frey, R.F.; Cahill, M.J.; McDaniel, M.A. Students’ Concept-Building Approaches: A Novel Predictor of Success in Chemistry Courses. J. Chem. Educ. 2017, 94, 1185. [Google Scholar] [CrossRef]
  4. Mikula, B.D.; Heckler, A.F. Framework and implementation for improving physics essential skills via computer-based practice: Vector math. Phys. Review. Phys. Educ. Res. 2017, 13, 010122. [Google Scholar] [CrossRef] [Green Version]
  5. Gulacar, O.; Eilks, I.; Bowman, C.R. Differences in General Cognitive Abilities and Domain-Specific Skills of Higher- and Lower-Achieving Students in Stoichiometry. J. Chem. Educ. 2014, 91, 961–968. [Google Scholar] [CrossRef]
  6. Sinapuelas, M.L.S. Why Do Some Students Struggle while others Succeed in Chemistry? A Study of the Influence of Undergraduate Student Beliefs, Perceptions, and Use of Resources on Performance in Introductory Chemistry. Ph.D. Thesis, University of California, Oakland, CA, USA, 2011. Available online: https://escholarship.org/uc/item/7nx2f1fx (accessed on 19 September 2022).
  7. Larson, L.; Stephen, A.; Bonitz, V.; Wu, T. Predicting Science Achievement in India: Role of Gender, Self-Efficacy, Interests, and Effort. J. Career Assess. 2014, 22, 89–101. [Google Scholar] [CrossRef] [Green Version]
  8. Gulacar, O. Investigation of Successful and Unsucccessful Students’ Problem Solving in Stoichiometry. Ph.D. Thesis, Western Michigan University, Kalamazoo, MI, USA, 2007. Available online: https://scholarworks.wmich.edu/dissertations/870 (accessed on 11 November 2022).
  9. Gulacar, O.; Wu, A.; Prathikanti, V.H.; Vernoy, B.J.; Kim, H.; Bacha, T.A.; Oentoro, T.C.; Reedy, K.J.; Navarrete-Pleitez, M. Benefits of Desirable Difficulties: Comparing the Influence of Mixed Practice to that of Categorized Sets of Questions on Students’ Problem-Solving Performance in Chemistry. Chem. Educ. Res. Pract. 2022, 23, 422–435. [Google Scholar] [CrossRef]
  10. Gulacar, O.; Cox, C.; Fynewever, H. Deconstructing the problem-solving process: Beneath assigned points and beyond traditional assessment. In Problems and Problem Solving in Chemistry Education: Analysing Data, Looking for Patterns and Making Deductions; Tsaparlis, G., Ed.; The Royal Society of Chemistry: London, UK, 2021; pp. 68–92. [Google Scholar]
  11. Gulacar, O.; Cox, C.; Tribble, E.; Rothbart, N.; Cohen-Sandler, R. Investigation of the correlation between college students’ success with stoichiometry subproblems and metacognitive awareness. Can. J. Chem. 2020, 98, 676–682. [Google Scholar] [CrossRef]
  12. Gulacar, O.; Tan, A.; Cox, C.T.; Bloomquist, J.; Jimmy, O.; Cao, N. Analyzing Characteristics of Experts in the Context of Stoichiometric Problem-Solving. Educ. Sci. 2019, 9, 219. [Google Scholar] [CrossRef] [Green Version]
  13. Bopegedera, A.M.R.P. Preventing mole concepts and stoichiometry from becoming "gatekeepers" in first year chemistry courses. ACS Symp. Ser. 2019, 1330, 121–136. [Google Scholar]
  14. Chi, M.T.H.; Feltovich, P.J.; Glaser, R. Categorization and Representation of Physics Problems by Experts and Novices. Cogn. Sci. 1981, 5, 121–152. [Google Scholar] [CrossRef]
  15. Taasoobshirazi, G.; Glynn, S.M. College students solving chemistry problems: A theoretical model of expertise. J. Res. Sci. Teach. 2009, 46, 1070–1089. [Google Scholar] [CrossRef]
  16. Persky, A.M.; Robinson, J.D. Moving from Novice to Expertise and Its Implications for Instruction. Am. J. Pharm. Educ. 2017, 81, 6065. [Google Scholar] [CrossRef] [PubMed]
  17. Bransford, J.D.; Brown, A.L.; Cocking, R.R. How experts differ from novices. In How People Learn: Brain, Mind, Experience, and School; Bransford, J.D., Brown, A.L., Cocking, R.R., Eds.; National Academy Press: Washington, DC, USA, 1999; pp. 31–50. [Google Scholar]
  18. Day, E.L.; Tang, H.; Kendhammer, L.K.; Pienta, N.J. Sequence Analysis: Use of scanpath patterns for analysis of students’ problem-solving strategies. In Eye Tracking for the Chemistry Education Researcher; American Chemical Society: New York, NY, USA, 2018; Volume 1292, pp. 73–97. [Google Scholar]
  19. Mehta, J. Search of Deeper Learning: The Quest to Remake the American High School; Fine, S.M., Ed.; Harvard University Press: Cambridge, MA, USA, 2019. [Google Scholar]
  20. Koretz, D.M. Measuring up What Educational Testing Really Tells Us; Harvard University Press: Cambridge, MA, USA, 2008. [Google Scholar]
  21. Dixson, D.D.; Worrell, F.C. Formative and Summative Assessment in the Classroom. Theory Into Pract. 2016, 55, 153–159. [Google Scholar] [CrossRef]
  22. Smith, K.J.; Metz, P.A. Evaluating Student Understanding of Solution Chemistry through Microscopic Representations. J. Chem. Educ. 1996, 73, 233–235. [Google Scholar] [CrossRef]
  23. Gilbert, J. Chemical Education: Towards Research-Based Practice; Kluwer Academic Publishers: Dordrecht, The Netherlands; Boston, MA, USA, 2002. [Google Scholar]
  24. diSessa, A.A.; Gillespie, N.M.; Esterly, J.B. Coherence versus fragmentation in the development of the concept of force. Cogn. Sci. 2004, 28, 843–900. [Google Scholar] [CrossRef]
  25. Vosniadou, S. Capturing and modeling the process of conceptual change. Learn. Instr. Learn. Instr. 1994, 4, 45–69. [Google Scholar] [CrossRef]
  26. Lee, C.; Rooney, P.; Parada, R.H. Fostering intentional learning with systems dynamic modeling. Aust. J. Educ. 2014, 58, 89–103. [Google Scholar] [CrossRef]
  27. Chen, X.; Mitrovic, A.; Mathews, M. Learning From Worked Examples, Erroneous Examples, and Problem Solving: Toward Adaptive Selection of Learning Activities. IEEE Trans. Learn. Technol. 2020, 13, 135–149. [Google Scholar] [CrossRef]
  28. Angawi, R.F. Using a problem solving-cooperative learning approach to improve students’ skills for interpreting 1H NMR spectra of unknown compounds in an organic spectroscopy course. J. Chem. Educ. 2014, 91, 823–829. [Google Scholar] [CrossRef]
  29. Chin, C.; Brown, D.E. Student-generated questions: A meaningful aspect of learning in science. Int. J. Sci. Educ. 2002, 24, 521–549. [Google Scholar] [CrossRef]
  30. Gilbert, G.L. How do I get the answer? Problem solving in chemistry. J. Chem. Educ. 1980, 57, 79–81. [Google Scholar] [CrossRef]
  31. Scott, F.J. A simulated peer-assessment approach to improving student performance in chemical calculations. Chem. Educ. Res. Pract. 2014, 15, 568–575. [Google Scholar] [CrossRef]
  32. Hardy, J.; Bates, S.P.; Casey, M.M.; Galloway, K.W.; Galloway, R.K.; Kay, A.E.; Kirsop, P.; McQueen, H.A. Student-Generated Content: Enhancing learning through sharing multiple-choice questions. Int. J. Sci. Educ. 2014, 36, 2180–2194. [Google Scholar] [CrossRef]
  33. Botelho, M.; Lam, O.; Watt, R.; Leung, D.; Kember, D. Evaluation of peer-generated MCQ s to assess and support learning in a problem-based learning programme. Eur. J. Dent. Educ. 2018, 22, 358–363. [Google Scholar] [CrossRef]
  34. Arasasingham, R.D.; Taagepera, M.; Potter, F. Assessing the Effect of Web-Based Learning Tools on Student Understanding of Stoichiometry Using Knowledge Space Theory. J. Chem. Educ. 2005, 82, 1251–1262. [Google Scholar] [CrossRef]
  35. Okanlawon, A.E. Teaching reaction stoichiometry: Exploring and acknowledging Nigerian chemistry teachers’ pedagogical content knowledge. Cypriot J. Educ. Sci. 2010, 5, 107–129. [Google Scholar]
  36. Gulacar, O.; Overton, T.L.; Bowman, C.R.; Fynewever, H. A novel code system for revealing sources of students’ difficulties with stoichiometry. Chem. Educ. Res. Pract. 2013, 14, 507–515. [Google Scholar] [CrossRef]
  37. Krippendorff, K. Reliability in Content Analysis: Some Common Misconceptions and Recommendations. Hum. Commun. Res. 2004, 30, 411–433. [Google Scholar] [CrossRef]
  38. Agung, S.; Schwartz, M.S. Students’ Understanding of Conservation of Matter, Stoichiometry and Balancing Equations in Indonesia. Int. J. Sci. Educ. 2007, 29, 1679–1702. [Google Scholar] [CrossRef]
  39. Ault, A. How to Say How Much: Amounts and Stoichiometry. J. Chem. Educ. 2001, 78, 1347. [Google Scholar] [CrossRef]
  40. Chandrasegaran, A.L.; Treagust, D.F.; Waldrip, B.G.; Chandrasegaran, A. Students’ Dilemmas in Reaction Stoichiometry Problem Solving: Deducing the Limiting Reagent in Chemical Reactions. Chem. Educ. Res. Pract. 2009, 10, 14–23. [Google Scholar] [CrossRef]
  41. Ralph, V.R.; Lewis, S.E. Chemistry topics posing incommensurate difficulty to students with low math aptitude scores. Chem. Educ. Res. Pract. 2018, 19, 867–884. [Google Scholar] [CrossRef]
  42. Bapu Ramesh, V.; Selvam, A.A.A.; Kulkarni, S.; Dattatreya Manganahalli, A.; Bettadapur, K.R. Designing and Using an Atomic Model Kit with H, C, N, and O Model Atoms Having a Mass Ratio of 1:12:14:16 to Teach the Concept of Mole and Associated Stoichiometric Relationships. J. Chem. Educ. 2020, 97, 986–991. [Google Scholar] [CrossRef]
  43. Chonkaew, P.; Sukhummek, B.; Faikhamta, C. STEM Activities in Determining Stoichiometric Mole Ratios for Secondary-School Chemistry Teaching. J. Chem. Educ. 2019, 96, 1182–1186. [Google Scholar] [CrossRef]
  44. Diab, S.; Sartawi, B. Classification of Questions and Learning Outcome Statements (LOS) Into Blooms Taxonomy (BT) By Similarity Measurements Towards Extracting Of Learning Outcome from Learning Material. Int. J. Manag. Inf. Technol. 2017, 9, 1–12. [Google Scholar] [CrossRef]
  45. Chandran, S.; Treagust, D.F.; Tobin, K.G. The role of cognitive factors in chemistry achievement. J. Res. Sci. Teach. 1987, 24, 145–160. [Google Scholar] [CrossRef]
  46. Guthrie, J.M. Proportional Reasoning in the Solution of Problems in High School Chemistry and Its Impact on Developing Critical Thinking Skills; ERIC: Washington, DC, USA, 1991. [Google Scholar]
  47. Kilner, W.C. The Chem-Math Project: Enhancing Success in General Chemistry through the Integration of Mathematics, Problem-Solving and Conceptual Understanding. An Action-Research Study. Ph.D. Thesis, University of New Hampshire, Durham, NH, USA, 2014. [Google Scholar]
  48. Darby, K.P.; Sloutsky, V.M. The Cost of Learning: Interference Effects in Memory Development. J. Exp. Psychology. Gen. 2015, 144, 410–431. [Google Scholar] [CrossRef]
  49. Simonsmeier, B.A.; Mayer, A.-K.; Rosman, T.; Gorges, J.; Flaig, M.; Schneider, M. Conceptual change and knowledge integration as learning processes in higher education: A latent transition analysis. Learn. Individ. Differ. 2018, 62, 49–61. [Google Scholar]
  50. Gulacar, O.; Sinan, O.; Bowman, C.; Yildirim, Y. Exploring the Changes in Students’ Understanding of the Scientific Method Using Word Associations. Res. Sci. Educ. 2015, 45, 717–726. [Google Scholar] [CrossRef]
  51. DiSessa, A.A. A History of Conceptual Change Research: Threads and Fault Lines; Cambridge University Press: Cambridge, MA, USA, 2014. [Google Scholar]
  52. Jeon, K.; Huffman, D.; Noh, T. The Effects of Thinking Aloud Pair Problem Solving on High School Students’ Chemistry Problem-Solving Performance and Verbal Interactions. J. Chem. Educ. 2005, 82, 1558. [Google Scholar]
  53. Clair-Thompson, H.S.; Overton, T.; Bugler, M. Mental capacity and working memory in chemistry: Algorithmic versus open-ended problem solving. Chem. Educ. Res. Pract. 2012, 13, 484–489. [Google Scholar] [CrossRef]
  54. Fayol, M.; Abdi, H.; Gombert, J.-E. Arithmetic Problems Formulation and Working Memory Load. Cogn. Instr. 1987, 4, 187–202. [Google Scholar] [CrossRef]
  55. Wagner, E.P. A Study Comparing the Efficacy of a Mole Ratio Flow Chart to Dimensional Analysis for Teaching Reaction Stoichiometry. Sch. Sci. Math. 2001, 101, 10–22. [Google Scholar] [CrossRef]
  56. Giuliodori, M.J.; Lujan, H.L.; DiCarlo, S.E. Collaborative group testing benefits high- and low-performing students. Adv. Physiol. Educ. 2008, 32, 274–278. [Google Scholar] [CrossRef]
  57. Gulacar, O.; Fynewever, H. A Research Methodology for Studying What Makes Some Problems Difficult to Solve. Int. J. Sci. Educ. 2010, 32, 2167–2184. [Google Scholar] [CrossRef]
  58. Romine, W.L.; Todd, A.N.; Clark, T.B. How Do Undergraduate Students Conceptualize Acid-Base Chemistry? Measurement of a Concept Progression. Sci. Educ. 2016, 100, 1150–1183. [Google Scholar] [CrossRef]
  59. Kousathana, M.; Demerouti, M.; Tsaparlis, G. Instructional Misconceptions in Acid-Base Equilibria: An Analysis from a History and Philosophy of Science Perspective. Sci. Educ. 2005, 14, 173–193. [Google Scholar] [CrossRef]
  60. Rodriguez, J.-M.G.; Bain, K.; Hux, N.P.; Towns, M.H. Productive features of problem solving in chemical kinetics: More than just algorithmic manipulation of variables. Chem. Educ. Res. Pract. 2019, 20, 175–186. [Google Scholar] [CrossRef]
  61. Marzabal, A.; Delgado, V.; Moreira, P.; Barrientos, L.; Moreno, J. Pedagogical Content Knowledge of Chemical Kinetics: Experiment Selection Criteria To Address Students’ Intuitive Conceptions. J. Chem. Educ. 2018, 95, 1245–1249. [Google Scholar] [CrossRef]
Figure 1. Highlights of the study design.
Figure 1. Highlights of the study design.
Education 12 00867 g001
Figure 2. A screenshot of coding page on Gradescope for a sample problem.
Figure 2. A screenshot of coding page on Gradescope for a sample problem.
Education 12 00867 g002
Figure 3. Experimental group’s pre- and post-ASR average values for the subtopics involved in the questions.
Figure 3. Experimental group’s pre- and post-ASR average values for the subtopics involved in the questions.
Education 12 00867 g003
Figure 4. The changes in the ASR values after the intervention for the students in the experimental and control groups.
Figure 4. The changes in the ASR values after the intervention for the students in the experimental and control groups.
Education 12 00867 g004
Figure 5. The code distributions per subtopic before (measured in the pretest) and after (measured in the posttest) the intervention.
Figure 5. The code distributions per subtopic before (measured in the pretest) and after (measured in the posttest) the intervention.
Education 12 00867 g005
Figure 6. The code distributions per subtopic for the experimental students grouped by success levels in general chemistry.
Figure 6. The code distributions per subtopic for the experimental students grouped by success levels in general chemistry.
Education 12 00867 g006
Table 1. Examples for the application of the COSINE codes *.
Table 1. Examples for the application of the COSINE codes *.
CodesSubtopicStudents’ Thinking ProcessStudent Work **
SMC
(Finding Moles of Carbon)
In this solution, the participant correctly identifies that he needs to use stoichiometry to go from grams to moles. The participant says, “I am going to convert everything I see here into moles. So, 40 g of carbon, I am going to use stoichiometry. Carbon is 12.011 g and then that equals to 1 mole. I am going to take 40 and divide that by 12.01 to get 3.33 moles of Carbon.”Education 12 00867 i001
DDLR
(Identifying Limiting Reactant)
The participant states he does not know how to do this problem and he also does not realize that he needed to calculate the limiting reactant using the two compounds. He thinks both would equally contribute the overall mass of N2. He states, “I started by writing down the two compounds given […] the question says it wants you to find the grams of N2 gas […] and now I am trying to find the grams of Nitrogen in this compound, so the 100 g is the total mass of N2H4 and I am trying to find Nitrogen [from it]. I am dividing the mass of nitrogen by the total mass to get the percentage of the mass of nitrogen in N2H4”. “I don’t know how to do this, but with my math knowledge it makes sense to say that there is the same amount of nitrogen in both of them [the two compounds].”Education 12 00867 i002
DSEEF
(Empirical Formula Ratio)
The participant multiplies the mole ratios by a common multiple to change the fraction into a whole number. His reasoning for manipulating the mole quantities is not the way specified in the traditional method. In doing so, he does not realize that he is changing the empirical formula he calculated. As they state, “carbon turned out to be 2.67 moles, we know that that is equal to two and two thirds, so we multiply by three to get rid of the fraction. So, you have the formula should be C8H12O12.”Education 12 00867 i003
* The detailed explanations for each code can be found in Gulacar et al. [10]. ** Highlighted region represents the subproblem being coded for that question.
Table 2. Examples for the application of the COSINE codes *.
Table 2. Examples for the application of the COSINE codes *.
CodesSubtopicStudents’ Thinking ProcessStudent Work **
CDMC
(Finding Moles of N2H4)
The participant clearly recognizes that this is a limiting reactant problem by stating “limiting reactant” when comprehending the question. She says she needs to “figure out which is the limiting” however she has a hard time recalling the exact steps. It is observed that she knows what needs to be done and as she mentions how there was a “proper procedure” however it “does not stick in my head.”Education 12 00867 i004
UGPY
(Calculating Percent Yield)
In this subproblem, the participant clearly guesses. He states, “I don’t know, I’m just gonna say 100% yield”.Education 12 00867 i005
URHWEQ
(Writing the Equation)
The participant is struggling to write the formula for strontium halide and the interviewer actually provides it. The interviewer states, “But I’ll just help you a little bit. […] the charge of strontium is plus two so in the halide it is a negative one so it’ll be a SrX2.”Education 12 00867 i006
UDIMC
(Finding Moles of Carbon)
The participant does the incorrect stoichiometric calculation to go from grams to moles carbon. Instead of dividing the grams of carbon they state that “I have to multiply the mass by the percentage and then add them up to see if it makes the total mass”.Education 12 00867 i007
* The detailed explanations for each code can be found in Gulacar et al. [10]. ** Highlighted region represents the subproblem being coded for that question.
Table 3. Acronyms for stoichiometric concepts and COSINE codes.
Table 3. Acronyms for stoichiometric concepts and COSINE codes.
AcronymStoichiometry TopicsAcronymCOSINE CODES and Formulas
BEQBalancing Chemical EquationSSuccessful
WEQWriting Chemical EquationUGUnsuccessful Guessed
MCMole ConceptUDIUnsuccessful Did Incorrectly
SRStoichiometric RatioURHUnsuccessful Received Hint
PYPercent YieldCDCould not Do
CMConservation of MassDSEDid Something Else
EFEmpirical FormulaDDDid not know to Do
MFMolecular FormulaNRNot Required
LRLimiting ReagentASRAttempt Success Rate
CSRComplete Success Rate
Table 4. Descriptive data for the changes in the CSR scores after the interventions.
Table 4. Descriptive data for the changes in the CSR scores after the interventions.
Experimental (N = 31)Control (N = 7)
MSDMinMaxMSDMinMax
∆ CSR0.040.12−0.170.44−0.070.08−0.190.00
Table 5. Descriptive data of students grouped by success level in General Chemistry I.
Table 5. Descriptive data of students grouped by success level in General Chemistry I.
A Students (N = 12)B Students (N = 9)C Students (N = 10)
MSDMinMaxMSDMinMaxMSDMinMax
∆ CSR0.010.06−0.140.100.010.15−0.170.260.090.14−0.090.44
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Gulacar, O.; Mann, H.K.; Mann, S.S.; Vernoy, B.J. The Influence of Problem Construction on Undergraduates’ Success with Stoichiometry Problems. Educ. Sci. 2022, 12, 867. https://doi.org/10.3390/educsci12120867

AMA Style

Gulacar O, Mann HK, Mann SS, Vernoy BJ. The Influence of Problem Construction on Undergraduates’ Success with Stoichiometry Problems. Education Sciences. 2022; 12(12):867. https://doi.org/10.3390/educsci12120867

Chicago/Turabian Style

Gulacar, Ozcan, Harjeet Kaur Mann, Sukhdev Singh Mann, and Brandon James Vernoy. 2022. "The Influence of Problem Construction on Undergraduates’ Success with Stoichiometry Problems" Education Sciences 12, no. 12: 867. https://doi.org/10.3390/educsci12120867

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop