Achieving and Monitoring Education for Sustainable Development and Global Citizenship: A Systematic Review of the Literature

This paper presents the results of a systematic review of literature (56 studies) related to Sustainable Development Goal 4.7. The goal of the research reported on here is to contribute to the discussion around strategies for working towards and monitoring SDG4.7 at the institutional level. Within this overarching focus, our review of the literature was designed to identify studies that have looked at both student learning and teacher education related to SDG4.7. This twin focus stems from the recognition that achieving the SDGs will be particularly difficult if policymakers are not attentive to both sides of the learning equation—that is, first, to the ways that teachers learn to teach about issues related to SDG4.7 and, then, the ways that students acquire this knowledge and are assessed. The five findings sections of this review correspond to the five areas of emphasis embedded in the language of SDG4.7, namely, education for (a) sustainable development, (b) human rights, (c) gender equality, (d) promoting of a culture of peace and non-violence, and (e) appreciation of cultural diversity. In accordance with the purpose of this review, the synthesis for each area of emphasis digs into the details of the educational interventions, monitoring and evaluation strategies, and results that are documented in the publications analyzed. Thus, this review can be useful for informing educational or pedagogical approaches related to SDG4.7, as well as for designing monitoring and evaluation tools for the SDGs.


Introduction
The Sustainable Development Goals (SDGs), approved by the United Nations in 2015, include a focus on education for sustainable development (ESD) and global citizenship education (GCEd). Specifically, by 2030, SDG target 4.7 seeks to "ensure that all learners acquire the knowledge and skills needed to promote sustainable development, including, among others, through education for sustainable development and sustainable lifestyles, human rights, gender equality, promoting of a culture of peace and non-violence, global citizenship and appreciation of cultural diversity and culture's contribution to sustainable development" [1]. However, what stands out is that the indicators that accompany different levels of monitoring of SDG target 4.7 do not focus on the institutional level. That is, to this point (writing in 2019), what has been identified are global and thematic level indicators-with the implication being that these indicators may not shed light on teaching and learning

Literature Characterization
Among the 56 publications retained in this study, 57% focused on student assessment of ESD or GCED at the institutional level, while 41% focused on the assessment of teacher education in the areas of ESD and GCEd, and only 2% on both. Thematically, the content of the programs assessed focused mainly on education for sustainable development and sustainable lifestyles (39%) and education for global citizenship and appreciation of cultural diversity (35%), followed by peace education (12%), human rights (11%) and gender equality (4%), as seen in Table 1. A large majority of the programs in these studies focused on formal education: 88% of the programs were implemented either in university (52%) or in the traditional k-12 classroom (36%). While the level of program implementation varied from kindergarten to adult education, it was most common for programs to focus on the university level (53%), followed by secondary school (28%), and then primary school (19%). Geographically, the programs were implemented in 18 different countries. There was a lack of variation as programs tended to be implemented in either Europe (50%) or in North America (38%). Fewer studies focused on other regions, and no publications were from Central or South America, East Asia or South Asia. The table in Appendix D indicates the geographic focus of each study.
In the identified literature, the majority of the programs were implemented by teachers (66%) in various educational settings, such as schools and universities, while other programs were implemented by different actors (e.g., researchers, volunteers, NGO employees, etc.).
With regards to methods, quantitative and qualitative assessment tools were used to understand the associated outcomes. Specifically, 41% (n = 23) of the studies utilized qualitative data, 39% (n = 29) used quantitative data and 20% (n = 11) utilized mixed methods approach. Over half of the studies used non-experimental methods (66%, n = 37), followed by quasi-experimental (25%, n = 14) and experimental methods (9%, n = 5). Further, 29% of the programs claimed statistically significant effects, of which only 2 studies applied an experimental design. About half of the studies (50%) made available their assessment tools or instruments.
Finally, the majority of the assessed programs claimed or showed positive outcomes or effects (86%), and over a half of studies claimed that the programs sought to be transformative (64%). All studies claimed useful pedagogical/policy implications for further implementation of the SDGs.

Literature Syntheses and Key Insights
The presentation of findings from the literature proceeds in a way that reflects the frequency with which each theme is covered in the publications we reviewed. First, we discuss education for sustainable development, followed by education for diversity, and then education for peace education, human rights, and gender equality. We begin with those themes for which there are more studies. Note that the references for to all the studies cited in the findings sections presented below can be found in the master list of references for all retained literature, found in Appendix B.

Sustainable Development
Setting: As seen in Figure 1, roughly one third of the programs (36%) were implemented in a traditional classroom and over half of the programs (60%) were implemented in university contexts. Other programs (4%) were either conducted as informal after-school or summer activities, or as an NGO program. As can be seen from Figure 2, outside of higher education contexts, the most common level for implementation was middle school. The majority of the programs were conducted in Western countries, i.e., Europe (68%) and North America (14%). Other ESD programs were conducted in Oceania (4%), South-East Asia (9%), and in the Middle East (4%). There were no articles found to be focused on East Asia, South Asia, or Central-Latin America. .

Figure 2. Educational Level of ESD Programs.
Objective: The literature revealed most efforts and objectives of ESD initiatives are focused on promoting and enhancing teacher and student knowledge and understanding of, experiences with, and sensitivity toward sustainable development and global citizenship. The majority of the programs aimed at transforming participants' perception of sustainable development and promoting understanding of related ESD issues as well as supporting the development of key competencies, attitudes, skills, behaviors and dispositions appropriate to the goals of ESD. While there are various frameworks seen in the literature to define the key competencies, the key ESD-related competencies described in the literature include: .

Figure 2. Educational Level of ESD Programs.
Objective: The literature revealed most efforts and objectives of ESD initiatives are focused on promoting and enhancing teacher and student knowledge and understanding of, experiences with, and sensitivity toward sustainable development and global citizenship. The majority of the programs aimed at transforming participants' perception of sustainable development and promoting understanding of related ESD issues as well as supporting the development of key competencies, attitudes, skills, behaviors and dispositions appropriate to the goals of ESD. While there are various frameworks seen in the literature to define the key competencies, the key ESD-related competencies described in the literature include: Objective: The literature revealed most efforts and objectives of ESD initiatives are focused on promoting and enhancing teacher and student knowledge and understanding of, experiences with, and sensitivity toward sustainable development and global citizenship. The majority of the programs aimed at transforming participants' perception of sustainable development and promoting understanding of related ESD issues as well as supporting the development of key competencies, attitudes, skills, behaviors and dispositions appropriate to the goals of ESD. While there are various frameworks seen in the literature to define the key competencies, the key ESD-related competencies described in the literature include: The key ESD teaching competencies for educators were described by the study of de Kraker et al. [21] in accordance with the report of the United Nations Economic Commission for Europe (2011) "Learning for the future: Competences in education for sustainable development". The authors explained that these competencies are categorized as "learning to know, learning to do, learning to live together and learning to be" and include: • Integrative thinking (including insights from different disciplines, cultures and world views; • Inclusivity (embracing a wide range of perspectives); • Dealing with complexities (providing opportunities to engage with and create bridges across a range of concepts and ideas) [21] (pp. 763-764).
More specifically, these skills, behaviors and dispositions include critical, creative and innovative reflection and the ability to analyze national and local issues for their relevance to global sustainability, as well as a willingness and motivation to be actively involved in and engaged with local issues, and to make positive contributions to real-world sustainability problems (e.g., [22][23][24][25]). One program aimed at reconciliation between communities for protecting their common natural resource (i.e., [26]).
Description: Many of the ESD programs used a participatory approach, meaning, e.g., discussion and in-class group exercises. The constructivist pedagogical approach was also used in that students were encouraged to be creative and to apply their own knowledge to solve specific sustainable development issues. As a typical example of a group exercise, students were asked to use problem-solving methodologies learned in their courses to analyze real-world sustainability problems and to propose their own solutions. Usually in this approach, students have "relatively large autonomy" while teachers' role is mainly to support and facilitate the learning process [21] (p. 761). However, it is sometimes seen that the issues to solve are first described by teachers even though students are expected to be engaged in the problem-solving process by using their own knowledge and specializations (e.g., [27]). Some programs were activity based, like organizing an outdoor program to promote ESD [25], or a summer camp to provide ESD and build sound relationships between communities [26].
In efforts to assess the effectiveness of teacher education, it was common among the programs not only to introduce pre-service teachers to the teaching method to foster students' competencies contributing to ESD, but also to introduce the basic values and lifestyles of sustainable development. Teacher education for sustainable development tended to feature interactive pedagogies, e.g., group discussion and experiential learning (hazard mapping and creating teaching materials) [28]. In contrast, some of the articles explored the relationship between certain in-school factors (e.g., teachers' knowledge and their introduction of ESD into the curriculum), with the implication being that these publications did not look at the implementation of a particular intervention.
Duration: The duration of the programs varied from very short (e.g., one day workshop) to long (e.g., one semester), though many of the articles focused on programs lasting more than 2 weeks. There are also longitudinal studies which looked at the effectiveness of ESD programs implemented over a number of years.
Implementer: The majority of the programs were implemented by teachers (64%) in various educational contexts such as schools and universities while some programs were implemented by others, such as researchers and graduate students (32%) or NGO employees (4%), as shown in Figure 3.  Transformative orientation: Nearly half of the ESD programs and initiatives studied (55%) sought to be transformative. Among articles focused on teacher education programs, some of them investigated change in teachers' perceptions of sustainable development or ESD while some others were interested in how teachers' knowledge and sense of moral obligations affect ESD teaching or curriculum development. There was often the assumption that understanding ESD is an evolving process. In general, many of the programs aimed not only to increase student knowledge for sustainable development but also to raise awareness and promote students' behavior change. For instance, some programs aimed to transform students' behaviors to be environmentally friendly, e.g., changing daily habits to reduce consumption of water and energy (e.g., [21,27]).
Assessment: Evaluations of the ESD programs were composed of both quantitative (46%) and qualitative research methods (36%), as seen in Figure 4. Among the quantitative research methods, as Figure 5 shows, 43% were quasi-experimental, 7% were experimental, and 50% were neither of them. Program participants were usually asked to answer the questionnaires with Likert scales. The questionnaires included items to identify how participants' knowledge, attitudes and perceptions toward ESD changed. In the assessment of teacher education programs, the participants were additionally asked through Likert-type questions about their opinion, perception and attitudes toward ESD, as well as their conceptualization and approaches to teaching ESD. Sometimes in the questionnaire, there was also an option provided for additional comments [21]. In qualitative research methods, semi-structured interviews and focus group discussions were mainly used. Questions were often centered on participants' awareness of ESD, teacher perspectives, knowledge, and understanding of ESD in addition to beliefs and opinions regarding sustainable development, and how participants were affected by ESD initiatives, experiences or programs. Student diaries, personal reports, observations of classroom interactions, exams, and reflections were also used. Transformative orientation: Nearly half of the ESD programs and initiatives studied (55%) sought to be transformative. Among articles focused on teacher education programs, some of them investigated change in teachers' perceptions of sustainable development or ESD while some others were interested in how teachers' knowledge and sense of moral obligations affect ESD teaching or curriculum development. There was often the assumption that understanding ESD is an evolving process. In general, many of the programs aimed not only to increase student knowledge for sustainable development but also to raise awareness and promote students' behavior change. For instance, some programs aimed to transform students' behaviors to be environmentally friendly, e.g., changing daily habits to reduce consumption of water and energy (e.g., [21,27]).
Assessment: Evaluations of the ESD programs were composed of both quantitative (46%) and qualitative research methods (36%), as seen in Figure 4. Among the quantitative research methods, as Figure 5 shows, 43% were quasi-experimental, 7% were experimental, and 50% were neither of them. Program participants were usually asked to answer the questionnaires with Likert scales. The questionnaires included items to identify how participants' knowledge, attitudes and perceptions toward ESD changed. In the assessment of teacher education programs, the participants were additionally asked through Likert-type questions about their opinion, perception and attitudes toward ESD, as well as their conceptualization and approaches to teaching ESD. Sometimes in the questionnaire, there was also an option provided for additional comments [21]. In qualitative research methods, semi-structured interviews and focus group discussions were mainly used. Questions were often centered on participants' awareness of ESD, teacher perspectives, knowledge, and understanding of ESD in addition to beliefs and opinions regarding sustainable development, and how participants were affected by ESD initiatives, experiences or programs. Student diaries, personal reports, observations of classroom interactions, exams, and reflections were also used.

Strengths and weaknesses of assessment approaches:
The strengths of the experimental research mainly lie in the large sample of program participants in the treatment group. Dividing ESDlearning-outcome scores by several main categories was helpful to understand in which specific areas students increased or decreased their score. When the study assesses various kinds of ESD-related courses and activities without such categorization, relationships between the outcomes and specific course or activity becomes unclear. As for the weakness of the quantitative research, it does not reveal the mechanisms behind this change [29]. For instance, how in-school factors of the program contributed to the ESD learning outcomes was not clear from the survey results. Also, challenges stem from the difficulty of defining outcomes and measuring success. As Sağdıç and Sahin [30] noted, some articles simply claim "success" as an outcome of the program without providing a clear definition of success and while indicators of "success" are not well developed.
As for the strengths of qualitative research, participants' opinions and comments added to the survey greatly enhanced the understanding of why participants scored various competencies in the way they did. Interviews with the participants after the survey would have allowed them to further explain what they wrote in the report. However, the feelings of participants about their experience   Strengths and weaknesses of assessment approaches: The strengths of the experimental research mainly lie in the large sample of program participants in the treatment group. Dividing ESDlearning-outcome scores by several main categories was helpful to understand in which specific areas students increased or decreased their score. When the study assesses various kinds of ESD-related courses and activities without such categorization, relationships between the outcomes and specific course or activity becomes unclear. As for the weakness of the quantitative research, it does not reveal the mechanisms behind this change [29]. For instance, how in-school factors of the program contributed to the ESD learning outcomes was not clear from the survey results. Also, challenges stem from the difficulty of defining outcomes and measuring success. As Sağdıç and Sahin [30] noted, some articles simply claim "success" as an outcome of the program without providing a clear definition of success and while indicators of "success" are not well developed.
As for the strengths of qualitative research, participants' opinions and comments added to the survey greatly enhanced the understanding of why participants scored various competencies in the way they did. Interviews with the participants after the survey would have allowed them to further explain what they wrote in the report. However, the feelings of participants about their experience Strengths and weaknesses of assessment approaches: The strengths of the experimental research mainly lie in the large sample of program participants in the treatment group. Dividing ESD-learning-outcome scores by several main categories was helpful to understand in which specific areas students increased or decreased their score. When the study assesses various kinds of ESD-related courses and activities without such categorization, relationships between the outcomes and specific course or activity becomes unclear. As for the weakness of the quantitative research, it does not reveal the mechanisms behind this change [29]. For instance, how in-school factors of the program contributed to the ESD learning outcomes was not clear from the survey results. Also, challenges stem from the difficulty of defining outcomes and measuring success. As Sagdıç and Sahin [30] noted, some articles simply claim "success" as an outcome of the program without providing a clear definition of success and while indicators of "success" are not well developed.
As for the strengths of qualitative research, participants' opinions and comments added to the survey greatly enhanced the understanding of why participants scored various competencies in the way they did. Interviews with the participants after the survey would have allowed them to further explain what they wrote in the report. However, the feelings of participants about their experience of the ESD program, which were often characterized using the results of semi-structured interviews, would not be enough to evaluate learning outcomes and to prove the effects of a program. For instance, evaluation of long-term impact and behavior change of the participants should be subject to more attention in program evaluation. To identify the long-term effects of ESD, such as sustainable behavior change, longitudinal studies would be helpful, as Brody and Ryu [27] suggest in their article. Whether quantitative or qualitative, self-reports cannot escape from self-bias. Thus, behavioral observation may have been helpful.
Implementation: Both student-focused programs and teacher-focused programs had challenges with implementation. The main factors hampering implementation were teaching methodology and teachers' capacity. For instance, the research of Kieu et al. [31] revealed that teaching methods were one of the main reasons for student negative feedback toward ESD at the university level. Students claimed the courses were boring due to the lack of interactive teaching and learning, excessive content and poor facilities. The authors pointed out that there remains a large gap between education reform to replace top-down teaching by more interactive pedagogies and the current status quo. Some cases suggested that even though teachers attempted to use interactive pedagogies, their efforts have not yet lead to a successful result. For instance, the research of Kaya [32] focused on the case of using an innovative teaching technique (i.e., the six thinking hats technique based on using six different thinking perspectives), but some students still characterized the courses as tiring and boring in their feedback. This case implies that interactive pedagogy remains a challenge among teachers.
In NGO programs that partner with universities, the factors hampering the program were found to be "overly ambitious course schedules, a lack of supervision during the follow-up activities, lengthy lectures and misleading trainee recruitment" [28] (p. 160). For instance, in some programs which did not coordinate their class schedules with those of the university, some students had to miss university classes without permission from their faculty [28].
In other cases, the authors pointed out the selection of language as a problematic aspect when the participants were from different language areas [26]. In the words of Kadis and Avraamidou [26], "even though there were only a few cases of individuals who were not fluent speakers of English, some problems of miscommunication did occur. Language was also a barrier in expressing personal feelings and having more meaningful conversations" [26] (p. 70).
Studies also reported other challenges and constraints which affected the implementation of sustainable development education programs, initiatives, and goals. Pre-service teacher education that prioritizes ESD, ongoing in-service teacher training and support related to the practical use of ESD tools, in addition to teachers' perceptions of the feasibility of implementing ESD into lesson times, may influence the implementation of ESD [24,33]. In one study, the amount of time and school support necessary to integrate new teaching methods and content related to ESD hindered teachers' implementation of an ESD toolkit [33]. Other studies found that the limited duration of ESD projects and programs may also play an important role in the extent to which ESD is integrated into future teaching and learning [24,25,34].
Outcomes: Most of the program evaluations included in the articles demonstrated or claimed positive effects on student/teacher learning related to ESD. Among these, five studies focused on teaching and learning assessment of ESD demonstrated a statistically significant effect [22,27,32,35,36], while three other studies focused on teacher education in the areas of ESD demonstrated the same [29,33,37]. For instance, some of these studies found that ESD courses strongly impacted participants' positive beliefs, norms and attitudes towards sustainable development [29,37]. Participant consciousness of nature, awareness of the need to save natural resources, more sustainable daily practices were also seen as positive outcomes. Core sustainability competencies such as systemic thinking, futures thinking, values thinking, and action-orientation have been also fostered. Teacher education for ESD showed a short-term increase in willingness and moral obligation to engage with the issue of sustainable development and to teach students with scientifically supported and ethically sound knowledge in ESD [29]. Similarly, the evaluation of ESD for students revealed its positive impact on the degree to which students make behavioral choices in a sustainable manner [27].
There were also more specific positive outcomes indicated by some programs, such as understanding of the concept of sustainable development [21] and creating positive feelings about community members and gaining knowledge about the nature of their community [26]. It was interesting to find that a program which succeeded in increasing participants' sustainability-related competencies did not necessarily increase their ESD teaching competencies [28]. In this program, the participants were provided with the opportunity for practical experiences on integrating disaster risk reduction into curricula and creating teaching materials. However, the author explained that this was mainly because the program design was not specifically for pre-service teachers [28]. It is also noteworthy that one assessment suggested that ESD-related competencies were developed differently among university students according to gender, disciplinary, affiliation and age [36]. For instance, as a result of the same ESD course, interpersonal competencies improved more among female students than male students, and normative orientation improved only among younger students [36].
Several studies reported positive learning outcomes from the implementation of ESD and global citizenship programs, especially when teachers are provided with professional development and training, ongoing support, and resources. In studies that focused on the efforts of teaching and learning of ESD, program outcomes indicated positive effects on student learning related to ESD and an increase in teachers' and students' understanding of and sensitivity toward ESD [22,34]. Some studies also reported improved student learning outcomes from teacher education coursework taken at the graduate and undergraduate levels, such as enhanced knowledge of ESD, systems thinking, decision making abilities, self-awareness, critical thinking, civic involvement, global competency and increased personal responsibility to be informed, use voice, and take action on ESD related issues [25,30,38].
Implications: Throughout the literature, there are many useful policy and pedagogical implications for further implementation of ESD. Regarding student-focused programs, many studies implied that teachers play a very important role in influencing future global citizens. As one study suggested, the ESD curriculum, supported by the national government, constitutes a crucial part of the modern education system in Sweden-a successful case. Some literature also suggests that, in order to be successful, implementation of ESD inside the classroom should aim to raise awareness of issues in the context of sustainability and sustainable development and that such awareness and consciousness should ultimately compel and motivate students to act and engage outside the classroom, as global citizens who meaningfully contribute to global issues [22,30]. Regarding teacher-focused programs, many authors emphasized the crucial role of higher education institutions. For instance, Andersson et al. [29] claim that teacher education programs that focus on ESD not only have the potential but also have the responsibility to provide future generations with knowledge and tools on tackling sustainable development issues.
For providing effective ESD, the role of NGOs was also highlighted in some articles. For instance, Kieu and Singer's [28] research in Vietnam revealed that students in a focus group saw NGOs as crucial educators in non-formal ESD. These authors consequently argue in favor of "using higher education such as teacher education to catalyze the necessary norm-based and behavioral changes concerning sustainable development in the next generation" [28] (p. 5147). NGOS can contribute to these efforts by fundraising, research funding, future employment, and reviewing curriculum. It is therefore suggested for universities to be more active in strengthening partnership with NGOs. At the same time, higher education's responsibility is to develop sustainable development curriculum and provide graduate-level courses on sustainable development.
However, transferring teachers' learning of ESD to classroom is another challenge. The study of Qablan et al. [39] shows that even though pre-service teachers' own moral attitudes are affected by the ESD course, they claim not to transfer these norms to students due to their beliefs against indoctrination. Their study implies that teachers felt more comfortable presenting objective facts but not giving students room to reflect or influence their choice [39]. Such teacher orientations represent a clear challenge to the achievement of the SDGs going forward.
At the university level, Zachariou and Valanides [25] suggest that teacher education programs need to be re-cultured to ensure that "future teachers see the benefits and possibilities of ESD" (p. 200). Spahiu and Lindemann-Matthies [33] write that supportive pre-service teacher education and in-service teacher training may increase teachers' confidence in integrating and implementing ESD, and may also increase teachers' capacity to create their own locally relevant ESD program and projects (p. 8062). Finally, Sims and Falkenberg [24] suggest inclusive collaboration with teachers in addition to supportive leadership may help the process of "reorienting teacher education towards sustainability" (p. 12).

Cultural Diversity
Setting: A large body of literature on appreciation of cultural diversity tends to focus on formal education. As Figures 6 and 7 show, while most of the programs were implemented at university level [40][41][42][43][44][45], only a handful of programs were conducted in a traditional classroom, mainly focused on primary schools [46]. 200). Spahiu and Lindemann-Matthies [33] write that supportive pre-service teacher education and in-service teacher training may increase teachers' confidence in integrating and implementing ESD, and may also increase teachers' capacity to create their own locally relevant ESD program and projects (p. 8062). Finally, Sims and Falkenberg [24] suggest inclusive collaboration with teachers in addition to supportive leadership may help the process of "reorienting teacher education towards sustainability" (p. 12).

Cultural Diversity
Setting: A large body of literature on appreciation of cultural diversity tends to focus on formal education. As Figures 6 and 7 show, while most of the programs were implemented at university level [40][41][42][43][44][45], only a handful of programs were conducted in a traditional classroom, mainly focused on primary schools [46].
Geographically, diversity programs tended to be implemented in either North America [40,41,43,44,46], or in Europe [45]. One program was implemented in Germany and the US in partnership between two universities [42], while another focused on the United States and South Africa [47]. Other countries that were noted to have had diversity programs were Ireland, Cyprus, Italy, Poland, and the United Kingdom, among others [48,49].   Objective: The goals and objectives of the programs ranged from immediate to far-reaching impacts on participants' understanding, awareness and attitudes towards diversity, though diversity was defined differently across different studies. With the diversity component infused in the curricula, most programs aimed to promote greater appreciation of cultural diversity on school campus as well as in the wider community [40,41,43].
Kubal et al. [40] refer to internal diversity pertinent to American society. Students should get exposed to different narratives along the lines of race, class, gender, ethnicity, age, sexual orientation, religion, and other social dimensions. Kvam, Considine and Palmeri [41] are concerned about the way students communicate about diversity in and outside of the classroom, since communication is inherently power-laden. Scott and Sims [43] take diversity outside of the classroom. Claiming that Geographically, diversity programs tended to be implemented in either North America [40,41,43,44,46], or in Europe [45]. One program was implemented in Germany and the US in partnership between two universities [42], while another focused on the United States and South Africa [47]. Other countries that were noted to have had diversity programs were Ireland, Cyprus, Italy, Poland, and the United Kingdom, among others [48,49].

Objective:
The goals and objectives of the programs ranged from immediate to far-reaching impacts on participants' understanding, awareness and attitudes towards diversity, though diversity was defined differently across different studies. With the diversity component infused in the curricula, most programs aimed to promote greater appreciation of cultural diversity on school campus as well as in the wider community [40,41,43].
Kubal et al. [40] refer to internal diversity pertinent to American society. Students should get exposed to different narratives along the lines of race, class, gender, ethnicity, age, sexual orientation, religion, and other social dimensions. Kvam, Considine and Palmeri [41] are concerned about the way students communicate about diversity in and outside of the classroom, since communication is inherently power-laden. Scott and Sims [43] take diversity outside of the classroom. Claiming that students lack competences for workforce diversity, they evaluate two impactful programs (Diverse Voices Conference (DVC) and Diverse Student Scholars (DSS) that let students transform their understanding of others from different identities and of workforce diversity.
One concept related to diversity that received significant attention was the idea of global citizenship. Hunt [48] specifically states, "Global citizenship education is expected to be transformative, building knowledge skills, values and attitudes that learners need to be able to contribute to a more inclusive, just and peaceful world" [48] (p. 10). In this vein, Sperandio, Grudzinski-Hall and Stewart-Gambino [44] describe a program that focuses on global citizenship education that provided students with the tools-such as information, experiences and perspectives-necessary to think consciously about their own responsibilities and perspectives, and discussed "global citizenship" in relation to the idea that "today's students will live in a diverse, global, and interconnected world whether they want to or not, whether they necessarily know it or not" [44] (p. 14). As these authors write, this idea of global citizenship leads to a series of important questions that educational programs must consider: What does it mean to ask of any student, regardless of major or intended career paths, that they become a global citizen? What is the difference between being a person who knows about non-U.S. cultures or languages and a global citizen? Is there a specific content, ideological perspective, or set of beliefs that are inherent in a citizen? What do students need in order to be able to determine, for themselves, their own relationship to the world? [44] (p. 14) On the other hand, diversity programs that were implemented among pre-service and in-service teachers were geared to foster the development of intercultural competence [42], to prepare teachers to work effectively with minority and low-income students [46], and to improve knowledge of discrimination by learning about the importance of differential treatment in achieving equal opportunities [45].
Leh, Grau and Guiseppe [42] describe intercultural competence based on Byram's (1997) model of intercultural communicative competence as it relates to "students' perceptions of culture after working as part of a multicultural and diverse team" [42] (p. 99). Intercultural competence includes knowledge about our own and other cultures, attitudes (e.g., curiosity, openness and respect towards other cultures) and skills (e.g., ability to interact and understand individuals from other cultures). With increased diversity in classroom demographics, this type of competence has become important to foster among pre-service teachers.
Youngs [46] and Turnsek [45] talk about diversity that is based on an anti-bias approach among teachers. Both authors argue that teachers are influenced by their beliefs and teaching experiences. Youngs [46] thinks that diversity should be an integral part in preparing teachers to work with students of various cultural backgrounds. Turnsek's [45] type of diversity can be described as proactive, value-based and activist. Teachers should be able to think critically to recognize the inequalities generated by power structures in society. As such, teachers should consciously reflect on their belief systems towards others. As can be seen, there are a variety of approaches to diversity across the studies retained.
Description: Some of the programs aimed to expose and engage students with speakers and groups from diverse cultural backgrounds within the community [40,41,43]. Kubal et al. [40] describe an artist in residency program. By inviting artists as guest speakers, race and ethnicity classes were developed around topics of immigration, assimilation and pluralism, prejudice and discrimination, and movement resistance. Artists addressed some of these issues by describing how they interpret their own ethnic experiences through art. Further, as noted in Scott and Sims's [43] study of University students in the United States and Germany, music, poetry, and dance can bring students, scholars, and the community closer together in a forum that supports human diversity and encourages dialogue on real world diversity issues among the community members.
Other programs utilized educational exchanges as a way of exposing students and teachers to real-world diversity. Somers [49] discusses Suas, an Irish-based INGO, festival that promoted educational opportunities for young people from disadvantaged settings and aimed to inspire, engage and educate young students as global citizens, as well as attempted to foster collaboration on projects focused on social change. Leh, Grau and Guiseppe [42] present a university exchange program where students from the US and Germany learned from each other firsthand through an online discussion of culture and language teaching, as well as in-person during a 21-day trip abroad. This excursion served as an educational field experience with visits to German elementary schools. In addition, the online course was accompanied by reflective tasks, such as surveys, debriefings and reflection papers regarding the relevance of the project experience and the effects of the project on the participant's personal and professional development. Sperandio, Grudzinski-Hall, and Stewart-Gambino [44] talk about the Global Citizenship Program (GCP) that engaged students through practical and experiential learning such as study abroad, summer opportunities to work with NGOs, as well as faculty and student exchanges. The GCP sought to provide students with the tools, such as information, experiences and perspectives necessary to think consciously about their own responsibilities and perspectives as global citizens.
The diversity programs among prospective teachers and in-service teachers focused on, both, independent study of diversity topics as well as experiential learning. For example, Turnsek [45] describes the Antidiscrimination and Diversity training of a total of 120 h. The first part of the program targeted participants' perceptions on the topic, followed by the exploration of written and spoken messages that influenced their thinking about minority groups. In Youngs [46], Teaching for Urban Contexts (TUC) was an experimental program that trained prospective teachers. The TUC program employed several strategies to prepare teachers to work with diverse populations, including readings on teachers who have been successful with minority students, and socio-emotional self-reflection. Some courses provided prospective teachers with strategies for learning about their students' cultural backgrounds and prior learning experiences. Finally, program participants did practical training in schools serving minority, low-income students, accompanied with field experiences in their students' communities. As will be further discussed, the various programs assessed in the literature tend to be experiential in nature and do not seem to lend themselves to easy assessment, nor to easy replicability for the purpose of achieving the SDGs.
Duration: The duration of cultural diversity programs varies greatly, as seen in Figure 8. While some of the programs were implemented over a short period of time, ranging from one day to five weeks [40,42,43], other programs took place over a duration of several months [44][45][46] or even years [50,51]. experiences in their students' communities. As will be further discussed, the various programs assessed in the literature tend to be experiential in nature and do not seem to lend themselves to easy assessment, nor to easy replicability for the purpose of achieving the SDGs.
Duration: The duration of cultural diversity programs varies greatly, as seen in Figure 8. While some of the programs were implemented over a short period of time, ranging from one day to five weeks [40,42,43], other programs took place over a duration of several months [44][45][46] or even years [50,51]. Implementer: Figure 9 shows that the vast majority of the programs were implemented by university professors [40,41,44,45], or by primary school teachers [46], though in some studies the programs were carried out by the researchers themselves [42,43]. Implementer: Figure 9 shows that the vast majority of the programs were implemented by university professors [40,41,44,45], or by primary school teachers [46], though in some studies the programs were carried out by the researchers themselves [42,43].

Duration:
The duration of cultural diversity programs varies greatly, as seen in Figure 8. While some of the programs were implemented over a short period of time, ranging from one day to five weeks [40,42,43], other programs took place over a duration of several months [44][45][46] or even years [50,51]. Implementer: Figure 9 shows that the vast majority of the programs were implemented by university professors [40,41,44,45], or by primary school teachers [46], though in some studies the programs were carried out by the researchers themselves [42,43]. Transformative orientation: While most literature does not focus on the impact of diversity programs, a few studies note the transformative nature that such programs may have on participants. Youngs [46] points out the importance of exposing pre-service teachers to the political and historical context of marginalized groups and the way they have been treated in the educational setting. Scott and Sims [43] talk about the annual Diverse Voices Conference (DVC) that provides higher education students, renowned scholars, professionals and community members an opportunity to speak out and to engage in dialogue on diversity issues. The goal of the student-centered DVC is to expand internationally. This would give an opportunity to students who present and attend the conference to be "more prepared to live, work and play more respectfully and productively in a multicultural world" [43] (p. 114). Likewise, Sperandio, Grudzinski-Hall and Stewart-Gambino [44] state that participants of Lehigh University's Global Citizenship Program, a program that was designed to develop tools to understand global responsibilities and for operating in a global environment, have been equipped with important information, experiences and perspectives that are necessary to take on the role of responsible global citizens.
Some studies focus on raising awareness while other studies specifically state they focused on transforming students' opinions or perspectives. A two-part exercise at West Chester University in Pennsylvania that focused on general value questions and involvement in community civic activities attempted to "raise awareness of the negative influence of oppression and stimulate interest in seeking greater knowledge about other cultures" [52] (p. 160). Nagengast [50] notes that in a study abroad program from Juniata College in Huntington, Pennsylvania, the courses were designed to change student perspectives and self-awareness with regards to intercultural sensitivity by focusing on the professor's impact on student attitudes on openness to diversity and global citizenship. Castellanos and Cole [53] argue that exposing students to diversity is not enough and that course content should strive to strengthen the students' community and to promote civic engagement. In a festival's learning events that mainly focused on workshops to examine how to critically examine media narratives and debates with a media focus [49], the goals were to help equip students with the skills and understanding of engaging with social media and global development issues by focusing on understanding different diverse stories that exist in the media narrative of the European "refugee crisis." Thus, many diversity-focused programs have a transformative orientation in that they seek to encourage students' development when it comes to their sensitivity to, understanding of, and action on diversity-related issues.
Assessment: Figure 10 shows that both quantitative and qualitative assessment tools were used to understand the associated outcomes to the appreciation of cultural diversity. Qualitative data mostly revolved around curriculum analysis, in-depth interviews, focus groups and observations with students, teachers or education professionals to capture participants experience and/or practice [41,46,54]. Other qualitative methods were used to capture participants' learning experience. For instance, in Kvam et al. [41] examination of self-reflective essays and focus-group interviews on university students' perceptions of diversity-focused learning, focus groups served as an opportunity for students to compare and contrast one another's perspectives. Additionally, students wrote self-reflective essays about each of the seven expected learning outcomes for their capstone portfolios for graduation. These essays, together with focus-group interviews conducted with current and former students, comprised the corpus of data for this study. In Youngs [46], teachers were asked to describe their backgrounds, motives for the teaching profession, and their primary goals as teachers in order to determine the impact of diversity programs. In Ball's [47] cross-national study that aimed to address how teachers can become effective when teaching students that are culturally and linguistically diverse, over one hundred oral and written texts were collected from American and South African teachers, including transcripts of classroom discussions, journal entries and reflection papers. Quantitative data (see Figure 11) were often collected in the form of Likert scale surveys [40][41][42][43]45,47] and reflected both student opinions and educator opinions about different programs. These surveys were administered both before and/or after program implementation. Leh, Grau and Guiseppe's [42] study focused on pre-service teachers' intercultural competence and understanding of diversity with American and German undergraduate students, with students from an American university completing open-ended pre-trip surveys. Students completed post-trip open-ended surveys and written reflections that aimed to assess cultural preconceptions and the effectiveness of the project methods regarding the use of technology to facilitate multicultural connections. In Turnsek's study [45] about antidiscrimination and diversity training (ADT) among prospective teachers a pre-program survey was administered to experimental and control groups to determine which independent variables are predictors of the respondents' knowledge and beliefs. The ADT was based on an anti-bias approach. To recognize the inequalities in the society, teachers should consciously reflect on their belief systems towards others. Out of all of the studies examined, there were only a few available quantitative studies that were longitudinal in nature (i.e., that gathered data over multiple years) and which included more than a few participants who had engaged in diversity programs [40,43]. Quantitative data (see Figure 11) were often collected in the form of Likert scale surveys [40][41][42][43]45,47] and reflected both student opinions and educator opinions about different programs. These surveys were administered both before and/or after program implementation. Leh, Grau and Guiseppe's [42] study focused on pre-service teachers' intercultural competence and understanding of diversity with American and German undergraduate students, with students from an American university completing open-ended pre-trip surveys. Students completed post-trip open-ended surveys and written reflections that aimed to assess cultural preconceptions and the effectiveness of the project methods regarding the use of technology to facilitate multicultural connections. In Turnsek's study [45] about antidiscrimination and diversity training (ADT) among prospective teachers a pre-program survey was administered to experimental and control groups to determine which independent variables are predictors of the respondents' knowledge and beliefs. The ADT was based on an anti-bias approach. To recognize the inequalities in the society, teachers should consciously reflect on their belief systems towards others. Out of all of the studies examined, there were only a few available quantitative studies that were longitudinal in nature (i.e., that gathered data over multiple years) and which included more than a few participants who had engaged in diversity programs [40,43].
consciously reflect on their belief systems towards others. Out of all of the studies examined, there were only a few available quantitative studies that were longitudinal in nature (i.e., that gathered data over multiple years) and which included more than a few participants who had engaged in diversity programs [40,43]. Strengths and weaknesses of the assessment approaches: As it is commonly found, self-assessment instruments may not necessarily report actual student or teacher characteristics. Also, it is important to look at the characteristics of selected participants that may limit the results of the program evaluation. For example, in Youngs [46], all four of the teacher participants in the study were white women, and all were in their first or second year of teaching in schools that serve significant numbers of minority, low-income students. Thus, this study and information may be quite narrow and not generalizable with regards to diversity or global citizenship education. For programs that administered surveys over multiple years [40,43], the strength of the instrument lies in the longevity.
Unlike Turnsek's [45] study that utilized a pre-program survey to determine which independent variables are predictors of the respondents' knowledge and beliefs, most programs did not have information on pre-exposure to diversity issues. As noted in Kubal et al. [40], when studying pre-service teachers, "future research should examine the effects of academic major, pre-exposure, and speaker upon student learning outcomes" [40] (p. 452).
Some studies suffered from a small sample size and thus the results cannot be generalized to the larger teacher or student population. Other times, the tool that was used for the study was not formally evaluated. Often times, studies did not include a description or a link to the survey questions, so it is difficult to determine the validity of the study. Some studies did not explicitly describe the conditions that the surveys were taken under. Some studies gave surveys that were even taken remotely [48], while other studies did not offer any demographic information on the students or teachers they were examining [50]. All of these factors make it difficult to judge the assessment tool.
Bias in multiple forms is also an issue that should be examined. In some cases, specific programs, like UNICEF UK's Rights Respecting Schools Award (RRSA), were heavily promoted in the article by the authors [51]. Self-reporting on a topic like "empathy" may bias the participants to say what they think they are supposed to say [55]. Geographically, the overwhelming majority of studies came from countries in Europe and in North America, and there was almost zero representation of Asian, African, South American or Middle Eastern countries. This ultimately led to a very specific idea of "diversity" and "global citizenship education" that mainly revolves around "Western" notions of diversity and what it means to be a "global citizen." In addition, those who take part in such programs may be more open-minded and therefore inclined to learn about such issues, which may make it hard to interpret the effect of these programs that try to change perceptions and attitudes of a wider community [44,51]. These issues should be taken into account when designing and assessing diversity programs for students around the world.

Implementation:
The implementation of diversity programs in formal education is often either incorporated in the existing curriculum, or offered in the form of trainings and workshops that may be accessible only to a small number of participants. The limitations of diversity program implementation may include financial constraints, political stance and leadership initiatives that may or may not recognize the importance of such programs in formal education. For example, Sperandio, Grudzinski-Hall and Stewart-Gambino [44] argue that a lot of the work for such programs is political, and is made harder when faculty are asked to debate, design and implement assessments for a new diversity-focused program. This is because, when defining outcomes, it is impossible for the faculty committee to not have a stance in and toward the world.
Elsewhere, Kvam et al. [41] stress the importance of including students in the disciplinary dialogue rather than trying to resolve the diversity debate for the students. Findings show that the term diversity had multiple meanings within and across student essays and focus-group interviews, however, it primarily evolved around race and ethnicity. The meanings that students ascribed to diversity were all rooted in white privilege. Thus, students should be part of the exploratory process of uncovering what diversity truly means.
Some authors question whether diversity programs go far enough in their design and focus. With regards to study abroad programs and the increase in respect for local cultures, Nagengast [50] surmises that intense, immersive study abroad experience can make students tolerant or respectful of any and all cultural practices. Separately, Somers [49] critiques diversity programs where students have to watch films, noting, "To watch and listen is not enough. To like and share on social media is not enough. We have to empower students to take action, and that action may start with the simple act of meeting someone new in person or seeking out authentic new voices online." [49] (p. 135). For one study that was focused solely on empathy, the author notes that empathy is not the exclusive property of the interculturalists and laments the lack of focus on social justice, empowerment and peace education in the program [55].
Outcomes: The documented outcomes of diversity programs were often noted as positive with regards to the importance of incorporating a diversity component into curriculum, as well as part of the pre-service teacher trainings to facilitate a deeper understanding of cultural diversity in education. Both students and teachers who partook in educational programs that incorporated appreciation of cultural diversity showed an increase in participants' attitudes, awareness and understanding of the importance of multiculturalism and diversity. Several programs improved participants' understanding and ability to identify discrimination as the program caused them to examine their own cultural biases [40,45,46]. It should be noted, however, that only 20% of the studies claimed statistically significant outcomes [45,48,53,55], of which only the study by Castellanos and Cole [53] applied experimental methods.
It is also possible to highlight individual studies. Although many programs evaluated are interesting, some seem more relevant in their potential to inform strategies to achieve and evaluate SDG4. 7. These examples include the following:

•
A study done by Bell [56] to evaluate the longitudinal effects of an in-school program addressing cultural diversity on the self-perception of student teachers with regards to their interpersonal competency found that there were large gains in teacher-student relationships, which focused on trustworthiness, comfort level, stereotypes of minority populations, and multiculturalism.

•
O'Neal's study [52] focused on helping students to appreciate diversity and to gain an understanding of multiple cultural backgrounds and found that the program self-report evaluation consistently showed high sensitivity to diversity.

•
Castellanos and Cole [53] studied how diversity course content influenced students' civic engagement and found that diversity courses that emphasize multicultural competence positively influence students regardless of race, and courses that emphasized society equity had a great impact on students of color.
• Sebba and Robinson [51] found that the UNICEF UK's Rights Respecting Schools Award (RRSA) had a profound impact on the majority of the schools involved in the program and that children in the program are more likely to acquire identities as active citizens and that students developed more positive attitudes toward diversity.

•
Carrell's study [55] noted that when cultural diversity was added to the communication curriculum, there was a strong impact on students' empathy. • Kvam et al. [41] found that most university students described diversity as something that they bring to communicative encounters, rather than as socially constructed within complex interactions embedded within social systems. For example, students described the need to open one's eyes/mind to different experiences, move outside of one's comfort zone, find common ground, avoid offensive behaviors, and confront and challenge assumptions. Students saw finding common ground as a skill for navigating diverse interactions, particularly those enacted in public-speaking contexts. • Leh et al. [42]: Pre-and post-surveys and reflective essays indicated that that online intercultural exchange reduced concerns before meeting face-to-face and the process successfully facilitated a deeper understanding of cultural diversity in education. Overall, results from both sites indicated greater comfort when initially meeting their culturally diverse peers, which were reflected in survey data and video data.
Although it is often noted that diversity programs were rather successful overall, this success may vary among participants [40,45]. Variables such as age, gender, and discipline were key points of variation. For instance, Kubal et al. [40] found that older undergraduate students found the program more valuable than younger students. Castellanos and Cole [53] found that diversity courses that emphasized multicultural competence positively influenced students, regardless of race. However, courses that focused on societal equity had a greater impact on students of color.
Implications: The implications of these studies highlight the importance of incorporating diversity in school curriculum and teacher training by educating pre-and in-service teachers and students about diversity in order to ensure the development of tolerance and empathy, and to reduce prejudice and discrimination, among others [55]. The objectives of diversity programs should be clearly defined and need to provide students with the tools (e.g., information, experiences, and different perspectives) that will make them think about their own responsibilities in relation to issues in the world as global citizens. Sperandio, Grudzinski-Hall and Stewart-Gambino [44] argue that it is crucial to differentiate one's knowledge of cultural diversity from being knowledgeable about the world and act as a global citizen. As such, the objectives of diversity programs in educational institutions should incorporate the three core dimensions of the GCED as defined the United Nations-meaning the cognitive, socio-emotional, and behavioral dimensions-in order to encourage the acquisition of skills, values, attitudes and behaviors. In this literature review, most studies fail to do so. Some of the studies focused on raising awareness while other studies specifically state they focused on transforming students' opinions or perspectives.
Pre-service teaching programs ought to prepare teachers to work effectively with culturally diverse students in an ever more diverse classrooms across the world [42,45,46]. Teachers' work is complex and difficult, and the studies examined show that teachers look for and desire the space and understanding to work well with a diverse group of students in an international classroom. Programs that help teachers understand the needs and issues that minority students face ultimately help teachers create a successful plan for students that may historically be pushed to the margins by the curriculum or the make-up of the classroom. Educational leaders that create both formal and informal opportunities for multicultural experiences, as well as teachers that have a grasp of basic educational issues and teaching styles, can spend energy focusing on attitudes, perceptions, and behaviors that will encourage a more diverse educational environment and one that focuses on creating global citizens.
In addition, faculty-driven diversity initiatives may prove effective at accomplishing workforce diversity, student engagement and transformative learning [43]. Exchanges between universities from different cultural backgrounds and countries help students to be more open-minded and more aware of a diverse set of issues and beliefs. Study abroad programs, that involve physical travel, also prove beneficial as students work toward understanding diversity and global citizenship. Training courses, either over a summer, during a semester or even over the course of a few years, were helpful for students to become more aware of the importance of diversity. Including students in the process of creating and implementing diversity programs is another way that students could be encouraged to grow with regards to diversity and global citizenship. These programs could be funded by educational institutions that are hoping to create a more diverse environment for learners of all backgrounds, and a more aware and conscious global citizen.

Peace Education
Setting: A large body of literature on peace education focuses on formal education. As seen in Figure 12, most of the programs were conducted in a traditional classroom setting, some at primary school [57][58][59], and others at secondary school level [60,61]. As shown in Figure 13, fewer programs were implemented at university level [62] or as an after-school activity in middle school and high school [63].
Geographically, more than half of the publications on peace education programs have origins in Europe [58,59,61,62] or in the US [57,63], while one assessment was implemented in Africa [60]. Geographically, more than half of the publications on peace education programs have origins in Europe [58,59,61,62] or in the US [57,63], while one assessment was implemented in Africa [60].  Objective: The objectives of peace education programs ranged from immediate to far-reaching impacts on participants' understanding of, awareness of and attitudes towards peace-building. However, peace education is defined in different terms depending on the national or regional   Objective: The objectives of peace education programs ranged from immediate to far-reaching impacts on participants' understanding of, awareness of and attitudes towards peace-building. However, peace education is defined in different terms depending on the national or regional contexts. The programs may pertain to either individual (e.g. [58,63]) or societal level (e.g. [60][61][62]). While some programs aimed to promote greater awareness, empathy and compassion among students [57,59] or among pre-service teachers [58,62], others were based on a peacekeeping strategy, which includes violence prevention and conflict resolution among youth for peaceful coexistence [60,61,63].
Sağkal et al. [61] discuss establishing and maintaining peace in schools in three different ways: • Peacekeeping strategy: to end conflicts and violence through strength through power, pressure and discipline, • Peacemaking strategy: to resolve conflicts in a constructive manner through conflict resolution and negotiation skills, and • Peacebuilding strategy: to build a culture of peace through transforming the culture of violence into a culture of peace [61]. Objective: The objectives of peace education programs ranged from immediate to far-reaching impacts on participants' understanding of, awareness of and attitudes towards peace-building. However, peace education is defined in different terms depending on the national or regional contexts. The programs may pertain to either individual (e.g., [58,63]) or societal level (e.g., [60][61][62]). While some programs aimed to promote greater awareness, empathy and compassion among students [57,59] or among pre-service teachers [58,62], others were based on a peacekeeping strategy, which includes violence prevention and conflict resolution among youth for peaceful coexistence [60,61,63].
Sagkal et al. [61] discuss establishing and maintaining peace in schools in three different ways: • Peacekeeping strategy: to end conflicts and violence through strength through power, pressure and discipline, • Peacemaking strategy: to resolve conflicts in a constructive manner through conflict resolution and negotiation skills, and • Peacebuilding strategy: to build a culture of peace through transforming the culture of violence into a culture of peace [61].
Sagkal et al. [61] argue that educators should not just focus on the peacekeeping, but rather utilize peacemaking and peacebuilding strategies which are more likely to lead to long-lasting peace.
In regard to pre-service teacher training, Coşkun [62] sees peace education as a means of peaceful co-existence among natives and refugees. It is educational institutions' responsibility to train teachers to contribute to societal peace by integrating peace education into the education system and the national curricula.
Description: Most of the programs aimed to expose students to the nature of peace and violence, and to teach youth the skills to resolve problems peacefully [59,61,63]. In Powers, Price-Johnson and Creative Research Associates [63], the In Pursuit of Peace Curriculum was designed to promote peace on the individual level. The main focus was to increase peaceful attitudes that would lead to violence prevention. In Baker [57], the objective of the World Peace game played by primary school children is to develop greater awareness of and a capacity for caring for others by building empathy and compassion. By being exposed to real-life problems, this game may prepare children for peaceful resolution of major world problems in the future. Basaran and Karakurt [58] argue that peace education is crucial to develop attitudes, skills and values among primary school students. Students will learn to develop violence handling skills. To achieve this, teachers need to be trained properly as they are normally the ones who deliver peace education to young learners in the classroom. Peace education in the context of Nigeria refers to peaceful and non-violent co-existence in the society as a whole. To tackle the ongoing political, religious and tribal tensions and violent conflicts, Olowo [60] proposes peace education as part of the national curriculum, to include "training on the avoidance and management of violence, conflict, better human relationships, unity and internal cooperation among various tribes" [60] (p. 9). Learning about peacebuilding can also be fun. For example, primary schoolers in Baker's article [57] tackle complex global issues in the World Peace game that has these young students use communication and collaborative skills with fellow students and teachers. Building trust among players is another important component of this game.
Sagkal et al. [59] describe a 24-class-hour peace education program among students in two middle schools in Izmir, Turkey. The program was applied on the experimental group two hours per week and took 12 weeks total. The program was divided in four parts: • Understanding the nature of peace and violence (7 h), • Elements that prevent and support peace (5 h), • Fundamental skills for a peaceful individual (7 h), and • Negotiation as a conflict resolution method (5 h).
The peace education program included techniques, such as discussion, pair and group work, and role-playing.
Several articles highlight the importance of promoting a culture of peace and non-violence in teacher education [15,58,60,62]. Basaran and Karakurt [58] describe the In-Service Training Program with the Peace Education for Primary Teachers (BEHEP) among primary school teachers who participated in practical activities on the topics of peace and violence, media and peace, human rights, and decision and conflict approaches in teaching based on cooperation. By utilizing experimental learning, BEHEP was teaching strategies, methods and techniques in course planning. The activities included discovery strategy, cooperative teaching, discussion, case study and dramatization. BEHEP was an education process encouraging active participation, based on active and experimental learning. Activities were based on cooperation in small groups of two, three or four teachers.
Coşkun [62] provides an example of a lesson plan that integrates peace education by utilizing the objective writing matrix. In planning daily lessons, teacher candidates should identify the objectives of each activity (at least five targets). The objectives should cover cognitive, affective and psychomotor domains.
Duration: The duration of peace education programs varies greatly, seen in Figure 14. While some of the programs were implemented over a short period of time, ranging from one day to several weeks [57][58][59]61], other programs took place for several months [63], or have established a tradition to be implemented for a number of years [62]. weeks [57][58][59]61], other programs took place for several months [63], or have established a tradition to be implemented for a number of years [62]. Implementer. While the majority of the peace education programs were implemented by classroom teachers [57,58,61], some of the programs were carried out by national research agencies or ministries of education [59,62] as seen in Figure 15. Implementer. While the majority of the peace education programs were implemented by classroom teachers [57,58,61], some of the programs were carried out by national research agencies or ministries of education [59,62] as seen in Figure 15. Implementer. While the majority of the peace education programs were implemented by classroom teachers [57,58,61], some of the programs were carried out by national research agencies or ministries of education [59,62] as seen in Figure 15. Transformative orientation: By promoting tolerance and empathy through educational practices and curriculum, peace education may contribute greatly to tranquility and peace in society. Most literature addresses the transformative effects that peace education may have on students and/or pre-and in-service teachers. Baker [57] states that through The World Peace game, students learn deeply and the newly acquired knowledge may be long-lasting. As a result, students may feel more motivated to save the world.
Some of the tools and measurement efforts sought to be transformative by suggesting individual actions and collective ways of engaging to address challenges faced by youth and adults as a result of conflict [61,62]. Because of its transformative potential, peace education, peace-building and reconciliation should be of particular interest to countries that deal with the refugee crisis or other types of conflict and violence [62,63].
Assessment: In most publications, quantitative and qualitative assessment tools were used to understand the associated outcomes of peace education and non-violence programs, seen in Figure   Figure 15. The Implementers of Peace Education Programs.
Transformative orientation: By promoting tolerance and empathy through educational practices and curriculum, peace education may contribute greatly to tranquility and peace in society. Most literature addresses the transformative effects that peace education may have on students and/or preand in-service teachers. Baker [57] states that through The World Peace game, students learn deeply and the newly acquired knowledge may be long-lasting. As a result, students may feel more motivated to save the world.
Some of the tools and measurement efforts sought to be transformative by suggesting individual actions and collective ways of engaging to address challenges faced by youth and adults as a result of conflict [61,62]. Because of its transformative potential, peace education, peace-building and reconciliation should be of particular interest to countries that deal with the refugee crisis or other types of conflict and violence [62,63].
Assessment: In most publications, quantitative and qualitative assessment tools were used to understand the associated outcomes of peace education and non-violence programs, seen in Figure 16. The vast majority of the programs utilized qualitative assessment in the form of interviews, lesson plan and document reviews, among others [58,61,62]. For example, Basaran and Karakurt [58] describe The In-Service Training Program with the theme of Peace Education for Primary Teachers (BEHEP) that utilized five different data collection tools: • Interviews: 14-item semi-structured interviews among teachers were conducted; • Peace Education concept forms: seven concept forms were developed and evaluated with three field experts to assess the influence of BEHEP implementation on conceptual knowledge of participants; • Course plans: participants developed a course plan that would reflect their learning and their conceptual comprehension levels; • Participant diaries: participants noted their views at the end of each training day; • In-service training evaluation scales: BEHEP Evaluation Scale with two dimensions and 28 items on a five-point Likert scale examined participants' views of teaching process and acquisitions and their views of organization design. Quantitative data (see Figure 17) were collected in the form of Likert scale surveys [59,60]. For example, Olowo [60] administered a 20-item questionnaire on a four-point scale to examine teachers' opinions on integrating peace education into Nigerian educational system in order to reduce violence and crime. Some of the programs administered pre-and post-surveys among program participants [61,63]. Sağkal et al.'s [61] study utilized mixed methods to examine the experiences of students who underwent peace education training. First, an aggression-focused pre-and post-test was conducted using a quasi-experimental design among 156 students in experimental and 106 students in control group. Second, 20 semi-structured interviews were conducted to better understand the students' experiences related to peace education. Quantitative data (see Figure 17) were collected in the form of Likert scale surveys [59,60]. For example, Olowo [60] administered a 20-item questionnaire on a four-point scale to examine teachers' opinions on integrating peace education into Nigerian educational system in order to reduce violence and crime. Some of the programs administered pre-and post-surveys among program participants [61,63]. Sagkal et al.'s [61] study utilized mixed methods to examine the experiences of students who underwent peace education training. First, an aggression-focused pre-and post-test was conducted using a quasi-experimental design among 156 students in experimental and 106 students in control group. Second, 20 semi-structured interviews were conducted to better understand the students' experiences related to peace education. Availability. The research instruments that were used to assess these programs are largely unavailable. Olowo's [60] study on the effects of integrating peace education in the Nigerian education system includes the questionnaire with 50 items on a 5-point Likert scale. Similarly, the study by Powers, Price-Johnson and Creative Research Associates [63] provides the pre-and postsurvey questions. The 3-point Likert scale survey ("agree", "undecided" and "disagree") covered eight main areas (eight items) of the program's impact on youth. Students could also comment in the open-ended written section of the survey.
Strengths and weaknesses of the assessment approaches: Often times, the assessment of peace education is limited to the implementation in selected schools or to specific areas and populations that are prone to conflict and/or violence. For example, Basaran and Karakurt [58] argue that the Availability. The research instruments that were used to assess these programs are largely unavailable. Olowo's [60] study on the effects of integrating peace education in the Nigerian education system includes the questionnaire with 50 items on a 5-point Likert scale. Similarly, the study by Powers, Price-Johnson and Creative Research Associates [63] provides the pre-and post-survey questions. The 3-point Likert scale survey ("agree", "undecided" and "disagree") covered eight main areas (eight items) of the program's impact on youth. Students could also comment in the open-ended written section of the survey.
Strengths and weaknesses of the assessment approaches: Often times, the assessment of peace education is limited to the implementation in selected schools or to specific areas and populations that are prone to conflict and/or violence. For example, Basaran and Karakurt [58] argue that the primary school selected for their study is known for a large migrant student population, which often times causes increased possibility of conflicts. Thus, the results of this study should be interpreted with caution. Similarly, Sagkal, Türnüklü and Totan [61] note that program implementation and assessment is limited to two middle schools located in a lower socio-economic region in the city of Izmir in Turkey. As with all studies, any extrapolations should be made with caution.
Outcomes: The documented outcomes of peace education and non-violence programs were often noted as positive with regards to the importance of incorporating peace education into curriculum or teacher training. Some of the major outcomes include positive changes to students' knowledge, skills, values, and socio-emotional engagement or orientation (i.e., empathy, compassion and tolerance) with other groups [57][58][59]. However, it should be noted that only two (or 29%) of the studies claimed statistically significant outcomes [59,61], and both of them applied quasi-experimental methods. One of the two studies [59] shows that a 24-class-hour peace education program among middle schoolers was effective in increasing students' empathy levels. While the other quasi-experimental study is further discussed below, we note here that both studies claim that children can be taught to be peaceful if peace education is a specific issue of focus in the classroom.
Peace education may also lead to a positive change in behaviors-including better problem-solving skills, better anger management skills, and better communication skills, among others. This may contribute to successful conflict resolution among youth [61,63]. In the African context, Olowo [60] found that peace education is overwhelmingly desired among secondary school teachers, where it can be integrated in the existing social studies curriculum. Teachers felt that incorporating peace education in the curriculum would contribute to peace and stability in the region. Based on the findings, Olowo [60] recommends that teacher training should be of vital importance to effectively teach peace education.
Despite predominantly positive outcomes following the assessment of peace education programs in various educational settings, small sample sizes at selected schools do not provide sufficient information to conclude what the quality of teaching is, and what some of the long-lasting impacts may be. Implementation: Since peace education is usually not a stand-alone course, but is rather incorporated in the existing curricula, Baker [57] argues that teachers may be hindered when it comes to implementation due to time restrictions and its less important status in the curriculum.
Olowo [60] stresses the importance of peace education and training in African countries that are still largely affected by conflict and violence. However, there is still insufficient capacity of teachers with expertise to support peace and non-violence subject areas. Educational institutions, national governments and NGOs should support conflict-sensitive teacher education in particular by creating guidelines to improve the teacher recruitment practices.
Implications: There are multiple implications that flow from the literature on peace education. First, these studies suggest the importance of incorporating peace education in school curriculum and teacher training. It is commonly agreed upon that peace education is not a stand-alone course, and thus, it needs to be incorporated into various curricula. However, peace education is often defined in broad terms that lack clear objectives. As recommended by Coşkun [62], teachers should be trained better to develop lesson plans that integrate peace education in daily lessons across various disciplines.
The objectives of each activity should be clearly defined, to include specific targets that need to be met. One goal of education across all subject disciplines should be to promote peace. To achieve that, Baker [57] suggests exposing youth to global issues early on, in primary school, as children tend to be more creative, curious and have a "can-do" attitude. This falls within the cognitive dimension of GCED that encourages the acquisition of knowledge about global issues, and the acquisition of skills on how to resolve these issues peacefully. Experimental learning is suggested to have students work collaboratively in small groups.
Second, peace education may contribute to the socio-emotional development of youth [61], which is the second dimension of GCED that fosters a sense of belonging, values, responsibilities, solidarity, tolerance, and respect for diversity. Cooperative learning and discussion would be some of the recommended strategies to use in a classroom.
Third, peace education must include the behavioral component that will prepare teachers and students to act responsibly towards a peaceful coexistence. Positive change in behavior may include better problem-solving skills, better anger management skills, better communication, and negotiation skills among others.
Finally, educators should be trained to use a peacebuilding strategy that, on a larger scale, aims to transform the culture of violence into a culture of peace. Basaran and Karakurt [58] claim that all primary school teachers should undergo peace education training which would enhance proficiencies and competencies of pre-and in-service teachers. This would contribute to the development of a social peace culture. This is particularly important in conflict regions where teachers do not possess sufficient capacity and expertise on the topics of peace and violence.

Human Rights
Setting: All of the literature examined in this study focused on programs that were implemented in formal education settings: two of the programs at the university level [50,64], one at the high school level [65], one at the middle school level [66] and two at the elementary school level [67,68], refer to Figures 18 and 19. Geographically the studies and reports originated mostly in Europe, though there were two in Turkey [66,67] and two others in Norway and Cyprus [65,68]. Two articles focus on programs in the United States [50], with one being a cross-cultural endeavor between universities in the U.S. and Mexico [64].     Objective: The objectives in most of the literature focused on human rights had a strong component of student or instructor self-awareness. The common thread running through these reports and programs was that greater self-awareness as a member of a global community would correlate to positive attitudes towards human rights [65]. Three of the articles focused on student awareness [64][65][66].
The literature also sought to identify the challenges teachers confront when attempting to use curriculum that addresses human rights and global citizenship. Teachers identified challenges in the form of comfort with the topic, knowledge around the subject of human rights, a lack of training in the content area and possible uncomfortable discussions with students [67,68]. It is interesting that the studies in Turkey and Cyprus were not focused on a specific program. The focus was on how human rights curriculum was being taught in the classroom and how teachers' understanding, perceptions and preparation impacted their ability to teach the curriculum.
The three remaining studies were program specific and occurred in the United States and Norway. In Norway, the objective was to determine students' adherence to International Objective: The objectives in most of the literature focused on human rights had a strong component of student or instructor self-awareness. The common thread running through these reports and programs was that greater self-awareness as a member of a global community would correlate to positive attitudes towards human rights [65]. Three of the articles focused on student awareness [64][65][66].
The literature also sought to identify the challenges teachers confront when attempting to use curriculum that addresses human rights and global citizenship. Teachers identified challenges in the form of comfort with the topic, knowledge around the subject of human rights, a lack of training in the content area and possible uncomfortable discussions with students [67,68]. It is interesting that the studies in Turkey and Cyprus were not focused on a specific program. The focus was on how human rights curriculum was being taught in the classroom and how teachers' understanding, perceptions and preparation impacted their ability to teach the curriculum.
The three remaining studies were program specific and occurred in the United States and Norway. In Norway, the objective was to determine students' adherence to International Baccalaureate Organization (IBO) human rights ideals. This was determined by measuring students' human rights competence, defined "as the ability to act in a way that promotes the inalienable rights and inherent dignity of all people regardless of ethnicity" [65] (p. 393). In a study conducted at Juanita College in Pennsylvania, short-term study abroad programs were evaluated for their link to student attitudes toward human rights. The researcher identified the tension between an ethnocentric versus ethnorelativist perspective in teaching human rights [50]. The third program investigated a human rights and disability course for social work and law students in the U.S. and Mexico. The objective was to determine the impact of a program that applied the perspectives of two fields, social work and the law, within two countries, on the understanding of the human rights of persons with disabilities [64].
Critelli et al. [64] describe a college level course, Human Rights and Disability from the Lens of Law and Social Work, that was designed with an interdisciplinary, multi-cultural approach to disability and a human rights framework. Human rights laws, treaties, and case studies were used to understand and examine the status of disability rights in Mexico and the U.S. Students worked with each other using technology to bridge the distance between New York and Mexico City.
Nagengast [50] investigates the impact of short-term study abroad programs on student attitudes towards human rights. Students from three university courses are included in the study. The control group had no study abroad component included. Student travel included trips to Gambia and Vietnam. Student intercultural sensitivity was measured to determine if they became more ethnocentric or ethnorelativist.
Transformative orientation: Human rights education can be presented in many ways and through a variety of topics. Overall, the programs are intended to be transformative for the individuals that participated in the studies by exposing teachers and students to new perspectives and experiences. Courses and curricula that include human rights topics can cause participants to think critically about social justice issues both in their community and globally [64,66]. The cross cultural, multidisciplinary course examined by Critelli, Lewis and Méndez-López [64] "facilitated greater knowledge of human rights principals and instruments, added to competence in applying a human tights framework to disability-related issues, and enhanced [students'] understanding of ways to integrate a human rights framework in social work practice" [64] (p. 133).
Duration: The duration of the programs varied widely. They ranged from as short as a single lesson [66] to as long as several years [50,68].

Strengths and weaknesses of the assessment approaches:
In most studies, quantitative and qualitative assessment tools were used to assess human rights education programs, as seen in Figure  21. Assessment tools typically were not available and in many cases the literature did not reference a specific assessment of student learning, program outcome or teacher effectiveness. However, the tools used for assessing human rights education that were described included surveys, student papers, and interviews [50,67].

Strengths and weaknesses of the assessment approaches:
In most studies, quantitative and qualitative assessment tools were used to assess human rights education programs, as seen in Figure 21. Assessment tools typically were not available and in many cases the literature did not reference a specific assessment of student learning, program outcome or teacher effectiveness. However, the tools used for assessing human rights education that were described included surveys, student papers, and interviews [50,67].

Strengths and weaknesses of the assessment approaches:
In most studies, quantitative and qualitative assessment tools were used to assess human rights education programs, as seen in Figure  21. Assessment tools typically were not available and in many cases the literature did not reference a specific assessment of student learning, program outcome or teacher effectiveness. However, the tools used for assessing human rights education that were described included surveys, student papers, and interviews [50,67]. Nagengast [50] provided the eight survey questions he developed to assess the learning outcomes of his courses. A five-point Likert scale was used to determine an overall change score between the pre-and post-trip survey for each student. A small number indicated a shift towards relativism while a large number meant a shift towards universalism. Open-ended, five-page essays were also used to further understand the quantitative data gathered from the survey. This study should be viewed cautiously; with an understanding that it applies to these courses alone. The Nagengast [50] provided the eight survey questions he developed to assess the learning outcomes of his courses. A five-point Likert scale was used to determine an overall change score between the pre-and post-trip survey for each student. A small number indicated a shift towards relativism while a large number meant a shift towards universalism. Open-ended, five-page essays were also used to further understand the quantitative data gathered from the survey. This study should be viewed cautiously; with an understanding that it applies to these courses alone. The differences in results between the courses that traveled to Gambia and students that traveled to Vietnam suggest that the design of the overall course may have more impact on learning outcomes than including a study abroad component [50]. In this case the key difference in the travel portion of course design was much more student/native population interaction in Gambia than in Vietnam.
The study of students in the IBO program in Norway used several surveys to measure ethno-cultural empathy, identification with all humanity, and human rights attitudes and intentions. While the surveys were not provided for reference, citations to publications were [65]. The author acknowledged that the three-component model and survey presented were "designed specifically to measure student competence in the context of the IBO and therefore the findings cannot be generalized beyond this context" [65] (p. 400).
Critelli et al. [64] provide survey items used to evaluate the U.S./Mexico university course, but do not provide open eneded questions used to further illuminate survey results. While the data collected provides insight into the method of human rights edcuation, the small sample size, and self-reporting of learning limit the understanding to this course alone.
Outcomes and Implementation: Two outcomes became apparent in the literature: (a) increased self-awareness correlated with increased awareness of others and (b) teacher confidence correlated with how the topic of human rights may impact student learning. The study by Parish [65] was the only study that presented statistically significant findings (see Figure 22). The first statistically significant findings presented was that an "identification with all humanity has a greater effect on the intention to act than either identification with those in [the] community or [the] nation" [65] (p. 398). Second, there was a strong, positive correlation between ethnocultural empathy-which refers, e.g., to the level to which one feels empathy towards people who are ethnically different from oneself-and the intention to act to promote human rights [65]. Lastly, a strong correlation was found between the positive attitudes of significant others (e.g., friends, colleagues, coworkers) towards human rights behavior, on one hand, and positive attitudes towards the "worthwhileness of the human rights behavior . . . and the intention to act to promote human rights," on the other [65] (p. 399). e.g., to the level to which one feels empathy towards people who are ethnically different from oneself-and the intention to act to promote human rights [65]. Lastly, a strong correlation was found between the positive attitudes of significant others (e.g., friends, colleagues, coworkers) towards human rights behavior, on one hand, and positive attitudes towards the "worthwhileness of the human rights behavior … and the intention to act to promote human rights," on the other [65] (p. 399). Students that studied human rights in an intercultural, interdisciplinary model reported a positive impact on their professional skill development, cultural understanding and broader views of the world [64]. Teachers that were not confident in their own understanding of human rights had more difficulty integrating the topic into their teaching [67,68]. This supports contextualized Students that studied human rights in an intercultural, interdisciplinary model reported a positive impact on their professional skill development, cultural understanding and broader views of the world [64]. Teachers that were not confident in their own understanding of human rights had more difficulty integrating the topic into their teaching [67,68]. This supports contextualized preparation and ongoing training so teachers can develop critical and transformative pedagogies in human rights education [68].
Implications: Implications of the studies in Turkey and Cyprus support the need for pre-and in-service teacher development in human rights concepts and pedagogies. Simply learning concepts without context and reflecting on teaching practices will not result in critical and transformative lessons [68]. Teachers should develop their ability to teach using discussion, group work and brain storming. These methods are outside the typical lecture with teacher directed question and answer sessions found in Turkey and Cyprus.
The three studies that focused on human rights education programs supported various implications that were specific to the area studied:

•
Critelli, Lewis, and Méndez-López [64] found that social workers can utilize their expertise to advocate for the people's rights to paid employment, access to food, education, chelter and health care. Also, using a human rights framework in social work education results in a more holistic approach to professional practice. • Nagengast [50] found that increased immersion in the local culture during a study abroad program would have a greater impact on students' views on human rights. However, the impact may be the unintentional acceptance of cutural practices that violate human rights. The later appears to be based on the influence of getting to know idividuals from the host culture well and empathizing with their cultural views of certain actions or traditions that would be considered violations of human rights.

•
Parish [65] investigated the human rights competency of students in the IBO using three surveys. These surveys allowed for a comparison of student competency between different locations. This could support the measurement of student competencies using other curricula and therefore provide for the measument of an area that is difficult to assess.
Overall, the literature supports human rights being taught using a contextualized approach to increase student engagement and connection with the topic. Students of all ages appear to benefit from pedagogies that increase interaction with the topic on an individual level so connections can be Sustainability 2020, 12, 1383 32 of 57 made with the topic. The small number of progamatic studies limits our understanding in the areas of teacher and student education in human rights.

Gender Equality
Setting: Literature that addresses the assessment of education programs in gender equality appears to be very limited. Of the two sources found for this investigation, one is based in primary grades 3, 4, 5, and 6 [69], and one is based at the university level [70]. Both programs are focused on activities in traditional classroom settings that were being provided by teachers. The two programs were studied in very different geographic locations. The first in the western United States [69] and the later in Turkey [70].
Objective: Grayson and Martin [69] present a professional development program, Gender Expectation and Student Achievement (GESA), that guides in-service teachers through a collegial process of self-evaluation intended to reduce gender bias in their teaching. They argue that gender bias is evident in five areas of classroom instruction: instructional contact; grouping and organization; discipline; self-concept; and evaluation. In these areas, the bias against girls is apparent and harmful. The teacher training aims to make teachers aware of this bias and once aware, they are willing to change practices to reduce the impact of bias on their students.
Acar-Erdol and Gözütok [70] aim to reduce the impact of gender inequality through a curriculum for pre-service teachers with the intention of improving their level of knowledge and awareness of gender equality. The research aims to evaluate the curriculum and propose adjustments based on reflective assessment. While the two studies use different vocabulary, they both identify gender equality as the equal treatment of students and non-discrimination in education regardless of gender.
Description: In 1984, GESA's professional development program was based on eight years of data collection and refinement [69]. The program included five monthly meetings of teachers in a collegial atmosphere that supported teacher awareness of gender equity issues within their individual classrooms. The organization of the program supported self-reflection and teacher practice through peer observations conducted between the monthly workshop. Teachers not only observed the practices of others, but also provided feedback based on data taken during the observations. The organization of the program was based on the theories of change management and staff development at the time. There are six components: Attitudinal change 3.
Climate for change 5.
Ownership 6. Dissemination The five workshops were based on five areas of gender disparity, mentioned previously: 1. Instructional contact 2.
Self-concept 5. Evaluation Following each workshop, teams of four teachers observed each other's classes three times. The observing teacher collects data on interactions between the teacher and students that can later be coded.
Turkey's Gender Equality Curriculum [70] for pre-service teachers was developed using the Taba Model. The model includes the following six steps:
Formulation of objectives 3.
Selection and organization of content 4.
Selection and organization of learning experiences 5.
Selection and organization of evaluation and measurement 6.
Control of the relationships of the curriculum's dimensions A group of eight experts, including curriculum developers, measurement and evaluation specialists and gender issue specialists, evaluated the curriculum in each of the six areas above.
Duration: The literature generally does not describe the implementation of programs. Therefore, the duration of a program does not apply. At the time of the report, the GESA professional development programs had been implemented for eight years. Each implementation lasted one school year [69].
Implementer: The GESA professional development programs were implemented by the researchers with the support of the Office of the Los Angeles County Superintendent of Schools [69]. The gender equality curriculum designed in Turkey was designed and the evaluation protocol implemented by the researchers [70].
Transformative orientation: The understanding of gender equality issues, a self-awareness of gender roles, and adding gender equality subject matter into curriculum is expected to be transformative. The GESA program intends to support changes in teacher practice. Grayson and Martin [69] state, "Once teachers have examined their own biases as demonstrated by their own behavior toward male and female students, curricular and other changes can be accepted more easily" [69] (p. 2).
The Gender Equity Curriculum in Turkey proposes to increase teachers' understanding of the issue and building social justice [70]. As a result of raising awareness, it is expected that gender discrimination will be reduced.
Strengths and weaknesses of the assessment approaches: Acar-Erdol and Gözütok [70] included the 29 item Reflective Evaluation Form (REF) completed by a relatively small group of eight experts to assess the curriculum in four areas: learning outcomes; content; learning experiences; and measurement and evaluation. This form included space for additional comments. The REF was based on current research and appeared to be thorough. The feedback gained from the results of the REF were specific and resulted in some adjustments being made to the curriculum. However, the group of experts that did the assessment was small and may have benefited from more diversity.
Grayson and Martin [69] used data from pre-and post-testing of the students in mathematics, reading and language arts along with pre-and post-gender expectations surveys taken by teachers and students, to assess the impact of their program. Unfortunately, the surveys were not included, and the academic tests were not specified. At the time of writing, the program was not complete, therefore no actual data was reported. Further investigation may result in finding the final published study and more details about the assessment tools.
Implementation: It is difficult to fully appreciate the effectiveness of Grayson and Martin's [69] study without final results. Although it is unlikely that the findings were drastically different, a final draft would resolve some questions about the data.
A follow-up to the curriculum developed by Acar-Erdol and Gözütok [70] includes analysis of its effectiveness and impact on new teachers' understandings of gender bias in themselves and their classrooms. Although not specifically discussed within the article, this implies that continued analysis of the impact of the curriculum on pre-service teachers should be conducted-the benefit being further refinement of the curriculum to attain the intended goals.
Outcomes: Acar-Erdol and Gözütok [70] state that teachers prepared in gender equity issues could reduce gender stereotypes and reduce gender inequality in educational settings. The curriculum that they developed and had analyzed was found to have learning outcomes, content, learning experiences and evaluation tools consistent with each other. The teaching principles and activities aligned with intended learning outcomes and the problems included in the evaluation tools were aligned. There were adjustments made to the curriculum based on the expert feedback.
While the program implemented by Grayson and Martin [69] was not complete at the time of writing, some results were shared from preliminary data. Teachers reduced the disparities in their interactions with male and female students. All participating teachers identified at least on relevant curricular change implemented during the program. They also reported benefiting from the collegial observations and positive changes in their attitudes and positive effects on their students. Each teacher identified at least one major area of disparity and a specific interaction that impacted their classroom.
Implications: The literature implies that gender equity is an important issue in education and in preparing global citizens that are equipped to reduce social injustice. This requires preparing teachers for the integration of gender equality into subject matter and to address it literally in the classroom [70]. Studies that include the assessment of programs or the effectiveness of programs about gender equality is very limited. There does seem to be some agreement that teachers, whether pre-service or in-service, can benefit from more personal awareness of their own understanding of gender norms and how gender inequality can impact the classroom. Despite the gains in awareness over the past few decades, there is still much to uncover. The studies support the need to guide teachers through a process to identify their own bias in order to move towards mitigating it in themselves and others. The added complexities of gender bias in the current understanding, or lack of understanding, of the LGBTQ community makes the topic of gender inequality in the classroom even more important. The lack of analysis of curriculum effectiveness leaves us depending upon an anecdotal understanding. This appears to be an area of potential future research.

Discussion and Conclusions
To be sure, there are many promising findings documented in the publications reviewed. The literature suggests effects, for example, on the knowledge, attitudes, values, beliefs, and behavior related to the five themes of interest. It has been repeatedly documented that courses dedicated to the themes addressed here can lead to outcomes that are relevant for SDG 4.7. Yet, there are a number of sensitive and politically contentious questions that must be answered going forward. For instance, one key question is how each of the five themes is defined and operationalized through the content included in the courses taught. As was discussed in the preceding sections, there is great variation across studies when it comes to how key terms are defined and what content/approaches are incorporated into learning activities. A further question is whether teachers will feel comfortable teaching content that they see as biased or that goes beyond what they understand to be a presentation of "the facts." And, finally, an important question is how each of these five issues will be evaluated, measured, and monitored for the purpose of achieving the SDGs. There are no easy answers to these questions, especially since they will require political negotiation among a range of actors at both the national and international levels. National actors are, of course, charged with developing and approving policy, curriculum, and national assessment strategies, but it is also true that a range of international experts and organizational actors are working together to define the SDG indicators and to design monitoring plans that will enable the collection of the necessary data to track progress on those indicators. The ways that actors from both the national and international levels engage in the coming years will determine the answers to the questions posed here.
The path forward is made more difficult by the fact that a relatively small percentage of the reviewed studies employed methods that are seen as credible, that is, seen as policy relevant. Only 19 of the 56 studies employed experimental (n = 5) or quasi-experimental (n = 14) methods. (See Appendix D to easily identify which studies employed these methods). While there are numerous limitations to both quasi-experimental and experimental methods [71], it remains true, in the current context, that these methods are seen to be more valid for the purpose of claiming effects, drawing lessons, and informing policy. Beyond an exclusive consideration of which studies used "rigorous" methods, we suggest that lessons be drawn based on whether the evaluation of the underlying education program provides practical insights into how the SDGs can be achieved, assessed, and monitored at various levels.
The above suggestion is more easily stated than realized. This is because the existing language of SDG4.7 is both very broad and very narrow. Of the five indicators for SDG 4.7 that have been defined, the first, that is, the "global indicator," should track the extent to which "(i) global citizenship and (ii) education for sustainable development, including gender equality and human rights, are mainstreamed at all levels in: (a) national education policies, (b) curricula, (c) teacher education and (d) student assessment" [3]. While is it not clear how a single indicator can track these various dimensions, it also stands out that this indicator does not depend on how each of its key terms are defined. On one hand, this is advantageous, since it allows for flexibility across countries that allows them to be contextually responsive; on the other hand, this is conceptually problematic, since it is not clear how comparisons of this indicator across countries should be interpreted, especially when there are different definitions and/or approaches to engaging with the relevant content.
Furthermore, the majority of the language of the other, "thematic" indicators currently associated with SDG4.7 does not seem to lend itself to ensuring that students engage deeply with the issues related to SDG4.7. The language for indicators 4.7.2-4.7.5 is as follows:  While the last two indicators do attempt to get at the issue of student knowledge, as opposed to simply student exposure to certain content, it is also important to note that there is yet to be agreement on the exact language for these indicators. The available documentation on the website for the Technical Cooperation Group-the group charged with discussing and developing indicators for SDG4-states that these two indicators are in need of further development [72]. Thus, it is not clear what the language will be, let alone how it will be operationalized in policy, curricula, or assessment.
In any event, when thinking practically about SDG4.7, one realizes that-past the political debates that beset agenda-setting and indicator elaboration-there is a real need for school systems to define and pursue more specific language and indicators for SDG4.7. It is here where this literature review can be useful. Those charged with, or interested in, developing policy, curricula, or assessments related to SDG4.7 can refer to the studies reviewed here. These studies characterize the various definitions, orientations, and foci that different educational interventions entail, as well as the associated outcomes. These studies can thus be used to draw lessons for how to achieve and assess progress related to 4.7. This is particularly true when it comes to making progress at the classroom or programmatic levels-those levels for which indicators have not yet been developed.
There is a clear emphasis in the Framework for Action that accompanies the education-related SDGs on the need for international organizations to assist countries with the technical and financial resources needed to achieve the internationally agreed upon goals. However, in the absence of clarity around what students are supposed to achieve and what countries are supposed to monitor, it seems unlikely that much support will be offered on this front. In the short-term, it is more probable that efforts related to SDG4.7 will be taken up by, both, countries who take it upon themselves and individual organizations that have an interest in this area.
Interested countries and organizations should not underestimate the planning and resources that are required in order to make progress on SDG4.7. For example, as part of their efforts, these actors should be sure not to overlook the need for teacher training to go along with new (or existing) curricula and programs for students [73]. At the same time, both governments and organizations should be attentive to the financial, personnel, and administrative resources that are required in order to simply communicate and to train teachers, principals, etc. on the plans, strategies, and curricula that are developed. If the research of Elaine Unterhalter and colleagues [74] is any indication, many actors at the subnational level are not even aware of global goals, let alone the corresponding plans, policies, etc. that have been developed at the national level. In part, this may be because insufficient resources have been invested in mobilizing support to ensure implementation and follow-up for the projects developed by governments and international organizations. At the same time, the lack of meaningful engagement with global goals at subnational levels may be a result of the macro nature of the indicators used to track progress on said goals. If the indicators reflect information that is relatively easy to collect but is mostly void of any meaningful insights about what is going on at the school level, how beneficial is the entire exercise of the global goals in the first place?
Perhaps a promising course of action could be to shift from a primary focus on learning outcomes to focus as well on the percentage of students who are engaged in school or community projects that attempt to make progress on the issues embedded in SDG4.7. After all, it is often by doing something in practice that we learn. Transformational experiences are more likely when we take students out of the classroom and put them in connection with the wider world. Following this line of thought, perhaps it is better to focus on teaching teachers both relevant content/curricula related to SDG4.7 as well as teaching them how to organize and implement projects and innovative pedagogical approaches that prompt students to do the following: take action connected to the SDGs, interact with each other across cultural lines, teach them how to get involved in their community, take away their reticence to engage in SDG issues, get them familiar with their environment, and, indeed, instill in them the skills and attitudes that GCEd calls for, so that they can start-or continue-to confront the challenges that affect us globally.
The indicators for SDG 4.7 could then focus on whether ESD/GCEd etc. has been incorporated into the curricula and whether students are engaged in relevant projects. We are not saying that none of the dimensions of 4.7 cannot be learned in the classroom. Certainly, as this review has shown, classroom learning is important for acquiring the knowledge, skills, values, etc. related to SDG4.7. What we are suggesting is that these efforts should be complemented by experiential learning that extends students' foundation of knowledge and gives them experience with the wider world-and the people in it-for which we all need to care.
At the sub-national or institutional level of the k-12 school, university, or community program, monitoring and evaluation strategies could then be multidimensional and could draw on the variety of assessment techniques documented in the studies reviewed here. As has been shown, a range of quantitative and qualitative methods were used to gauge the ways that educational programs and interventions related to SDG4.7 have affected relevant understandings, knowledge, skills, beliefs, and behaviors. While global and country-level indicators may necessarily be more summative in nature, we suggest that institutional-level indicators, evaluation, and monitoring should be more formative, nuanced, and qualitative in nature-in order to be more useful for providing feedback that can encourage growth at the individual and local level. Here, surveys or questionnaires, which are useful for assessing knowledge, beliefs, and values, can be complemented with such strategies as interviews, focus groups, classroom observations, curriculum and lesson plan analysis, participant diaries, and self-reflective essays. In a cyclical way, the insights generated from these strategies would not only serve as the basis for feedback to teachers and students engaging with the knowledge, skills, etc. that are necessary to achieve the SDGs, but could also inform the design of professional development and teacher training to address areas of need. In the ways described above, learning, monitoring, and evaluation can work together to make meaningful progress on multiple levels towards a future in which the world is characterized by sustainable development, acceptance of cultural diversity, peaceful co-existence, the protection of human rights, and gender equality.  IMPLICATIONS: Does the study identify/suggest any useful policy/pedagogical implications for further implementation of the SDGs? (These notes can be from the author or our own observations/thoughts.) 9.
IMPLEMENTATION: Were there any issues with the implementation of the program, curriculum, etc. that could have affected the results documented? (Here, the focus is on whether or not-and the extent to which-the results of the program may have been hampered by how the program was implemented, or the context into which it was implemented. We want to make notes about this so that we can carefully interpret the results/outcomes that we document in #7.) This study is an analysis of the U.S. and South African teachers' developing discourses in a teacher education program. Teachers were exposed to a course that would have them consider the role and function of literacies in their lives and the lives of others, especially among diverse students.   University students enrolled in "Fundamentals of Speech Communication," "TV Production and Direction," "Interpersonal Communication" (used for control purposes only), and "Intercultural Communication" courses. Two course sections for each participating basic course instructor were randomly selected as control groups, and two course sections were randomly selected as experimental groups. Two basic course instructors infused diversity into the public speaking portion of their experimental sections, and the other two basic course instructors infused diversity into the interpersonal instruction of their experimental sections.   The program intends to orient university students to their own acknowledgement of the appreciation of diversity and to gain an understanding of multiple cultural backgrounds. The course uses journals, a final exam, small group discussion, a group project, novel analysis and a short paper on oppressive conditions in a country of their choice. The International Baccalaureate (IB) was developed in the 1950s/1960s, for students aged 3-19, to help create a peaceful world through intercultural understanding and respect, and to address the pragmatic concerns of increasing numbers of mobile families. Three components of the IB human rights ideals are studied: identification of 'self' as a part of a common humanity; ethno-cultural empathy; and positive attitudes. The In Pursuit of Peace Curriculum was designed to teach youth the skills needed to resolve problems peacefully. It was implemented at the violence prevention "Peace camp" that included team building, guest speakers, group consensus, project planning and problem solving. Participants in grades 6-12 planned a peace project to benefit the community.  This study describes two programs: The Diverse Voices Conference (DVC) and The Diverse Student Scholars (DVS) Program. DVC is a one-day annual conference that uses a variety of cultural influences such as poetry, music, singing and dance to enhance the experience of attendees. DVS promotes community engagement, networking opportunities, critical thinking and professional presentation skills.

years and 10 years
Teacher(s) Quantitative: Survey (2) whether the toolkit and a one-day in-service workshop will help to improve high school teachers' environmental knowledge, understanding of ESD, and use of methodological approaches suitable for ESD in Kosovo; and (3) whether teacher-talk will decrease and pupil-talk will increase in class after introducing the toolkit. Turnsek Diversity This study describes the 120-hour long Antidiscrimination and Diversity training that took place during a two-year post graduate Early Childhood Education study program. The first part of the program targeted participants' perceptions on the topic, followed by the exploration of written and spoken messages that influenced their thinking about minority groups.