You are currently viewing a new version of our website. To view the old version click .
Journal of Intelligence
  • Article
  • Open Access

14 November 2022

WikiLink: An Encyclopedia-Based Semantic Network for Design Creativity

,
,
,
,
and
1
Dyson School of Design Engineering, Imperial College London, London SW7 2AZ, UK
2
Department of Computer Science and Technology, Zhejiang University, Hangzhou 310030, China
3
Zhejiang-Singapore Innovation and AI Joint Research Lab, Zhejiang University, Hangzhou 310030, China
*
Author to whom correspondence should be addressed.
This article belongs to the Special Issue Understanding Creativity and Stimulating Creativity

Abstract

Data-driven design is a process to reuse data sources and provide valuable information to provoke creative ideas in the stages of design. However, existing semantic networks for design creativity are built on data sources restricted to technological and scientific information. Existing studies build the edges of a semantic network on statistical or semantic relationships, which are less likely to make full use of the benefits from both types of relationships and discover implicit knowledge for design creativity. Therefore, to overcome the gaps, we constructed WikiLink, a semantic network based on Wikipedia, which is an integrated source of general knowledge and specific knowledge, with broad coverage of disciplines. The weight in WikiLink fuses both the statistic and semantic weights between concepts instead of simply one type of weight, and four algorithms are developed for inspiring new ideas. Evaluation experiments are undertaken, and the results show that the network is characterised by high coverage of terms, relationships and disciplines, which demonstrates and supports the network’s effectiveness and usefulness. A demonstration and case study results indicate that WikiLink can serve as an idea generation tool for creativity in conceptual design. The source code of WikiLink and the backend data are provided open-source for more users to explore and develop.

1. Introduction

Design is a ubiquitous process that occurs throughout a variety of fields. Conceptual design is the early stage of design where an initial idea is formulated (Childs 2013). The progression of conceptual design development requires a designer to fully utilize their creativity capability and existing knowledge. In other words, the creative attributes of conceptual design depend highly on a designer’s ability to master, apply and utilize human-centred, scientific and technological knowledge according to the design problem to provoke design creativity. Researchers have utilized a large amount of imagery data or textual data available on the Internet to provide design intuition for novel ideas. This imposes a heavy challenge (Hao et al. 2014) for designers on how to effectively discover and acquire pertinent knowledge and information to promote design creativity.
With the advent of big data, semantic networks can represent associations well between ontology-based knowledge, making it easier and more intuitive to discover implicit knowledge for creativity in the early stage of design. The highly diverse nature of design suggests that design creativity can benefit from a multiplicity of distinct data. However, existing semantic networks for design creativity are built on data sources restricted to technological and scientific knowledge. Existing studies build the edges of a semantic network either with statistical or semantic relationships, which are less likely to make full use of the benefits from both types of relationships and discover implicit knowledge for design creativity.
To address the challenges highlighted, this study proposed an encyclopedia-based network called WikiLink for the creativity in the early stage of design. The source code of WikiLink is published on https://github.com/zjud3/WikiLink, accessed on 2 November 2022. The main contributions of this paper can be summarized as follows:
  • A semantic network for design creativity is constructed. Wikipedia is applied as the data source for the semantic network, which contains information from a wide range of fields and expands the data to a new boundary.
  • A combined weight is introduced for the relationship in the semantic network. The combined weight mixes the statistical relationship and semantic relationship which better captures the implicit connection between concepts for design creativity. Four algorithms are further developed for design which enables the retrieval with different levels and manners.
  • The constructed semantic network for design creativity is further developed as a tool. An evaluation and demonstration for the tool are subsequently conducted. The results show that WikiLink can effectively provide design stimuli for idea generation.
The paper is organised as follows: Section 2 describes the state of knowledge and background for the research, and Section 3 introduces the process of constructing WikiLink. Section 4 presents the experimentation including the results on coverage of concepts, coverage of relationships, coverage of disciplines and term-to-term relationships. Section 5 demonstrates the use of four functions in WikiLink and presents a design case with WikiLink. Finally, Section 6 concludes with limitations and suggestions for further research directions. It should be noted that the concepts “design” and “design creativity” in this paper refer to early stage design only.

4. Evaluation

In this section, we conduct four studies on WikiLink to demonstrate its effectiveness and usefulness. Some other semantic networks, which are publicly accepted or aiming for design creativity, are selected as benchmarks during the comparison, including B-link, WordNet, ConceptNet, Wikidata and DBpedia. The evaluation is conducted from four perspectives, i.e., coverage of concepts, coverage of relationships, coverage of disciplines, term-to-term evaluation and effectiveness of combined relationships to provide an overview of the strengths and weaknesses of WikiLink.

4.1. Coverage of Golden Concepts

In order to demonstrate the feasibility of WikiLink, golden concepts, which are composed of words and terms, are defined as the benchmark to evaluate WikiLink’s term coverage. To evaluate the coverage in disciplines and ensure the impartiality, the golden concepts should be collected from a data source different from Wikipedia but have the multi-disciplinary structure. The golden concepts are collected manually within an online source Encyclopedia Britannica through several steps. Firstly featured concepts are obtained from its website. There are several categories of topics available concerning different domains, including culture, science and technology. By gathering these classified words and terms, it is ensured that the collected data contains interdisciplinary knowledge. The original data is refined afterward by removing uncommon expressions and standardizing their formats. The aim of this step is to assure the precision of the following evaluations. Eventually, we obtain a list of 468 words and terms, covering knowledge in 8 domains, and part of the concepts are shown in Table 1.
Table 1. The overview of golden concepts.
With these golden concepts, we then evaluate how many concepts are contained in WikiLink. The retrieval rate C R , as shown in Equation (11), is applied as the metric of concept retrieval:
C R = n C N C
where n C means how many concepts are contained in the network, while N C represents the number of golden concepts, which is 468 is this case.
WordNet and ConceptNet are used as two benchmarks for evaluation. DBPedia and other Wikipedia-based network are not assessed since they are all extracted from Wikipedia which will lead to a same result as WikiLink theoretically. It is observed that WordNet only contains 209 concepts, resulting in a low C R rate of .449. The specific C R values of different categories are shown separately in Table 2 (the highest rate is bolded in each line), from which we notice that WikiLink gives the highest retrieval rate, indicating that our network has a wider coverage of concepts compared with the other tools considered.
Table 2. Retrieving results of golden concepts.
To be specific, our approach involves more concepts in most categories and achieves the highest retrieval rate. In comparison, WordNet shows overall weaknesses due to its inadequacy in processing two-word terms. ConceptNet has decent performance in the fields of art, science, sports and technology, but it lacks strengths in certain categories such as topics and events.
This result can be explained by the limitation of ConceptNet’s construction properties. Even though the data source of ConceptNet includes two-word terms, such as stained glass, chemical element and mental disorder, these terms are mostly composed of one adjective and one noun. Except for names of countries and regions, seldom are two-noun terms involved in ConceptNet. Based on our observation, plenty of concepts in those two categories, i.e., topics and events, are composed of more than one noun, e.g., teacher education, Paris agreement and Pacific crest trail, which are exactly situations that ConceptNet lacks a solution to. This explains ConceptNet’s low C R rate for those two categories. In contrast, our approach can deal with various kinds of terms, which explains its overall high coverage. This high coverage of concepts can support design creativity with a large concept space.

4.2. Coverage of Golden Relationships

A list of golden relationships is selected from the data source as the evaluation benchmark to quantitatively evaluate the performance of relationship coverage. Similar to the construction process of WikiLink, we extracted concept relationships from Encyclopedia Britannica’s spotlight articles. Only those which are composed of golden concepts are retained. We randomly picked 1000 concept pairs from the retained ones and defined as golden relationships.
Denoting golden relationships as set H, we compare the performance of WikiLink with other tools in terms of the coverage of golden relationships. In this process, we retrieve all relationships between golden concepts from each tool and denote these retrieved relationships as set V. The evaluation metric is defined as follows:
R = | V H | | H |
where R indicates the retrieving rate of relationships. WordNet, ConceptNet, Wikidata and DBpedia are chosen as benchmarks. The results are shown in Table 3.
Table 3. Evaluation results of golden relationships.
Specifically, 15 relationships are retrieved from WordNet, which belong to golden relationships, leading to a significantly low R value of only .015. This retrieving rate can be explained by WordNet’s data structure. To our knowledge, WordNet only retrieves specific relationships, including “synonyms”, “sister terms”, “hypernyms” and “hyponyms” between two concepts, which leads to its huge deficiency in context association and results in a low retrieving rate.
The web API of ConceptNet is used to retrieve concepts and relationships. It turns out that there are 170 relationships which are found in the golden relationships, resulting in an R value of .170. The retrieving rate can be understood from two perspectives. ConceptNet’s network contains more concepts than WordNet, which can be observed from its C R value. In addition, it provides richer explanations for “relationships”. In other words, as well as “synonyms” and “hypernyms”, ConceptNet is also able to retrieve “related terms” and “terms with this context” for an arbitrary single concept. These two reasons both contribute to its retrieving rate.
Both Wikidata and DBpedia are knowledge graphs based on Wikipedia and can be retrieved with a SPARQL query. It should be noted that in Wikidata, a unique identifier of a concept is required while retrieving the relationships between them. However, some concepts contain multiple semantics which lead to several identifiers. In order to balance the time complexity and performance of retrieval, we sort all the identifiers of each concept in ascending order and take the first three identifiers to form the identifier list of the concept. When retrieving the relationship between two concepts, the identifiers in the two lists are paired, and if any pair exists in the knowledge triple, then this pair of relationships will be considered as covered. Experimental results show that 178 golden relationships are found in Wikidata and 449 in DBpedia. Though DBpedia achieves high coverage since it covers the vast majority of Wikipedia entries, almost half of the golden relationships still cannot be retrieved. The potential reason is that the relationship between two concepts in DBpedia and Wikidata needs to be described with a particular property (e.g., “is instance of”), which means a solid and closely connected relationship and reduces the potential knowledge association for design creativity. For example, “Therapy”, which refers to the means used to solve a health problem, is related to “Public health”, but the relationship between them cannot be described by a particular property. Thus, the golden relationship “Therapy” and “Public health” can be retrieved in Wikilink but not in Wikidata and DBpedia. It should be noted that YAGO is not considered as a benchmark in this evaluation since it is constructed as a knowledge base for real-world named entities, such as person and cities (Pellissier Tanon et al. 2020). The relationships (e.g., “birthPlace”, “ofCountry”) are formed accordingly. These close relationships will not be covered in the golden relationships, and it is unfair to compare with WikiLink.
In the end, 721 relationships can be retrieved from the golden relationships within WikiLink. This can be explained by its largest number of concepts, and the relationships in our approach are defined differently, i.e., they are established between concepts that are shown on the same pages. To summarize, WikiLink achieves a retrieving rate of .721 and shows the best performance. This high retrieving rate of relationships builds enough associations which can potentially contribute to design creativity.

4.3. Coverage of Categories

To demonstrate that WikiLink covers a wide range of categories, we categorize and count all the nodes in WikiLink according to Wikipedia’s category rules. Wikipedia defines 13 main categories: cultural, geography, health, history, human, mathematics, natural, people, philosophy, religion, society, technology and reference. By traversing all the items’ categories in WikiLink, the distribution of the 13 categories is presented in Figure 3.
Figure 3. The distribution of concepts in WikiLink.
It can be seen from the graph that WikiLink’s data have a wide distribution among 13 main categories, and the count of a particular main category can reach up to 100,000 nodes. Especially, the natural, people and reference categories have the largest counts, which are 1,241,491, 1,161,583 and 1,222,966, respectively. Rather than focusing only on technological and scientific knowledge, WikiLink is a more generic semantic network, with knowledge from a wide coverage of disciplines, which can be used in daily design creativity activities to obtain inspiration. Specifically, the data source of B-link mainly comes from scientific papers, which leads to the uneven distribution of each discipline, while WikiLink has a wide range of information in different fields and disciplines. Compared with TechNet, the result of WikiLink shows higher diversity as the distribution of TechNet is highly correlated with the distribution of patents, which may affect the inspiration of the design because of the coverage limitation, even though it contains a large number of domains within technology fields.

4.4. Term-to-Term Evaluation

To evaluate whether the computed edge weights are consistent with human judgment, thirty term pairs (three groups and each ten as a group) representing various degrees of relevance were prepared by language experts, and ten students were employed to rate the relevance of each pair. The students scored semantic relevance and statistical relevance on a five-scale from one (not related) to five (highly related), and the average of scores is computed for each pair. The semantic relevance and statistical relevance are then combined as the weight in the “Explore-General” algorithm. In this evaluation, only the “Explore-General” edge weights in the four algorithms is evaluated since the weight calculation in the four algorithms is all similar.
With the evaluation results, Cronbach’s alpha is used to measure the inter-rater reliability which is 0.78 as an acceptable result. Spearman’s rank correlation coefficient is then used to assess the relationship between computed edge weights and human judgments. Table 4 shows the result of the Spearman rank correlation coefficients between the pairwise association values of the same term pairs.
Table 4. Term-to-term evaluation results.
The hypothesis of the Spearman correlation coefficient is then tested to determine whether the results are statistically significant. By checking the table of critical values, the three groups’ Spearman’s rho are all greater than the critical value .57 (one tail, α = .05), so the null hypothesis is rejected. This supports that there is a strong correlation between the computed edge weights and human judgments , upheld by a significance level of 95%.

4.5. Effectiveness of Combined Relationships

As introduced in Section 3, the statistical relationships between two concepts are established if they co-occur in the same article. Constructing the basic connection from a statistical perspective only could potentially lead to a phenomenon that the retrieval is dominated by some highly common concepts. These dominating common concepts decrease the retrieval probability of other useful concepts for design creativity. However, using semantic relationships only as the weight of edges is beneficial for design but might require longer association for implicit knowledge discovery. The semantic relationships are thus incorporated to balance the statistical relationship. To demonstrate the effectiveness of the proposed weight fusion, three types of retrieval results based on different relationships (networks with combined relationships, with statistical relationships, and with semantic relationships) are compared. The concept “health” is chosen for the “Explore” function, and the concept pair “health and 3d printing” is chosen for the “Search Path” function.
Figure 4 and Table 5 are the results of “Explore” and “Search Path”, respectively. It can be seen that the results of “Explore” and “Search Path” with statistical relationship have more concepts which contain common and general meaning but are irrelevant with “health” semantically, e.g., “United States” and “United Kingdom” which are dominant nodes in this case. Conversely, the results of the two functions with semantic relationships contain more relevant concepts but only show the semantic relevance to “health” (e.g., “environmental health” and “health care”). The combined relationship makes a balance between the statistical relationship and semantic relationship so that it produces a relatively positive result.
Figure 4. Retrieval results for “health” with three different relationships.
Table 5. The high-correlated knowledge associations between “health” and “3d printing” with three different relationships.
The node degree of a concept means the sum of weights of all edges incident to that node. The average node degree of concepts is calculated in combined relationships and statistical relationships to demonstrate whether the very common results are balanced quantitatively. Table 6 and Table 7 show that in four functions the average node degrees of concepts with combined relationship are all observably lower than that of concepts with statistical relationship, which imply that the semantic relationship balances the statistical relationships to retrieve valuable information. Both the quantitative and qualitative results indicate that the combined relationship is efficient to reduce the influence of dominant concepts with high node degree in retrieval results thus facilitating design creativity activities.
Table 6. The average node degree of retrieval results for “health” with two different relationships.
Table 7. The average node degree of knowledge associations between “health” and “3d printing” with two different relationships.

5. Demonstration

In this section, we showcase four functions in WikiLink for information retrieval and design creativity. Qualitative analysis of the results is performed to demonstrate the features of each function. In addition, the performance of WikiLink is compared with six state-of-the-art tools, and the corresponding results are also analyzed qualitatively.

5.1. The “Explore-General” and “Explore-Specific” Mode

To fairly compare the performance of “Explore-General” and “Explore-Specific” modes, two terms in the field of engineering design are chosen: “3d printing” and “fused deposition modeling”. 3D printing is a multi-faceted technology, has been employed across a broad range of applications (Berman 2012) and is a widely used term with general meanings. Fused deposition modeling (FDM) is a 3D printing method that heats a continuous thermoplastic filament and extrudes it for layer-by-layer deposition (Hamzah et al. 2018), which is also a widely used term with specific meanings. These two terms are input and explored in “general” and “specific” modes, respectively. Figure 5 shows the top 10 relevant terms in each retrieval. By comparing the “general” results (the first row) with the “specific” results (the second row), it can be seen that the terms in “general” results are more common and comprehensible, such as computer-aided design and artificial intelligence, while the terms in “specific” results, such as stl (file format) and polyetherimide, are normally very specific concepts in particular domains. Furthermore, as the figure shows, FDM’s specific result is centered scattering. This implies that primary terms in a particular domain are discrete and irrelevant to each other.
Figure 5. Retrieval results for “3d printing” and “fused deposition modeling” in “Explore-General” and “Explore-Specific” mode.

5.2. The “Search Path-Basic” and “Search Path-Professional” Mode

The “Search Path” function allows users to explore the implicit associations between two items even from different domains. It also has two modes that can return two types of associations. In order to test the above two modes, we used two pairs of terms, “brain” and “computer”, which are weakly related, and “avocado” and “chair”, which are seemingly unrelated. Table 8 shows the retrieved highest-correlated “basic” and “professional” knowledge associations of the two pairs. Obviously, the “basic” paths are shorter, and the “professional” paths are longer. Most of the nodes in “basic” paths are concepts with general meanings between the two domains, such as artificial intelligence, fruit and furniture, while the “professional” path is longer and the nodes are almost scientific terms or specific objects such as “neuroscience”, “xylitol” and “guacamole”. Some explicit associations are discovered in the results. For example, brain science drives the advance of computer science, especially artificial intelligence, which appears in the path “brain → artificial intelligence → computer”. In addition, more implicit associations are connected by some surprising concepts, such as “fruit”, “furniture” and “rocking chair”, which may provoke the idea of fruit-shaped furniture, such as an avocado-shaped rocking chair. It is found that in some cases purely statistical weights between edges result in a longer and more surprising path which may inspire more innovative ideas in design activities.
Table 8. The high-correlated two types of knowledge associations.

5.3. The “Explore” and “Search Path” Function

The above shows that the “Explore” function aims to discover the knowledge associations around a single term, while the “Search Path” function aims to search for the associations between two terms. To clarify the difference between them, a hot concept in engineering design, “metaverse”, was explored along with two weakly related terms separately: “shopping” and “meeting”. Retrieval experiments were conducted in “Explore-Specific” and “Search Path-Professional” mode, respectively. As shown in Figure 6, the retrieval results of “metaverse” cover a wide range of fields, including “virtual world”, “simulated reality”, “cyberspace” and related games, including “Second Life” and “Active Worlds”. These wide results can lead to comprehensive knowledge discovery and an open imagination about the target term. On the other hand, the paths between “metaverse” and a selected concept focus on bridging the fields that connect them, which leads to combinational ideas. For instance, the nodes linking “metaverse” and “shopping” are related to “virtual economy”, and the nodes linked “metaverse” and “meeting” are related to virtual society.
Figure 6. Comparisons of the results from “Explore” and “Search Path”: (a) the “Explore Specific” results for “metaverse”; (b) the “Search Path-Professional” results between “metaverse” and “shopping”; (c) The “Search Path-Professional” results between “metaverse” and “meeting”.

5.4. Comparison with Benchmark Tools

We undertook a retrieval comparison between WikiLink and the other four benchmark tools. The target terms are “neural network” in computer science and “trypsin” in medical physiology. This experiment aims to test whether our network can return a broad range of related terms which can stimulate creativity in the design process efficiently. Since the number and presentation of retrieval results vary from tool to tool, we selected the top 10 related terms for each tool to present in Table 9. Especially, the results of WikiLink and B-link (Shi et al. 2017) were obtained through their “Explore” function. The result of ConceptNet (Speer et al. 2017) was obtained from its “Related terms” category. The result of WordNet (Fellbaum 2010) was obtained from its “Synset” and “Example sentence” functions. The result of Wikidata was obtained from its “Search Wikidata” function, and since the term “neural network” in Wikidata refers specifically to the structure in biology, the search for this was redirected to the term “artificial neural network” in machine learning which more closely matches our expectations. The result of DBpedia was obtained from its “Keyword Search” function. In addition, we have also tried to compare with the large knowledge base “YAGO”, but the retrieval results are mostly translations of target terms in other languages (e.g., “Trypsiini” in Finish is obtained when searching for “Trypsin”), which does not make sense for stimulating creativity in the design process. The main reason is that YAGO is a knowledge base focusing on real-world named entities, such as people, cities and organizations, but is not suitable for the terms with semantics (Pellissier Tanon et al. 2020). Therefore, YAGO was not included in our comparative experiments.
Table 9. The top 10 related terms to “neural network” and “trypsin” in WikiLink and the 4 benchmark tools.
According to Table 9, the terms retrieved by WikiLink in the “general” and “specific” modes both demonstrate the effectiveness of the “Explore” function. For example, the retrieval results of “neural network” in the “specific” mode are all domain-specific terms related to the components (e.g., “artificial neuron”), functions (e.g., “cognitive model”) and applications (e.g., “deep learning”) of “neural network”. Since the “Explore” function of WikiLink is divided into the “Explore-General” and “Explore-Specific” modes, its results, containing common terms (from the “Explore-General” mode) and technical terms (from the “Explore-Specific” mode), cover a comprehensive range. In contrast, ConceptNet, WordNet, TechNet, Wikidata and DBpedia simply have only one retrieval mode, which leads to their retrieved results invariably focusing on some technical terms in a specific range. Furthermore, the retrieval result of “trypsin” from DBpedia contains some terms that are only lexically similar but semantically irrelevant (e.g., “Nybergsund IL-Trysil” and “Trysil”), which may mislead the user’s understanding of the target term. Even though B-Link retrieves in the two modes as WikiLink, its results are also limited by the data source which is engineering academic papers and design websites. It can be seen that the retrieved terms of B-Link tend to contain specific meanings. Instead, WikiLink applies Wikipedia as the data source for its semantic network, which covers information from a wide range of domains. The comparison suggests that WikiLink is more capable of retrieving terms in various domains, which is essential for knowledge discovery in the knowledge-intensive design creativity process.

5.5. A Design Case

A designer was recruited to conduct a design case and demonstrate the process of applying WikiLink for design creativity. Generally, the designer would be initially given a design question with a “Basic word” and then required to apply the “Explore” function and “Search Path” function in WikiLink to freely explore the related concepts around the “Basic word” which could potentially inspire the designer. By applying the “Explore function”, the designer could discover the knowledge concepts “C1”, “C2” and “C3” around the “Basic word” as denoted in Figure 7. The “Search Path” function provides the paths, e.g., PathC1C2 between two terms “C1” and “C2” for combinational creativity (Han et al. 2019). This process can be iteratively applied to discover knowledge associations and paths such as “C3” and PathC1C3. The related concepts obtained from WikiLink are then used to form design inspiration links such as “Basic_word-C1, PathC1C3, PathC2C3”, and some of them are eventually chosen for the design output of specific design ideas.
Figure 7. The flow of concepts exploration in WikiLink.
A real design case is conducted to illustrate how to facilitate design creativity with WikiLink. Since a “hair dryer” is a well-known product archetype with homogenization in the market and its creativity has encountered a bottleneck, the designer is required to generate ideas and provide innovative designs for a hair dryer. The concept “hair dryer” is chosen as the design query (also known as the basic word) in WikiLink in this case. The designer then starts with the “Explore” function by freely choosing several different step lengths and switching between general and specific mode for divergent and convergent thinking. Some screenshot examples are shown in Figure 8. It is noted that the designer is not restricted to using “hair dryer” as the only query. After the initial exploration in WikiLink, the designer obtained some interesting and inspiring concepts, such as “Entertainment weekly”, “Vacuum cleaner”, “Comb”, “Hair iron”, “Hair gel”, “Hair roller”, “Hot comb”, “Horn” and “Pyramid”. The next step is to apply the “Search Path” function by freely querying the paths between two concepts of the designer’s interests. Some retrieval results are shown in Table 10.
Figure 8. The examples of concepts retrieved by “Explore”: (a) “Explore-General” results with one step for “hair dryer”; (b) “Explore-General” results with two steps for “hair dryer”; (c) “Explore-Specific” results with two steps for “hair dryer”.
Table 10. The paths between the inspiring concepts and “hair dryer” retrieved by “Search Path”.
The designer continued to explore knowledge concepts for design creativity stimuli by iteratively using the “Explore” and “Search Path” functions. The “Explore” function helps discover the knowledge associations around a single term, while the “Search Path” function can potentially look for the associations between two terms. The designer recorded all the interesting and inspiring concepts and formed the “design inspiration links”, as shown in Figure 9, where the base of the link is “Hair Dryer”, and rest of the concepts were from WikiLink obtained by using “Explore” and “Search Path” functions. The above process was repeated to produce at least one design inspiration link and until the designer thought it is enough to formulate design ideas. Eventually, with the ideas originating from the concepts in the inspiration link, the designer produced the final complete design scheme and drew corresponding design sketches.
Figure 9. The example of “design inspiration link”.
In particular, we use Figure 10 and Figure 11 as the designs produced with the inspiration links “Hair dryer” , “Comb”, “Hairstyle”, “Tie-dye”, “Zardozi”.
Figure 10. The sketch of the hair dryer inspired by “Tie-dye” and “Zardozi”.
Figure 11. The sketch of hair dryer inspired by “Comb”.
In particular, two ideas were generated during the designer’s manipulation with WikiLink. The first design, as shown in Figure 10, is in appearance a design inspired by “Tie-dye” and “Zardozi”. Existing hair dryers in the market are mostly a single pure color with a smooth or frosted plastic shell. “Tie-dye”, the characteristic of the Bai nationality, has special patterns which are uneven in-depth and rich in layers, and overcomes the rigidity of pure color. “Zardozi”, a traditional Chinese craft, has a delicate touch feeling compared with plastic material. Thus “Tie-dye” and “Zardozi” inspire the designer to integrate traditional Chinese cultural elements into the design of a hair dryer to increase cultural connotation. The second design (Figure 11) is functional and inspired by “Hairstyle” and “Comb” in the design inspiration link. The idea is to design the replaceable hair dryer nozzle with the features of “Comb” so that users can comb their hair conveniently while drying the hair without searching it in a hurry.

6. Conclusions

A semantic network for design creativity has been constructed. Wikipedia is applied as the data source for the semantic network. During the construction, the Wikipedia items are regarded as the nodes, the interlinks between the items on the same page are regarded as the directly connected relationship (edges) between nodes. The evaluation result indicates that the network contains information from a wide range of fields and expands the data to a new boundary. Instead of simply one type of weight, a combined weight is introduced for the relationship in the semantic network. The combined weight fuses the statistical relationship and semantic relationship which better captures the implicit connection between concepts for design creativity. Four algorithms have been developed to retrieve relevant knowledge concepts and relationships with different levels and manners. The constructed semantic network for design creativity is further developed as a tool, called WikiLink. An evaluation and demonstration for WikiLink were conducted. Compared with other benchmarks, with the fusion of semantic meaning weight and statistical weight, WikiLink can well balance the breadth and depth in exploring knowledge for design creativity. A design case was conducted to demonstrate the process of how WikiLink can facilitate idea generation. The results indicate that WikiLink can serve as a design ideation tool for design creativity.
The study leaves space for future research although it does provide a functional panel for practical use. The weight strength fusing two types of weight is one of the main contributions in this research, but it only shows the numerical value and lacks explicit semantic meaning describing the relationship between two concepts. Thus, a semantic description is expected to be added to the edges in WikiLink and provide richer information for design creativity. The network visualization of WikiLink is currently designed on a two-dimensional scale, which might cause an information explosion when the retrieved network keeps growing. A three-dimensional scale network along with other information visualization techniques could be a solution and provide a more dynamic way for users to explore information and obtain inspiration more effectively.

Author Contributions

Conceptualization, H.Z. and L.C.; Methodology, H.Z., Q.J., L.C.; Software, Q.J., T.S.; Investigation, Q.J.; Data Curation, T.S.; Supervision—L.C.; Writing—Original Draft, H.Z.; Writing—Review & Editing, L.S., P.C., L.C. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by National Natural Science Foundation of China: No. 62207023; National Key R&D Program of China: 2021YFF0900602; The Ng Teng Fong Charitable Foundation in the form of ZJU-SUTD IDEA Grant.

Institutional Review Board Statement

Not applicable.

Data Availability Statement

The code and data are published on https://github.com/zjud3/WikiLink, accessed on 2 November 2022.

Acknowledgments

The authors would like to express our gratitude to The Ng Teng Fong Charitable Foundation in the form of ZJU-SUTD IDEA Grant.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Ackoff, Russell L. 1989. From data to wisdom. Journal of Applied Systems Analysis 16: 3–9. [Google Scholar]
  2. Auer, Sören, Christian Bizer, Georgi Kobilarov, Jens Lehmann, Richard Cyganiak, and Zachary Ives. 2007. Dbpedia: A nucleus for a web of open data. In The Semantic Web. Berlin/Heidelberg: Springer, pp. 722–35. [Google Scholar]
  3. Bae, Suyun Sandra, Oh-Hyun Kwon, Senthil Chandrasegaran, and Kwan-Liu Ma. 2020. Spinneret: Aiding creative ideation through non-obvious concept associations. Paper presented at the 2020 CHI Conference on Human Factors in Computing Systems, Honolulu, HI, USA, April 25–30; pp. 1–13. [Google Scholar]
  4. Berman, Barry. 2012. 3-d printing: The new industrial revolution. Business Horizons 55: 155–62. [Google Scholar] [CrossRef]
  5. Boden, Margaret A. 1998. Creativity and artificial intelligence. Artificial Intelligence 103: 347–56. [Google Scholar] [CrossRef]
  6. Bojanowski, Piotr, Edouard Grave, Armand Joulin, and Tomas Mikolov. 2017. Enriching word vectors with subword information. Transactions of the Association for Computational Linguistics 5: 135–46. [Google Scholar] [CrossRef]
  7. Casakin, Hernan, and Georgi V. Georgiev. 2021. Design creativity and the semantic analysis of conversations in the design studio. International Journal of Design Creativity and Innovation 9: 61–77. [Google Scholar] [CrossRef]
  8. Chen, Ting-Ju, and Vinayak R. Krishnamurthy. 2020. Investigating a mixed-initiative workflow for digital mind-mapping. Journal of Mechanical Design 142: 101404. [Google Scholar] [CrossRef]
  9. Cheong, Hyunmin, Wei Li, Adrian Cheung, Andy Nogueira, and Francesco Iorio. 2017. Automated extraction of function knowledge from text. Journal of Mechanical Design 139: 111407. [Google Scholar] [CrossRef]
  10. Childs, Peter, Ji Han, Liuqing Chen, Pingfei Jiang, Pan Wang, Dongmyung Park, Yuan Yin, Elena Dieckmann, and Ignacio Vilanova. 2022. The creativity diamond—A framework to aid creativity. Journal of Intelligence 10: 73. [Google Scholar] [CrossRef]
  11. Childs, Peter R. N. 2013. Mechanical Design Engineering Handbook. Oxford: Butterworth-Heinemann. [Google Scholar]
  12. Ernst, Holger. 2003. Patent information for strategic technology management. World Patent Information 25: 233–42. [Google Scholar] [CrossRef]
  13. Fellbaum, Christiane. 2010. Wordnet. In Theory and Applications of Ontology: Computer Applications. Berlin/Heidelberg: Springer, pp. 231–43. [Google Scholar]
  14. Fu, Katherine, Jonathan Cagan, Kenneth Kotovsky, and Kristin Wood. 2013. Discovering structure in design databases through functional and surface based mapping. Journal of mechanical Design 135: 031006. [Google Scholar] [CrossRef]
  15. Furukawa, Takao, Kaoru Mori, Kazuma Arino, Kazuhiro Hayashi, and Nobuyuki Shirakawa. 2015. Identifying the evolutionary process of emerging technologies: A chronological network analysis of world wide web conference sessions. Technological Forecasting and Social Change 91: 280–94. [Google Scholar] [CrossRef]
  16. Georgiev, Georgi V., and Danko D. Georgiev. 2018. Enhancing user creativity: Semantic measures for idea generation. Knowledge-Based Systems 151: 1–15. [Google Scholar] [CrossRef]
  17. Georgiev, Georgi V., Naofumi Sumitani, and Toshiharu Taura. 2017. Methodology for creating new scenes through the use of thematic relations for innovative designs. International Journal of Design Creativity and Innovation 5: 78–94. [Google Scholar] [CrossRef]
  18. Geschka, Horst. 1983. Creativity techniques in product planning and development: A view from west germany. R&D Management 13: 169–83. [Google Scholar]
  19. Geum, Youngjung, and Yongtae Park. 2016. How to generate creative ideas for innovation: A hybrid approach of wordnet and morphological analysis. Technological Forecasting and Social Change 111: 176–87. [Google Scholar] [CrossRef]
  20. Goel, Ashok K. 1997. Design, analogy, and creativity. IEEE Expert 12: 62–70. [Google Scholar] [CrossRef]
  21. Goel, Ashok K., Swaroop Vattam, Bryan Wiltgen, and Michael Helms. 2012. Cognitive, collaborative, conceptual and creative—Four characteristics of the next generation of knowledge-based cad systems: A study in biologically inspired design. Computer-Aided Design 44: 879–900. [Google Scholar] [CrossRef]
  22. Gorti, Sreenivasa R., Ashok Gupta, Gerard Jounghyun Kim, Ram D. Sriram, and Albert Wong. 1998. An object-oriented representation for product and design processes. Computer-Aided Design 30: 489–501. [Google Scholar] [CrossRef]
  23. Goucher-Lambert, Kosa, and Jonathan Cagan. 2019. Crowdsourcing inspiration: Using crowd generated inspirational stimuli to support designer ideation. Design Studies 61: 1–29. [Google Scholar] [CrossRef]
  24. Hamzah, Hairul Hisham, Saiful Arifin Shafiee, Aya Abdalla, and Bhavik Anil Patel. 2018. 3d printable conductive materials for the fabrication of electrochemical sensors: A mini review. Electrochemistry Communications 96: 27–31. [Google Scholar] [CrossRef]
  25. Han, Ji, Hannah Forbes, Feng Shi, Jia Hao, and Dirk Schaefer. 2020. A data-driven approach for creative concept generation and evaluation. Paper presented at the Design Society: DESIGN Conference, Cavtat, Croatia, October 26–29; Cambridge: Cambridge University Press, vol. 1, pp. 167–76. [Google Scholar]
  26. Han, Ji, Feng Shi, Liuqing Chen, and Peter RN Childs. 2018a. The combinator—A computer-based tool for creative idea generation based on a simulation approach. Design Science 4. [Google Scholar] [CrossRef]
  27. Han, Ji, Feng Shi, Liuqing Chen, and Peter RN Childs. 2018b. A computational tool for creative idea generation based on analogical reasoning and ontology. AI EDAM 32: 462–77. [Google Scholar] [CrossRef]
  28. Han, Ji, Min Hua, Dongmyung Park, Pan Wang, and PRN Childs. 2020. Computational conceptual distances in combinational creativity. Paper presented at the Design Society: DESIGN Conference, Cavtat, Croatia, October 26–29; Cambridge: Cambridge University Press, vol. 1, pp. 177–86. [Google Scholar]
  29. Han, Ji, Min Hua, Feng Shi, and Peter R. N. Childs. 2019. A further exploration of the three driven approaches to combinational creativity. Paper presented at the Design Society: International Conference on Engineering Design, Delft, The Netherlands, August 5–8; Cambridge: Cambridge University Press, vol. 1, pp. 2735–744. [Google Scholar]
  30. Hao, Jia, Yan Yan, Lin Gong, Guoxin Wang, and Jianjun Lin. 2014. Knowledge map-based method for domain knowledge browsing. Decision Support Systems 61: 106–14. [Google Scholar] [CrossRef]
  31. He, Yuejun, Bradley Camburn, Haowen Liu, Jianxi Luo, Maria Yang, and Kristin Wood. 2019. Mining and representing the concept space of existing ideas for directed ideation. Journal of Mechanical Design 141: 121101. [Google Scholar] [CrossRef]
  32. Hey, Jonathan, Julie Linsey, Alice M. Agogino, and Kristin L. Wood. 2008. Analogies and metaphors in creative design. International Journal of Engineering Education 24: 283. [Google Scholar]
  33. Howard, Thomas J., Stephen J. Culley, and Elies Dekoninck. 2008. Describing the creative design process by the integration of engineering design and cognitive psychology literature. Design Studies 29: 160–80. [Google Scholar] [CrossRef]
  34. Hu, Jie, Jin Ma, Jin-Feng Feng, and Ying-Hong Peng. 2017. Research on new creative conceptual design system using adapted case-based reasoning technique. AI EDAM 31: 16–29. [Google Scholar] [CrossRef]
  35. Ivanov, Alex, and Dianne Cyr. 2014. Satisfaction with outcome and process from web-based meetings for idea generation and selection: The roles of instrumentality, enjoyment, and interface design. Telematics and Informatics 31: 543–58. [Google Scholar] [CrossRef]
  36. Johnson, Nancy R. 1992. Metaphor and design. Studies in Art Education 33: 144–53. [Google Scholar] [CrossRef]
  37. Joulin, Armand, Edouard Grave, Piotr Bojanowski, Matthijs Douze, Hérve Jégou, and Tomas Mikolov. 2016. Fasttext. zip: Compressing text classification models. arXiv arXiv:1612.03651. [Google Scholar]
  38. Kim, Hongbin, and Kwangsoo Kim. 2012. Causality-based function network for identifying technological analogy. Expert Systems with Applications 39: 10607–19. [Google Scholar] [CrossRef]
  39. Kwon, Heeyeul, Yongtae Park, and Youngjung Geum. 2018. Toward data-driven idea generation: Application of wikipedia to morphological analysis. Technological Forecasting and Social Change 132: 56–80. [Google Scholar] [CrossRef]
  40. Li, Xin, Qianqian Xie, Tugrul Daim, and Lucheng Huang. 2019. Forecasting technology trends using text mining of the gaps between science and technology: The case of perovskite solar cell technology. Technological Forecasting and Social Change 146: 432–49. [Google Scholar] [CrossRef]
  41. Linsey, Julie, Arthur B. Markman, and Kristin Lee Wood. 2012. Design by analogy: A study of the wordtree method for problem re-representation. Journal of Mechanical Design 134: 041009-1-12. [Google Scholar] [CrossRef]
  42. Liu, Qiyu, Kai Wang, Yan Li, and Ying Liu. 2020. Data-driven concept network for inspiring designers’ idea generation. Journal of Computing and Information Science in Engineering 20: 031004. [Google Scholar] [CrossRef]
  43. Liu, Ying-Chieh, Amaresh Chakrabarti, and Thomas Bligh. 2003. Towards an ‘ideal’approach for concept generation. Design Studies 24: 341–55. [Google Scholar] [CrossRef]
  44. Luo, Jianxi, Serhad Sarica, and Kristin L. Wood. 2021. Guiding data-driven design ideation by knowledge distance. Knowledge-Based Systems 218: 106873. [Google Scholar] [CrossRef]
  45. McCaffrey, Tony, and Lee Spector. 2018. An approach to human–machine collaboration in innovation. AI EDAM 32: 1–15. [Google Scholar] [CrossRef]
  46. Munoz, David, and Conrad S. Tucker. 2016. Modeling the semantic structure of textually derived learning content and its impact on recipients’ response states. Journal of Mechanical Design 138: 042001. [Google Scholar] [CrossRef]
  47. Osborn, Alex F. 1953. Applied Imagination. New York: Scribner. [Google Scholar]
  48. Pellissier Tanon, Thomas, Gerhard Weikum, and Fabian Suchanek. 2020. Yago 4: A reason-able knowledge base. Paper presented at European Semantic Web Conference, Crete, Greece, May 31–June 4; Berlin/Heidelberg: Springer, pp. 583–96. [Google Scholar]
  49. Rezgui, Yacine, Stefan Boddy, Matthew Wetherill, and Grahame Cooper. 2011. Past, present and future of information and knowledge sharing in the construction industry: Towards semantic service-based e-construction? Computer-Aided Design 43: 502–15. [Google Scholar] [CrossRef]
  50. Runco, Mark A., and Garrett J. Jaeger. 2012. The standard definition of creativity. Creativity Research Journal 24: 92–96. [Google Scholar] [CrossRef]
  51. Sarica, Serhad, and Jianxi Luo. 2021. Design knowledge representation with technology semantic network. Proceedings of the Design Society 1: 1043–52. [Google Scholar] [CrossRef]
  52. Sarica, Serhad, Jianxi Luo, and Kristin L. Wood. 2020. Technet: Technology semantic network based on patent data. Expert Systems with Applications 142: 112995. [Google Scholar] [CrossRef]
  53. Sarica, Serhad, Binyang Song, En Low, and Jianxi Luo. 2019. Engineering knowledge graph for keyword discovery in patent search. Paper presented at the Design Society: International Conference on Engineering Design, Delft, The Netherlands, August 5–8; Cambridge: Cambridge University Press, vol. 1, pp. 2249–58. [Google Scholar]
  54. Sarica, Serhad, Binyang Song, Jianxi Luo, and Kristin Wood. 2019. Technology knowledge graph for design exploration: Application to designing the future of flying cars. Paper presented at International Design Engineering Technical Conferences and Computers and Information in Engineering Conference, Anaheim, CA, USA, August 18–21; New York: American Society of Mechanica Engineers, vol. 59179, p. V001T02A028. [Google Scholar]
  55. Sarica, Serhad, Binyang Song, Jianxi Luo, and Kristin L. Wood. 2021. Idea generation with technology semantic network. AI EDAM 35: 1–19. [Google Scholar] [CrossRef]
  56. Shi, Feng, Liuqing Chen, Ji Han, and Peter Childs. 2017. A data-driven text mining and semantic network analysis for design information retrieval. Journal of Mechanical Design 139: 111402. [Google Scholar] [CrossRef]
  57. Shibata, Naoki, Yuya Kajikawa, Yoshiyuki Takeda, and Katsumori Matsushima. 2008. Detecting emerging research fronts based on topological measures in citation networks of scientific publications. Technovation 28: 758–75. [Google Scholar] [CrossRef]
  58. Siddharth, L., and Amaresh Chakrabarti. 2018. Evaluating the impact of idea-inspire 4.0 on analogical transfer of concepts. Ai Edam 32: 431–48. [Google Scholar] [CrossRef]
  59. Sosa, Ricardo, Kristin L. Wood, and Rajesh E. Mohan. 2014. Identifying opportunities for the design of innovative reconfigurable robotics. Paper presented at International Design Engineering Technical Conferences and Computers and Information in Engineering Conference, Buffalo, NY, USA, August 17–20; New York: American Society of Mechanica Engineers, vol. 46407, p. V007T07A010. [Google Scholar]
  60. Sowa, John F. 2014. Principles of Semantic Networks: Explorations in the Representation of Knowledge. Burlington: Morgan Kaufmann. [Google Scholar]
  61. Speer, Robyn, Joshua Chin, and Catherine Havasi. 2017. Conceptnet 5.5: An open multilingual graph of general knowledge. Paper presented at Thirty-first AAAI Conference on Artificial Intelligence, San Francisco, CA, USA, February 4–9. [Google Scholar]
  62. Suchanek, Fabian M., Gjergji Kasneci, and Gerhard Weikum. 2007. Yago: A core of semantic knowledge. Paper presented at 16th International Conference on World Wide Web, Perth, Australia, April 3–7; pp. 697–706. [Google Scholar]
  63. Taura, Toshiharu, Eiko Yamamoto, Mohd Yusof Nor Fasiha, Masanori Goka, Futoshi Mukai, Yukari Nagai, and Hideyuki Nakashima. 2012. Constructive simulation of creative concept generation process in design: A research method for difficult-to-observe design-thinking processes. Journal of Engineering Design 23: 297–321. [Google Scholar] [CrossRef]
  64. The Design Council. 2017. What Is the Framework for Innovation? Design Council’s Evolved Double Diamond. London: The Design Council. [Google Scholar]
  65. VanGundy, Arthur B. 1988. Techniques of Structured Problem Solving. Berlin/Heidelberg: Springer. [Google Scholar]
  66. Vattam, Swaroop, Bryan Wiltgen, Michael Helms, Ashok K. Goel, and Jeannette Yen. 2011. Dane: Fostering creativity in and through biologically inspired design. In Design Creativity 2010. Berlin/Heidelberg: Springer, pp. 115–22. [Google Scholar]
  67. Vrandečić, Denny, and Markus Krötzsch. 2014. Wikidata: A free collaborative knowledgebase. Communications of the ACM 57: 78–85. [Google Scholar] [CrossRef]
  68. Yamamoto, Eiko, Masanori Goka, Nor Fasiha Mohd Yusof, Toshiharu Taura, and Yukari Nagai. 2009. Virtual modeling of concept generation process for understanding and enhancing the nature of design creativity. Paper presented at DS 58-2: Proceedings of ICED 09, the 17th International Conference on Engineering Design, Palo Alto, CA, USA, August 24–27. [Google Scholar]
  69. Yuan, Soe-Tsyr Daphne, and Pei-Kang Hsieh. 2015. Using association reasoning tool to achieve semantic reframing of service design insight discovery. Design Studies 40: 143–75. [Google Scholar] [CrossRef]
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Article Metrics

Citations

Article Access Statistics

Multiple requests from the same IP address are counted as one view.