Effect of Distance on Open Innovation : Differences among Institutions According to Patent Citation and Reference

The main topic of this paper is the effects of distance between technology and the market, on open innovation. For this, we set up two research questions, as follows: Is there any relation between the distance between technology and the market, and open innovation? If there is, what differences are there in the relation among Fortune 500 companies, non-Fortune 500 companies, laboratories, universities, and start-ups? First, this study measured the distance between technology and the market of a patent by the size of its list of references and citations. Second, the OI network among patent application subjects was described based on patent similarity. Third and most importantly, regression analyses were used to answer the research questions. The first result was that there were differences in the distance and OI among Fortune 500 firms, Fortune non-500 firms, laboratories, universities, and start-ups. Thus, there are relations between the distance between technology and the market, and open innovation. The second result was that the distance between technology and the market was found to moderate the open innovation effect in Fortune 500 companies and laboratories.


Introduction
The "information age" and the "knowledge economy" have brought with them a shift in key economic assets, from resources to labor and technology, and from the natural environment to knowledge [1].With this, capitalism is now being redefined as a complex and evolving system [2].As the knowledge-based society accelerates, more and more aspects of knowledge and technology are being applied to patents.However, the ratio or possibility of realizing patents into successfully commercialized products significantly varies among subjects of patent application.In other words, there are different sorts of gap between technologies and markets of patents applied for by organizations such as Fortune 500 and non-Fortune 500 companies, universities, laboratories, and start-ups.In addition, gaps for diverse subjects differently affect open innovation (OI) performance.Facebook, which was a small start-up at a university and also a non-Fortune 500 company, became a Fortune 500 company within 10 years of its foundation.In most instances, students and researchers create their companies right after they finish university, with some succeeding in the long run [3].
Maybe the distance between technology and the market will be different among five agents.In addition, open innovation will have a relationship with the distance between technology and the market.However, until now, the relation between open innovation and the distance between technology and the market was not sufficiently developed because, most of all, the way to measure the distance between technology and the market was not yet elucidated.[4] (p. 14) analyzed the challenges in intellectual property by exploring the legal aspects of the following emerging technologies:
Similarly, one study by Ejermo [5] uses patent counts to examine the effects of environmental policies on technological innovation in renewable energy.
Aside from being used as an indicator of technological importance, patent citations can serve as a measure of the value of innovations.With this, following an empirical analysis of a particular innovation, patent citations can be significant indicators of the value of innovations such as computed tomography scanners [6].Trajtenberg, Henerson, and Jaffe [7] stated that "importance" and "generality" are measures for the basicness of an innovation, while the degree of technological self-reliance and faithfulness to the innovational paths comprise the basicness of research [7].It is also through the analysis of patent citations that people can measure spillover, knowledge diffusion, and knowledge obsolescence [8].In addition, patent citations can serve as evidence of the level of the geographic localization of knowledge spillovers [9].
In addition, it was discovered that modeling of the flow of patent citations could serve as a reflection of knowledge diffusion across institutional and geographic boundaries, such as that from universities and federal laboratories; moreover, patent citations also prove patterns of how knowledge flows from one country to another [10,11].Through patent citation analysis, the effects of universities and federal laboratories on commercial innovation can be measured [12,13].
To analyze the technological closeness given the closeness of two IPC classes, both the main and supplementary classifications of patents have to be examined [5].From the main classification of a patent, "spillovers" can be inferred to the patent's supplementary class [14].In addition, patent diversity, which is measured by classes or subclasses, can also be used as a predictor of regional innovation [5].Technological diversity, measured by patent data, such as "IPC co-occurrence" in class or subclass, was identified as a persistent innovator because of its contributions to product diversification and sales growth [15].With this, an "IPC co-occurrence" is when multiple IPC codes belonging to different technology fields exist in a single patent.Through patents, we can measure corporate market value; in fact, Chen and Chang [16] identified that patent citations and relative patent position (RPP) were positively associated with corporate market value; however, the Herfindahl-Hirschman index of patents (HHI) was negatively associated with this value.
A company has a significantly higher intention to pursue among R & D collaborators, with smaller companies having higher numbers of patent applications than larger ones [17].Technical collaboration in invention activities by co-inventors and co-assignees can be measured through the co-invention index (= 1 j ∑ j i ∑ k ic k ), and the co-assignee index (= 1 j ∑ j i ∑ k ac k ) [18].The HHI of patents can be used to evaluate the concentration of patents among patent classes as well as the concentration level of companies' technological capability [19].According to the US pharmaceutical industry, the HHI values of patents and RPP in a relevant technological field have positive influences both nonlinearly and monotonically upon corporate performance, while the influence of patent citations is nonlinearly U-shaped [20].By analyzing citation networks and visualizing patent statistics, the position of applicants within citation networks can help explain applicant behavior in the marketplace, such as people's cooperation or patent infringement trials [21].
In a study in which the ratio of joint patent application cases among all patents of companies is defined as the breadth of OI, and in which the average of the patent applicants of each patent of a company is the depth of OI, the concept of collaboration of joint patent application is materialized as OI [22].Yun et al. [22] use two concepts, the breadth of OI, which means the ratio of all people who gave positive answers to the survey of OI, and the depth of OI, which means the percentage of all people who gave positive answers about OI with high intensity, to develop the width and depth of OI and to connect this concept with joint patent application [23].To measure the synergy in an innovative system, such as OI and triple helix, it is necessary to consider diverse aspects of patents, such as citations, joint patent applications, etc. [24].To identify the kind of innovations necessary to secure our future, the proper patent characteristics, such as citation, applicant, and patent network, that are to be connected with which technology patent or business model patent must be identified [25].In addition, the process in which OI circulates through the complex adaptive system, and the evolutionary change dynamics, can be concretely analyzed through the time series change of a company, that is, the change of citations or of the patent network [26].Most of all, patent forward citations have become an established measure for the technological and economic value of patents in the empirical innovation literature [27].
Next, we review only the key papers and books on open innovation which are required to develop the research questions of this paper because there is currently a broad awareness of open innovation and its relevance to corporate R & D [28].Open innovation is the use of purposive inflows and outflows of knowledge to accelerate internal innovation, and expand the markets for external use of innovation, respectively [29].So, Open innovation does not deter but motivates the active production and use of intellectual property [30].Open innovation indicates two inbound processes: sourcing and acquiring, and two outbound processes, revealing and selling [31].So, open innovation requires a creative interpretation and adaptation of the value propositions or business models in each situation, as open innovation coincides with the new business model in the third or fourth industrial revolution [28,32].Regarding the open innovation paradigm, the necessity of allowing ideas to both flow out of the corporation in order to find better sites for their monetization, and flow into the corporation as new offerings and new business models, requires different ways of managing the open innovation processes, based on teamwork or task forces, and the different roles, which are more or less proactive in special industries such as telecommunication [32,33].According to recent research, constraints on the application of firm resources decrease innovative performance of open innovation, but external knowledge increases the innovative performance of open innovation [34].

Research Method
This study measured the distance between technology and the market of a patent for a subject using the size of its list of references and citations [21,23,27].As can be seen in Figure 1, the A type, with many citations and references, was defined as the group with the shortest distance between technology and the market; the B type, with many citations and a small number of references, was the group with a slightly short distance between those two areas; the C type, with a small number of citations and many references, was the group with a slightly far distance between the two areas; and the D type, with small numbers of citations and references, was the group with the farthest distance.
innovation processes, based on teamwork or task forces, and the different roles, which are more or less proactive in special industries such as telecommunication [32,33].According to recent research, constraints on the application of firm resources decrease innovative performance of open innovation, but external knowledge increases the innovative performance of open innovation [34].

Research Method
This study measured the distance between technology and the market of a patent for a subject using the size of its list of references and citations [21,23,27].As can be seen in Figure 1, the A type, with many citations and references, was defined as the group with the shortest distance between technology and the market; the B type, with many citations and a small number of references, was the group with a slightly short distance between those two areas; the C type, with a small number of citations and many references, was the group with a slightly far distance between the two areas; and the D type, with small numbers of citations and references, was the group with the farthest distance.The distance between technology, and the market was used in this study was summarized for each patent application subject and divided into four types, as shown in Figure 1.The division criterion was the median value.Patents to analyze were classified into those of Fortune 500 companies, non-Fortune 500 companies, universities, laboratories, and start-ups (individual patent applicants), and the median of their citations and references for each of the top five subjects was calculated.In this study, it was defined that the A type had values the same as or higher than the median of the citations and references (the distance between technology and the market for these was closer than it was for the others).The B type had values the same or higher than the median of the citations and less than the median of the references (the distance was slightly closer than it was for the others, except for the A type).The C type had values less than the median of the citations and the same as or higher than the median of the references (the distance was slightly farther than it was for the others, except for the D type).The D type had values less than the median of the citations and the references (the distance was farther than it was for the others).Because, according to previous studies, market performance is determined depending on the references and citations, we can estimate the distance between technology and the market according to the structure of the references and citations [18,20].While we can presume that many references mean a patent that was closer to markets and based on diverse technologies, it could be thought that many citations indicated the frequent use of the patent in a certain market.Thus, the B type is closer to the market than is the C type.The distance between technology, and the market was used in this study was summarized for each patent application subject and divided into four types, as shown in Figure 1.The division criterion was the median value.Patents to analyze were classified into those of Fortune 500 companies, non-Fortune 500 companies, universities, laboratories, and start-ups (individual patent applicants), and the median of their citations and references for each of the top five subjects was calculated.In this study, it was defined that the A type had values the same as or higher than the median of the citations and references (the distance between technology and the market for these was closer than it was for the others).The B type had values the same or higher than the median of the citations and less than the median of the references (the distance was slightly closer than it was for the others, except for the A type).The C type had values less than the median of the citations and the same as or higher than the median of the references (the distance was slightly farther than it was for the others, except for the D type).The D type had values less than the median of the citations and the references (the distance was farther than it was for the others).Because, according to previous studies, market performance is determined depending on the references and citations, we can estimate the distance between technology and the market according to the structure of the references and citations [18,20].While we can presume that many references mean a patent that was closer to markets and based on diverse technologies, it could be thought that many citations indicated the frequent use of the patent in a certain market.Thus, the B type is closer to the market than is the C type.
Second, for the level of OI, in the case of the A type, of patents with many references and citations, the number of patents of A type was set as the depth of OI, and the ratio of the patents to total patents including A type, B type, C type, and D type was set as the breadth of OI, considering that the patents were externally and frequently used.Second, in this study, the breadth of OI was measured according to the ratio of the A type of each subject, that is, how large a percentage of the patents of the A type was found among the patents applied for by the target organizations (e.g., universities, laboratories, etc.).In addition, the depth of OI was measured according to the number of the patents of the A type of the organization.The ratio of the A type, with many references and citations, of an organization was used to measure the width of OI, which meant how diverse the technical activities of the organization were that were connected to the outside.In addition, the number of A type patentscould be estimated and used to determine the strength of OI activities.
Third, based on previous studies, the performance of an organization was measured according to the number of sub-IPCs.In many previous studies, the ways in which technically creative and diverse patents were applied was used in various ways as a measurement index of the performance of an organization [5,14].
Fourth, based on the above three methods, the impact of OI on the company performance, and the moderating effect of the distance between technology and the market on the OI, were determined through statistical analysis [28][29][30].
In addition, to display the OI network in the stage before statistical analysis, the OI network of patent application subjects was described based on the patent similarity, with a cosine value of 0.8 or higher and sub-IPC 7 digits.This study briefly estimated the visual characteristics of an OI network through the similarity network of sub-IPC, a dependent variable.By matching the locations of the five subjects in the OI network according to the similarity, the locations of statistically similar subjects in the network were analyzed through visual network analysis results.Furthermore, we analyzed the locations and characteristics of the five subjects, in particular looking at statistically similar subjects in modules and components in the network.

Research Framework
Through regression analysis, this study used the five groups to draw a group with similarity by setting a university as a basic variable and a dummy variable, and analyzing the difference between the analysis models of similar groups as shown Figure 2.
With this, we formulated the following hypotheses: In general, open innovation affected the innovation performance of an organization in the shape of a U-curve.(A) [20,23,28,31,35].
Hypothesis 2. In general, organization performance will increase when the distance between technology and the market is short.(B) [4,5,7].
Hypothesis 3. In general, the distance between technology and the market will control the impact of open innovation on the innovation performance of an organization.(C) [5,8].
Hypothesis 4. The five groups had different impacts on the performance of open innovation [10,11].
According to previous studies, while OI affects performance in the shape of an inverted U-curve, citations have an impact on company performance that shows a U-curve [20,23].Therefore, it is reasonable that OI affects citation performance in the shape of a U-curve.

Analysis of Descriptive Statistics
This study was conducted targeting 79,685 patents applied for in 2007 like Table 1.The number excluded the data reviewed in duplicate among 95,546 patents applied for in 2007.The number of patent application cases was the number of target patents in the database on 6 November 2016.In addition, the distribution of references and citations was not a normal distribution like that in Appendix A. Instead of an average, median values, that is, 14 for references and 1 for citations, were used.For the D type, the percentages of universities and laboratories were the highest, but that of startups was the lowest.Type D proved the technically statistical value of the type.In the A type, the percentages of universities and laboratories were low, but that of start-ups was the highest.Type A also proved the technically statistical value of the type.In addition, in the B and C types, the five subjects all showed a significant difference in terms of technology statistics.
In terms of technical statistics, the number of patents for each subject differed.For example, the number of patents applied for by non-Fortune 500 companies was 51,789, and that by start-ups was 3317.In addition, that by universities was 997.Thus, to analyze the difference between the OI and the distance of technology and the market among the five subjects, we selected the top 100 patents applied for for each subject and conducted a precise statistical analysis including an additional regression analysis.

Analysis of Descriptive Statistics
This study was conducted targeting 79,685 patents applied for in 2007 like Table 1.The number excluded the data reviewed in duplicate among 95,546 patents applied for in 2007.The number of patent application cases was the number of target patents in the database on 6 November 2016.In addition, the distribution of references and citations was not a normal distribution like that in Appendix A. Instead of an average, median values, that is, 14 for references and 1 for citations, were used.For the D type, the percentages of universities and laboratories were the highest, but that of start-ups was the lowest.Type D proved the technically statistical value of the type.In the A type, the percentages of universities and laboratories were low, but that of start-ups was the highest.Type A also proved the technically statistical value of the type.In addition, in the B and C types, the five subjects all showed a significant difference in terms of technology statistics.
In terms of technical statistics, the number of patents for each subject differed.For example, the number of patents applied for by non-Fortune 500 companies was 51,789, and that by start-ups was 3317.In addition, that by universities was 997.Thus, to analyze the difference between the OI and the distance of technology and the market among the five subjects, we selected the top 100 patents applied for for each subject and conducted a precise statistical analysis including an additional regression analysis.
When looking at the cases of the four types in Table 2, there is a big difference among the top 500 companies between Apple and Google, and among IBM, Samsung, and Nokia.Apple and Google, which promote OI-based corporate strategies, have a relatively high ratio of the A type.It is easy to think that each subject has a certain trend, but laboratories and universities have a relatively high ratio of C and D types.This study selected 100 organizations for each subject, as shown in Appendix B, to clarify, through regression analysis, the relation among OI, the distance between technology and the market, and the company performance.Through the correlation analysis, with details shown in Table 3, OI, the distance between technology and the market, and the moderating variables that were to be clarified through regression analysis, can be seen to have a significant relation with diverse aspects beyond company innovation performance.

Preliminary Network Analysis of the OI Network and Distance between Technology and the Market
First, the four groups in the sub-IPC similarity network are described as shown in Figure 3.A group which has high reference and high citation is displayed relatively at the center of the sub-IPC similarity network.Thus, it is reasonable to set the A group as the measurement index for the width and depth of OI.This means a strong relation between the distance between technology and the market, and open innovation which was shown in the similarity network.Second, the distribution of Fortune 500 and non-Fortune 500 companies in the sub-IPC network is displayed in Figure 4.It can be seen that the location structure of the two organization groups is highly different.We can presume that this difference leads to a different impact on OI performance.This means that Fortune 500 firms and Fortune non-500 firms will be different in open innovation effects and distance.Second, the distribution of Fortune 500 and non-Fortune 500 companies in the sub-IPC network is displayed in Figure 4.It can be seen that the location structure of the two organization groups is highly different.We can presume that this difference leads to a different impact on OI performance.This means that Fortune 500 firms and Fortune non-500 firms will be different in open innovation effects and distance.Second, the distribution of Fortune 500 and non-Fortune 500 companies in the sub-IPC network is displayed in Figure 4.It can be seen that the location structure of the two organization groups is highly different.We can presume that this difference leads to a different impact on OI performance.This means that Fortune 500 firms and Fortune non-500 firms will be different in open innovation effects and distance.Third, when describing universities, start-ups, and laboratories in the sub-IPC network, the start-ups and universities are weakly connected with each other in location structure, and form an independent network, as shown in Figure 5.In addition, laboratories do not have a clear network.Thus, it can be estimated that the relation of OI, the distance between technology and the market, and the company performance are significantly different among universities, start-ups, and laboratories.As the network structure of open innovation among laboratories, universities, and starts-up, is totally different in appearance, we can evaluate that these show the difference of performance according to the situations of open innovation and distance.
Sustainability 2017, 9, 1478 9 of 17 Third, when describing universities, start-ups, and laboratories in the sub-IPC network, the start-ups and universities are weakly connected with each other in location structure, and form an independent network, as shown in Figure 5.In addition, laboratories do not have a clear network.Thus, it can be estimated that the relation of OI, the distance between technology and the market, and the company performance are significantly different among universities, start-ups, and laboratories.As the network structure of open innovation among laboratories, universities, and starts-up, is totally different in appearance, we can evaluate that these show the difference of performance according to the situations of open innovation and distance.

Statistical Analysis and Hypothesis Test
According to the entire moderating effect regression analysis, shown in Table 4, this analysis model is statistically significant.That is, the analysis model shown in Figure 2 is statistically significant according to the analysis of the 500 institutions (i.e., 100 for each group).
In addition, OI has a statistically significant impact on company performance, as shown in Model 1.It can be presumed that the values of "OD*OB" and "(OD*OB)" are statistically significant, and that OI affects company performance in the shape of a U-curve, as the former is positive and the latter is negative.In other words, Hypothesis 1 is accepted.
In Model 2, the distance between technology and the market has a positive impact on company performance.That is, the closer the distance is, the higher the company performance is.Thus, Hypothesis 2 is accepted.
In Model 3, the moderating effect is statistically significant.That is, the closer the distance is, the higher the impact of OI is on company performance.Therefore, Hypothesis 3 is accepted.

Statistical Analysis and Hypothesis Test
According to the entire moderating effect regression analysis, shown in Table 4, this analysis model is statistically significant.That is, the analysis model shown in Figure 2 is statistically significant according to the analysis of the 500 institutions (i.e., 100 for each group).
In addition, OI has a statistically significant impact on company performance, as shown in Model 1.It can be presumed that the values of "OD*OB" and "(OD*OB)" are statistically significant, and that OI affects company performance in the shape of a U-curve, as the former is positive and the latter is negative.In other words, Hypothesis 1 is accepted.
In Model 2, the distance between technology and the market has a positive impact on company performance.That is, the closer the distance is, the higher the company performance is.Thus, Hypothesis 2 is accepted.
In Model 3, the moderating effect is statistically significant.That is, the closer the distance is, the higher the impact of OI is on company performance.Therefore, Hypothesis 3 is accepted.Finally, according to the analysis of Fortune 500 and non-Fortune 500 companies, laboratories, and start-ups as dummies, and based on the analysis of universities in Models 1, 2, and 3, the results of universities and Fortune 500 and non-Fortune 500 companies are found to be different.Thus, we can presume that, for each subject, the impact of OI and distance between technology and the market on company performance is different.In other words, Hypothesis 4 is accepted.Meanwhile, the differences that exist among the five subjects need to be additionally analyzed.According to Appendix C, in the case of Fortune 500 companies and laboratories, each of the organizations in the top 100 shows a statistically significant impact of OI on performance and the OI moderating effect of the distance between technology and the market.However, these aspects are not statistically significant in the case of non-Fortune 500 companies, universities, and start-ups.In other words, the OI and the distance between technology and the market, which were measured using references and citations for the five subjects, have different impacts on company performance.In addition, the distance had a different moderating effect on the OI performance of companies.

Discussion
First, if we answer the first research question "Is there any relation between the distance between technology and the market, and open innovation?", the answer is Yes because Hypothesis 2, and 3 were accepted in addition to Hypothesis 1.Thus, the closer the distance between technology and the market, the higher open innovation, and firm performance.The core research result of this study is to identify that the closer the distance between technology and the market is, the higher the company performance is.In addition, the distance has a positive moderating effect on the open innovation (OI) effect on company performance.In the case of patents with a small number of references and citations, that is, when there is a long distance between technology and the market, or when the distance is long even though the technological novelty or scientific value is high, it cannot be confirmed that company performance increases.A patent with a close distance between technology and the market has many references and citations.To improve company performance, a patent with many references and citations needs to be applied for.That is, the development of a patent with many references and citations implies a short distance between technology and the market.
The answer to the second research question "If there is, what differences are there in this relation among Fortune 500 firms, non-Fortune 500 companies, laboratories, universities, and start-ups?" is Yes.However, the detail difference among five agents requires additional research.
The second core result of this study is that the possibility of objectively measuring the distance between technology and the market using patents can be checked.The difference among individual companies is shown in the structure and percentages of references and citations of their patents.Thus, it is confirmed that the ratio of technology to market of the four types depending on references and citations and the total number of patents can be a significant parameter to measure the distance.
In addition, this study shows that it is possible to compare and analyze the distances between technology and the market by converting the distance types of technology and the market into measuring values by giving weights; these weights are then applied using statistical analysis.As the knowledge-based economy accelerates and the OI paradigm continues to extend, the number of patents is increasing in all industries and nations.Under these conditions, it is valuable in theory and reality that patents are used to measure and analyze the distance between technology and the market for companies, universities, and national research institutes and to formulate strategies.

Limits and Additional Research Goals
First, to use patent references and citations to measure and analyze the distance between technology and the market, concrete research is required targeting companies in the same industry or cluster with time series data or at a specific time [36].Through a further study that uses patents to measure the distance between technology and the market, and that compares and analyzes the relations of market sales of companies, additional research to determine a method of analyzing the distance mentioned in this study is needed.
Second, additional research to precisely measure the OI of companies using the number of patents with many references and citations, and their ratios in groups, is needed [37].By analyzing the reference and citation network of a specific business model patent, which is the basis of a corporate business model, or a technical patent variously used or licensed, which is the basis of a corporate business, in depth, an additional in-depth analysis of the real OI value of a patent with many references and citations can be obtained.
Third, the concrete differences among five agents in the distance, open innovation, and firm performance require additional research.

Figure 1 .
Figure 1.Four types of relation between technology and the market for patents, considering references and citations.

Figure 1 .
Figure 1.Four types of relation between technology and the market for patents, considering references and citations.

Figure 3 .
Figure 3. Distribution of four groups in the sub-IPC network.

Figure 3 .
Figure 3. Distribution of four groups in the sub-IPC network.

Figure 4 .
Figure 4. Fortune 500 and non-Fortune 500 companies in the sub-IPC network.

Figure 4 .
Figure 4. Fortune 500 and non-Fortune 500 companies in the sub-IPC network.

Figure 5 .
Figure 5. Laboratories, start-ups, and universities in the sub-IPC network.

Figure 5 .
Figure 5. Laboratories, start-ups, and universities in the sub-IPC network.

Table 1 .
Overview of all patents to analyze in 2007.

Table 1 .
Overview of all patents to analyze in 2007.

Table 2 .
Examples per type.

Table 3 .
Analysis of correlation.

Table 4 .
Entire moderating effect regression analysis.

Table A2 .
Analysis result of 100 Fortune 500 companies.

Table A4 .
Analysis result of 100 organizations from among laboratories.

Table A5 .
Analysis result of 100 organizations from among universities.

Table
Analysis result of 100 companies from among start-ups.