Next Article in Journal
Advances in Information Security and Privacy
Next Article in Special Issue
Study on the Technology Trend Screening Framework Using Unsupervised Learning
Previous Article in Journal
Preparation of Al2O3 Multichannel Cylindrical-Tube-Type Microfiltration Membrane with Surface Modification
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Technology Commercialization Activation Model Using Imagification of Variables

1
Institute of Engineering Research, Korea University, Seoul 02841, Korea
2
Department of Big Data and Statistics, Cheongju University, Cheongju 28503, Korea
*
Authors to whom correspondence should be addressed.
Appl. Sci. 2022, 12(16), 7994; https://doi.org/10.3390/app12167994
Submission received: 25 July 2022 / Revised: 3 August 2022 / Accepted: 9 August 2022 / Published: 10 August 2022

Abstract

:
Various institutions such as universities and corporations strive to commercialize technologies produced through R&D investment. The ideal way to commercialize technology is to transfer it, recognizing the value of the developed technology. Technology transfer is the transfer of technology from R&D entities, such as universities, research institutes, and companies, to others, with the advantage of spreading research results and maximizing cost efficiency. In other words, if enough technology is transferred, it can be commercialized. Although many institutions have various support measures to assist in transferring technology, there is no substitution for quantitative, objective methods. To solve this problem, this paper proposes a technology transfer prediction model based on the information found in patents. However, it is not realistic to include the information from all patents in the quantitative, objective method, so patterns related to technology transfer must be identified to select the appropriate patents that can be used in the predictive model. In addition, a method is needed to address the insufficient training data for the model. Training data are limited because some technology transfer information is not disclosed, and there is little technology transferred in new technology fields. The technology transfer prediction model proposed in this paper searches for hidden patterns related to technology transfer by imaging the patent information, which can also be applied to image analysis models. Furthermore, augmenting the data can solve the problem of the lack of learning data for technology transfer. To examine whether the proposed model can be used in real industries, we collected patents related to artificial intelligence technology registered in the United States and conducted experiments. The experimental results show that the models trained by imaging patent information performed excellently. Moreover, it was shown that the data augmentation technique can be used when there are insufficient data for technology transfer.

1. Introduction

Technology commercialization is the act of creating economic benefit by directly or indirectly transferring technology developed through R&D investment and using it for business activities [1,2,3,4]. In other words, technology commercialization connects R&D investment with added value creation, and it is an important management strategy carried out to spread research results and strengthen national and corporate competitiveness. Therefore, many countries strive to commercialize technology [4]. They also support promoting technology commercialization to address major factors that could lead to market, system, or coordination failure [4,5,6]. However, despite national support, many difficulties persist, such as the valley of death and the Darwinian sea, preventing most technologies from being commercialized [2]. Moreover, it is challenging for an R&D institution to commercialize its research results because of the researchers’ perception of technology commercialization and problems with performance management.
Technology transfer is intended to increase the success rate of technology commercialization by spreading the R&D results to consumers and industries [7,8,9]. Forms of technology transfer include licensing, mergers and acquisitions (M&A), and the transfer of intellectual property rights [10]. Of these, the transfer of intellectual property rights, that is, the transfer of patents to others, is an important form of technology transfer that promotes commercialization [11,12]. Transferring patents has the advantage of spreading research results and maximizing cost efficiency because the value of already-developed R&D results is recognized and transferred to others. Because of these advantages, many countries and institutions prefer to commercialize technology by transferring it rather than connecting R&D directly to commercialization. In other words, technology transfer has several advantages, such as the prevention of overlapping research, reduction of the R&D cost, and the possibility of professional commercialization, and it is the ideal way to commercialize technology [13]. A significant amount of technology must be transferred for commercialization, so some governments establish and operate support systems to increase the technology transfer success rate. Nevertheless, the success rate is still not high because of existing alternative technologies and market uncertainty. In addition, much time and money are consumed analyzing the value of patents to acquire, and the analysts’ judgments are subjective [14]. Solving this problem requires a data-driven, quantitative, objective technology transfer analysis model [15,16].
Existing quantitative technology transfer analysis models use various patent data information [17,18,19,20,21,22]. However, in reality, quantitative models cannot use all the information in patents. Therefore, it is necessary to search effectively for patterns related to technology transfer in limited patent information and reflect them in the model. In addition, technology transfer is not open to the public for reasons such as security, and there are industries where technology is less often transferred. In such cases, there are insufficient training data to build a technology transfer prediction model. If the training data are insufficient, the model is less likely to be accurate. Therefore, constructing a technology transfer analysis model requires incorporating patent information effectively and addressing the lack of learning data. In this paper, we propose a predictive model that can effectively reflect patterns related to technology transfer by imaging the patent information and addressing the lack of training data. The proposed model uses both numerical and textual information from the patents and uses the DeepInsight [23] algorithm to image the patent information. Imaging the patent information makes it possible to search for characteristics related to technology transfer and to apply a specialized model for image analysis. Moreover, data augmentation is proposed when there are insufficient data for model training. To examine whether the model proposed in this paper can be used in actual industrial fields, we collected patents related to artificial intelligence (AI) technology registered with the United States Patent and Trademark Office (USPTO) to conduct experiments.
This paper is structured as follows. First, in Section 2, related theories and previous studies are reviewed. Section 3 describes the proposed model in detail. Section 4 shows the experimental results from applying the proposed model with actual data. Section 5 discusses the proposed model and experimental results. Finally, Section 6 discusses the limitations of the proposed model, conclusions, and future research tasks.

2. Related Work

2.1. Technology Commercialization

Technology commercialization is the creation of economic benefits by directly or indirectly using R&D output for business [1,2,3]. Because technology commercialization has advantages, such as spreading research results and improving corporate and national competitiveness, many countries operate support measures [4,5,6]. However, despite national support, various difficulties have prevented technology from being sufficiently commercialized. Various studies have proposed methods to address these difficulties and commercialize technology [24,25,26,27,28,29]. Chen analyzed the performance of venture companies using a regression model and identified factors affecting technology commercialization [24]. Jung et al. used a decision-tree model to analyze the success and failure factors of public technology commercialization [25]. Kim and Cho analyzed the major factors contributing to the success of commercializing technology transferred from government-funded research institutes to small and medium enterprises using logistic regression and a decision-tree model [26]. Chen, Jung et al., and Kim and Cho analyzed the success and failure factors of technology commercialization using a quantitative model, but they focused only on specific institutions. Furthermore, they had little data to analyze. Kirchberger and Pohl analyzed previous studies on technology commercialization and searched for the success factors based on the type of institution [27]. Anokhin et al. analyzed the framework for commercializing misfit technology and suggested alternatives for the various cases [28]. Carayannis et al. compared and analyzed the characteristics of the technology commercialization process of US and Russian universities [29]. Studies analyzing the framework of technology commercialization suggested alternatives to the process, from R&D to commercialization. In reality, performing everything from R&D to technology commercialization in one institution is costly and highly likely to fail. Therefore, it is necessary to revitalize technology commercialization via technology transfer using research results that have already been developed.

2.2. Technology Transfer Using Patent Analysis

Technology transfer is intended to increase the success rate of technology commercialization by disseminating the research results to consumers and industries [7,8,9]. There are various types of technology transfer, but the transfer of a patent is a common form. Patent-based technology transfer can maximize the cost efficiency and dissemination of research results by transferring R&D results that have already been developed. A technology transfer study using patent analysis was previously conducted [17,18,19,20,21,22]. Patent analysis is primarily divided into qualitative and quantitative techniques [30,31]. In the qualitative technique, the patents are analyzed by experts, whose subjectivity can be seen in the results, and it requires a significant amount of time and money [14]. Solving these problems requires technology transfer research using quantitative, objective techniques [15,16].
Such techniques use various information from patent data [17,18,19,20,21,22]. Trappey et al. extracted quantitative indicators through principal component analysis and the Kaiser-Meier-Olkien test to evaluate the quality of technology-transfer patents. In addition, they constructed a patent-quality evaluation model with a neural network [17]. Park proposed a model to analyze technology transferability using patent citation information [18]. Park et al. analyzed the potential functions of patents through text-mining techniques and proposed a model to examine the possibility of technology transfer in various industries [19]. Choi et al. used patents to analyze factors affecting technology transfer [20]. Furthermore, technology transfer characteristics were analyzed by applying social network analysis (SNA) to each institution. These prior studies searched for technology transfer fields and influence factors based on patent analysis. In fact, R&D institutions or technology consumers must focus on things such as marketing by searching for patents with high transfer potential. Therefore, a quantitative model is needed to identify potential technology transfer patents in advance. Jun et al. applied text-mining techniques to structure patents and predicted the possibility of technology transfer through SNA [21]. Lee et al. applied textual information and quantitative indicators to an ensemble model to improve the prediction of technology transfer patents [22]. Previous studies for predicting technology transfer patents have built quantitative models using various patent information, but the patent information used to build the actual model was limited. Furthermore, due to the characteristics of technology transfer, the amount of patent training data was insufficient. This makes it difficult to capture small technology-transfer-related variations in the data, and predictive performance may be poor in certain industries due to the lack of training data. The model proposed in this paper converts patent information into images and effectively searches for information related to technology transfer. Moreover, data augmentation allows it to be used when the training data are sparse.

2.3. DeepInsight: Imagification of Variables

Analysis techniques such as statistics and machine learning are usually applied to tabular or similar data composed of numbers because it can be challenging to understand the information contained in non-image data without such techniques. When it is difficult to capture minor variations in such tabular data, they are often converted into images for analysis [23,32]. This conversion into image data allows the inherent information to be more effectively identified using a model specialized for image analysis, such as a convolutional neural network (CNN). Due to these advantages, non-image data are converted into image data in various fields such as genetic analysis and defect measurement [23,32,33,34,35,36].
DeepInsight, proposed by Sharma et al., is an algorithm developed to capture small variations in non-image data and easily discover differences between data [23]. DeepInsight constructs an image by placing similar elements or features together and dissimilar elements farther apart. To transform image data, DeepInsight first defines the location of features using a dimension reduction technique such as kernel principal component analysis or t-distributed stochastic neighbor embedding (tSNE). This study uses tSNE to reduce the dimensions when DeepInsight is used. Equation (1) is the tSNE cost function [37].
C = K L ( P | | Q )   = i j p i j log p i j q i j ,
where p is the probability that the j -th neighbor is selected when given the i -th object existing in a high-dimensional space. Moreover, q is the probability that the j -th neighboring object will be selected, given the i -th object embedded in the low-dimensional space. Next, DeepInsight uses the convex hull algorithm on the location information of the features to find the smallest rectangle containing all objects, rotate them, and map them to pixels. DeepInsight converts various data such as text and genetic analysis into image data. DeepInsight was used in this study with patent information to discover hidden information related to technology transfer.

3. Proposed Model

In this study, we propose a technology commercialization activation model based on technology transfer prediction. The proposed model uses both the numerical and textual information in patents. Moreover, we used DeepInsight to image patent information to discover hidden patterns related to technology transfer and to improve the predictive model performance. Data augmentation was applied so that the model can be used in industries where technology transfer information is not disclosed and technology transfer is rare. The flow of proposed model is shown in Figure 1.

3.1. Data Description and Preprocessing

In this study, we propose a technology commercialization activation model based on technology transfer prediction. To examine whether the proposed model can actually be used in the industrial field, patents were collected from the Database, and experiments were carried out. A total of 15,193 patents related to AI technology and registered with the USPTO were collected. The patents were filed between 1989 and 2016. Among these, 4137 cases of technology transfer occurred. Technology-transferred patents are cases transferred to another person. The transferred patent was checked on keywert, a paid patent information site [38]. Table 1 summarizes the characteristics of the experimental data.
The technology transfer prediction model proposed in this paper was built by learning patent data. Therefore, both numerical and textual information from the patents were used. The numerical information used to build the model includes the indicators shown in Table 2.
As shown in Table 2, indicators that measure various values were extracted and used to build the model. app_to_regi is the number of days elapsed from filing a patent application to registration. In general, applicants request priority examination to grant rights quickly for patents with high utility value. Thus, when the app_to_regi value is low, the utility value is high. all_claim_num is the number of claims, indicating the scope of the rights. Therefore, all_claim_num can be used as a measure of rights. applicant_num represents the number of applicants. Because it indicates whether multiple applicants are involved in the corresponding patent application, it can be used to measure the utility value. inventor_num represents the number of inventors who participated in a patent, so it can measure the sustainability of the development. current_owner_num represents the number of current patent holders, so it is a measure of the influence of the patent in the market. IPC_num is the number of technical field codes associated with the corresponding patent. A higher value means that the patent can be extended to various technologies. b_citation_num and f_citation_num are the numbers of backward and forward citations, respectively. In the backward citation, another patent cited the patent, and in the forward citation, the patent was cited by another patent. The citation degree of a patent can be used to measure the impact of a technology. fam_nation_num and fam_doc_num indicate the number of countries and patents included in family applications. Because a family application is intended to exercise rights in various countries, these indicators can measure the market influence. alone_app_yn indicates whether a patent application is a sole application. A sole patent is less encumbered than a patent in which technology transfer and an overseas application are filed jointly, making it a measure of the utility value. stand_patent_yn indicates whether it is a standard patent. A standard patent plays a key role in the relevant technical field, so it is possible to measure the technological influence. Finally, lit_yn indicates whether or not patent litigation has been filed. Patents that have been litigated are likely relatively important technologies, as they have caused disputes in their field. Therefore, it is possible to measure the utility value based on whether a lawsuit has been filed. lit_yn identified through keywert [38].
In addition to the quantitative information mentioned above, a patent contains detailed information about the developed technology in text form, such as the abstract and title. Patents have different index values depending on the elementary technology field to which they belong. For this reason, the elementary technology must be identified for each patent and reflected in the model training. Elementary technology can be identified through the text information and clustering [39]. In this paper, the abstract section of the patent was used to identify the elementary technology. However, the corpus size is minimal because the text information extracted from the patent data is limited to a specific technology. To derive effective clustering results, patents must be embedded based on a sufficiently large corpus. Therefore, this study used bidirectional encoder representations from transformers (BERT) pre-trained with a large corpus. Table 3 shows the characteristics of the BERT model used for patent embedding [40].
As shown in Table 3, the pre-trained BERT learns various huge text datasets and embeds a set of documents to be analyzed as real numbers in a 384-dimensional space. In this study, 15,193 patents were embedded in 384 dimensions for clustering. In addition, by applying K-means clustering to the embedded patents, multiple clusters were formed, and the elementary technology was identified. When clustering was applied, the optimal number of clusters was selected by comparing values derived from various measures. Table 4 summarizes the measures for deriving the optimal number of clusters [41,42,43,44].
Clusters define and label the elementary technology using high-frequency words. The elementary technology label assigned to each patent is converted into a dummy variable, merged with the quantitative information, and used for the technology commercialization activation model learning.

3.2. Technology Commercialization Activation Model

Patents contain detailed, diverse information about technologies that have been developed, but they require much analysis time and know-how to use them to commercialize technology. In addition, when quantitative analysis is used for technology commercialization, the performance is poor due to limitations in learnable variables and an insufficient number of technology transfers. The performance can be expected to improve if hidden patterns related to technology transfer are searched for effectively within the range of available patent information and reflected in the predictive model construction. Moreover, the lack of technology transfer learning data must also be addressed. The technology commercialization activation model proposed in this paper explores technology transfer patterns inherent in the data by imaging the preprocessed variables and reflecting them to improve the predictive model performance. The imaged data are augmented to solve the lack of learning data.
Converting non-image data into image data makes it possible to search for potential information that has not yet been identified. It can also be used with models that analyze image data, such as CNNs. Therefore, non-image data is imaged and used for analysis in various fields such as gene and text analysis. In this paper, DeepInsight, which converts non-image data into image data, was applied to the patent information. Figure 2 shows the schematic of converting the patent information into image data.
As shown in Figure 2, DeepInsight was applied to the preprocessed patent information, converting it into image data. The non-image data from the patent was normalized using Equation (2) and used with DeepInsight.
x x m i n x m a x x m i n ,
where x m i n is the minimum value of the data and x m a x is the maximum value. Normalized values were projected into a two-dimensional space via tSNE, which uses a convex hull algorithm for projected spatial information to find the smallest rectangle that encloses all the data, rotate it, and map it to a pixel. The number of converted images is equal to the number of patents. Table 5 summarizes the parameters used for data image transformation.
The comparison models are logistic regression (LR), K-nearest neighbor (KNN), decision tree (DT), random forest (RF), AdaBoost (AB), and a CNN. The comparison models learn non-image and image data, respectively, and compare the technology transfer prediction performance. Additionally, data augmentation was performed to prepare for situations with only a small amount of technology transfer learning data, and the learning and prediction results are also presented. Data augmentation uses the synthetic minority oversampling technique (SMOTE) and random oversampling (RS), which are primarily used in label imbalance problems. Furthermore, accuracy, precision, recall, and the F1-score were used to evaluate the model prediction performance in this paper.

4. Experimental Results

Various models were used to evaluate the technology transfer prediction performance of the technology commercialization activation model proposed in this paper, and the results were compared. To evaluate the technology transfer prediction performance of the proposed technology commercialization activation model, actual registered patents were collected, and experiments were conducted. A total of 15,193 patents related to AI technology were collected. Of these, 12,154 were used as the training data, and the remaining 3039 were used for the test data. Table 6 summarizes the characteristics of the training and test datasets.
Table 6 shows there were 8845 (72.77%) non-transferred and 3309 (27.23%) transferred patents in the training dataset. In addition, the test dataset contained 2211 (72.75%) non-transferred and 828 (27.25%) transferred patents. The image data had a size of 120 × 120, consisting of three channels (RGB).
Text information was extracted from the patent abstract, and the documents were embedded in 384 dimensions using the pre-trained BERT. Next, the elementary technology was identified by applying K-means clustering to the embedded patent. The optimal K value was then selected by comparing the values derived for each measure, as shown in Table 7.
The areas colored in yellow in Table 7 show the optimal values derived for each measure. For CH and DB, it was found that K values of two and three, respectively, were appropriate. For SS and SSE, a K value of five was found to be appropriate. Based on these results, the collected patent data were grouped into five elementary technology clusters in this study. Table 8 defines the high-frequency words and elementary technology by cluster.
As shown in Table 8, the basic technology labels identified in patents were converted into dummy variables and merged with preprocessed quantitative information to be used with DeepInsight. Figure 3 shows an example of a converted patent information image.
Figure 3 shows the difference in the patent information image based on whether or not the technology was transferred. The technology transfer prediction performance was compared by training the models on the derived patent information images using the parameters shown in Table 9.
We trained the models shown in Table 9 using non-image and image data and compared the technology transfer prediction performance. Table 10 summarizes their prediction performance by data type.
The values colored in yellow in Table 10 indicate the best performance for each indicator. In general, the predictive performance of the models trained with image data was better than that of the models trained with non-image data. RF performed best, and its accuracy and precision improved when trained with image data. In addition, all measurement indices for KNN were better when it was trained with image data than when it was trained with non-image data. CNN, which is specialized in image data analysis, had an accuracy of 0.75, the second-highest prediction performance after RF, and its recall and F1-score were the best of all the models.
A significant proportion of the previously learned dataset was labeled as transferred technology, at a ratio of 7:3. In general, technology transfer in industry is infrequent. Therefore, it is necessary to prepare for cases where the labels are imbalanced. In this study, the image data were augmented using the oversampling techniques SMOTE and RS, and the training and prediction performance of the models were compared. Table 11 summarizes the data augmented by oversampling.
As shown in Table 11, data augmentation adjusted the ratio of the technology transfer label from 7:3 to 5:5. In other words, a total of 12,154 training data elements were augmented to 17,690. Table 12 summarizes the prediction performance of each model based on the data augmentation technique.
Table 12 shows that RF outperformed the other models. The accuracy of the models was lower than when trained on the data before data augmentation. However, the F1-scores were higher in most models than when trained with the original training dataset. The CNN prediction performance was evaluated after learning with data augmented by the commonly used Keras ImageDataGenerator [45]. Table 13 summarizes the parameters used in ImageDataGenerator and the prediction performance of CNNs trained on the augmented data.
Table 13 shows that the CNN trained using the augmented image data had better accuracy than the original model. In addition, the precision, recall, and F1-scores were higher than those of the other models trained on the augmented data. Thus, when sufficient training data cannot be obtained or the technology transfer label is highly imbalanced, training a prediction model using data augmentation may be a better method.

5. Discussion

In this study, we used DeepInsight to image variables, searched for patterns related to technology transfer in limited patent information, and reflected them in building a predictive model. In addition, a data augmentation technique that can be used when there are insufficient training data was applied to the model. Experiments were conducted by collecting patents registered with the USPTO. The results showed that the models trained by converting patent information into images generally outperformed those trained on non-image data. Among them, the RF model accuracy and CNN recall and F1-score were the highest values. In the data augmentation experiment based on oversampling, the RS method performed better than SMOTE, and the CNN accuracy when the model was trained through ImageDataGenerator was better than that of the original model. Unlike the oversampling technique, which generates new data, CNN is considered to have improved prediction performance because it uses data augmented by converting existing data by such actions as rotation and horizontal flip.
In this study, only patents registered in the USPTO were used for modeling. A patent is a territorial system in which rights are granted only in the country in which it is filed. Therefore, the characteristics of patents may differ from country to country. It will be necessary to collect and model patents from various countries and to compare and analyze the results.

6. Conclusions

Technology commercialization is an important management strategy to strengthen the competitiveness of countries and companies, which various countries are trying to revitalize. Despite their efforts, various challenges make technology commercialization difficult to revitalize. Among the various forms of technology commercialization, technology transfer, in which patents are transferred to others, is the ideal commercialization model. Many institutions strive to promote technology commercialization through technology transfer. However, analyzing the target patents for technology transfer consumes substantial time and money, and the existing quantitative models have poor prediction performance because of limitations in patent information and a lack of training data. In this study, patent information was converted into image data using DeepInsight, and patterns related to technology transfer were explored. In addition, a data augmentation technique that can be used when the data for technology transfer learning are insufficient was applied to the model. The proposed model was tested with an actual patent and showed that it can be used in industry. The model proposed in this paper can be used for predicting technology transfer related to AI. Using the proposed model, it is believed that it will be possible to identify patents with high potential for technology transfer in the AI field in advance and establish various technology commercialization strategies.
The model proposed in this paper can help commercialize technology by predicting technology transfers in advance. However, the proposed model has the following limitations. First, the model was built focusing on only one type of technology transfer, namely, transferring patents. In actuality, technology is transferred in various ways, such as licensing and M&A, but all these methods were not reflected in the model. Next, some of the original image information, such as drawings in the patents, was not used. Patents include images such as drawings to explain the technology developed and to secure rights. The original images from these patents should be combined with the existing quantitative information. In future research, more effective quantitative technology commercialization revitalization methods should be studied by addressing the above-mentioned limitations.

Author Contributions

Y.K. and S.P. conceived and designed the experiments; Y.K. analyzed the data to illustrate the validity of this study; Y.K. and J.K. wrote the paper and performed all of the research steps. All authors have read and agreed to the published version of the manuscript.

Funding

This research was supported by Basic Science Research Program through the National Research Foundation of Korea (NRF) funded by the Ministry of Education (2022R1I1A1A01062652).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The data presented in this study are available on request from the author [contact: [email protected]].

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Mitchell, W.; Singh, K. Survival Of Businesses Using Collaborative Relationships To Commercialize Complex Goods. Strateg. Manag. J. 1996, 17, 169–195. [Google Scholar] [CrossRef]
  2. Branscomb, L.M.; Auerswald, P.E.; Evans, D.L.; Bond, P.J.; Bement, A.L. Between Invention and Innovation an Analysis of Funding for Early-Stage Technology Development; United States Department of Commerce: Washington, DC, USA, 2002.
  3. Markman, G.D.; Siegel, D.S.; Wright, M. Research and Technology Commercialization. J. Manag. Stud. 2008, 45, 1401–1423. [Google Scholar] [CrossRef]
  4. Hwang, I. Analyzing Key Determinants of National R&D Technology Commercialization and Its Implications. 2021. Available online: https://www.kistep.re.kr/boardDownload.es?bid=0031&list_no=42465&seq=1 (accessed on 7 July 2022).
  5. Aram, J.D.; Lynn, L.H.; Reddy, N.M. Institutional Relationships and Technology Commercialization: Limitations of Market Based Policy. Res. Policy 1992, 21, 409–421. [Google Scholar] [CrossRef]
  6. Hwang, H.-R.; Kim, K.-K.; Jeong, H.-K. A Study on the Technology Commercialization Policy for Technology-Based SMEs: Case on Daedeok Innopolis. Asia-Pac. J. Bus. Ventur. Entrep. 2013, 8, 39–52. [Google Scholar] [CrossRef]
  7. Bozeman, B. Technology Transfer and Public Policy: A Review of Research and Theory. Res. Policy 2000, 29, 627–655. [Google Scholar] [CrossRef]
  8. Wahab, S.A.; Rose, R.C.; Osman, S.I.W. Defining the Concepts of Technology and Technology Transfer: A Literature Analysis. Int. Bus. Res. 2011, 5, 61–71. [Google Scholar] [CrossRef]
  9. About Technology Transfers. Available online: https://eng.kitech.re.kr/support/page1-1.php (accessed on 7 July 2022).
  10. Rogers, E.M.; Takegami, S.; Yin, J. Lessons Learned about Technology Transfer. Technovation 2001, 21, 253–261. [Google Scholar] [CrossRef]
  11. Arora, A.; Gambardella, A. Ideas for Rent: An Overview of Markets for Technology. Ind. Corp. Chang. 2010, 19, 775–803. [Google Scholar] [CrossRef]
  12. Koo, Y.; Cho, K. The Relationship between Patents, Technology Transfer and Desorptive Capacity in Korean Universities. Sustainability 2021, 13, 5253. [Google Scholar] [CrossRef]
  13. Lee, Y.-J. Strategies for the Successful Technology Transfer from Public Research Institutes in Korea. J. Technol. Innov. 2008, 16, 141–163. [Google Scholar]
  14. Kaya, H.; Yazgan, M.E. Probability Forecasts of Macroaggregates in the Turkish Economy. Emerg. Mark. Finance Trade 2014, 50, 214–229. [Google Scholar] [CrossRef]
  15. Baek, D.H.; Sul, W.; Hong, K.P.; Kim, H. A Technology Valuation Model to Support Technology Transfer Negotiations. R D Manag. 2007, 37, 123–138. [Google Scholar] [CrossRef]
  16. Kim, M.-S.; Lee, C.-H.; Choi, J.-H.; Jang, Y.-J.; Lee, J.-H.; Lee, J.; Sung, T.-E.; Choi, J.-H.; Jang, Y.-J.; Lee, J.-H.; et al. A Study on Intelligent Technology Valuation System: Introduction of KIBO Patent Appraisal System II. Sustainability 2021, 13, 12666. [Google Scholar] [CrossRef]
  17. Trappey, A.J.C.; Trappey, C.V.; Wu, C.Y.; Lin, C.W. A Patent Quality Analysis for Innovative Technology and Product Development. Adv. Eng. Inform. 2012, 26, 26–34. [Google Scholar] [CrossRef]
  18. Park, Y.; Lee, S.; Lee, S. Patent Analysis for Promoting Technology Transfer in Multi-Technology Industries: The Korean Aerospace Industry Case. J. Technol. Transf. 2012, 37, 355–374. [Google Scholar] [CrossRef]
  19. Park, H.; Yoon, J.; Kim, K. Using Function-Based Patent Analysis to Identify Potential Application Areas of Technology for Technology Transfer. Expert Syst. Appl. 2013, 40, 5260–5265. [Google Scholar] [CrossRef]
  20. Choi, J.; Jang, D.; Jun, S.; Park, S. A Predictive Model of Technology Transfer Using Patent Analysis. Sustainability 2015, 7, 16175–16195. [Google Scholar] [CrossRef] [Green Version]
  21. Jun, S.; Park, S.; Jang, D. A Technology Valuation Model Using Quantitative Patent Analysis: A Case Study of Technology Transfer in Big Data Marketing. Emerg. Mark. Finance Trade 2015, 51, 963–974. [Google Scholar] [CrossRef]
  22. Lee, J.; Kang, J.H.; Jun, S.; Lim, H.; Jang, D.; Park, S. Ensemble Modeling for Sustainable Technology Transfer. Sustainability 2018, 10, 2278. [Google Scholar] [CrossRef] [Green Version]
  23. Sharma, A.; Vans, E.; Shigemizu, D.; Boroevich, K.A.; Tsunoda, T. DeepInsight: A Methodology to Transform a Non-Image Data to an Image for Convolution Neural Network Architecture. Sci. Rep. 2019, 9, 11399. [Google Scholar] [CrossRef] [Green Version]
  24. Chen, C.J. Technology Commercialization, Incubator and Venture Capital, and New Venture Performance. J. Bus. Res. 2009, 62, 93–103. [Google Scholar] [CrossRef]
  25. Jung, M.; Lee, Y.-b.; Lee, H. Classifying and Prioritizing the Success and Failure Factors of Technology Commercialization of Public R&D in South Korea: Using Classification Tree Analysis. J. Technol. Transf. 2015, 40, 877–898. [Google Scholar] [CrossRef]
  26. Effects, K.; Lee, M.-K.; Lee, S.; Kim, M.; Kim, J.-K.; Cho, K.-T. Effects of Technology Commercialization Proactiveness on Commercialization Success: The Case of ETRI in Korea. Sustainability 2022, 14, 7056. [Google Scholar] [CrossRef]
  27. Kirchberger, M.A.; Pohl, L. Technology Commercialization: A Literature Review of Success Factors and Antecedents across Different Contexts. J. Technol. Transf. 2016, 41, 1077–1112. [Google Scholar] [CrossRef]
  28. Anokhin, S.; Wincent, J.; Frishammar, J. A Conceptual Framework for Misfit Technology Commercialization. Technol. Forecast. Soc. Chang. 2011, 78, 1060–1071. [Google Scholar] [CrossRef]
  29. Carayannis, E.G.; Cherepovitsyn, A.Y.; Ilinova, A.A. Technology Commercialization in Entrepreneurial Universities: The US and Russian Experience. J. Technol. Transf. 2015, 41, 1135–1147. [Google Scholar] [CrossRef]
  30. Zhang, L.; Li, L.; Tao, L. Patent Mining. ACM Sigkdd Explor. Newsl. 2015, 16, 1–19. [Google Scholar] [CrossRef]
  31. Park, S.; Jun, S. Patent Analysis Using Bayesian Data Analysis and Network Modeling. Appl. Sci. 2022, 12, 1423. [Google Scholar] [CrossRef]
  32. Zhu, Y.; Brettin, T.; Xia, F.; Partin, A.; Shukla, M.; Yoo, H.; Evrard, Y.A.; Doroshow, J.H.; Stevens, R.L. Converting Tabular Data into Images for Deep Learning with Convolutional Neural Networks. Sci. Rep. 2021, 11, 11325. [Google Scholar] [CrossRef]
  33. Furuya, S.; Sanaee, A.; Georgescu, S.; Townsend, J.; Rasmussen, B.; Chow, P.; Snelling, D.; Goto, M. Imagification Technology and Deep Learning Accelerating Defect Detection in Non-Destructive Testing for Wind Turbine Blades. Fujitsu Sci. Tech. J. 2019, 55, 23–29. [Google Scholar]
  34. Naz, M.; Shah, J.H.; Khan, M.A.; Sharif, M.; Raza, M.; Damaševičius, R. From ECG Signals to Images: A Transformation Based Approach for Deep Learning. PeerJ Comput. Sci. 2021, 7, e386. [Google Scholar] [CrossRef] [PubMed]
  35. Rahim, M.A.; Hassan, H.M. A Deep Learning Based Traffic Crash Severity Prediction Framework. Accid. Anal. Prev. 2021, 154, 106090. [Google Scholar] [CrossRef] [PubMed]
  36. Andresini, G.; Appice, A.; de Rose, L.; Malerba, D. GAN Augmentation to Deal with Imbalance in Imaging-Based Intrusion Detection. Future Gener. Comput. Syst. 2021, 123, 108–127. [Google Scholar] [CrossRef]
  37. van der Maaten, L.; Hinton, G. Visualizing Data Using T-SNE. JMLR 2008, 9, 2579–2605. [Google Scholar]
  38. Keywert. Available online: https://www.keywert.com/ (accessed on 7 July 2022).
  39. Sharma, A. A Survey On Different Text Clustering Techniques For Patent Analysis. IJERT 2012, 1, IJERTV1IS9098. [Google Scholar] [CrossRef]
  40. Sentence-Transformers/Pretrained_models.Md at Master UKPLab/Sentence-Transformers GitHub. Available online: https://github.com/UKPLab/sentence-transformers/blob/master/docs/pretrained_models.md (accessed on 7 July 2022).
  41. Sklearn.Metrics.Calinski_harabasz_score—Scikit-Learn 1.1.1 Documentation. Available online: https://scikit-learn.org/stable/modules/generated/sklearn.metrics.calinski_harabasz_score.html (accessed on 7 July 2022).
  42. Sklearn.Metrics.Davies_bouldin_score—Scikit-Learn 1.1.1 Documentation. Available online: https://scikit-learn.org/stable/modules/generated/sklearn.metrics.davies_bouldin_score.html (accessed on 7 July 2022).
  43. Sklearn.Metrics.Silhouette_score—Scikit-Learn 1.1.1 Documentation. Available online: https://scikit-learn.org/stable/modules/generated/sklearn.metrics.silhouette_score.html (accessed on 7 July 2022).
  44. Sklearn.Cluster.KMeans—Scikit-Learn 1.1.1 Documentation. Available online: https://scikit-learn.org/stable/modules/generated/sklearn.cluster.KMeans.html (accessed on 7 July 2022).
  45. Image Preprocessing—Keras Documentation. Available online: https://keras.io/ko/preprocessing/image/ (accessed on 7 July 2022).
Figure 1. Proposed model.
Figure 1. Proposed model.
Applsci 12 07994 g001
Figure 2. Imagification process using DeepInsight.
Figure 2. Imagification process using DeepInsight.
Applsci 12 07994 g002
Figure 3. Samples of patent information images: (a) transferred patent’s image; (b) non-transferred patent’s images.
Figure 3. Samples of patent information images: (a) transferred patent’s image; (b) non-transferred patent’s images.
Applsci 12 07994 g003
Table 1. Experimental data.
Table 1. Experimental data.
Technological FieldDBApplication
Period
StatusNumber of Patents
(Number Transferred)
Artificial Intelligence (AI)USPTO1989–2016Registered15,193 (4137)
Table 2. Numerical information.
Table 2. Numerical information.
IndicatorDescriptionMeasurable Value
Information
app_to_regi (days)Time required from application to registrationUtility value
all_claim_numNumber of claimsRights
applicant_numNumber of applicantsUtility
inventor_numNumber of inventorsSustainable development
current_owner_numNumber of current rights ownersMarket impact
IPC_numNumber of IPC codesTechnology scalability
b_citation_numNumber of backward citationsTechnology impact
f_citation_numNumber of forward citationsTechnology impact
fam_nation_numNumber of family nationsMarket impact
fam_doc_numNumber of family patentsMarket impact
alone_app_ynSole application status (dummy)Utility value
stand_patent_ynStandard patent status (dummy)Technology impact
lit_ynLitigation status (dummy)Utility value
Table 3. Information about the pre-trained BERT.
Table 3. Information about the pre-trained BERT.
ModelDescriptionNumber of
Dimensions
Size
all-MINIlm-L12-v2Trained on a large, diverse dataset of over one billion training pairs384120 MB
Table 4. Measures for deriving the optimal number of clusters.
Table 4. Measures for deriving the optimal number of clusters.
MeasureDescription
CH
(Calinski-Harabasz index)
Calculated as the ratio of the sum of dispersion between clusters and the dispersion within clusters for all clusters. The higher the value, the better the performance [41].
DB
(Davies-Bouldin index)
Calculated as the average similarity between each cluster and its most similar cluster, where the similarity is the ratio of the distance within the cluster to the distance between clusters. The lower the value, the better the performance [42].
SS
(Silhouette score)
A measure of how similar an object is to its own cluster compared to other clusters. The higher the value, the better the performance [43].
SSE
(Sum of squared errors)
The sum of the squared differences between each observation and its group’s mean. SSE is used with elbow plots to find the optimal number of clusters [44].
Table 5. Imagification parameters.
Table 5. Imagification parameters.
ParameterValue
Dimensionality reduction methodtSNE
Pixel size120 × 120
Channel3 (RGB)
Table 6. Characteristics of the training and test datasets.
Table 6. Characteristics of the training and test datasets.
DataNumber of Patents
(Size: Height × Width × Channel)
Number of Labels
(0:Non-Transferred,
1:Transferred)
Non-image dataTraining data12,1540:8845
1:3309
Test data30390:2211
1:828
Image dataTraining data12,154 (120 × 120 × 3)0:8845
1:3309
Test data3039 (120 × 120 × 3)0:2211
1:828
Table 7. Result of optimal number of clusters.
Table 7. Result of optimal number of clusters.
KMeasure
CHDBSSSSE
2782.671 4.315 0.047 183,495.098
3676.904 3.042 0.058 177,159.764
4635.130 3.258 0.063 171,663.809
5580.094 3.149 0.064 167,377.781
6516.671 3.587 0.044 164,899.573
7473.446 3.640 0.043 162,545.410
8438.377 3.609 0.045 160,513.242
9409.199 3.638 0.046 158,727.356
Table 8. Elementary technologies.
Table 8. Elementary technologies.
ClusterTop-Frequency WordsElementary Technology
0Signal, Sound, SpeechAuditory intelligence
1Image, Detect, MotionVisual intelligence
2Language, Translate, GenerateLanguage intelligence
3Device, Control, CircuitAI semiconductor
4Assist, Person, DigitIntelligent agent
Table 9. Parameters for comparison models.
Table 9. Parameters for comparison models.
ModelParameters
Logistic regression (LR)L2 Penalty
K-Nearest neighbor (KNN)K = 3, Distance = Minkowski
Decision tree (DT)Criterion = Gini
Random forest (RF)Criterion = Gini, # of features = sqrt
AdaBoost (AB)Criterion = Gini, # of features = sqrt
Convolutional neural network (CNN)Layers = 3, activation function = relu/softmax,
Optimizer = Adam, Epochs = 50
Table 10. Prediction performance by data type.
Table 10. Prediction performance by data type.
Data TypeMeasureModel
LRKNNDTRFABCNN
Non-image dataAccuracy0.730.660.660.760.74-
Precision0.630.530.570.70.75
Recall0.520.520.580.610.54
F1-score0.470.520.570.620.5
Image dataAccuracy0.730.70.660.770.740.75
Precision0.630.610.580.730.750.68
Recall0.520.590.580.610.540.62
F1-score0.470.590.580.620.50.63
Table 11. Parameters of comparison models.
Table 11. Parameters of comparison models.
Training Data LabelOriginal Proportion in Dataset (%)Proportion after Data Augmentation (%)
0: Non-transferred8845 (72.77%)8845 (50%)
1: Transferred3309 (27.23%)8845 (50%)
Table 12. Comparison of prediction performance by data augmentation technique.
Table 12. Comparison of prediction performance by data augmentation technique.
TechniqueMeasureModel
LRKNNDTRFAB
SMOTEAccuracy0.610.630.650.730.67
Precision0.580.590.580.650.59
Recall0.60.60.590.630.59
F1-score0.570.580.590.630.59
RSAccuracy0.620.630.660.750.66
Precision0.580.590.570.680.6
Recall0.60.60.570.630.61
F1-score0.580.580.570.640.6
Table 13. Parameters of ImageDataGenerator and CNN prediction performance.
Table 13. Parameters of ImageDataGenerator and CNN prediction performance.
Image Data Generator ParametersCNN
MeasureValue
Horizontal flip = TrueAccuracy0.76
Vertical flip = TruePrecision0.7
Rotation range = 0.45Recall0.6
Zoom range = 0.2F1-score0.61
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Kim, Y.; Park, S.; Kang, J. Technology Commercialization Activation Model Using Imagification of Variables. Appl. Sci. 2022, 12, 7994. https://doi.org/10.3390/app12167994

AMA Style

Kim Y, Park S, Kang J. Technology Commercialization Activation Model Using Imagification of Variables. Applied Sciences. 2022; 12(16):7994. https://doi.org/10.3390/app12167994

Chicago/Turabian Style

Kim, Youngho, Sangsung Park, and Jiho Kang. 2022. "Technology Commercialization Activation Model Using Imagification of Variables" Applied Sciences 12, no. 16: 7994. https://doi.org/10.3390/app12167994

APA Style

Kim, Y., Park, S., & Kang, J. (2022). Technology Commercialization Activation Model Using Imagification of Variables. Applied Sciences, 12(16), 7994. https://doi.org/10.3390/app12167994

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop