EconPapers    
Economics at your fingertips  
 

Integrated Survival Model for Predicting Patent Litigation Hazard

Youngho Kim, Sangsung Park, Junseok Lee, Dongsik Jang and Jiho Kang
Additional contact information
Youngho Kim: Department of Industrial Management Engineering, Korea University, Seoul 02841, Korea
Sangsung Park: Department of Big Data and Statistics, Cheongju University, Chungbuk 28503, Korea
Junseok Lee: MICUBE Solution, Seoul 06719, Korea
Dongsik Jang: Department of Industrial Management Engineering, Korea University, Seoul 02841, Korea
Jiho Kang: Machine Learning Big Data Institute, Korea University, Seoul 02841, Korea

Sustainability, 2021, vol. 13, issue 4, 1-15

Abstract: Patent litigation occurs when a company’s product or service violates the scope of another company’s patent rights. When they occur, companies suffer a disruption to the sales of their products and services, thus hindering the sustainability of their business activities. For this reason, companies have established and analyzed wide-ranging strategies to prevent patent litigation. Of those, statistical and machine learning-based quantitative methods using patent big data have several advantages, such as a reduced cost and objective results. Existing quantitative methods analyze patent information and litigation based on the time of data collection. However, the values of patents and their litigation hazards change over time. In addition, the existing methods do not take into account censored data; that is, patents that may result in litigation after the data is collected. In this paper, to solve this problem we propose an integrated survival model that considers censored data and predicts patent litigation hazards over time. The proposed model is a non-parametric survival analysis method based on a random survival forest. It uses pre-trained word2vec and clustering to effectively reflect the technology fields as well as the quantitative information of the patent. The word2vec is a technique for natural language processing and enables the use of patent text information. In order to examine the practicality of the integrated survival model, an experiment is conducted with patent big data related to sensor semiconductors based on AI technology applicable to robotics. In the experiment, it was found that the litigation hazard occurred 150 months after the patent application and increase rapidly from 200 months. Furthermore, the proposed model showed better predictive performance than other survival analysis models. The proposed model could be used by potential defendants to protect their patents.

Keywords: patent litigation; survival analysis; patent big data; text mining; random survival forest (search for similar items in EconPapers)
JEL-codes: O13 Q Q0 Q2 Q3 Q5 Q56 (search for similar items in EconPapers)
Date: 2021
References: View references in EconPapers View complete reference list from CitEc
Citations: View citations in EconPapers (2)

Downloads: (external link)
https://www.mdpi.com/2071-1050/13/4/1763/pdf (application/pdf)
https://www.mdpi.com/2071-1050/13/4/1763/ (text/html)

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:gam:jsusta:v:13:y:2021:i:4:p:1763-:d:494763

Access Statistics for this article

Sustainability is currently edited by Ms. Alexandra Wu

More articles in Sustainability from MDPI
Bibliographic data for series maintained by MDPI Indexing Manager ().

 
Page updated 2025-03-19
Handle: RePEc:gam:jsusta:v:13:y:2021:i:4:p:1763-:d:494763