search for


Patent Technology Trends of Oral Health: Application of Text Mining
J Dent Hyg Sci 2024;24:9-21
Published online March 31, 2024;
© 2024 Korean Society of Dental Hygiene Science.

Hee-Kyeong Bak1 , Yong-Hwan Kim2 , and Han-Na Kim1,†

1Department of Dental Hygiene, College of Health and Medical Sciences, Cheongju University, Cheongju 28503, 2Department of Library and Information Science, College of Humanities and Social Sciences, Cheongju University, Cheongju 28503, Korea
Correspondence to: Han-Na Kim,
Department of Dental Hygiene, College of Health and Medical Sciences, Cheongju University, 298 Daeseong-ro, Cheongwon-gu, Cheongju 28503, Korea
Tel: +82-43-229-8373, Fax: +82-43-229-8969, E-mail:
Received December 19, 2023; Revised January 11, 2024; Accepted January 18, 2024.
This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License ( which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
Background: The purpose of this study was to utilize text network analysis and topic modeling to identify interconnected relationships among keywords present in patent information related to oral health, and subsequently extract latent topics and visualize them. By examining key keywords and specific subjects, this study sought to comprehend the technological trends in oral health-related innovations. Furthermore, it aims to serve as foundational material, suggesting directions for technological advancement in dentistry and dental hygiene.
Methods: The data utilized in this study consisted of information registered over a 20-year period until July 31st, 2023, obtained from the patent information retrieval service, KIPRIS. A total of 6,865 patent titles related to keywords, such as “dentistry,” “teeth,” and “oral health,” were collected through the searches. The research tools included a custom-designed program coded specifically for the research objectives based on Python 3.10. This program was used for keyword frequency analysis, semantic network analysis, and implementation of Latent Dirichlet Allocation for topic modeling.
Results: Upon analyzing the centrality of connections among the top 50 frequently occurring words, “method,” “tooth,” and “manufacturing” displayed the highest centrality, while “active ingredient” had the lowest. Regarding topic modeling outcomes, the “implant” topic constituted the largest share at 22.0%, while topics concerning “devices and materials for oral health” and “toothbrushes and oral care” exhibited the lowest proportions at 5.5% each.
Conclusion: Technologies concerning methods and implants are continually being researched in patents related to oral health, while there is comparatively less technological development in devices and materials for oral health. This study is expected to be a valuable resource for uncovering potential themes from a large volume of patent titles and suggesting research directions.
Keywords : Big data, Data mining, Oral health, Patent, Topic modeling


With the concurrent advancement of the economy, South Korea has seen an increasing interest in healthcare information among its citizens, propelled by advancements in medical technology and improvements in education. The desire for better health has escalated with the rapidly aging population. Both individuals and society are actively interested in enhancing health and fitness and prolonging lifespan. This growing interest extends to oral health, which plays a role in improving quality of life1). As consumer interest in oral health continues to rise, premium-grade toothbrush market competition within the oral care product industry intensifies. This has led to the introduction of various functional toothpaste products amid increasing competition2). Interest in oral cleansers has been increasing, not only for the management of oral diseases and the oral environment but also for functions such as bad breath elimination and teeth whitening. The development of effective oral cleansers targeting periodontal disease and oral environmental management has been consistently researched. In South Korea, active research focuses on natural substances that induce changes in oral environments and examines their antibacterial effects on periodontal diseases and oral bacteria3). Interest in dental treatments extends beyond oral health and includes an increased focus on aesthetic dental care. Owing to recent westernized dietary changes, there has been a growing demand and interest in orthodontics4). Moreover, with an increasing proportion of adults seeking orthodontic treatment and an increase in income levels, there is a heightened desire for aesthetic procedures. Consequently, ongoing developments in orthodontic device innovations and clinical research are aligned with these trends. Transparent aligners and lingual orthodontic devices have been widely adopted as key tools in aesthetic orthodontic treatments. Companies that provide transparent aligners are developing various methods and auxiliary devices for tooth movement5). In dental prosthodontics, numerous aesthetic materials and restorative products have been introduced, notably zirconia, which has garnered significant attention owing to its excellent strength, wear resistance, and high biocompatibility6). As the demand for esthetic restorative materials increases, computer-aided design/computer-aided manufacturing (CAD/CAM) methods have been introduced. In dentistry, the utilization of three-dimensional (3D) printing technology is steadily increasing in various areas such as diverse tooth models, temporary teeth, transparent aligners, and implants, leading to continuous advancements in associated technologies7,8).

As various studies have advanced with the development of oral health-related technologies, it is challenging to find comprehensive research that encompasses all oral health-related technologies. There are diverse methods for predicting promising technologies using technological trend analyses. Traditional methods such as the Delphi technique, analytic hierarchy process, scenario technique, expert panels, and trend extrapolation exist but are limited by the subjective opinions of relevant technology experts. Consequently, data-based methods utilizing paper and patent information have been predominantly employed9). Patent data, for example, can be utilized as a metric for measuring trends and achievements in technological research and development. It refers to textual documentation containing specific technical and scientific information about an invention and detailing the elements intended for legal protection. Moreover, it is quantifiable and objective data on the latest technology, which is globally standardized according to specific formats, thus making it applicable for various purposes. Sawng et al.10) and Jo et al.11) have previously conducted research utilizing patent information as analytical data for technological trends12). In this study, we performed network analysis and topic modeling using text analysis programs to comprehensively analyze patent data. Network analysis involves breaking down words constituting sentences to extract meaningful concepts, and represents how words form relationships in a network format. Network analysis allows us to identify keywords and understand the connections between words, thereby enabling us to infer the context of sentences within a text13). Kim et al.14) also applied network analysis methods to analyze keywords related to dental hygiene. Topic Modeling is a technique used to estimate latent and meaningful topics within a collection of unstructured text data. Among these algorithms, Latent Dirichlet Allocation (LDA) is a method that calculates a specific number of topics by considering the probability distribution of terms related to the topics15). Kim et al.16) and Lee17) conducted patent analyses by applying topic-modeling techniques to uncover hidden themes within a large volume of documents.


Although previous studies have examined text analysis using patent data, there remains a shortage of research in the healthcare field. This study aimed to utilize text-mining techniques to examine the interrelationships among keywords in patents related to oral health and extract latent topics for visualization. Through this process, the study sought to derive significant keywords, analyze core and specific technologies within different topics, and comprehend technological trends. Additionally, it advocates the active utilization of text analysis techniques in dental hygiene and aims to utilize patent-derived analysis as fundamental data to promote advancements in oral health-related technologies.

Materials and Methods

This study progressed sequentially, as depicted in Fig. 1, and involved data collection, data preprocessing, keyword extraction, frequency analysis, network analysis and visualization, and topic modeling.

Fig. 1. Research methodology.

1.Patent data collection

As of July 31, 2023, a total of 11,710 patent documents related to oral health were collected from the domestic patent database service platform, KIPRIS ( The scope of data collection was not limited by time and included patents that were either publicly disclosed or registered, utilizing search queries constructed with keywords related to “dentistry,” “teeth,” and “oral health.” Keyword selection was based on synonymous words provided by the KIPRIS’s search term expansion function. After gathering the patent data, invention titles and abstracts were reviewed, excluding titles associated with pet-related contents. Additionally, among Korean invention titles containing “teeth,” “tooth,” or “dental” terms, those not semantically related were excluded. The final research dataset comprised 6,865 invention titles in Korean, collected from March 2003 to July 2023.

2.Data preprocessing and keyword extraction

We used Python version 3.10 (Python Software Foundation, Santa Clara, CA, USA) as an analytical tool capable of text analysis. For the Korean text, we employed the Hannanum morphological analyzer from the Korean Natural Language Processing in Python (KoNLPy) package. To extract meaningful keywords, we preprocessed the tokenized words. The words in Korean, such as “seuk-ru, im-peul-laen-teu, im-peul-lam-teu, beu-rae-kit, pik-seu-chyu, pik-seu-chyu-eo, pik-seu-chwo, yu-nit, yu-ni-teu, eo-bu-teu-meon-teu, and eo-byu-teu-meon-teu,” have been standardized to their equivalent words with the same meanings: screw, implant, bracket, fixture, unit, and abutment. This involved preprocessing by removing punctuation, numbers, and stop words, and extracting words consisting of more than one syllable for the analysis.

3.Keyword frequency analysis

Using the Pandas module and counter function in Python, we conducted a frequency analysis of keywords (37,155) extracted from patent invention titles. We extracted the top 1 to 50 keywords with the highest occurrence frequencies.

4.Network analysis and network visualization

We conducted a network analysis to understand the connections between the keywords. We compared the degree centrality indices of the top 50 keywords by frequency. This value is based on the degree of direct connections between nodes, indicating the centrality of the network. A higher degree of centrality implies that a node is highly connected directly to many other nodes, and is calculated based on the number of links directly connected to a specific node18,19). The analysis was performed using a Python library network. From the total keywords, the 50 most frequently appearing keywords were used to derive a 2D adjacency matrix (50×50), indicating the co-occurrence relationships among these keywords. A portion of the top 10 results is presented in tabular form. For visual inspection of the degree of connectivity between nodes, we visualized the keyword network using Gephi 0.10.1. (Gephi Consortium, Paris, France) The nodes were positioned based on their degree centrality values. Nodes ranked from 1 to 10 were centrally placed and their color was set to red. The nodes ranked 11∼20 were orange, 21∼30 were yellow, 31∼40 were green, and 41∼50 were blue.

5.Topic modeling

To extract meaningful topics from the patent data, we performed topic modeling based on the connection rules between keywords. We used the “gensim.models.LdaMulticore” module from the Gensim library, version 4.3.0 in Python, as our research tool. To determine the optimal number of topics, we conducted topic optimization experiments based on topic coherence values. We varied the number of topics from 1 to 20, measured the coherence value for each topic, and selected the number of topics that exhibited the highest coherence value as the appropriate number of topics for our study. The repetition count for topic sampling was set to 500 times20). We used the LDA method to categorize data collected from the library into topics. For the parameter settings, we designated the number of topics (num_topic) as eight. The chunk size, which represents the number of documents processed in a single training session, was set to 2,000. The total number of training passes (passes) were set to 20, and the per-document iteration count (iterations) was set to 1,50021). The results of the topic modeling were visualized using the pyLDAvis library, which demonstrated the trained model. The keywords associated with topics were generated by setting the lambda (λ) value to 0.622). Lambda values range from 0 to 1 and act as hyperparameters that determine the diversity of word selection23). Furthermore, the lambda value represents the weight that indicates the degree of relevance between each topic and keyword. When setting the weight to 1, it generates a ranking of familiar terms based on the words that frequently appear in each topic. However, setting the weight to 0 prioritizes the selection of words that show significant differences among topics24,25). The research findings of topic modeling were interpreted by two researchers (dental hygienists) under the guidance of a topic-modeling expert. The criteria for topic selection were centered on five main keywords. During the topic-labeling process, topics were structured to include at least one of the five keywords.


1.Keyword frequency analysis and network analysis

Table 1 presents the results based on the top 50 words with a high frequency of occurrence in invention titles, along with their respective centrality values. According to the analysis of occurrence frequency and centrality, “Method” exhibited the highest centrality value. Following this, “Apparatus” and “Tooth” showed high frequencies. Words with a centrality of 0.9 or higher, excluding words with a centrality of 1, were “Apparatus”, “Treatment”, “System”, “Inclusion”, “Possibility”, and “Oral”. Examining words ranked 11th to 15th based on centrality, we observed “Implant,” “Orthodontics,” “3D,” “Processing,” and “Usage.” Among the top 50 keywords, those with lower centrality included “Extracts,” “Zirconia,” and “Active ingredient,” which ranked 48th, 49th, and 50th, respectively (Table 1).

Frequency, Degree Centrality of Top 50 Keywords

Word Frequency Degree Centrality
Rank Frequency Rank Degree Centrality
Method 1 2,347 1 1
Apparatus 2 1,677 5 0.979
Tooth 3 1,502 2 1
Oral 4 1,491 10 0.918
Manufacture 5 1,001 3 1
Composition 6 949 17 0.775
Implant 7 939 11 0.897
Orthodontic 8 736 12 0.897
System 9 626 7 0.938
Use 10 577 4 1
Inclusion 11 564 8 0.938
Toothbrush 12 326 43 0.51
Dentition 13 254 34 0.632
Instrument 14 245 20 0.734
For treating 15 238 27 0.693
Prevention 16 223 45 0.489
Treatment 17 202 6 0.979
Extracts 18 194 48 0.387
Recording 19 193 24 0.714
Prosthetic 20 189 28 0.673
Picture 21 188 25 0.714
Containing 22 182 44 0.51
Bracket 23 172 47 0.408
Guide 24 171 18 0.775
Disease 25 169 39 0.591
Procedure 26 166 35 0.632
Processing 27 161 14 0.836
Imaging 28 147 38 0.612
Management 29 145 21 0.734
Possibility 30 140 9 0.938
Production 31 131 19 0.755
Data 32 130 22 0.734
Artificial 33 130 23 0.734
3dimension 34 128 13 0.877
Medium 35 128 29 0.673
Scanner 36 123 40 0.571
Image 37 121 36 0.632
Provide 38 120 16 0.795
Active ingredient 39 118 50 0.367
Generate 40 117 26 0.714
3D 41 114 30 0.673
Scan 42 113 31 0.653
Computer 43 113 32 0.653
Abutment 44 113 41 0.551
Toothpaste 45 112 46 0.448
Usage 46 105 15 0.836
Zirconia 47 98 49 0.387
Digital 48 95 37 0.632
Purpose 49 94 42 0.53
Medical 50 94 33 0.653

2.Visualization of network analysis results

After reviewing Fig. 2, which displays the results of the network visualization and examines the 2D adjacency matrix (50×50) showing the co-occurrence relationships, it is evident that thicker links between nodes represent a higher frequency of co-occurrence. For instance, when “Method” is linked to “Manufacturing (866),” “Tooth (593),” and “Apparatus (680),” the links appear thicker. “Oral” is found to be connected to “Composition (451).” “Implant” shows multiple connections with “method (260),” “Apparatus (137),” and “Tooth (260).” “Orthodontics” exhibits close associations with “Method (228),” “Apparatus (266),” “Tooth (228),” and “Dentition (220)” (Fig. 2).

Fig. 2. Network connectivity of the top 50 keywords (The nodes’ colors were represented based on betweenness centrality values: the top 10 are in red, rankings 11∼20 in orange, 21∼30 in yellow, 31∼40 in green, and 41∼50 in blue). As the frequency of co-occurrence increases, the thicker links between nodes in the network appear.

3.Topic modeling

To determine the optimal number of topics, topic coherence values were computed for varying topic counts ranging from 1 to 20. The coherence value for the eight topics was highest at 0.807 (rounded to the fourth decimal place) (Fig. 3).

Fig. 3. Coherence scores by number of topics (the dashed lines indicate the highest coherence scores for the number of topics).

Table 2 displays the top ten core keywords for each of the eight topics, representing the labeling results for each topic. Each topic was analyzed based on primary keywords to understand the respective subjects. Topic 4 is identified with the highest weight, characterized as “Implants.” Meanwhile, the themes for the lowest-weighted topics, Topic 1 and Topic 3, are “Instrument and Materials for Oral Health” and “Toothbrush and Oral Health Care,” respectively.

Result of the Topic Modeling

of topic
Topic label (Keywords 1∼10)
1 Instrument and Materials for Oral Health
(Instrument, Ingredient, Care, Hygiene, Improvement, Washing, Sintering, Structure, Impression Material, Tissue)
2 Orthodontics
(Method, Apparatus, Orthodontic, Dentition, Recording, Picture, Data, Processing, Tooth, Image)
3 Toothbrush and Oral Health Care
(Toothbrush , Fixing, Able, Removing, Electric, Unit chair, Gum, Driver, Toothbrush hair)
4 Implant
(Manufacture, Implant, Method, Use, Guide, Abutment, Prosthetic, Tooth, 3D, Fixture)
5 Oral Composition for Prevention and Treatment
(Composition, Oral, Inclusion, For treating, Prevention, Disease, Containing, Extracts, Active ingredient, Toothpaste)
6 Dental Treatment Aid Apparatus
(Apparatus, Measurement, Mask, Auxiliary, Root canal, Dental, Occlusion, Material, Medical, Cleaning)
7 Apparatus or Method Based on Artificial Intelligence
(Tooth, Orthodontic, Artificial, Model, Apparatus, Whitening, Purpose, Derived, Transparent, Attachment)
8 Oral Care System or Service
(Management, System, Prove, Service, Method, Oral, Zirconia, Predict, Graft material, Health)

The lambda (λ) value, determining the diversity of word selection, was set to 0.6.

Fig. 4 presents a visualization of topic modeling using pyLDAvis after setting the number of topics to eight. It displays the Intertopic Distance Map (IDM) and the 30 core keywords. In the IDM, topics 6 and 7 intersect within one quadrant, while in another quadrant, topics 2, 4, and 8 are situated closely, overlapping with each other. In contrast, topics 1, 3, and 5 appeared relatively distant from the other groups and did not overlap with any other topics. The weightage of each topic was: Topics 1 (5.5%), 2 (19.6%), 3 (5.5%), 4 (22.0%), 5 (17.4%), 6 (8.9%), 7 (10.5%), and 8 (10.5%). The topic with the highest weightage is represented by the larger circle indicating “Implant,” while the lowest-weighted topics are indicated by smaller circles denoting “Instrument and Materials for Oral Health” and “Toothbrush and Oral Care.” The core keywords forming the entire set of topics are “Manufacturing,” “Composition,” “Implant,” “Oral,” and “Method,” with the most frequent keyword being “Methods” (Fig. 4).

Fig. 4. Intertopic distance map.


This study was conducted to understand technological trends through keyword network analysis and topic modeling based on domestic patents related to oral health. When considering the occurrence frequency, words, such as “Method,” “Apparatus,” “Tooth,” “Oral,” and “Manufacture,” showed higher proportions. This could be attributed to the collection of data that primarily focused on patents related to oral health, hence reflecting these prominent terms. According to previous research26) analyzing patents using network analysis methods, there are cases where words, such as “Method,” “Apparatus,” and “System,” were not considered as stop words and were derived as key keywords. Considering that specific technical domains might influence the interpretation of the results, these words were not excluded and were analyzed together. Additionally, in studies related to medical procedures and patents, it is noted that medical procedures involving human life are excluded from patent rights to prevent restrictions on such procedures. Instead, according to patent office practices, the functional and systemic operating methods of medical devices themselves are recognized as patentable, not as medical procedures or treatments27). This can be interpreted as one of the reasons why words related to “method” and “apparatus” accounted for a significant proportion in the frequency analysis results of this study.

To identify keywords with relatively higher importance in relation to their frequency of occurrence, we analyzed their degree of centrality. There are cases where keywords are perceived as important, despite having a low frequency of occurrence but a relatively high degree of centrality. For instance, the keyword “Use,” ranking 10th in frequency, had the highest degree centrality of 1, while “Treatment,” ranking 17th, showed the next highest value of 0.979. Therefore, relying solely on the frequency of occurrence to derive key terms should be avoided. Furthermore, when comparing the rank differences between frequency of occurrence and degree centrality, “3D” ranked 34th in frequency with 128 occurrences but secured a relatively higher position at 13th with a degree centrality of 0.877. The importance of “3D” is also evident in the network visualization. It is observed to be connected with nodes such as “Data,” “Scanner,” and “Scan.” In the collected patent data, titles like “Method for 3D scan data processing for dental prosthesis manufacture” and “Apparatus and method for restoring 3D oral scan data using computerized tomography images” were identified. Moreover, the relationships between keywords were confirmed within the sentences. Digital dental technology based on 3D printing, one of the core technologies of the Fourth Industrial Revolution in dentistry, is advancing. The introduction of 3D scanners into dentistry has significantly affected CAD/CAM technology in prosthetic manufacturing systems, and digital impression techniques using intraoral or extraoral scanners are commonly used in clinical settings28).

We conducted topic modeling and keyword analysis to extract potential themes from the text. Determining the number of topics before executing the model is crucial in this regard29). There is no statistical solution for determining the appropriate number of topics, as it depends heavily on the interpretability, validity, and research questions that influence the topics generated through topic modeling. The determination of the number of topics is subjective and is decided by researchers based on what they believe would yield the most meaningful results through topic modeling.

2.Key results and comparison

To minimize subjectivity, topic coherence values were used as in a previous study30). In this study, a function measuring topic coherence values for each number of topics from 2 to 20 was implemented in Python, following the methodology of a previous study to determine the appropriate number of topics30). In topic modeling analysis, topics with low relevance to the research subject are sometimes excluded based on the researchers’ judgment. In related studies, it has been observed that researchers reviewed the words included in the derived topics to eliminate those with low relevance to the research topic from the analysis. This exclusion of low-relevance topics by the research team has been acknowledged and considered in the topic inference method25,31).

Topic 1 comprises words like “Instrument” and “Ingredient,” followed by “Care,” “Hygiene,” and “Improvement.” Interpreting this as a comprehensive concept encompassing oral health, it was named “Instrument and Materials for Oral Health.”

Topic 2 mainly consists of words such as “Orthodontic” and “Dentition,” and this topic was named “Orthodontics.” The analysis also confirmed the association of words like “Recording,” “Picture,” and “Data” with “Orthodontic.” Orthodontics involves clinical assessments based on quantitative analyses of the human skeletal structure and dental alignment. In clinical practice, there are ongoing developments in orthodontic analysis systems in South Korea, such as WebCeph software (AssembleCircle Corp., Seongnam, Korea), using imaging data, indicating a significant attempt to employ artificial intelligence (AI) technology and digital image recognition and processing in the field of dental alignment32).

Topic 3 is focused on words like “Tooth,” “Removing,” and “Electric,” thus designated as the theme “Toothbrush and Oral Health Care.” Choi et al.33) analyzed 512 patents from 2005 to 2014, to examine the patent trends related to toothbrushes. They suggested that various shapes, materials, functions, and socially oriented aspects of toothbrushes have been consistently researched and patented. Moreover, they anticipated the continued production of electric toothbrushes.

Topic 4 is inferred as “Implant” based on words such as “Manufacture,” “Implant,” “Method,” “Use,” and “Guide.” Unlike in the past, dental implants are now considered and proposed as the foremost treatment when teeth are lost, demonstrating a high long-term success rate and reliability. Studies continue to focus on the materials and surface forms of implants for osseointegration, aiming to address the drawbacks or complications associated with them34). In the field of surgery, diagnostic models that replicate the structure or form of surgical sites are being developed to aid in surgical decision-making. Templates and surgical guides that utilize 3D printing technology are being developed to enhance the accuracy and safety of surgeries8). Recently, the majority of the domestic implant companies in Korea have expanded the distribution of advanced software and 3D printers. This has enabled practitioners to directly develop surgical guides for implants. The use of implant surgical guides has become widespread, facilitating their diverse production using various materials and methods. Several researchers are actively studying this trend35,36).

Topic 5 was inferred based on the words “Composition,” “Oral,” “For treating,” “Prevention,” and “Extracts,” and was named “Oral Composition for Prevention and Treatment.” According to the study of Hwang et al.37), when analyzing patents related to toothpaste on KIPRIS, composition accounted for 35% of the patents, representing the most developed field. Various extracts have been used in patents related to preventing and treating periodontal diseases. Moreover, studies are being conducted on the effects of extracts, such as their antibacterial effects, on oral health37).

Topic 6 appears to be associated with “Dental Treatment Aid Apparatus,” based on the words “Apparatus,” “Measurement,” and “Root Canal.” Through research on devices for occlusal force measurement38), wireless handpieces39) for root canal treatment, and devices for root canal irrigation40), it is evident that devices and instruments related to measurement and root canals are under investigation. Devices for measuring root canal length, vertical height, and bone density were identified in the collected patent inventories.

Topic 7 highlights the development of devices for artificial tooth processing and orthodontic systems utilizing AI, with prominent keywords including “Tooth,” “Orthodontic,” “Artificial,” “Model,” and “Apparatus.” Therefore, it was named “Apparatus or Method Based on Artificial Intelligence.” The future of AI foresees its extensive adoption in various healthcare sectors, particularly in the diagnostic imaging field, and it is anticipated to be widely applied in dentistry. Research suggests that AI will enhance diagnostic efficiency by automating repetitive tasks, thereby reducing costs and alleviating the burden on healthcare systems in an aging society. This efficiency is expected to extend to dental inventory management, patient scheduling, automatic generation of electronic medical records, and surgical feedback, eventually evolving into an integrated dental operation and management system that utilizes AI41). In a study by Choi et al.42), it was highlighted that clinical trials of AI-assisted software aiding physicians’ diagnoses in South Korea increased significantly from six cases in 2018 to 17 cases in 2019. Moreover, the range of conditions targeted by AI technology has expanded beyond prostate and breast cancer to include diverse conditions, such as lung diseases, vertebral compression fractures, and dental conditions42). The potential for the advancement of patented technology related to “Apparatus or Method Based on Artificial Intelligence” is considerably high and requires continuous attention.

Topic 8 was named “Oral Care System or Service,” derived from major keywords such as “Management,” “System,” “Provide,” “Service,” and “Method,” concluding the analysis. With the growing interest in oral health, governments have been continuously implementing oral health promotion programs targeting vulnerable populations, such as students, seniors, and people with disabilities, by establishing oral health clinics and regional clinics for dental care43,44). With the evolving trends of the times, the approach to providing oral health services has transformed. The widespread adoption of smartphones has led to increased interest in applications aimed at improving the quality of life of people with disabilities. For instance, there was a case where an Android-based “15 Minutes of Daily Oral Exercise” app was developed to enhance oral motor skills for individuals with cerebral palsy45). Moreover, studies analyzing oral health-related apps have highlighted their significance as crucial educational tools for acquiring oral health knowledge and proper maintenance methods, as seen in related research46). This development indicates the advancement of technology related to oral care services associated with the theme of Topic 8.


The technologies associated with oral health have advanced in various aspects. However, while there have been studies on specific technological trends in dentistry and dental hygiene, there is a lack of comprehensive research analyzing patented technologies. Therefore, a significant contribution of this study lies in deriving the potential topics related to oral technology over the past 20 years and segmenting them into particular topics. This not only aids in setting future directions for detailed research and formulating patent strategies but also provides valuable data. Despite the methodological limitations, the value of this study lies in deriving results based on a vast amount of patent data in the dental hygiene field using Python and proposing a method to extract keywords representing topics. We hope that the methodology and visualization tools used in this study will find applications in various research fields within dental hygiene. Moreover, the results from network analysis and topic modeling can be utilized to guide technological development in the field.


In this study, text mining techniques were employed for network analysis, whereas in topic modeling, efforts were made to maintain objectivity by undergoing refinement processes and utilizing stop-word lists. While objectivity regarding the number of topics was maintained using topic coherence values, the process of inferring and naming topics unavoidably involved qualitative interpretation by the researchers, potentially incorporating subjective judgments based on their choices. Exploring various methods to maintain objectivity in topic modeling is essential, and further research is required to select topic labels that semantically represent topics effectively. Additionally, applying time-series analysis to topic modeling to analyze patent trends across different years would aid in predicting competitiveness and promising technologies for various technical changes over time.


This study conducted text analysis on 11,710 patent data related to oral health and dentistry obtained from KIPRIS as of July 31st, spanning 2023 years, using a program developed in Python. Of these, 6,865 keywords highly relevant to the field were selected. The analysis involved determining the relationships among keywords in patent titles by analyzing centrality based on the top 50 words by frequency and applying topic modeling using LDA to infer topics.

The analysis of the connectivity between adjacent words revealed that the centrality index was highest for “Method” and lowest for “Active ingredient.” In network visualization, “Method” exhibited thicker links when connected to “Manufacture,” “Tooth,” and “Apparatus,” while “Oral” was closely linked to “Composition.”

Regarding topic modeling derived from keywords, the topic weights indicated “Implant” as the highest, followed by “Orthodontics,” “Oral composition for prevention and treatment,” “Apparatus or Method Based on Artificial Intelligence,” “Oral Care System or Service,” and “Dental Treatment Aid Apparatus.” Conversely, topics related to “Instrument and Materials for Oral Health” and “Toothbrush and Oral Health Care” had the lowest weightage. Therefore, in clinical and research contexts, studies primarily focused on “Implant” and “Orthodontics” regarding patent technology trends.

This study aimed to comprehensively understand and categorize technological trends related to oral health and utilize the frequency and relationships of the analyzed keywords to grasp technological advancements. It is anticipated that this research will serve as fundamental material for technological developments associated with patents in dentistry and dental hygiene.



Conflict of interest

No potential conflict of interest relevant to this article was reported.

Ethical approval

Not applicable.

Author contributions

Conceptualization: Hee-Kyeong Bak. Data acquisition: Hee-Kyeong Bak. Formal analysis: Hee-Kyeong Bak. Supervision: Han-Na Kim. Writing-original draft: Hee-Kyeong Bak and Han-Na Kim. Writing-review & editing: Hee-Kyeong Bak, Yong-Hwan Kim, and Han-Na Kim.



Data availability

Raw data is provided at the request of the corresponding author for reasonable reason.

  1. Lee KH, Lee SB, Jung EY, Jo EB, Jung ES: Factors influencing awareness of dental health insurance among adults. J Korean Soc Dent Hyg 18: 771-783, 2018.
  2. Moon KH, Kim JM: Analysis of preference convergence by analyzing search words for oralcare products: using the Google trend. J Korea Converg Soc 10: 59-64, 2019.
  3. Park HK, Lee MK, Jeon ES, Yu SB, Kim HJ: Effects of mouth rinsing with foam vitamins and its intake on reduction in oral microorganisms. J Korean Soc Dent Hyg 19: 387-397, 2019.
  4. Seong HJ, Jeong JH, Lee SY, et al.: Influence of clinical characteristics and restriction factors on cooperation for orthodontic treatment in adolescent orthodontic patients. J Dent Hyg Sci 16: 84-92, 2016.
  5. Kang SG, Dafeng Qu, Kwon SY: The treatment of lip protrusion which demands intrusion and bodily movement of upper anterior tooth with antero-posterior lingual retractor combined with clear-aligner. Clin J Korean Assoc Orthod 9: 189-201, 2019.
  6. Lee MH, Kim JS, Park EC, Kim HJ: Prosthetic treatment in esthetic area with monolithic zirconia using coloring liquid: a case report. J Korean Acad Prosthodont 60: 293-300, 2022.
  7. Bae TS, Lee MH, Shin JW: Dental applications of ceramic materials for aesthetic restoration and their future prospects. Korean J Dent Mater 49: 77-85, 2022.
  8. Lee SH: Prospect for 3D printing technology in medical, dental, and pediatric dental field. J Korean Acad Pediatr Dent 43: 93-108, 2016.
  9. Song KT, Bong KH, Park JM: A study on the process of identifying emerging technology using patent data: a joint approach of factor analysis and text mining methods. J Intell Prop 17: 169-204, 2022.
  10. Sawng YW, Choi JW, Joung SK, Lim SY: National comparative study on the technology ecosystem of the smart surgical medical system: focused on the patent data analysis. J Inf Technol Appl Manag 27: 125-145, 2020.
  11. Jo GC, Yoon SJ, Bae JW, Kim BJ: Using US patent analysis to monitor the technological trend in the field of gastrointestinal microbiome: implications on Korean medicine research and development. J Korean Med 44: 38-55, 2022.
  12. Oh SH, Choi HY, Yoon JH: Monitoring augmented reality technology using topic modeling of patents. J Korean Inst Ind Eng 43: 213-228, 2017.
  13. Park CS: Using text network analysis for analyzing academic papers in nursing. Perspect Nurs Sci 16: 12-24, 2019.
  14. Kim BR, Ahn E, Hwang SJ, Jeong SJ, Kim SM, Han JH: Analysis of dental hygienist job recognition using text mining. J Dent Hyg Sci 21: 70-78, 2021.
  15. Rashed M, Piorkowski J, McCulloh I. Evaluation of extremist cohesion in a darknet forum using ERGM and LDA. Paper presented at: 2019 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining; 2019 Aug 27-30; Vancouver, Canada. New York: Association for Computing Machinery, 2019. p. 899-902.
  16. Kim GJ, Yoon DH, Hwang JH, Sun DJ: Discovering the emerging technologies through patent topic modeling and growth curve model. J Korean Inst Intell Syst 27: 357-363, 2017.
  17. Lee JW: Sports virtual reality technology monitoring through patent big data analysis: LDA algorithm-based topic modeling. Korean J Converg Sci 11: 185-202, 2022.
  18. Park H, Bae S, Pak SI: Properties of a social network topology of livestock movements to slaughterhouse in Korea. J Vet Clin 33: 278-285, 2016.
  19. Han JH, Hyun YG, Chae UR, Lee GH, Lee JY: A study on the healthcare technology trends through patent data analysis. J Digit Converg 18: 179-187, 2020.
  20. Suh YR, Koh KS, Lee JW: An analysis of the change in media's reports and attitudes about face masks during the COVID-19 pandemic in South Korea: a study using Big Data latent dirichlet allocation (LDA) topic modelling. J Korea Inst Inf Commun Eng 25: 731-740, 2021.
  21. Kang HA, Lim HS: A study on search query topics and types using topic modeling and principal components analysis. KIPS Trans Softw Data Eng 10: 223-234, 2021.
  22. Sievert C, Shirley K. LDAvis: a method for visualizing and interpreting topics. Paper presented at: Workshop on Interactive Language Learning, Visualization, and Interfaces; 2014 Jun 27; Baltimore, USA. Stroudsburg: Association for Computational Linguistics, 2014. p. 63-70.
    Pubmed CrossRef
  23. Lee JH: A study on document summarization using class-based neural topic modeling. J Korean Inst Intell Syst 32: 494-499, 2022.
  24. Kim EH, Suh YH: A method of calculating topic keywords for topic labeling. J Korea Soc Digit Ind Inf Manag 16: 25-36, 2020.
  25. Lee HS, Lee GM, Cho JH: What words have the press used to describe children's smartphone use?: a semantic network analysis and topic modeling of the press article between 2011 and 2021. Korean J Commun Inf 114: 204-246, 2022.
  26. Min KB, Park HJ: A study on the patent trend of 'Smart Farm' in domestic through network analysis. J Korea Inst Inf Electron Commun Technol 15: 413-422, 2022.
  27. Lee BM: A study on the medical practice and patent protection system. Inha Law Res Inst 22: 1-40, 2019.
  28. Park JS, Lim YJ, Lee JW, Kim BJ: A review on the accuracy assessment methods of 3-dimensional digital dental models. J Dent Rehabil Appl Sci 35: 55-63, 2019.
  29. Yun EK, Kim JO, Byun HM, Lee GG: Topic modeling and keyword network analysis of news articles related to nurses before and after "the Thanks to You Challenge" during the COVID-19 pandemic. J Korean Acad Nurs 51: 442-453, 2021.
    Pubmed CrossRef
  30. Kim J, Na H, Park KH: Topic modeling of profit adjustment research trend in Korean accounting. J Digit Converg 19: 125-139, 2021.
  31. Ham SK, Jung SG, Kim EY: Analysis of media coverage on the issue of "Comfort Women" by daily newspapers in South Korea from 2003 to 2020: a big data study using topic modeling. Korean J Commun Inf 111: 181-215, 2022.
  32. Chang MS: Clinical utilization of web based and artificial intelligence driven orthodontic analysis program "WebCeph.". J Korean Dent Assoc 60: 164-175, 2022.
  33. Choi JS, Lee YH, Cho HJ, et al.: Patent trend analysis of the toothbrush. Korean J Oral Maxillofac Pathol 40: 709-720, 2016.
  34. Kim YT: Past, present and future of the dental implant topography. J Korean Dent Assoc 59: 226-231, 2021.
  35. Kim HD: Consideration of computer-guided implant surgery. J Korean Acad Esthet Dent 28: 4-17, 2019.
  36. Lee JY, Yoon JY, Oh N: The use of surgical guide stent for implant placement. J Korean Acad Prosthodont 52: 366-375, 2014.
  37. Hwang DG, Na SR, Cho HJ, et al.: Toothpaste development trend analysis - composition. Korean J Oral Maxillofac Pathol 40: 697-709, 2016.
  38. Im JH, Lee W, Kim MJ, Lim YJ, Kwon HB: Factors that affect the bite force measurement. J Dent Rehabil Appl Sci 32: 1-7, 2016.
  39. Lee BK, Lee Y, Park SH, Cho KM, Kim JW: Comparison vibration characteristics of several wireless endodontic handpieces. J Dent Rehabil Appl Sci 38: 81-89, 2022.
  40. Sung G, Sung J, Lee MH: Development and performance test of a micro bubble irrigation system for root canal cleaning of tooth. J Korean Soc Vis 14: 40-45, 2016.
  41. Hwang JJ, Heo MS: Future perspectives of artificial intelligence. J Korean Dent Assoc 60: 290-298, 2022.
  42. Choi HC, Kim CM, Park SC: Literature analysis of deep learning based dental imaging readings. Health Serv Manag Rev 14: 15-28, 2020.
  43. Ju OJ, Jang YJ: Awareness of teachers in a region on school dental clinics and preventive programs. J Dent Hyg Sci 15: 18-23, 2015.
  44. Lee HI: Recent progress and future directions of public dental health service for oral health promotion of disabled. J Converg Consil 4: 115-126, 2021.
  45. Lee SB, Jeong PY, Lee MH: 'Daily oral motor exercise for 15 minutes' for cerebral palsy: a case report for the mobile application development. AAC Res Pract 3: 81-90, 2015.
  46. Jung JY, Kim SH: Analysis of oral health-related smartphone applications. J Korean Soc Dent Hyg 19: 493-502, 2019.

March 2024, 24 (1)
Full Text(PDF) Free

Social Network Service

Cited By Articles
  • CrossRef (0)
  • Download (178)

Author ORCID Information