An AI-Driven Methodology for Patent Evaluation in the IoT Sector:

Assessing Relevance and Future Impact

Lelio Campanile

2 a

, Renato Zona

1 b

, Antonio Perfetti

and Franco Rosatelli

Department of Engineering, Universit

a degli Studi della Campania, Via Roma 29, Aversa (CE), Italy

Department of Mathematics and Physics, Universit

a degli Studi della Campania, Caserta, Italy

Fondazione Ricerca e Imprenditorialit

a, Napoli, Italy

{lelio.campanile, renato.zona}@unicampania.it, perfetti@fondazioneri.it, rosatelli@fondazioneri.it

Keywords:

Patent Evaluation, Patent Classiﬁcation, Machine Learning, LLM.

Abstract:

The rapid expansion of the Internet of Things has led to a surge in patent ﬁlings, creating challenges in

evaluating their relevance and potential impact. Traditional patent assessment methods, relying on manual

review and keyword-based searches, are increasingly inadequate for analyzing the complexity of emerging IoT

technologies. In this paper, we propose an AI-driven methodology for patent evaluation that leverages Large

Language Models and machine learning techniques to assess patent relevance and estimate future impact.

Our framework integrates advanced Natural Language Processing techniques with structured patent metadata

to establish a systematic approach to patent analysis. The methodology consists of three key components:

(1) feature extraction from patent text using LLM embeddings and conventional NLP methods, (2) relevance

classiﬁcation and clustering to identify emerging technological trends, and (3) an initial formulation of impact

estimation based on semantic similarity and citation patterns. While this study focuses primarily on deﬁning

the methodology, we include a minimal validation on a sample dataset to illustrate its feasibility and potential.

The proposed approach lays the groundwork for a scalable, automated patent evaluation system, with future

research directions aimed at reﬁning impact prediction models and expanding empirical validation.

1 INTRODUCTION

The proliferation of the Internet of Things (IoT) has

led to a marked increase in patent ﬁlings, underscor-

ing the rapid growth of innovation win this domain.

As IoT technologies continue to expand across vari-

ous sectors, such as smart cities, healthcare, industrial

automation and transportation systems, the global

number of patents granted worldwide has seen a sig-

niﬁcant increase, that poses signiﬁcant challenges for

researchers, policymaker, and industry stakeholders

who seek to assess their relevance, technological im-

pact, and potential future inﬂuence. The ability to

efﬁciently assess and evaluate patents is becoming

crucial, as it supports innovation, assures fair intel-

lectually property practices, and informs investment

strategies. However, traditional workﬂows for patents

evaluation and management shows theirs limitations

in handle large volumes of documents in special way

while capturing the semantic inherent within patents.

https://orcid.org/0000-0003-4021-4137

https://orcid.org/0000-0001-6718-9387

The evaluation of patents has traditionally relied

on manual review by domain experts and keyword-

based retrieval systems. This approach limits the scal-

ability, rendering it ineffective for large-scale analy-

sis. Manual evaluations and classiﬁcations, are prone

to inefﬁciencies and inconsistencies due to their re-

liance on human judgment. Furthermore, keyword-

based retrieval systems frequently fail to capture the

subtleties and complexities of patent text, particu-

larly within interdisciplinary domains such as IoT,

where the interconnections between concepts can be

highly context-dependent. Traditional impact assess-

ments rely heavily on citation-based metrics, which

may not provide a comprehensive understanding of a

patent’s potential inﬂuence or innovation value. This

limitation renders it challenging to accurately evalu-

ate the pertinence, applicability, and overall impact of

patents within rapidly evolving ﬁelds.

The integration of Artiﬁcial Intelligence (AI) and

Natural Language Processing (NLP) techniques into

patent analysis has activated advancements in auto-

mated and scalable evaluation methodologies. Recent

breakthroughs in Large Language Model (LLM), in-

Campanile, L., Zona, R., Perfetti, A. and Rosatelli, F.

An AI-Driven Methodology for Patent Evaluation in the IoT Sector: Assessing Relevance and Future Impact.

DOI: 10.5220/0013519700003944

Paper published under CC license (CC BY-NC-ND 4.0)

In Proceedings of the 10th International Conference on Internet of Things, Big Data and Security (IoTBDS 2025), pages 501-508

ISBN: 978-989-758-750-4; ISSN: 2184-4976

501

cluding GPT, and domain-speciﬁc embeddings, have

demonstrated superior performance in text compre-

hension, semantic similarity analysis, and automated

classiﬁcation tasks. These LLMs facilitate the ex-

traction of high-dimensional, context-aware represen-

tations of patent texts. The application of LLMs

to patent analysis is a novel research are with unre-

solved challenges. Some of these challenges include

data sparsity, which hinders the development of ro-

bust models; domain adaptation, which necessitates

the customization of models for speciﬁc domains; and

explainability of AI-driven decisions, which raises

concerns regarding the interpretability and trustwor-

thiness of results.

In this paper, we present a novel AI-driven

methodology for patent evaluation that harnesses the

power of state-of-the-art LLMs and machine learning

techniques to assess the relevance and impact of IoT

patents. Our approach is structured around four key

components.

We employ LLM-based embeddings and tradi-

tional NLP methods for feature extraction, encoding

the semantic meaning of patents. This stage produces

unstructured data, which is then transformed into a

structured one using a structured data output frame-

work for LLM. The next step involves applying rele-

vance classiﬁcation and clustering to identify the un-

derlying technological patterns within the IoT patent

landscape. Finally, our framework estimates the pre-

liminary impact of each patent through a combination

of citation analysis, semantic similarity, and time-

series forecasting techniques. Unlike previous ap-

proaches that rely solely on keyword retrieval or cita-

tion counts, our methodology aims to provide a com-

prehensive, data-driven framework for understanding

the signiﬁcance of patents and saving in a structured

way.

The primary contribution of this research is in

its methodological framework, which joins conven-

tional patent evaluation workﬂows with AI-driven

techniques. By incorporating LLMs and deep learn-

ing models, we created a novel pipeline that not only

enhances the accuracy of patent analysis but also ad-

dresses the limitations of existing approaches. Fur-

thermore, our work presents an initial validation on a

sample dataset, demonstrating the feasibility and po-

tential of the proposed approach. Although this val-

idation is limited in scope, it serves as a foundation

for future research aimed at reﬁning impact prediction

models and expanding empirical validation to larger

datasets.

The remainder of this paper is organized as fol-

low: Section 2 provides an overview of context and

background on previous work on patent analysis. Sec-

tion 3 describes the proposed methodology. Section

4 presents a minimal validation study, demonstrat-

ing the applicability of our method to a selected IoT

patent set. Section 5 discusses the implications of our

ﬁndings, comparing our approach with existing tech-

niques and discussing limitations. Finally, Section 6

concludes the paper and highlights future research di-

rections.

2 RELATED WORKS

Historically, the patent system was established to en-

courage and regulate technical progress and innova-

tion (Frumkin, 1947). However, increasing global

patent applications and rapid technological develop-

ment pose signiﬁcant challenges to patent ofﬁces and

practitioners (Krestel et al., 2021).

Automating these processes not only improves the

efﬁciency of IP management, but also allows the NLP

community to explore a promising and still under-

researched ﬁeld. Previous research has examined the

early stages of the use of intelligent, automated meth-

ods for patent analysis (Abbas et al., 2014), the in-

troduction of Deep Learning (DL) techniques, which

have simpliﬁed some patent-related tasks (Krestel

et al., 2021), and speciﬁc applications, such as patent

retrieval (Shalaby and Zadrozny, 2019).

Here, we focus on two main areas: patent analy-

sis and automatic patent text generation. While anal-

ysis is aimed at understanding and extracting rele-

vant information from this vast knowledge base (Ab-

bas et al., 2014), generative activities aim at the auto-

mated creation of patent content. Several studies have

explored the use of Machine Learning (ML) and NLP

in the patent domain, which is characterized by highly

technical and legal texts (Krestel et al., 2021). In ad-

dition, recent LLMs have demonstrated extraordinary

capabilities across a wide range of tasks in general

domains (Min et al., 2023), suggesting their potential

use in the management and editing of patent literature,

an essential resource for documenting technological

evolution. However, their use in patent-related activi-

ties still remains under-explored, due to the complex-

ity of the texts and the ﬁeld itself. The work of (Ab-

bas et al., 2014) represents some of the ﬁrst research

devoted to patent analysis, adopting text mining and

visualization methods, which opened up new perspec-

tives for study. DL has recently gained attention for

knowledge management and patents. (Krestel et al.,

2021) identiﬁed eight main task categories for DL-

based methods, including decision support, patent

classiﬁcation and retrieval, patent evaluation, predic-

tion of emerging technologies, automatic patent text

AI4EIoT 2025 - Special Session on Artiﬁcial Intelligence for Emerging IoT Systems: Open Challenges and Novel Perspectives

502

generation, litigation analysis and the use of computer

vision techniques. Some studies have focused on spe-

ciﬁc aspects of patent analysis. For example,(Shalaby

and Zadrozny, 2019) examined the problem of patent

retrieval, i.e. the search for relevant documents, an ac-

tivity that, although it may seem similar to traditional

searching, has well-deﬁned constraints and purposes.

Patent literature is an important source of knowledge

on technological advances and is known for its poten-

tial to inspire innovations, especially when technolo-

gies from different ﬁelds are combined.

Data science techniques must address these difﬁ-

culties in order to effectively extract useful informa-

tion for design and innovation (Jiang et al., 2022).

The application of data science to patents not only

supports theoretical and methodological design anal-

ysis, but also provides tools for innovation strategies.

In particular, in the early stages of technology de-

velopment, NLP can support idea generation, predic-

tion of industry trends and the matching of problems

and solutions (Just, 2024). While much research fo-

cuses on analyzing existing patent literature, there is

a growing interest in the generative use of language

models. NLP techniques can be exploited to make

patent language more accessible, generate abstracts

or even create new patent texts from structured in-

put (Casola and Lavelli, 2022). The recent devel-

opment in the ﬁelds of LLM are reported in (Chang

et al., 2024). The ﬁeld of patents also connects to

other areas of intellectual property, such as copyright,

trademarks and industrial designs, as well as interdis-

ciplinary areas such as knowledge management and

the assessment of the economic value of innovations

(Aristodemou and Tietze, 2018). Moreover, the legal

language of patents shares many features with other

legal texts, so much so that most practitioners in the

ﬁeld have a strong legal background. For this reason,

NLP techniques developed for law in general could

be applied to patent analysis in the future (Katz et al.,

2023).

Although LLMs offer signiﬁcant potential for

knowledge extraction and textual analysis, their ap-

plication in patenting is still limited. Previous studies

have employed word embeddings such as Word2Vec

(Mikolov et al., 2013) and DL models such as LSTM,

but transformer-based models have opened up new

possibilities (Vaswani et al., 2017). Early attempts

to apply LLMs to patents were based on relatively

small models, such as GPT-2 (Radford, 2018), while

more advanced models have not yet been explored in

depth in this area. A signiﬁcant obstacle is the lack

of reference datasets and established metrics to as-

sess the performance of LLMs in patent analysis. Al-

though patent ofﬁces release raw documents, publicly

available datasets for speciﬁc patent analysis tasks are

still limited. Current searches focus mainly on sum-

mary sections of patents, but detailed descriptions and

claims are much more relevant for automated patent

analysis and generation.

3 PROPOSED METHODOLOGY

Patent evaluation and classiﬁcation is an activity for

both businesses and regulators, as it allows them

to determine the technological, economic and legal

value of an invention. Patent analysis can be con-

ducted through traditional methodologies entrusted to

experts in the ﬁeld, as well as through the use of auto-

mated tools based on machine learning and artiﬁcial

intelligence techniques. The analysis conducted by

human experts is based on a set of established crite-

ria that include novelty, inventive step and industrial

applicability of the patent. These parameters are ex-

amined within the existing patent and scientiﬁc liter-

ature to assess whether the invention meets the legal

requirements for patentability. In addition, the analy-

sis considers the clarity of the claims, scope of pro-

tection and potential market coverage. Patent ana-

lysts, often with a technical and legal background,

conduct extensive searches of specialized databases

and compare new applications with prior patents to

identify any overlaps or contrasts with the state of the

art. In parallel, recent years have seen an increas-

ing development of IT tools for patent classiﬁcation

and evaluation. These tools are based on NLP and

ML techniques, which make it possible to extract rel-

evant information from patent documents. For exam-

ple, the use of text mining algorithms makes it possi-

ble to identify semantic relationships between patents

and automatically classify inventions into technology

categories deﬁned by international standards, such as

the International Patent Classiﬁcation (IPC) and the

Cooperative Patent Classiﬁcation (CPC). Another in-

novative approach involves DL models and text em-

bedding techniques to improve the ability to identify

similarities between existing patents and new appli-

cations, facilitating the evaluation process. Further-

more, LLMs are showing signiﬁcant potential in gen-

erating patent abstracts and translating technical lan-

guage into a more accessible form. Despite these ad-

vances, the application of AI techniques to the patent

ﬁeld still presents some challenges. One of the main

difﬁculties concerns the length and complexity of

patent texts, which often exceed the processing ca-

pacity of traditional language models. Furthermore,

patent language is characterized by highly speciﬁc

terminology and complex syntactic structures, mak-

An AI-Driven Methodology for Patent Evaluation in the IoT Sector: Assessing Relevance and Future Impact

503

ing automated interpretation difﬁcult without a train-

ing phase on speciﬁc corpora. To address these chal-

lenges, scholars are exploring several strategies, in-

cluding increasing the context window in language

models and adopting Retrieval Augmented Genera-

tion (RAG) techniques, which combine information

retrieval from structured sources with text generation.

These developments could improve the ability to clas-

sify and evaluate patents more accurately and efﬁ-

ciently. In conclusion, patent evaluation and classi-

ﬁcation today relies on both the expertise of human

analysts and the potential offered by automated tools

based on artiﬁcial intelligence. While the former pro-

vide in-depth and contextualized interpretation, the

latter offer a large-scale analysis capability, reducing

the time and cost of the process. The combination

of these two methodologies appears to be the most

promising way to further improve the management

and analysis of patent literature.

3.1 The Data Model

The proposal is an advanced patent analysis tool

based on information extraction technology and se-

mantic, functional and graphical analysis of patent

texts and related search reports. It leverages AI al-

gorithms, trained on a large patent domain, to pro-

vide an in-depth analysis of the individual patent, its

relative positioning and reference context. The refer-

ence patent domain consists of the worldwide dataset

of existing patents. The advantage is the free avail-

ability of the worldwide database of existing patents

as it is in the public domain. The reference patent

domain databases used as a ﬁrst approximation are

WIPO, ESPO, ESPACENET, GOOGLE PATENT.

Thanks to the advanced analysis of the proposal,

it is possible to:

• Make strategic decisions based on concrete data;

• Identify emerging technology trends;

• Deﬁne patent policies and technology improve-

ment drivers;

• Discover new customers, suppliers, partners or

competitors;

• Identify new markets.

The proposal differs from traditional tools in that it

operates on unstructured information databases, mak-

ing consultation and analysis more efﬁcient, reducing

time and associated costs. The proposal for patent

documents is a comparative test prior to patent docu-

ment submission or parallel to the patent process for

patents already ﬁled but still reserved. This tool acts

as a complement to the prior art analysis, supporting

the investment decision-making process and the start

of the formal patenting process. The output of the

proposal is a radar diagram, which assigns a score on

a scale of 1 to 5 for each of the seven patent indexes,

already trained on the patent reference database:

• Feasibility, literally represents the feasibility of

the patent from a realization point of view, taking

into account complexity, availability of materials,

the need for ad hoc structures for relitigation, and

the maturity of the technological environment ;

• Multipurpose intended as the ability of the in-

vention to have applications outside the originally

chosen ﬁeld ;

• Obsolescence intended as how far it is from the

end of its useful life. This parameter does not co-

incide with the time span of the legal concession;

• Geo Context indicates how attractive a market is

in the world market;

• Protection represents how protected the invention

is;

• Tech Interest, represents how the subject of inven-

tion is of global interest;

• Tech Resonance, represents how much redun-

dancy there is in the world panorama of inventions

with respect to the subject matter.

These indexes assess both the intrinsic value of the

patent and how it compares with the external context:

• Technological state of the art ;

• Territorial extension level ;

• Geographical distribution of comparable patent

families ;

• Population of similar inventions.

A Global Score summarizes the overall patent value

based on the above indexes. The proposal allows:

• Attribution of ‘weighted’ index scales (arithmetic

or logarithmic);

• Categorization and ranking of patents contained

in a portfolio;

• Comparison with predeﬁned minimum thresholds

or derived from repetitive analysis;

The proposal allows to compare patents within

your portfolio or with previously analyzed patents.

In addition, it allows you to generate a customized

Global Score, based on speciﬁc weightings, to assess

the overall patent value.

The proposal represents an innovative and repeat-

able system for patent evaluation, in pre-grant stage

and in the whole patent life cycle, providing strategic

AI4EIoT 2025 - Special Session on Artiﬁcial Intelligence for Emerging IoT Systems: Open Challenges and Novel Perspectives

504

support for investment decisions, technological im-

provement and competitive positioning. Thanks to its

advanced technology, it is able to offer an in-depth

and predictive view of the value and impact of patents

in an increasingly dynamic and competitive market

environment.

3.2 Feature Extraction and NLP

Techniques

Our methodology is founded on conventional NLP

techniques but extends them by utilizing LLMs, par-

ticularly LLaMA 70B, to obtain and examine patent

data. Although traditional methods are effective for

preliminary text analysis, LLMs provide the capacity

to understand deeper semantic connections and con-

textual subtleties frequently ignored by simpler mod-

els.

A signiﬁcant advancement in our methodology is

the generation of structured data from LLMs gener-

ated content. Patent documents are intrinsically un-

structured, rendering methodical processing challeng-

ing. We employ advanced structured output genera-

tion algorithms to transform free-text patent descrip-

tions into a standardized format that corresponds with

the main criteria outlined in previous section. Tech-

niques such as prompt engineering, instruction tweak-

ing, and RAG are utilized producing structured repre-

sentations of patents. Recent scholarly research has

proven the efﬁcacy of these strategies in retrieving

domain-speciﬁc knowledge.

To guarantee high-quality structured data, we uti-

lize schema-based extraction, directing LLM to pro-

duce data in speciﬁed types, such as JSON or XML.

This facilitates easy integration with subsequent ma-

chine learning models. Furthermore, few-shot learn-

ing enhances the dependability of structured extrac-

tion by supplying instances of accurately formatted

data during inference, improving consistency and di-

minishing ambiguity.

Another signiﬁcant feature is constraint-based ex-

traction, wherein the LLM-generated content is ver-

iﬁed against established rules or knowledge bases.

Post-processing approaches, including regex valida-

tion, ontology-based ﬁltering, and entity linking, are

utilized to enhance structured data and rectify errors.

These procedures guarantee that extracted ﬁelds, in-

cluding assignees, citations, patent classiﬁcation, and

important claims, comply with domain-speciﬁc spec-

iﬁcations.

Following the extraction process, the structured

data is consolidated into a centralized dataset de-

signed for ML applications. This dataset supports

patent comparison and review, facilitating more rigor-

ous study. The organized dataset has ﬁelds like tech-

nological categorization, novelty indicators, existing

relationships, impact scores, and key entities such as

assignees, inventors, citations, and jurisdictional cov-

erage.

We primarily utilize zero-shot and few-shot learn-

ing paradigms, while also investigating ﬁne-tuning as

a supplementary upgrade. Fine-tuning LLaMA 70B

on specialized corpora, such historical patent records

and annotated datasets, may enhance the model’s pre-

cision and recall. Nevertheless, considering computa-

tional limitations and ethical concerns related to pro-

prietary data, ﬁne-tuning is not the central emphasis

of this study. We utilize adaptive prompting tactics

and context-aware embeddings to enhance model ef-

ﬁciency without necessitating signiﬁcant retraining.

We assess various approaches for the practical

implementation of structured data extraction from

LLMs. Among the most promising solutions are

LangChain, Hugging Face’s Transformers, and LLa-

maIndex, each providing comprehensive APIs for

dealing with various LLMs and producing structured

outputs.

Our methodology provides a scalable and auto-

mated framework for obtaining signiﬁcant insights

from patents by integrating traditional NLP pipelines

with sophisticated NLP capabilities.

3.3 Relevance Classiﬁcation and Impact

Estimation

Patent relevance classiﬁcation is essential for assess-

ing the importance of a patent within the IoT do-

main. We plan to utilize a combination of supervised

machine learning models and unsupervised clustering

techniques to evaluate and classify patents according

to their technological and economic worth. We will

apply random forest classiﬁers, Support Vector Ma-

chines (SVM) , and DL models for relevance classi-

ﬁcation to assess the signiﬁcance of a patent based

on extracted data. These models are trained on his-

torical patent datasets with annotated relevance rat-

ings, ensuring strong performance in patent catego-

rization. Furthermore, we employ hierarchical clus-

tering to discern clusters of patents that display anal-

ogous technological attributes, thereby offering in-

sights into nascent trends in IoT technologies.

In addition to classiﬁcation, estimating the im-

pact of a patent is crucial for forecasting its long-term

effects. We combine several predictive indicators,

such as citation trajectory analysis, semantic similar-

ity to high-impact patents, and time-series forecasting

of technological uptake. Citation-based methodolo-

gies analyze the historical citation trends of patents to

An AI-Driven Methodology for Patent Evaluation in the IoT Sector: Assessing Relevance and Future Impact

505

Figure 1: Flowchart of the proposal.

evaluate their anticipated future impact. Furthermore,

embedding-based semantic similarity metrics enable

the assessment of the alignment of a speciﬁc patent

with historically signiﬁcant patents.

Our methodology integrates classiﬁcation and ef-

fect estimation, offering a thorough evaluation frame-

work that connects traditional and AI-driven patent

examination. Utilizing ML and insights from LLM,

we develop a system that efﬁciently categorizes

patents and forecasts their signiﬁcance in inﬂuencing

future technical progress.

4 PRELIMINARY VALIDATION

We evaluate the feasibility of our proposed method-

ology through a preliminary validation utiliz-

ing a representative case study based on patent

WO2020193804A1, widely described in (Di Gennaro

et al., 2024; Minutolo et al., 2020; Gennaro et al.,

2022), which relates to an IoT-related invention. This

validation demonstrates the efﬁcacy of our structured

extraction pipeline and classiﬁcation models in as-

sessing patent relevance and estimating potential im-

pact.

WO2020193804A1, Transducer, is selected as a

case study due to its considerable relevance in the

IoT domain. The patent outlines a new method for

distributed data management in IoT networks, posi-

tioning it as a suitable candidate to assess the efﬁcacy

Figure 2: Index scoring of the evaluated patent.

of our approach in extracting structured insights and

forecasting its technological implications.

The results in terms of patent indexes are reported,

see Figure 2.

The structured data extraction module is utilized

on Transducer, producing a machine-readable rep-

resentation that includes essential attributes such as

technological classiﬁcation, key claims, prior art ref-

erences, and citation trajectories. Our classiﬁcation

model predicts the relevance score of a patent by an-

alyzing its textual content and assessing its semantic

similarity to high-impact patents within our dataset.

The evaluation of classiﬁcation performance is

AI4EIoT 2025 - Special Session on Artiﬁcial Intelligence for Emerging IoT Systems: Open Challenges and Novel Perspectives

506

conducted through three primary metrics: best ﬁt

similarity to established high-impact patents within

an embedding space, semantic overlap with glsiot-

speciﬁc patent clusters identiﬁed via hierarchical

clustering, and citation-based inﬂuence indicators ob-

tained from historical patent citation trends.

We compare the structured outputs produced by

LLaMA 70B, with manually annotated patent data to

validate our methodology. Factors considered include

the accuracy of extracted entities, such as assignees,

inventors, and technological components; the com-

pleteness of structured data ﬁelds in comparison to

human-extracted insights; and the consistency of rele-

vance classiﬁcation against expert-labeled patent cat-

egories.

This case study illustrates our approach, yet it has

certain limitations. The validation relies on a singular

patent, necessitating expansion to a broader dataset

for enhanced generalization. Estimating impact based

on citations necessitates longitudinal data, which may

not be completely accessible for recent patents. Fur-

thermore, although LLMs offer structured insights,

their dependability in patent-speciﬁc contexts neces-

sitates additional benchmarking against evaluations

by domain experts.

This preliminary validation indicates that our

methodology effectively extracts structured informa-

tion from IoT-related patents, establishing a founda-

tion for evaluating relevance and impact. Further re-

search is necessary to enhance the classiﬁcation mod-

els, broaden the dataset, and strengthen the reliability

of impact predictions. Future research will concen-

trate on expanding the validation scale and incorpo-

rating supplementary benchmark datasets to enhance

the evaluation comprehensiveness.

5 DISCUSSION

The proposed AI-driven methodology for patent eval-

uation presents notable improvements in the anal-

ysis of intellectual property. The integration of

LLM with ML techniques signiﬁes a signiﬁcant shift

from traditional methods, providing enhanced accu-

racy, scalability, and automation. This method im-

proves the assessment of patents through more ac-

curate relevance classiﬁcation and impact forecast-

ing. LLM embeddings enhance the comprehension

of patent texts by effectively capturing semantic re-

lationships that keyword-based methods frequently

miss. This advancement facilitates more efﬁcient

clustering of patents, thereby assisting in the iden-

tiﬁcation of emerging technological trends. The in-

tegration of citation analysis and semantic similarity

measures enhances impact prediction, offering a com-

prehensive perspective on a patent’s signiﬁcance over

time.

This AI-driven evaluation framework provides ad-

vantages to stakeholders, including patent ofﬁces, re-

search institutions, and corporations engaged in IoT

innovation. Patent examiners can utilize the proposed

method to enhance the efﬁciency of the patent ap-

proval process by more effectively identifying rel-

evant prior art. Research institutions may employ

the methodology to monitor technological advance-

ments and evaluate the potential impact of speciﬁc

patents on future research trajectories. Businesses

can employ the system to inform decisions regarding

patent investments and strategic collaborations, uti-

lizing predicted impact metrics. AI-based evaluation

serves as a transformative instrument in the manage-

ment of intellectual property.

While this approach offers certain advantages, it

is essential to recognize several limitations. The efﬁ-

cacy of LLM is contingent upon the quality and di-

versity of the training data utilized. LLMs trained

on general corpora exhibit robust performance across

various NLP tasks; however, ﬁne-tuning on domain-

speciﬁc patent texts may be necessary for optimal out-

comes. Future research should investigate the cre-

ation of specialized LLMs that are speciﬁcally trained

on patent data to improve accuracy and relevance.

The preliminary validation study is constrained in

scope due to its reliance on a limited dataset of IoT

patents. A comprehensive evaluation utilizing larger

and more diverse patent datasets is essential to vali-

date the generalization of the methodology. The se-

lection of impact prediction metrics must be reﬁned

to ensure robustness across various technological do-

mains.

A signiﬁcant challenge in AI-driven patent eval-

uation is the interpretability of the results. LLMs

offer robust semantic representations; however, their

decision-making processes frequently lack trans-

parency. Enhancing the usability and trustworthi-

ness of relevance classiﬁcation and impact prediction

can be achieved through the development of explain-

able AI techniques that elucidate the reasoning behind

these processes. The comprehensibility and valida-

tion of AI-generated evaluations by human experts is

a critical focus for future development.

6 CONCLUSIONS AND FUTURE

WORKS

This study introduces an AI-based approach for as-

sessing IoT-related patents, including sophisticated

An AI-Driven Methodology for Patent Evaluation in the IoT Sector: Assessing Relevance and Future Impact

507

NLP techniques and ML models. Our methodology

offers a scalable and automated solution for patent

analysis through the integration of LLM-based struc-

tured data extraction, relevance categorization, and

impact calculation. The ﬁndings from our initial vali-

dation demonstrate that the methodology successfully

extracts signiﬁcant insights from patents, presenting a

viable alternative to conventional review techniques.

The suggested methodology improves patent anal-

ysis by utilizing advanced LLMs to extract structured

information, categorizing patents according to their

technological relevance, and assessing their potential

impact through predictive modeling. Our research il-

lustrates that the integration of NLP approaches with

machine learning allows for a more thorough evalua-

tion of patents, aiding in the detection of signiﬁcant

advances in the IoT sector.

Notwithstanding the encouraging outcomes, nu-

merous obstacles persist. The initial validation was

performed on a restricted dataset, and subsequent re-

search should aim to broaden the methodology to

encompass a wider and more varied collection of

patents. Furthermore, enhancing the effect estima-

tion model through advanced time-series forecasting

and the integration of external elements like market

adoption patterns could improve predictive accuracy.

A vital topic for enhancement is the augmentation

of the interpretability of AI-driven patent evaluation,

guaranteeing that the methodology yields practical in-

sights for academics, enterprises, and policymakers.

The next steps will focus on augmenting the

dataset, reﬁning domain-speciﬁc LLM models for

patent analysis, and assessing the methodology across

several technological areas outside IoT. Additionally,

incorporating expert feedback into the review process

could enhance the trustworthiness of automated as-

sessments. This research addresses these problems,

enhancing AI-assisted intellectual property analysis

and facilitating more efﬁcient, data-driven decision-

making in patent review.

REFERENCES

Abbas, A., Zhang, L., and Khan, S. U. (2014). A literature

review on the state-of-the-art in patent analysis.

Aristodemou, L. and Tietze, F. (2018). The state-of-the-art

on intellectual property analytics (ipa): A literature

review on artiﬁcial intelligence, machine learning and

deep learning methods for analysing intellectual prop-

erty (ip) data.

Casola, S. and Lavelli, A. (2022). Summarization, simpliﬁ-

cation, and generation: The case of patents.

Chang, Y., Wang, X., Wang, J., Wu, Y., Yang, L., Zhu,

K., Chen, H., Yi, X., Wang, C., Wang, Y., Ye, W.,

Zhang, Y., Chang, Y., Yu, P. S., Yang, Q., and Xie,

X. (2024). A survey on evaluation of large language

models. ACM Transactions on Intelligent Systems and

Technology, 15.

Di Gennaro, L., de Cristofaro, M., Loreto, G., Minutolo, V.,

Olivares, L., Zona, R., and Frunzio, G. (2024). In-

situ load testing of an ancient masonry structure using

ﬁbre optics. Structures, 70:107567.

Frumkin, M. (1947). Early history of patents for innovation.

Transactions of the Newcomen Society, 26.

Gennaro, L. D., Damiano, E., Cristofaro, M. D., Netti,

N., Olivares, L., Zona, R., Iavazzo, L., Coscetta, A.,

Mirabile, M., Giarrusso, G., D’Ettore, A., and Min-

utolo, V. (2022). An innovative geotechnical and

structural monitoring system based on the use of nsht.

Smart Materials and Structures, 31.

Jiang, S., Sarica, S., Song, B., Hu, J., and Luo, J. (2022).

Patent data for engineering design: A critical review

and future directions. Journal of Computing and In-

formation Science in Engineering, 22.

Just, J. (2024). Natural language processing for innovation

search – reviewing an emerging non-human innova-

tion intermediary. Technovation, 129.

Katz, D. M., Hartung, D., Gerlach, L., Jana, A., and Bom-

marito, M. J. (2023). Natural language processing in

the legal domain. SSRN Electronic Journal.

Krestel, R., Chikkamath, R., Hewel, C., and Risch, J.

(2021). A survey on deep learning for patent analy-

sis. World Patent Information, 65.

Mikolov, T., Chen, K., Corrado, G., and Dean, J. (2013).

Efﬁcient estimation of word representations in vector

space. In 1st International Conference on Learning

Representations, ICLR 2013 - Workshop Track Pro-

ceedings.

Min, B., Ross, H., Sulem, E., Veyseh, A. P. B., Nguyen,

T. H., Sainz, O., Agirre, E., Heintz, I., and Roth, D.

(2023). Recent advances in natural language process-

ing via large pre-trained language models: A survey.

ACM Computing Surveys, 56.

Minutolo, V., Cerri, E., Coscetta, A., Damiano, E., Cristo-

faro, M. D., Gennaro, L. D., Esposito, L., Ferla,

P., Mirabile, M., Olivares, L., and Zona, R. (2020).

Nsht: New smart hybrid transducer for structural and

geotechnical applications. Applied Sciences (Switzer-

land), 10.

Radford, A. (2018). Improving language understanding

by generative pre-training. Homology, Homotopy and

Applications, 9.

Shalaby, W. and Zadrozny, W. (2019). Patent retrieval: a

literature review.

Vaswani, A., Shazeer, N., Parmar, N., Uszkoreit, J., Jones,

L., Gomez, A. N., Łukasz Kaiser, and Polosukhin,

I. (2017). Attention is all you need. In Advances

in Neural Information Processing Systems, volume

2017-December.

AI4EIoT 2025 - Special Session on Artiﬁcial Intelligence for Emerging IoT Systems: Open Challenges and Novel Perspectives

508