Perovskite-LLM: Knowledge-Enhanced Large Language Models for Perovskite Solar Cell Research

2502.12669

Xiang Liu et el.

🤗 2025-02-19

TL;DR
#

The rapid growth of research in perovskite solar cells necessitates efficient knowledge management. Existing methods like literature reviews and databases struggle to capture the complex relationships within this field. This paper introduces Perovskite-LLM, a system designed to address these challenges.

Perovskite-LLM integrates three key components: a domain-specific knowledge graph (Perovskite-KG), two complementary datasets (Perovskite-Chat and Perovskite-Reasoning), and two specialized large language models (Perovskite-Chat-LLM and Perovskite-Reasoning-LLM). Perovskite-KG organizes domain knowledge, while Perovskite-Chat and Perovskite-Reasoning provide high-quality data for instruction tuning and scientific reasoning. The specialized LLMs are trained on these datasets to achieve superior performance in knowledge retrieval and scientific reasoning tasks, offering researchers valuable tools for efficient literature review, experimental design, and problem-solving.

Key Takeaways
#

Why does it matter?
#

This paper is crucial for perovskite solar cell researchers because it provides a comprehensive knowledge-enhanced system that significantly improves knowledge retrieval and scientific reasoning. It offers effective tools for literature reviews, experimental design, and problem-solving, thereby accelerating research and innovation in this rapidly developing field. The novel multi-agent framework for generating high-quality instruction-tuning data is also a valuable contribution, potentially influencing methodology in other domains. The work opens new avenues for integrating LLMs with materials science, promising further advances in knowledge management and automated discovery.

Visual Insights
#

🔼 This figure illustrates the three-stage pipeline for constructing the Perovskite Knowledge Graph (Perovskite-KG). Stage 1 involves filtering documents based on a predefined schema to select relevant publications. In Stage 2, a large language model extracts key knowledge entities and relationships from the filtered documents. Finally, Stage 3 organizes this extracted information into a structured knowledge graph stored in a graph database. The schema integrates three ontologies: fabrication, parameters, and performance, to capture the complex relationships within perovskite solar cell research.
read the caption
(a) The pipeline of Perovskite-KG construction.

Category	Rationale
Device Structure	Fundamental aspects focusing on high-efficiency (>25% PCE) device architecture and fabrication processes (Q1-Q3)
Perf. Enhancement	Analysis of problem-solving approaches and strategic choices in high-performance devices (Q4-Q5)
Metrics	Key performance indicators (V_OC, FF, J_SC) and their optimization methods (Q6-Q9)
Stability	Critical stability aspects addressing main degradation pathways: moisture, thermal, and light stability (Q10-Q12)
Defect & Recom.	Fundamental mechanisms affecting device efficiency through defect passivation and recombination control (Q13-Q14)
Interface	Interface engineering and charge transport optimization (Q15-Q17)
Materials	Comprehensive analysis of functional materials and their characteristics in different device components (Q18-Q21)

🔼 This table categorizes 21 research questions related to perovskite solar cells into seven key areas: Device Structure and Fabrication, Performance Enhancement Strategies, Performance Metrics Improvement, Stability Improvements, Defect and Recombination Management, Interface and Extraction Layer Enhancements, and Materials Used in Perovskite Solar Cells. Each category includes a brief description of its technical focus, providing a structured overview of the research questions.
read the caption
Table 1: Classification of Research Questions in Perovskite Solar Cell Studies

In-depth insights
#

Perovskite LLM Intro
#

A hypothetical ‘Perovskite LLM Intro’ section would likely begin by establishing the context of perovskite solar cell research, highlighting its rapid advancements and significant potential as a next-generation photovoltaic technology. It would then introduce the challenges inherent in this field, such as the vast and rapidly growing body of research making knowledge management and efficient information retrieval difficult. The section would then motivate the need for a knowledge-enhanced large language model (LLM) specifically tailored to perovskite solar cell research, emphasizing how such a system could significantly improve the efficiency of scientific discovery and innovation by facilitating literature review, experimental design, and complex problem-solving. The introduction would briefly describe the key components of the proposed Perovskite LLM system, potentially including a domain-specific knowledge graph, curated datasets, and specialized LLMs for different reasoning tasks. Finally, the introduction should clearly state the system’s overall goal: to provide researchers with effective tools to accelerate progress in the field of perovskite solar cell research.

KG Construction
#

Constructing a robust knowledge graph (KG) for perovskite solar cell research involves a multi-step process. Data acquisition is crucial, typically involving the extraction of relevant information from a large corpus of research papers and patents. Information extraction techniques, such as Named Entity Recognition (NER) and Relation Extraction, are used to identify key entities (materials, processes, properties) and their relationships. The extracted information is then carefully curated and cleaned, addressing issues like inconsistencies and ambiguities. This stage often includes manual verification to ensure accuracy. Finally, the curated data is organized into a structured format, usually a graph database, with nodes representing entities and edges representing relationships between them. The design of the KG schema itself is a key decision; a well-designed schema facilitates efficient knowledge representation and retrieval, enabling effective querying and reasoning. The process demands significant computational resources and expertise in both materials science and knowledge graph technologies. Evaluation of the KG’s quality and completeness is essential to ensure its usefulness in downstream tasks like literature review and scientific discovery.

LLM Experiments
#

A hypothetical ‘LLM Experiments’ section in a research paper would likely detail the empirical evaluation of large language models (LLMs) applied to a specific task, such as question answering or scientific reasoning within the perovskite solar cell domain. The section should begin by clearly defining the base LLMs used (e.g., GPT-3.5-Turbo, LLaMA), and the specific modifications or fine-tuning techniques implemented to adapt them for the given task. Key metrics for evaluating the LLMs’ performance must be explicitly stated (e.g., accuracy, F1-score, perplexity) to allow for a robust comparison against baselines and other state-of-the-art models. A thorough description of the datasets employed, including their size, composition, and any preprocessing steps, is also crucial for reproducibility. The experimental setup, including hardware and software configurations, should be documented. Finally, a comprehensive analysis of the results, including statistical significance tests and error analysis, would be necessary to draw meaningful conclusions and justify any claims of improved performance.

Future Directions
#

Future research in perovskite solar cells should prioritize enhancing long-term stability under diverse environmental conditions. This involves exploring new materials with improved thermal and moisture resistance, as well as advanced encapsulation techniques. Developing scalable and cost-effective manufacturing processes is crucial for widespread adoption. This requires innovation in solution processing methods, including the exploration of new precursor materials and deposition techniques. Improving the understanding of the fundamental mechanisms governing perovskite degradation is critical for guiding materials design and optimization. Advanced characterization techniques are needed to reveal the underlying causes of failure. Developing advanced large language models specifically designed for perovskite materials research can accelerate the discovery and optimization process. This includes integrating domain-specific knowledge graphs and utilizing multi-agent frameworks to solve complex scientific problems.

System Limits
#

Analyzing a research paper’s ‘System Limits’ section requires a nuanced understanding of the technology’s capabilities and constraints. A thoughtful analysis should move beyond a simple list of limitations. Instead, it should explore the interconnectedness of these limits, examining how one limitation might exacerbate another, and exploring the underlying causes. For instance, a system’s reliance on a specific dataset might limit its generalizability, while computational constraints could restrict the complexity of models. Identifying the trade-offs inherent in design choices is crucial. For example, achieving high accuracy might necessitate sacrificing speed or efficiency. It is vital to assess the severity of each limitation within the context of the system’s intended application. A limitation that is insignificant in a controlled environment could be devastating in a real-world scenario. Furthermore, the analysis should consider potential mitigation strategies. Are there known methods to overcome these limitations? If not, what are the avenues for future research to enhance the system? This comprehensive and in-depth examination of the system’s ‘System Limits’ ultimately guides the direction of future development and informs the responsible deployment of the technology.

More visual insights
#

More on tables

Model	Perovskite QA
Model	PPL $\downarrow$	Rouge-L $\uparrow$	LLM-Judge $\uparrow$
GPT-3.5-Turbo	-	11.24	1.24
GPT-4o-Mini	-	11.90	1.34
GPT-4o	-	11.36	1.41
LLaMA-3.1-8B	6.77	13.18	1.28
LLaMA-3.1-70B	4.98	17.38	1.80
Qwen-2.5-7B	6.23	11.22	1.39
Qwen-2.5-72B	5.12	10.17	1.31
Perovskite-Chat-LLM	2.97	41.25	2.97

🔼 This table presents a comparison of the performance of various large language models (LLMs) on the Perovskite QA dataset. The models are evaluated across three key metrics: perplexity (PPL), Rouge-L score, and LLM-Judge score. Lower perplexity and higher Rouge-L and LLM-Judge scores indicate better performance. The table includes baseline LLMs like GPT-3.5-Turbo and LLaMA-3.1-8B, allowing for a direct comparison with the Perovskite-Chat-LLM, a specialized LLM trained on a perovskite-specific dataset. This comparison highlights the effectiveness of the specialized LLM in handling perovskite-related questions.
read the caption
Table 2: Performance of Perovskite-Chat-LLM on Perovskite QA

Model	Perovskite_MCQ
Model	Easy	Hard	All $\uparrow$
GPT-3.5-Turbo	86.63	49.29	77.15
GPT-4o-Mini	89.79	61.79	82.68
GPT-4o	91.37	65.00	84.68
LLaMA-3.1-8B	100.00	0.00	74.61
LLaMA-3.1-70B	93.44	66.43	86.58
Qwen-2.5-7B	92.22	55.36	82.86
Qwen-2.5-72B	93.07	64.29	85.77
Perovskite-LLM	95.50	62.86	87.22

🔼 This table presents the performance of the Perovskite-Chat-LLM model on a multiple-choice question (MCQ) dataset related to perovskite solar cells. The model’s performance is compared against the LLaMA-3.1-8B baseline model. The difficulty of the questions is categorized as either ‘Easy’ or ‘Hard’ based on the LLaMA-3.1-8B baseline model’s ability to answer them correctly. The table shows the accuracy of each model on easy questions, hard questions, and all questions combined, offering a comprehensive evaluation of the Perovskite-Chat-LLM model’s capabilities.
read the caption
Table 3: Performance of Perovskite-Chat-LLM on Perovskite MCQ. The LLaMA-3.1-8B baseline model’s performance defines Easy/Hard question categories.

Model	$\#$ ex	GPQA $\uparrow$	Minerva $\uparrow$	Avg $\uparrow$
API Models
o1	-	77.30	-	-
o1-preview	-	73.30	47.10	60.20
o1-mini	-	60.00	-	-
Deepseek-R1	-	71.50	-	-
32B
Qwen2.5-32B-Instruct	-	48.00	41.20	44.60
QwQ-32B-preview	-	65.10	39.00	52.05
LIMO-32B*	0.8K	66.70	44.90	55.80
S1-32B*	1K	59.60	47.79	53.69
7B
R1-Qwen2.5-7B*	800K	44.49	25.25	34.87
R1-LLaMA3-8B*	800K	19.19	30.51	24.85
OpenThinker-7B*	114K	42.90	41.10	42.00
Perovskite Reasoning-LLM	2K	43.95	44.49	44.22

🔼 This table presents a performance comparison of the Perovskite-Reasoning-LLM model against other existing models on two established benchmarks for evaluating reasoning capabilities: GPQA and Minerva. The table highlights the number of training examples used for each model and shows the achieved scores on both benchmarks. The asterisk (*) indicates results obtained through the authors’ own evaluation.
read the caption
Table 4: We evaluate the performance of Perovskite-Reasoning-LLM on the GPQA and Minerva benchmarks. * indicates the results are from our evaluation. ##\## ex = number of examples used for fine-tuning.

Ontology

Sub-Category

Data Type

Description

Example

Fabrication

Coating

Parameter

Float

The specifics of the coating method used

in the material deposition process.

5000 rpm, 100

\mu

Method

String

Different fabrication techniques,

involving variations in material deposition.

spin coating

Annealing

Parameter

Float

Refers to the heating conditions applied to the perovskite,

which are essential for crystallization and stability.

120°C, 10min

Parameters

Solvent

String

the liquid medium used to dissolve precursors,

helping to form a uniform perovskite layer

Dimethylformamide (DMF)

Device

Structure

Patterned

String

The architecture of the device

(e.g., layer order, material interfaces)

ITO/SAM/perovskite

/C60/BCP/Cu

Additive

String

Any additional materials or chemicals

potassium ions

Performance

Thermal

Stability

String

The material’s ability to

withstand heat without degrading

>98% of initial efficiency of >24%

after 1,500 hours of continuous

maximum power point tracking

Light

Stability

String

How resistant the material is

to prolonged exposure to light.

>92% of initial performance for 1,200 hours

under the damp-heat test

(85°C and 85% relative humidity)

Moisture

Stability

String

The material’s resilience against

humidity or water exposure.

Initial PCE of control, target-1 and target-2

devices is 21.73%, 24.42% and 24.11%, respectively.

Degraded to 78% of initial PCE after 1,500 hours at 55±5°C

Fill Factor

Value

Float

A measure of the device’s maximum power output.

0.88

Open-Circuit

Voltage Value

Float

The maximum voltage the device can

produce under open-circuit conditions.

1.2 V

Short-Circuit

Current Value

Float

The current density when the circuit is closed.

25 mA/cm²

Power Conversion

Efficiency Value

Float

The efficiency with which the device

converts sunlight into electricity.

25 %

🔼 This table presents the schema used to build the Perovskite-KG knowledge graph. It details the ontology categories (Fabrication, Parameters, Performance), sub-categories within each ontology, the data type for each sub-category, and an example to clarify each entry. The schema organizes knowledge about perovskite materials, their synthesis, device components, and performance metrics into a structured format for improved knowledge retrieval and reasoning.
read the caption
Table 5: Schema in Perovskite-KG.

Coating

Parameter

🔼 This table lists the prompts used by the Information Extraction Agent, a key component of the multi-agent framework for generating the instruction tuning dataset. Each prompt is a question designed to extract specific information about perovskite solar cells from research papers. The prompts are categorized by seven key research areas related to perovskite solar cells, including Device Structure and Fabrication, Performance Enhancement Strategies, and Materials Used in Perovskite Solar Cells. Each category contains multiple questions focusing on different aspects of perovskite solar cell research.
read the caption
Table 6: Prompts for Information Extraction Agent.

The specifics of the coating method used

in the material deposition process.

🔼 This table details the prompts used by the Verification Agent, a crucial component in the Perovskite-KG construction pipeline. The agent’s role is to ensure the accuracy of information extracted from research papers. The prompt instructs the agent to compare extracted data against the original text, identifying and correcting any discrepancies while preserving the original meaning and preserving details like numerical values and material names. The expected output is a JSON object containing the corrected information and an explanation of any modifications made.
read the caption
Table 7: Prompts for Verification Agent.

Different fabrication techniques,

involving variations in material deposition.

🔼 This table details the prompts used by the Organization Agent within a multi-agent framework designed for constructing a knowledge graph related to perovskite solar cells. The agent’s task is to synthesize verified information from multiple sources into coherent, topic-focused responses, ensuring that complex technical information is presented logically and accessibly. The prompt instructs the agent to organize information provided as paragraphs that answer a specific question, returning a JSON object containing a single ‘answer’ field with the synthesized, continuous response.
read the caption
Table 8: Prompts for Organization Agent.

Annealing

Parameter

🔼 This table presents the prompts used for evaluating the quality of the large language model (LLM)’s responses. It outlines the criteria for assessment, including accuracy, completeness, relevance, and clarity, each rated on a scale of 1 to 5. The prompts also request an overall score and summary evaluation, all formatted as a JSON object for structured reporting.
read the caption
Table 9: Prompts for LLM-Judge.

Refers to the heating conditions applied to the perovskite,

which are essential for crystallization and stability.

🔼 This table systematically categorizes key research questions in the field of high-performance perovskite solar cells. It groups questions into seven major categories: Device Structure and Fabrication, Performance Enhancement Strategies, Performance Metrics Improvement, Stability Improvements, Defect and Recombination Management, Interface and Extraction Layer Enhancements, and Materials Used in Perovskite Solar Cells. Each category contains several specific research questions focusing on different aspects of perovskite solar cell technology, providing a comprehensive overview of the most important research areas and challenges.
read the caption
Table 10: Systematic Classification of Research Questions in High-Performance Perovskite Solar Cell Studies

the liquid medium used to dissolve precursors,

helping to form a uniform perovskite layer

🔼 This table lists the full names and abbreviated versions of seven research categories related to perovskite solar cells. It also shows the number of research papers included in each category, illustrating the relative focus of research within the perovskite solar cell field.
read the caption
Table 11: Correspondence between abbreviated and full names of research categories in perovskite solar cells

Device

Structure

🔼 This table details the hyperparameters used during the training process of two large language models (LLMs): Perovskite-Chat-LLM and Perovskite-Reasoning-LLM. It lists the specific values used for key parameters such as learning rate, batch size, number of epochs, optimizer, learning rate scheduler, and warmup steps. These hyperparameters are crucial in influencing the performance and efficiency of the training process for each LLM. The table allows for comparison of the training strategies employed for the two LLMs.
read the caption
Table 12: Training Hyperparameters for Perovskite-Chat-LLM and Perovskite-Reasoning-LLM

TL;DR#

Key Takeaways#

Why does it matter?#

Visual Insights#

In-depth insights#

Perovskite LLM Intro#

KG Construction#

LLM Experiments#

Future Directions#

System Limits#

More visual insights#

Full paper#

TL;DR
#

Key Takeaways
#

Why does it matter?
#

Visual Insights
#

In-depth insights
#

Perovskite LLM Intro
#

KG Construction
#

LLM Experiments
#

Future Directions
#

System Limits
#

More visual insights
#

Full paper
#