🔬 DiffMat2D: 基于扩散模型的二维材料生成与优化

Diffusion Model for 2D Material Generation with HER Optimization

利用条件扩散模型从通用晶体数据库中学习材料结构特征，通过智能优化手段，设计并生成具备高HER催化活性、高稳定性和实验可合成性的新型二维材料。

快速开始 • 模型架构 • 实验结果 • 创新点

📋 目录

项目背景
核心特性
项目结构
模型架构
算法原理
快速开始
训练指南
材料生成
结果展示
创新点
与Baseline对比
API参考
常见问题

🎯 项目背景

研究动机

在清洁能源领域，氢气演化反应（HER） 是电解水制氢的核心步骤。高效的HER催化剂可以显著降低制氢成本，推动氢能经济发展。二维材料因其独特的表面原子排布和可调节的电子结构，成为HER催化剂的理想候选材料。

然而，传统的材料筛选方法面临以下挑战：

挑战	描述
🔍 搜索空间巨大	可能的二维材料组合超过10^6种
⏱️ 实验周期长	单个材料合成与测试需要数周至数月
💰 成本高昂	DFT计算和实验验证成本高
🎲 成功率低	随机筛选的命中率极低

解决方案

本项目提出了一种基于条件扩散模型的智能材料生成框架：

┌─────────────────────────────────────────────────────────────────┐
│                        DiffMat2D Framework                       │
├─────────────────────────────────────────────────────────────────┤
│                                                                  │
│   ┌──────────────┐     ┌──────────────┐     ┌──────────────┐    │
│   │   晶体数据库   │ ──▶ │  扩散模型学习  │ ──▶ │  目标导向生成  │    │
│   │  (MP/C2DB)   │     │   结构特征    │     │   新型材料    │    │
│   └──────────────┘     └──────────────┘     └──────────────┘    │
│                                                                  │
│   ┌──────────────────────────────────────────────────────────┐  │
│   │              Multi-Task Optimization                      │  │
│   │   HER活性 (ΔG_H→0) + 稳定性 + 可合成性                      │  │
│   └──────────────────────────────────────────────────────────┘  │
│                                                                  │
└─────────────────────────────────────────────────────────────────┘

✨ 核心特性

🧠 智能生成条件扩散模型学习材料结构分布基于GNN的等变消息传递支持目标导向的可控生成	⚡ 多目标优化 HER催化活性（ΔG_H → 0 eV）热力学/动力学稳定性实验可合成性预测
🔄 端到端流程从晶体图直接生成CIF文件自动结构验证与优化集成性能评估模块	📊 完整可视化训练过程实时监控生成材料性能分析与Baseline对比评估

📁 项目结构

project/
│
├── 📂 models/                          # 核心模型
│   ├── diffusion_model.py              # 条件扩散模型 (GNN-based)
│   ├── structure_generator.py          # 晶体结构生成器
│   └── optimization.py                 # 多任务优化模块
│
├── 📂 dataset/                         # 数据处理
│   └── material_dataset.py             # 材料数据集与图转换
│
├── 📂 utils/                           # 工具函数
│   ├── geo_utils.py                    # HER活性/稳定性计算
│   └── vis.py                          # 结果可视化
│
├── 📂 data/                            # 数据存储
│   ├── raw/                            # 原始CIF文件 (500个)
│   └── 2d_materials/                   # 筛选后的二维材料 (81个)
│
├── 📂 results/                         # 输出结果
│   ├── loss_curve.png                  # 训练损失曲线
│   ├── her_performance.png             # HER性能分布
│   ├── stability_curve.png             # 稳定性评估
│   ├── generated_structures.png        # 生成结构可视化
│   ├── comparison_table.png            # 与Baseline对比
│   └── generated/                      # 生成的CIF文件 (10个)
│
├── 📂 checkpoints/                     # 模型权重
│   ├── best_model.pt                   # 最优模型
│   ├── checkpoint_epoch_*.pt           # 训练检查点
│   └── final_model.pt                  # 最终模型
│
├── 🐍 train.py                         # 训练脚本
├── 🐍 test.py                          # 测试与生成脚本
├── 📄 requirements.txt                 # 依赖列表
└── 📄 README.md                        # 项目说明

🏗 模型架构

1. 整体框架

┌─────────────────────────────────────────────────────────────────────────┐
│                      DiffMat2D: End-to-End Pipeline                      │
├─────────────────────────────────────────────────────────────────────────┤
│                                                                          │
│  ╔═══════════════╗    ╔═══════════════════╗    ╔═══════════════════╗    │
│  ║   Input       ║    ║   Conditional     ║    ║    Structure      ║    │
│  ║   Crystal     ║───▶║   Diffusion       ║───▶║    Generator      ║    │
│  ║   Graph       ║    ║   Model           ║    ║    + Validation   ║    │
│  ╚═══════════════╝    ╚═══════════════════╝    ╚═══════════════════╝    │
│         │                      │                         │               │
│         │              ┌───────┴───────┐                 │               │
│         │              │   Condition   │                 │               │
│         │              │   Embedding   │                 │               │
│         │              └───────────────┘                 │               │
│         │                      ▲                         │               │
│         │              ┌───────┴───────┐                 ▼               │
│         │              │  Target Props │         ┌─────────────┐        │
│         │              │  (ΔG_H, Stab, │         │   Output    │        │
│         │              │   Synth)      │         │   CIF File  │        │
│         │              └───────────────┘         └─────────────┘        │
│         │                                                │               │
│         └────────────────────┬───────────────────────────┘               │
│                              ▼                                           │
│                   ┌─────────────────────┐                                │
│                   │  Multi-Task Loss    │                                │
│                   │  L = L_noise + λ₁L_HER + λ₂L_stab + λ₃L_synth       │
│                   └─────────────────────┘                                │
│                                                                          │
└─────────────────────────────────────────────────────────────────────────┘

2. 条件扩散模型详解

┌─────────────────────────────────────────────────────────────────────────┐
│                    Conditional Crystal Diffusion Model                   │
├─────────────────────────────────────────────────────────────────────────┤
│                                                                          │
│  Input:  Crystal Graph G = (V, E)                                        │
│          V: atoms with types and coordinates                             │
│          E: interatomic bonds                                            │
│                                                                          │
│  ┌─────────────────────────────────────────────────────────────────┐    │
│  │                    Embedding Layer                               │    │
│  │  ┌──────────────┐  ┌──────────────┐  ┌────────────────────┐     │    │
│  │  │ Atom Type    │  │ Time Step    │  │ Condition Vector   │     │    │
│  │  │ Embedding    │  │ Embedding    │  │ [ΔG_H, stab, synth]│     │    │
│  │  │ (54 types)   │  │ (Sinusoidal) │  │ MLP Projection     │     │    │
│  │  └──────┬───────┘  └──────┬───────┘  └─────────┬──────────┘     │    │
│  │         │                 │                    │                │    │
│  │         └────────────┬────┴────────────────────┘                │    │
│  │                      ▼                                          │    │
│  │              ┌───────────────┐                                  │    │
│  │              │  Concatenate  │ → h₀ ∈ ℝ^(hidden_dim)            │    │
│  │              └───────────────┘                                  │    │
│  └──────────────────────┬──────────────────────────────────────────┘    │
│                         ▼                                                │
│  ┌─────────────────────────────────────────────────────────────────┐    │
│  │              Equivariant Graph Convolution Blocks × 4            │    │
│  │                                                                  │    │
│  │    ┌─────────────────────────────────────────────────────┐      │    │
│  │    │  Block i:                                            │      │    │
│  │    │                                                      │      │    │
│  │    │  1. Message Passing (3 layers):                     │      │    │
│  │    │     m_ij = MLP([h_i || h_j || e_ij || d_ij])        │      │    │
│  │    │     h_i' = h_i + Σ_j m_ij                           │      │    │
│  │    │                                                      │      │    │
│  │    │  2. Time Conditioning:                              │      │    │
│  │    │     h_i'' = h_i' + t_embed                          │      │    │
│  │    │                                                      │      │    │
│  │    │  3. Coordinate Update (SE(3) Equivariant):          │      │    │
│  │    │     Δx_i = Σ_j (x_i - x_j) · φ(m_ij)                │      │    │
│  │    │     x_i' = x_i + Δx_i                               │      │    │
│  │    │                                                      │      │    │
│  │    └─────────────────────────────────────────────────────┘      │    │
│  │                                                                  │    │
│  └──────────────────────┬──────────────────────────────────────────┘    │
│                         ▼                                                │
│  ┌─────────────────────────────────────────────────────────────────┐    │
│  │                     Output Heads                                 │    │
│  │  ┌────────────────────┐       ┌────────────────────┐            │    │
│  │  │  Atom Type Head    │       │  Coordinate Head   │            │    │
│  │  │  MLP → Softmax     │       │  MLP → ℝ³          │            │    │
│  │  │  ε_type ∈ ℝ^54     │       │  ε_coord ∈ ℝ³      │            │    │
│  │  └────────────────────┘       └────────────────────┘            │    │
│  └─────────────────────────────────────────────────────────────────┘    │
│                                                                          │
└─────────────────────────────────────────────────────────────────────────┘

3. 多任务优化模块

┌─────────────────────────────────────────────────────────────────────────┐
│                       Multi-Task Optimizer                               │
├─────────────────────────────────────────────────────────────────────────┤
│                                                                          │
│  ┌───────────────────────────────────────────────────────────────────┐  │
│  │                        Loss Components                             │  │
│  │                                                                    │  │
│  │  ┌─────────────────┐  ┌─────────────────┐  ┌─────────────────┐    │  │
│  │  │  HER Activity   │  │   Stability     │  │ Synthesizability│    │  │
│  │  │     Loss        │  │     Loss        │  │     Loss        │    │  │
│  │  │                 │  │                 │  │                 │    │  │
│  │  │ L_HER = |ΔG_H|² │  │ L_stab =        │  │ L_synth =       │    │  │
│  │  │                 │  │ max(E_f, 0) +   │  │ BCE(pred, 1) +  │    │  │
│  │  │ Sabatier        │  │ kinetic_penalty │  │ complexity_reg  │    │  │
│  │  │ Principle       │  │                 │  │                 │    │  │
│  │  └────────┬────────┘  └────────┬────────┘  └────────┬────────┘    │  │
│  │           │                    │                    │              │  │
│  │           │     w₁=0.3         │     w₂=0.4        │    w₃=0.3   │  │
│  │           └──────────┬─────────┴─────────┬─────────┘              │  │
│  │                      ▼                   ▼                        │  │
│  │           ┌──────────────────────────────────────────┐            │  │
│  │           │  L_total = w₁·L_HER + w₂·L_stab          │            │  │
│  │           │           + w₃·L_synth                   │            │  │
│  │           │                                          │            │  │
│  │           │  + Dynamic Weight Adjustment:            │            │  │
│  │           │    w_i(t) = w_i · exp(-L_i / τ)          │            │  │
│  │           └──────────────────────────────────────────┘            │  │
│  │                                                                    │  │
│  └───────────────────────────────────────────────────────────────────┘  │
│                                                                          │
└─────────────────────────────────────────────────────────────────────────┘

📐 算法原理

扩散过程

扩散模型通过逐步添加噪声将数据分布转换为高斯分布，然后学习逆过程进行生成。

前向过程（加噪）

给定初始晶体结构 $x_0$，前向过程定义为：

$$q(x_t|x_{t-1}) = \mathcal{N}(x_t; \sqrt{1-\beta_t}x_{t-1}, \beta_t \mathbf{I})$$

经过 $T$ 步后：

$$q(x_T|x_0) = \mathcal{N}(x_T; \sqrt{\bar{\alpha}_T}x_0, (1-\bar{\alpha}_T)\mathbf{I})$$

其中 $\bar{\alpha}t = \prod{s=1}^{t}(1-\beta_s)$

逆向过程（去噪）

模型学习参数化的逆向过程：

$$p_\theta(x_{t-1}|x_t, c) = \mathcal{N}(x_{t-1}; \mu_\theta(x_t, t, c), \sigma_t^2\mathbf{I})$$

其中 $c = [\Delta G_H, \text{stability}, \text{synthesizability}]$ 是条件向量。

训练目标

噪声预测损失：

$$\mathcal{L}_{\text{noise}} = \mathbb{E}_{t, x_0, \epsilon}\left[|\epsilon - \epsilon_\theta(x_t, t, c)|^2\right]$$

HER活性损失（Sabatier原理）：

$$\mathcal{L}_{\text{HER}} = |\Delta G_H|^2$$

最优HER催化剂要求 $\Delta G_H \approx 0$ eV

稳定性损失：

$$\mathcal{L}_{\text{stab}} = \max(E_f, 0) + \lambda_k \cdot \mathbb{1}[\text{unstable phonon}]$$

可合成性损失：

$$\mathcal{L}_{\text{synth}} = -\log p(\text{synthesizable}|x) + \gamma \cdot \text{complexity}(x)$$

总损失：

$$\mathcal{L}_{\text{total}} = \mathcal{L}_{\text{noise}} + \lambda_1\mathcal{L}_{\text{HER}} + \lambda_2\mathcal{L}_{\text{stab}} + \lambda_3\mathcal{L}_{\text{synth}}$$

条件生成

在采样过程中，通过classifier-free guidance增强条件控制：

$$\tilde{\epsilon}_\theta(x_t, t, c) = (1+w) \cdot \epsilon_\theta(x_t, t, c) - w \cdot \epsilon_\theta(x_t, t, \emptyset)$$

其中 $w$ 是guidance权重。

🚀 快速开始

环境要求

依赖	版本要求	说明
Python	≥ 3.10	推荐3.10或3.11
PyTorch	≥ 2.0	支持CUDA加速
PyTorch Geometric	≥ 2.3	图神经网络
Pymatgen	≥ 2024.1	材料结构处理
ASE	≥ 3.22	原子模拟环境
CUDA	≥ 11.8	可选，GPU加速

安装步骤

# 1. 克隆仓库
git clone https://github.com/your-repo/DiffMat2D.git
cd DiffMat2D

# 2. 创建虚拟环境
conda create -n diffmat2d python=3.10
conda activate diffmat2d

# 3. 安装PyTorch（根据CUDA版本选择）
# CUDA 11.8
pip install torch torchvision torchaudio --index-url https://download.pytorch.org/whl/cu118
# CPU only
pip install torch torchvision torchaudio

# 4. 安装其他依赖
pip install -r requirements.txt

# 5. 安装PyTorch Geometric
pip install torch-geometric

# 6. 验证安装
python -c "import torch; import torch_geometric; print('Installation successful!')"

数据准备

获取Materials Project API密钥
- 访问 https://materialsproject.org/
- 注册账户并获取API密钥

配置环境变量

# 创建.env文件
echo "MP_API_KEY=your_api_key_here" > .env

下载数据（可选，已包含预处理数据）

python -c "from dataset.material_dataset import MaterialDataset; d = MaterialDataset('data/2d_materials'); print(f'Loaded {len(d)} materials')"

🎓 训练指南

基础训练

# 使用默认参数训练
python train.py

# 指定训练参数
python train.py --epochs 100 --batch_size 16 --lr 1e-4

GPU训练（推荐）

# 单GPU训练
python train.py --device cuda --batch_size 32 --epochs 100

# 指定GPU
CUDA_VISIBLE_DEVICES=0 python train.py --device cuda

从检查点恢复

python train.py --resume checkpoints/checkpoint_epoch_50.pt

完整参数说明

参数	类型	默认值	说明
`--epochs`	int	100	训练轮数
`--batch_size`	int	16	批大小
`--lr`	float	1e-4	学习率
`--hidden_dim`	int	128	隐藏层维度
`--num_blocks`	int	4	扩散块数量
`--num_timesteps`	int	500	扩散时间步
`--device`	str	"cpu"	训练设备 (cpu/cuda)
`--save_interval`	int	10	检查点保存间隔
`--resume`	str	None	恢复训练的检查点路径
`--weight_her`	float	0.3	HER损失权重
`--weight_stab`	float	0.4	稳定性损失权重
`--weight_synth`	float	0.3	可合成性损失权重

训练过程监控

训练过程中会自动保存：

checkpoints/checkpoint_epoch_*.pt - 定期检查点
checkpoints/best_model.pt - 验证集最优模型
checkpoints/final_model.pt - 最终模型
results/loss_curve.png - 损失曲线

🧪 材料生成

基础生成

# 生成10个材料
python test.py --num_samples 10

# 使用特定模型
python test.py --model_path checkpoints/best_model.pt --num_samples 20

目标导向生成

# 指定目标属性
python test.py \
    --num_samples 20 \
    --target_delta_g 0.0 \
    --target_stability 0.9 \
    --target_synth 0.85 \
    --guidance_scale 2.0

生成参数说明

参数	类型	默认值	说明
`--model_path`	str	None	模型权重路径
`--num_samples`	int	10	生成材料数量
`--target_delta_g`	float	0.0	目标ΔG_H值 (eV)
`--target_stability`	float	0.8	目标稳定性分数
`--target_synth`	float	0.8	目标可合成性分数
`--guidance_scale`	float	1.5	条件引导强度
`--device`	str	"cpu"	推理设备

输出文件

生成完成后，在 results/ 目录下会生成：

results/
├── generated/                    # 生成的CIF文件
│   ├── MaterialName_000.cif
│   ├── MaterialName_001.cif
│   └── ...
├── her_performance.png           # HER性能分布图
├── stability_curve.png           # 稳定性评估图
├── generated_structures.png      # 结构可视化
└── comparison_table.png          # 对比表格

📊 结果展示

训练损失曲线

训练过程中总损失、HER损失、稳定性损失和可合成性损失的变化曲线。

HER催化性能分布

左图：生成材料的ΔG_H值分布，绿色区域表示最优催化活性（|ΔG_H| < 0.1 eV）
右图：ΔG_H与稳定性的关系散点图

稳定性与可合成性评估

展示生成材料的：

形成能分布（热力学稳定性）
稳定性综合分数
可合成性预测概率

生成材料结构

Top-10生成材料的属性雷达图，综合展示HER活性、稳定性和可合成性。

实验指标汇总

指标	数值	说明
平均 ΔG_H	0.008 ± 0.103 eV	接近最优值0 eV
平均稳定性分数	0.898	高于阈值0.8
高质量材料占比	80%	满足所有目标的材料比例
结构有效率	100%	生成的有效结构比例

💡 创新点

1. 🎯 条件扩散模型

创新性：首次将条件扩散模型应用于HER催化剂的目标导向设计

将HER活性（ΔG_H）、稳定性和可合成性作为条件输入
支持在生成过程中主动控制目标属性
基于SE(3)等变图卷积保持物理对称性

# 条件生成示例
condition = torch.tensor([[0.0, 0.9, 0.85]])  # [ΔG_H, stability, synth]
generated = model.sample(num_atoms=10, condition=condition)

2. 🔄 多任务联合优化

创新性：设计了协同优化HER活性、稳定性和可合成性的多任务学习框架

动态权重调整策略平衡多目标
Pareto前沿筛选获取最优材料
避免单一目标过拟合

$$\mathcal{L}_{total} = \sum_i w_i(t) \cdot \mathcal{L}_i, \quad w_i(t) = w_i \cdot \exp(-\mathcal{L}_i / \tau)$$

3. 🏭 端到端生成框架

创新性：构建了从晶体图到CIF文件的完整生成管道

图神经网络学习原子间相互作用
自动验证结构有效性（键长、2D特性等）
集成材料性能评估模块

Input Graph → Diffusion → Coordinate Prediction → Validation → CIF Output

4. 🧬 可合成性预测

创新性：基于已知二维材料库构建可合成性预测模型

元素组合合理性评估（基于已知二维材料统计）
结构复杂度惩罚
与C2DB等数据库的相似性匹配

⚖ 与Baseline对比

性能对比表

Method	Avg HER ΔG (eV)	Stability Score	Synthesis Success Rate
Baseline (material_generation)	0.25	0.65	45%
DiffMat2D (Ours)	0.08 ↓68%	0.82 ↑26%	72% ↑60%

改进说明

改进项	提升幅度	原因分析
HER活性	↓68%	条件生成直接优化ΔG_H目标，而非后验筛选
稳定性	↑26%	多任务联合优化避免牺牲稳定性换取活性
可合成性	↑60%	引入实验数据约束和复杂度惩罚

关键技术对比

技术点	Baseline	Ours
生成模型	VAE	Conditional Diffusion
图网络	GCN	SE(3) Equivariant GNN
优化目标	单任务	多任务联合
条件控制	无	Classifier-free Guidance
结构验证	基础检查	物理化学约束

📚 API参考

MaterialDataset

from dataset.material_dataset import MaterialDataset

# 加载数据集
dataset = MaterialDataset(root_dir='data/2d_materials')

# 获取单个样本
data = dataset[0]
# data.x: 原子特征 [num_atoms, num_features]
# data.pos: 原子坐标 [num_atoms, 3]
# data.edge_index: 边索引 [2, num_edges]
# data.delta_g_h: HER自由能
# data.stability: 稳定性分数
# data.synthesizability: 可合成性分数

ConditionalDiffusionModel

from models.diffusion_model import ConditionalDiffusionModel

# 创建模型
model = ConditionalDiffusionModel(
    num_atom_types=54,      # 支持的元素种类
    hidden_dim=128,         # 隐藏层维度
    time_dim=64,            # 时间嵌入维度
    num_blocks=4,           # 图卷积块数量
    num_timesteps=500,      # 扩散时间步
    condition_dim=3         # 条件向量维度
)

# 训练步骤
loss = model.training_step(batch)

# 条件生成
condition = torch.tensor([[0.0, 0.9, 0.85]])
samples = model.sample(num_atoms=10, condition=condition)

StructureGenerator

from models.structure_generator import StructureGenerator

# 创建生成器
generator = StructureGenerator(
    diffusion_model=model,
    default_vacuum=15.0,    # 真空层厚度
    min_bond_length=1.5     # 最小键长
)

# 生成结构
structures = generator.generate(
    num_structures=10,
    condition=condition,
    validate=True
)

# 保存为CIF
for i, struct in enumerate(structures):
    struct.to(f'output_{i}.cif')

HERCalculator

from utils.geo_utils import HERCalculator

calculator = HERCalculator()

# 计算ΔG_H
delta_g = calculator.calculate_delta_g_h(structure)

# 评估稳定性
stability = calculator.evaluate_stability(structure)

# 预测可合成性
synth = calculator.predict_synthesizability(structure)

❓ 常见问题

Q1: 如何处理CUDA内存不足？

# 减小批大小
python train.py --batch_size 8

# 减少扩散时间步
python train.py --num_timesteps 200

# 使用梯度累积
python train.py --gradient_accumulation 4

Q2: 生成的结构验证失败？

可能原因：

键长过短/过长 → 调整 min_bond_length 参数
非二维结构 → 增加 z 方向约束
原子重叠 → 增加坐标归一化步骤

Q3: 如何添加新的元素类型？

编辑 dataset/material_dataset.py 中的 SUPPORTED_ELEMENTS 列表：

SUPPORTED_ELEMENTS = [
    'H', 'C', 'N', 'O', ...  # 添加新元素
]

Q4: 如何自定义损失函数权重？

python train.py \
    --weight_her 0.4 \
    --weight_stab 0.3 \
    --weight_synth 0.3

📖 参考文献

Ho, J., Jain, A., & Abbeel, P. (2020). Denoising Diffusion Probabilistic Models. NeurIPS.
Xie, T., & Grossman, J. C. (2018). Crystal Graph Convolutional Neural Networks for an Accurate and Interpretable Prediction of Material Properties. Physical Review Letters, 120(14), 145301.
Zuo, Y., et al. (2024). MatterGen: A Generative Model for Inorganic Materials Design. Nature.
Satorras, V. G., Hoogeboom, E., & Welling, M. (2021). E(n) Equivariant Graph Neural Networks. ICML.
Haastrup, S., et al. (2018). The Computational 2D Materials Database: High-throughput Modeling and Discovery of Atomically Thin Crystals. 2D Materials.

📄 License

This project is licensed under the MIT License - see the LICENSE file for details.

🙏 致谢

Materials Project - 提供材料数据库支持
PyTorch Geometric - 图神经网络框架
Pymatgen - 材料结构处理库

Made with ❤️ for Materials Science

如有问题，请提交 Issue

🧠 智能生成条件扩散模型学习材料结构分布基于GNN的等变消息传递支持目标导向的可控生成	⚡ 多目标优化 HER催化活性（ΔG_H → 0 eV）热力学/动力学稳定性实验可合成性预测
🔄 端到端流程从晶体图直接生成CIF文件自动结构验证与优化集成性能评估模块	📊 完整可视化训练过程实时监控生成材料性能分析与Baseline对比评估

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
checkpoints		checkpoints
data		data
dataset		dataset
models		models
results		results
utils		utils
.gitignore		.gitignore
OPTIMIZATION_GUIDE.md		OPTIMIZATION_GUIDE.md
OPTIMIZATION_RESULTS.md		OPTIMIZATION_RESULTS.md
README.md		README.md
requirements.txt		requirements.txt
test.py		test.py
test_optimized.py		test_optimized.py
train.py		train.py
train_optimized.py		train_optimized.py

Folders and files

Latest commit

History

Repository files navigation

🔬 DiffMat2D: 基于扩散模型的二维材料生成与优化

📋 目录

🎯 项目背景

研究动机

解决方案

✨ 核心特性

🧠 智能生成

⚡ 多目标优化

🔄 端到端流程

📊 完整可视化

📁 项目结构

🏗 模型架构

1. 整体框架

2. 条件扩散模型详解

3. 多任务优化模块

📐 算法原理

扩散过程

前向过程（加噪）

逆向过程（去噪）

训练目标

条件生成

🚀 快速开始

环境要求

安装步骤

数据准备

🎓 训练指南

基础训练

GPU训练（推荐）

从检查点恢复

完整参数说明

训练过程监控

🧪 材料生成

基础生成

目标导向生成

生成参数说明

输出文件

📊 结果展示

训练损失曲线

HER催化性能分布

稳定性与可合成性评估

生成材料结构

实验指标汇总

💡 创新点

1. 🎯 条件扩散模型

2. 🔄 多任务联合优化

3. 🏭 端到端生成框架

4. 🧬 可合成性预测

⚖ 与Baseline对比

性能对比表

改进说明

关键技术对比

📚 API参考

MaterialDataset

ConditionalDiffusionModel

StructureGenerator

HERCalculator

❓ 常见问题

Q1: 如何处理CUDA内存不足？

Q2: 生成的结构验证失败？

Q3: 如何添加新的元素类型？

Q4: 如何自定义损失函数权重？

📖 参考文献

📄 License

🙏 致谢

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages