一、基本上来看大概由5个评价参数:

  1. Validity(有效性)
  2. Uniqueness(独一无二性)
  3. Novelty(新颖性)
  4. KL divergence(KL散度)
  5. FCD(Fréchet ChemNet 距离)

KL-divergence 和 FCD是为了测量生成的分子和训练集、测试集的分子之间的相似性;

validity、FCD、KL-散度 三者的得分高度一致,它们三者的得分与“ novelty ”呈负相关;

二、使用的基准模型为:

  1. Random sampler: baseline model;
  2. SMILES LSTM: Long-Short-Term Memory DNN for SMILES strings;
  3. Graph MCTS: Graph-based Monte Carlo Tree Search;
  4. AAE: Adversarial AutoEncoder;
  5. ORGAN: Objective-Reinforced Generative Adversarial Network;
  6. VAE: Variational AutoEncoder;
  7. FASMIFRA: Fast Assembly of SMILES Fragments (proposed method);
  8. Negative control: FASMIFRA without extended bond typing (any fragment can be connected to any other fragment)

1、Molecular De Novo Design through Deep Reinforcement Learning

code:https://github.com/MarcusOlivecrona/REINVENT
paper:https://arxiv.org/abs/1704.07555

Again the consequence of removing the actives from the Prior was a threefold reduction in the probability of generating a test set active: the difference between the two Priors is directly mirrored by their corresponding Agents. Apart from generating a higher fraction of structures that are predicted to be active, both Agents also generate a significantly higher fraction of valid SMILES (Table 4). Sequences that are not valid SMILES receive a score of -1, which means that the scoring function naturally encourages valid SMILES.

2、Masked graph modeling for molecule generation(2021)【nature communications】

paper:Masked graph modeling for molecule generation | Nature Communications
code:GitHub - nyu-dl/dl4chem-mgm

In this work, we evaluate our approach on two popular molecular graph datasets, QM9 、 ChEMBL, using a set of five distribution-learning metrics introduced in the GuacaMol benchmark: the validity, uniqueness, novelty, KL-divergence (KLD) and Fréchet ChemNet Distance (FCD) scores.

(1)、在QM9数据集上使用“  masked graph model  ”得出的结果的基准指标之间的Spearman相关系数。

Table 1. Spearman’s correlation coefficient(spearman相关系数) between benchmark metrics for results using the masked graph model on the QM9 dataset.

(2)、在QM9数据集上使用LSTM、Transformer Small和Transformer Regular的结果的基准指标之间的Spearman相关系数。

Table 2.  Spearman’s correlation coefficient between benchmark metrics for results using LSTM, Transformer Small and Transformer Regular on the QM9 dataset.

(3)、QM9上的分布结果。CharacterVAE、GrammarVAE、GraphVAE和MolGAN结果取自Cao和Kipf。

Table 3. Distributional results on QM9. CharacterVAE, GrammarVAE, GraphVAE and MolGAN results are taken from Cao and Kipf.

(4)、ChEMBL上的分布结果。LSTM, Graph MCTS, AAE, ORGAN and VAE结果取自Brown等人

Table 4. Distributional results on ChEMBL. LSTM, Graph MCTS, AAE, ORGAN and VAE (with a bidirectional GRU as encoder and autoregressive GRU as decoder) results are taken from Brown et al..

支撑材料:

Supplementary Table 1: Effect of varying masking rate and graph initialization on the benchmark results for our masked graph model on QM9 and ChEMBL.

3、Generative Adversarial Networks for De Novo Molecular Design

paper:https://onlinelibrary.wiley.com/doi/abs/10.1002/minf.202100045

Table 1 summarizes the results of the baseline models and SMILES-MaskGAN for the distribution-learning benchmarks. The baseline models listed in the Table 1 are available at www.github.com/BenevolentAI/guacamol_baselines . The random sampler samples 10,000 molecules from the ChEMBL training dataset with replacement and computes the scores.
The novelty score is zero because it only samples the molecules present in the training data.
Based on the KL divergence score and FCD, the results demonstrate that the SMILES LSTM model reproduces a distribution similar to that of the ChEMBL training dataset. However,
in terms of validity, uniqueness, and novelty, it scored lower than the Graph Monte Carlo Tree Search (Graph MCTS) and the proposed SMILES-MaskGAN.

Although the Graph MCTS model[49] exhibits high scores for validity, uniqueness, and novelty, it fails to reproduce the molecular distribution of the training data. The adversarial autoencoder (AAE) model[50] exhibited good scores for validity, uniqueness, and novelty. However, the distribution of the generated molecules was not close to that of the training data. The variational autoencoder (VAE) model[16] does not have the best score for all benchmarks; however, it yields relatively acceptable scores compared with the other models. In addition, ORGAN,[24] which combines the reinforcement learning method with the GAN architecture, obtains the lowest scores on all benchmarks. Although it has relatively acceptable scores for uniqueness and novelty, it only has a slight similarity with the distribution of training data and does not produce more than half of the valid molecules.

4、Molecular generation by Fast Assembly of (Deep)SMILES fragments【2021】

paper:Molecular generation by Fast Assembly of (Deep)SMILES fragments | Journal of Cheminformatics | Full Text

code:GitHub - UnixJunkie/FASMIFRA: Molecular Generation by Fast Assembly of SMILES Fragments

Random sampler: baseline model;
SMILES LSTM: Long-Short-Term Memory DNN for SMILES strings;
Graph MCTS: Graph-based Monte Carlo Tree Search;
AAE: Adversarial AutoEncoder;
ORGAN: Objective-Reinforced Generative Adversarial Network;
VAE: Variational AutoEncoder;
FASMIFRA: Fast Assembly of SMILES Fragments (proposed method);
Negative control: FASMIFRA without extended bond typing (any fragment can be connected to any other fragment)

生成的SMILES以及对应的图像的评价指标【2】相关推荐

  1. 【pytorch】torch.meshgrid()==>常用于生成二维网格,比如图像的坐标点

    np.meshgrid()函数常用于生成二维网格,比如图像的坐标点. x1 ,y1 = torch.meshgrid(x,y) 输入参数: 参数是两个,第一个参数我们假设是x,第二个参数假设就是y   ...

  2. 深度图像修复的回顾和改进:使用生成对抗网络基于Patch的图像修复

    点击上方"AI公园",关注公众号,选择加"星标"或"置顶" 作者:Chu-Tak Li 编译:ronghuaiyang 导读 相比于之前,在 ...

  3. 生成多频外差的光栅图像【Matlab】

    背景介绍 在matlab中生成多频外差的光栅图像,其中3种频率的选择参考如下文献: Liu S , Feng W , Zhang Q , et al. Three-dimensional shape ...

  4. 文本自动生成研究进展与趋势之图像到文本的生成

    图像到文本的生成 1 国际研究现状 图像到文本的生成技术是指根据给定的图像生成描述该图像内容的自然语言文本,例如新闻图像附带的标题.医学图像附属的说明.儿童教育中常见的看图说话.以及用户在微博等互联网 ...

  5. 图像相似性评价指标(SSIM、MSE、PSNR)简单介绍及计算方法

    图像相似性评价指标 图像相似性评价指标 SSIM(结构相似性) MSE(均方误差) PSNR(峰值信噪比 ) 使用python进行计算 图像相似性评价指标 对于图像生成质量的通用性评价指标主要有SSI ...

  6. 图像的评价指标之PSNR——峰值信噪比

    图像的评价指标之PSNR--峰值信噪比 文章目录: 以及Python的实现 参考: https://blog.csdn.net/szfhy/article/details/49615833 https ...

  7. 图像的评价指标之SSMI——结构相似性

    图像的评价指标之SSMI--结构相似性 文章目录: https://blog.csdn.net/chaipp0607/article/details/70158835 https://zhuanlan ...

  8. 图像相似性评价指标SSIM/PSNR

    图像相似性评价指标SSIM/PSNR 1.结构相似性指标SSIM 参考自维基百科SSIM 1.1介绍 结构相似性指标(英文:structural similarity index,SSIM index ...

  9. delphi 生成 超大量xml_用OpenCV4实现图像的超分别率

    用OpenCV4实现图像的超分别率 本实验原文链接:· f="https://arxiv.org/pdf/1807.06779.pdf">https://arxiv.org/ ...

  10. 图像生成技术发展趋势_如何管理图像和视频中的颜色:最新趋势和最佳做法

    图像生成技术发展趋势 During the last world football cup, few people knew that only the flags of Argentina and ...

最新文章

  1. studio2008 无法显示该网页
  2. 我在交大的数学建模经验!
  3. 腾讯云Ubuntu挂载硬盘空间
  4. 大道至简:软件工程实践者的思想——第七、八章感想
  5. ASP.NET 使用 System.Web.Script.Serialization 解析 JSON (转)
  6. (chap4 Http状态码) 5XX
  7. Python和Java结合的项目实战_[项目实战] Python高级教程项目实战篇 Python和Java结合的项目实战 视频教程 [...
  8. setnx是原子操作吗_Redis面试七连问,你能扛得住吗?
  9. java+tomcat(apr,native)
  10. 使用Asp.Net MVC开发兼职文章系统
  11. 基于算法的建模--小结
  12. 容器技术学习系列文章总目录
  13. 【SICP练习】150 练习4.6
  14. 【Uniapp】Uniapp 实现 App 版本自动升级
  15. python交通流预测算法_使用KNN方法进行的短时交通流预测和结果分析
  16. OpenSees二次开发实例01
  17. IDEA 运行时出现 too long 异常
  18. 两招让你成为牛X的T型人才
  19. 使用pe系统 修复引导
  20. 【动态规划DP】跳木桩

热门文章

  1. 大数据,云计算 核心体系
  2. 新版本Google在安装外部扩展插件的时候会提示“程序包”无效的解决方法
  3. 不一样的类小草图形写法
  4. 8255芯片+8254芯片实现硬延时跑马灯
  5. vue用mand-mobile ui做交易所移动版实战示例
  6. 疫情防控加速数字化,亚洲普惠金融迎来大发展
  7. Mathematica/偏导数/最小二乘法(线性回归)
  8. 华为S5700交换机登录认证配置学习笔记
  9. 动态规划算法解Travelling Salesman Problem(TSP)问题
  10. 订餐系统实验报告java_订餐系统的java实训报告.ppt