深度学习-MuQYY的博客-第2页

为什么在VAE中，损失函数往往是最大化ELBO

因为 $$ KL = -ELBO + p_\theta(z|x_i) $$ 则有 $$ KL + ELBO = p_\theta(z|x_i) $$ 其中$p_\theta(z|x_i)$为一个常量因此最大化ELBO实际上就是最小化$KL$散度

MuQYY2年前

0670

Deep Learning Note 40 Transformer

import math import torch import pandas as pd from torch import nn from d2l import torch as d2l # 基于位置的前馈网络(实际上就是一个两层的全连接) class PositionWiseFFN(nn.Module): de...

MuQYY2年前

0610

Deep Learning Note 30 循环神经网络(RNN)的从零开始实现

import math import torch from torch import nn from torch.nn import functional as F from d2l import torch as d2l batch_size, num_steps = 32, 35 train_iter, vocab = d2l.load_data_tim...

MuQYY2年前

01070

论文笔记①High-Resolution Image Synthesis with Latent Diffusion Models

文献基本信息文献名称: High-Resolution Image Synthesis with Latent Diffusion Models 期刊杂志: CVPR 2022 研究类型类型: Research Article 文献基本内容研究背景: 图像合成是计算机视觉...

MuQYY2年前

01300

Deep Learning Note 39 多头注意力

import math import torch from torch import nn from d2l import torch as d2l # 缩放点积注意力 class DotProductAttention(nn.Module): def __init__(self, dropout, **kwargs): super(DotPr...

MuQYY2年前

0660

Deep Learning Note 29 自然语言统计与读取长序列数据

1、自然语言统计 import random import torch from d2l import torch as d2l tokens = d2l.tokenize(d2l.read_time_machine()) # 因为每个文本行不一定是一个句子或者一个段落，所以必须将所有...

MuQYY2年前

01230

论文笔记②Adding Conditional Control to Text-to-Image Diffusion Models

文献基本信息文献名称: Adding Conditional Control to Text-to-Image Diffusion Models 期刊杂志: ICCV 2023 研究类型类型: Research Article 文献基本内容研究背景: 文本到图像的扩散模型...

MuQYY2年前

01370

Deep Learning Note 38 Seq2Seq with Attention

import torch import torch.nn as nn from d2l import torch as d2l class AttentionDecoder(d2l.Decoder): """带有注意力机制的解码器基本接口""" def __init__...

MuQYY2年前

0680

论文笔记③T2I-Adapter: Learning Adapters to Dig Out More Controllable Ability for Text-to-Image Diffusion Models

文献基本信息文献名称: T2I-Adapter: Learning Adapters to Dig Out More Controllable Ability for Text-to-Image Diffusion Models 期刊杂志: AAAI 研究类型类型: Research Article 文献基...

MuQYY2年前

01260

Deep Learning Note 37 注意力评分(Attention Score)

import math import torch from torch import nn from d2l import torch as d2l # 遮掩softmax操作 def masked_softmax(X, valid_lens): """通过最后一个轴上遮蔽元素来执行 sof...

MuQYY2年前

0610