深度学习-MuQYY的博客

更新

浏览

Flow Matching的数学原理

这篇文章介绍了Flow Matching作为一种生成模型训练方法，将其视为扩散模型的更通用形式。其核心思想是将数据视为在流场中运动的粒子，通过学习一个与时间相关的向量场来引导粒子从先验分布移动...

MuQYY8个月前

0357

Transformer详解

Transformer 模型详解 1. Transformer 概览 2017 年，Google 在论文 Attention is All You Need 中提出了 Transformer 模型。Transformer 使用了 Self-Attention（自注意力）机制，取代了在 NL...

MuQYY2年前

0900

具有ID信息的文本图像对数据集制作

1. 图像下载（Image Downloading）: 首先，列出了一个名人名单，这些名单可以从VoxCeleb和VGGFace等公开的名人面部数据集中获取。根据名单，使用搜索引擎爬取数据，大约为每个名字下载100张图...

MuQYY2年前

0900

论文笔记④DreamBooth: Fine Tuning Text-to-Image Diffusion Models for Subject-Driven Generation

文献基本信息文献名称: DreamBooth: Fine Tuning Text-to-Image Diffusion Models for Subject-Driven Generation 期刊杂志: CVPR 2023 研究类型类型: Research Article 文献基本内容研究背...

MuQYY2年前

01360

论文笔记⑥Imagic: Text-Based Real Image Editing with Diffusion Models

文献基本信息文献名称: Imagic: Text-Based Real Image Editing with Diffusion Models 期刊杂志: CVPR 2023 研究类型类型: Research Article 文献基本内容研究背景: 大规模文本到图像模型展...

MuQYY2年前

01540

论文笔记⑤SDEdit: Guided Image Synthesis and Editing with Stochastic Differential Equations

文献基本信息文献名称: SDEdit: Guided Image Synthesis and Editing with Stochastic Differential Equations 研究类型类型: 研究文章文献基本内容研究背景: 生成模型可以从随机噪声中创建...

MuQYY2年前

01360

论文笔记③T2I-Adapter: Learning Adapters to Dig Out More Controllable Ability for Text-to-Image Diffusion Models

文献基本信息文献名称: T2I-Adapter: Learning Adapters to Dig Out More Controllable Ability for Text-to-Image Diffusion Models 期刊杂志: AAAI 研究类型类型: Research Article 文献基...

MuQYY2年前

01270