Deep Learning - 标签

Deep Learning - 标签 - mywebsitehttps://steven-yl.github.io/mywebsite/tags/deep-learning/Deep Learning - 标签 - mywebsiteHugo -- gohugo.iozh-CNsteven@gmail.com (Steven)steven@gmail.com (Steven)This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.Fri, 03 Apr 2026 00:00:00 +0800深度学习中的常见归一化方法https://steven-yl.github.io/mywebsite/norm/Wed, 01 Apr 2026 10:00:00 +0800Stevenhttps://github.com/steven-ylhttps://steven-yl.github.io/mywebsite/norm/深度学习中的常见归一化方法KL 散度与离散流匹配中的广义 KL 损失https://steven-yl.github.io/mywebsite/kl_div/Wed, 25 Mar 2026 00:00:00 +0800Stevenhttps://github.com/steven-ylhttps://steven-yl.github.io/mywebsite/kl_div/本文把 KL 散度相关的几个核心概念串起来，给出离散流匹配中广义 KL 损失的直观解释与 PyTorch 实现示例。Loss Functions：系统化整理https://steven-yl.github.io/mywebsite/loss_type/Wed, 25 Mar 2026 00:00:00 +0800Stevenhttps://github.com/steven-ylhttps://steven-yl.github.io/mywebsite/loss_type/本笔记从任务视角覆盖主流 Loss Functions，包括经典方法、现代变体以及实际组合策略，便于快速对照与选型。PyTorch lr曲线https://steven-yl.github.io/mywebsite/lr_function/Tue, 24 Mar 2026 00:00:00 +0800Stevenhttps://github.com/steven-ylhttps://steven-yl.github.io/mywebsite/lr_function/lr曲线图PyTorch 激活函数https://steven-yl.github.io/mywebsite/active_function/Tue, 24 Mar 2026 00:00:00 +0800Stevenhttps://github.com/steven-ylhttps://steven-yl.github.io/mywebsite/active_function/本文汇总 Sigmoid、Tanh、ReLU、GELU、Swish 等激活函数，并提供分组图与总览图。PyTorch 分布式训练与操作工具技术文档https://steven-yl.github.io/mywebsite/distributed_training_guide/Thu, 12 Mar 2026 00:00:00 +0800Stevenhttps://github.com/steven-ylhttps://steven-yl.github.io/mywebsite/distributed_training_guide/从进程组初始化、DDP 封装、数据分片、集体通信到 Lightning 封装，全面讲解如何在单机多卡与多机多卡场景下正确使用 PyTorch 分布式训练。PyTorch Dataset 体系技术文档https://steven-yl.github.io/mywebsite/pytorch_dataset_guide/Thu, 12 Mar 2026 00:00:00 +0800Stevenhttps://github.com/steven-ylhttps://steven-yl.github.io/mywebsite/pytorch_dataset_guide/覆盖 map-style/IterableDataset、全部内置 Dataset 扩展、图数据与 HF datasets、典型项目扩展模式、padding 与 collate 职责划分，以及与 DataLoader 的衔接。Pytorch 权重初始化方法https://steven-yl.github.io/mywebsite/net_init/Thu, 12 Mar 2026 00:00:00 +0800Stevenhttps://github.com/steven-ylhttps://steven-yl.github.io/mywebsite/net_init/全面对比深度学习权重初始化方法的原理、公式推导、优缺点与适用场景，附 PyTorch 代码示例和 Transformer 架构初始化最佳实践。PyTorch DataLoader 技术解读https://steven-yl.github.io/mywebsite/dataloader_guide/Thu, 12 Mar 2026 00:00:00 +0800Stevenhttps://github.com/steven-ylhttps://steven-yl.github.io/mywebsite/dataloader_guide/从索引流、取样本、成 batch 三条线讲清 DataLoader 职责，涵盖 Sampler、collate_fn、num_workers、pin_memory 及与 Dataset 的衔接。PyTorch 模型训练技术文档：求解器、参数配置与训练循环https://steven-yl.github.io/mywebsite/training_solver_guide/Thu, 12 Mar 2026 00:00:00 +0800Stevenhttps://github.com/steven-ylhttps://steven-yl.github.io/mywebsite/training_solver_guide/从总览到各章节：Optimizer/SGD/Adam/AdamW 全解读、LRScheduler 族、param_groups、梯度累积与裁剪、损失选型及学习率与 batch 配置经验。