Skip to content

wmt/005-wmt-gradient-theory #2

@utterances-bot

Description

@utterances-bot

SameTime WMT 专题:梯度函数决定学习规律 | GrepCode

从 sin(0.76) 到 tanh(3.06) 的距离,不是函数好坏——是梯度方向的自洽性决定了一个模型能走多远。

https://www.grepcode.cn/wmt/005-wmt-gradient-theory.html

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions