René's URL Explorer Experiment

Title: 一文深层解决过拟合 · Issue #29 · aialgorithm/Blog · GitHub

Open Graph Title: 一文深层解决过拟合 · Issue #29 · aialgorithm/Blog

X Title: 一文深层解决过拟合 · Issue #29 · aialgorithm/Blog

Description: 一、过拟合的本质及现象过拟合是指模型只过分地匹配特定训练数据集，以至于对训练集外数据无良好地拟合及预测。其本质原因是模型从训练数据中学习到了一些统计噪声，即这部分信息仅是局部数据的统计规律，该信息没有代表性，在训练集上虽然效果很好，但未知的数据集（测试集）并不适用。 1.1 拟合效果的评估方式通常由训练误差及测试误差（泛化误差）评估模型的学习程度及泛化能力。欠拟合时训练误差和测试误差在均较高，随着训练时间及模型复杂度的增加而下降。在到达一个拟合最优的临界点之后，训...

Open Graph Description: 一、过拟合的本质及现象过拟合是指模型只过分地匹配特定训练数据集，以至于对训练集外数据无良好地拟合及预测。其本质原因是模型从训练数据中学习到了一些统计噪声，即这部分信息仅是局部数据的统计规律，该信息没有代表性，在训练集上虽然效果很好，但未知的数据集（测试集）并不适用。 1.1 拟合效果的评估方式通常由训练误差及测试误差（泛化误差）评估模型的学习程度及泛化能力。欠拟合时训练误差和测试误差在...

X Description: 一、过拟合的本质及现象过拟合是指模型只过分地匹配特定训练数据集，以至于对训练集外数据无良好地拟合及预测。其本质原因是模型从训练数据中学习到了一些统计噪声，即这部分信息仅是局部数据的统计规律，该信息没有代表性，在训练集上虽然效果很好，但未知的数据集（测试集）并不适用。 1.1 拟合效果的评估方式通常由训练误差及测试误差（泛化误差）评估模型的学习程度及泛化能力。欠拟合时训练误差和测试误差在...

Opengraph URL: https://github.com/aialgorithm/Blog/issues/29

X: @github

direct link

Domain: github.com

Hey, it has json ld scripts:

{"@context":"https://schema.org","@type":"DiscussionForumPosting","headline":"一文深层解决过拟合","articleBody":"# 一、过拟合的本质及现象\r\n过拟合是指模型只过分地匹配特定训练数据集，以至于对训练集外数据无良好地拟合及预测。其本质原因是模型**从训练数据中学习到了一些统计噪声，即这部分信息仅是局部数据的统计规律**，该信息没有代表性，在训练集上虽然效果很好，但未知的数据集（测试集）并不适用。\r\n\r\n\r\n## 1.1 拟合效果的评估方式\r\n![image](https://img-blog.csdnimg.cn/img_convert/76773e515a9b61ec8ce18ddd9463e291.png)\r\n通常由训练误差及测试误差（泛化误差）评估模型的学习程度及泛化能力。\r\n\r\n**欠拟合**时训练误差和测试误差在均较高，随着训练时间及模型复杂度的增加而下降。在到达一个**拟合最优**的临界点之后，训练误差下降，测试误差上升，这个时候就进入了**过拟合**区域。它们的误差情况差异如下表所示：\r\n![image](https://img-blog.csdnimg.cn/img_convert/3c38576e40e120b476d35460953f06dc.png)\r\n\r\n## 1.2 拟合效果的深入分析\r\n\r\n对于拟合效果除了通过训练、测试的误差估计其泛化误差及判断拟合程度之外，我们往往还希望了解它为什么具有这样的泛化性能。统计学常用“偏差-方差分解”（bias-variance decomposition）来分析模型的泛化性能：泛化误差为偏差+方差+噪声之和。\r\n\r\n![image](https://img-blog.csdnimg.cn/img_convert/902b74af15408905e525f257d2af98f2.png)\r\n\r\n**噪声（ε）** 表达了在当前任务上任何学习算法所能达到的泛化误差的下界，即刻画了学习问题本身（客观存在）的难度。\r\n\r\n**偏差（Bias）** 是指用所有可能的训练数据集训练出的所有模型的输出值与真实值之间的差异，刻画了模型的拟合能力。偏差较小即模型预测准确度越高，表示模型拟合程度越高。\r\n![image](https://img-blog.csdnimg.cn/img_convert/d7292412890e915572f51e562bda3bc3.png)\r\n\r\n**方差（Variance）** 是指不同的训练数据集训练出的模型对同预测样本输出值之间的差异，刻画了训练数据扰动所造成的影响。方差较大即模型预测值越不稳定，表示模型(过)拟合程度越高，受训练集扰动影响越大。\r\n![image](https://img-blog.csdnimg.cn/img_convert/8e74a543f7c34a3da65a84dc4d54a738.png)\r\n如下用靶心图形象表示不同方差及偏差下模型预测的差异：\r\n\r\n偏差越小，模型预测值与目标值差异越小，预测值越准确；\r\n\r\n方差越小，不同的训练数据集训练出的模型对同预测样本预测值差异越小，预测值越集中；\r\n![image](https://img-blog.csdnimg.cn/img_convert/12fdec2c9426fe223ad69f60f98fbffe.png)\r\n\r\n**“偏差-方差分解”** 说明，模型拟合过程的泛化性能是由学习算法的能力、数据的充分性以及学习任务本身的难度所共同决定的。\r\n\r\n**当模型欠拟合时**：模型准确度不高（高偏差），受训练数据的扰动影响较小（低方差），其泛化误差大主要由高的偏差导致。\r\n\r\n**当模型过拟合时**：模型准确度较高（低偏差），模型容易学习到训练数据扰动的噪音（高方差），其泛化误差大由高的方差导致。\r\n\r\n![](https://img-blog.csdnimg.cn/img_convert/a2f46ff1e9028f0254f3451b4472c89b.png)\r\n实践中通常欠拟合不是问题，可以通过使用强特征及较复杂的模型提高学习的准确度。\r\n而**解决过拟合**，即如何减少泛化误差，提高泛化能力，通常才是优化模型效果的重点。\r\n\r\n# 二、如何解决过拟合\r\n\r\n## 2.1 解决思路\r\n上文说到学习统计噪声是过拟合的本质原因，而模型学习是以经验损失最小化，现实中学习的训练数据难免有统计噪音的。一个简单的思路，通过提高数据量数量或者质量解决统计噪音的影响：\r\n\r\n- 通过足够的数据量就可以有效区分哪些信息是片面的，然而现实情况数据通常都很有限的。\r\n\r\n- 通过提高数据的质量，可以结合先验知识加工特征以及对数据中噪声进行剔除（噪声如训练集有个“用户编号尾数是否为9”的特征下，偶然有正样本的占比很高的现象，而凭业务知识理解这个特征是没有意义的噪声，就可以考虑剔除）。但这样，一来过于依赖人工，人工智障？二来先验领域知识过多的引入，如果领域知识有误，不也是噪声。 \r\n\r\n当数据层面的优化有限，接下来登场主流的方法——正则化策略。\r\n\r\n在以（可能）增加经验损失为代价，以降低泛化误差为目的，解决过拟合，提高模型泛化能力的方法，统称为正则化策略。\r\n\r\n##  2.2 常见的正则化策略及原理\r\n\u003e本节尝试以不一样的角度去理解正则化策略，欢迎留言交流。\r\n\r\n正则化策略经常解读为对模型结构风险的惩罚，崇尚简单模型。并不尽然！如前文所讲学到统计噪声是过拟合的本质原因，所以模型复杂度容易引起过拟合（只是影响因素）。然而工程中，对于困难的任务需要足够复杂的模型，这种情况缩减模型复杂度不就和“减智商”一样？所以，通常足够复杂且有正则化的模型才是我们追求的。\r\n\r\n机器学习是从训练集经验损失最小化为学习目标，而学习的训练集里面不可避免有统计噪声。除了提高数据质量和数量方法，我们不也可以在模型学习的过程中，给一些指导性的先验假设(即根据一些已知的知识对参数的分布进行一定的假设)，帮助模型更好避开一些“噪声”的信息并关注到本质特征，更好地学习模型结构及参数。这些指导性的先验假设，也就是正则化策略，常见的正则化策略如下：\r\n\r\n### L2 正则化\r\nL2 参数正则化 (也称为岭回归、Tikhonov 正则) 通常被称为权重衰减 (weight decay)，是通过向⽬标函数添加⼀个正则项 Ω(θ) ，使权重更加接近原点，模型更为简单。从贝叶斯角度，L2的约束项可以视为模型参数引入先验的高斯分布（参见Bob Carpenter的 Lazy Sparse Stochastic Gradient Descent for Regularized  ）\r\n![](https://img-blog.csdnimg.cn/img_convert/b248dccacafefc4fb20e027f9bf0e21d.png)\r\n对带L2目标函数的模型参数更新权重，ϵ学习率：\r\n![](https://img-blog.csdnimg.cn/img_convert/8f8893bb99523adcfe06bf73c6de3e97.png)\r\n\r\n从上式可以看出，加⼊权重衰减后会导致学习规则的修改，即在每步执⾏梯度更新前先收缩权重 (乘以 1 − ϵα )，有权重衰减的效果。\r\n\r\n\r\n### L1 正则化\r\nL1 正则化（Lasso回归）是通过向⽬标函数添加⼀个参数惩罚项 Ω(θ)，为各个参数的绝对值之和。从贝叶斯角度，L1的约束项也可以视为模型参数引入拉普拉斯分布。\r\n\r\n![](https://img-blog.csdnimg.cn/img_convert/0b8670200d36e01ef223e785a92380e4.png)\r\n\r\n对带L1目标函数的模型参数更新权重（其中 sgn(x) 为符号函数，取参数的正负号）：\r\n![](https://img-blog.csdnimg.cn/img_convert/8d94fca457b3a56facd667dca0c013cb.png)\r\n可见，在-αsgn(w)项的作用下，  w各元素每步更新后的权重向量都会平稳地向0靠拢，w的部分元素容易为0，造成稀疏性。\r\n\r\n\r\n**对比L1，L2两者**，L2范式约束具有产生平滑解的效果，没有稀疏解的能力，即参数并不会出现很多零。假设我们的决策结果与两个特征有关，L2正则倾向于综合两者的影响（可以看作符合bagging的多释假设），给影响大的特征赋予高的权重；而L1正则倾向于选择影响较大的参数，而尽可能舍弃掉影响较小的那个（ 可以看作符合了“奥卡姆剃刀定律--如无必要勿增实体”的假设）。在实际应用中 L2正则表现往往会优于 L1正则，但 L1正则会压缩模型，降低计算量。\r\n![](https://img-blog.csdnimg.cn/img_convert/5c240e0a41d7c644b357a8de9be2a1fb.png)\r\n\r\n在Keras中，可以使用regularizers模块来在某个层上应用L1及L2正则化，如下代码：\r\n\r\n```\r\nfrom keras import regularizers\r\nmodel.add(Dense(64, input_dim=64,\r\n                kernel_regularizer=regularizers.l1_l2(l1=α1, l2=α2)  # α为超参数惩罚系数\r\n```\r\n\r\n### earlystop\r\nearlystop（早停法）可以限制模型最小化代价函数所需的训练迭代次数，如果迭代次数太少，算法容易欠拟合（方差较小，偏差较大），而迭代次数太多，算法容易过拟合（方差较大，偏差较小），早停法通过确定迭代次数解决这个问题。\r\n![](https://img-blog.csdnimg.cn/img_convert/7fd06be66d25d503383888504b2bcd79.png)\r\nearlystop可认为是将优化过程的参数空间限制在初始参数值 θ0 的小邻域内（Bishop  1995a  和Sjöberg and Ljung 1995 ），在这角度上相当于L2正则化的作用。\r\n\r\n在Keras中，可以使用callbacks函数实现早期停止，如下代码：\r\n\r\n```\r\nfrom keras.callbacks import EarlyStopping\r\n\r\ncallback =EarlyStopping(monitor='loss', patience=3)\r\nmodel = keras.models.Sequential([tf.keras.layers.Dense(10)])\r\nmodel.compile(keras.optimizers.SGD(), loss='mse')\r\nhistory = model.fit(np.arange(100).reshape(5, 20), np.zeros(5),\r\n                    epochs=10, batch_size=1, callbacks=[callback],\r\n                    verbose=0)\r\n\r\n```\r\n### 数据增强\r\n数据增强是提升算法性能、满足深度学习模型对大量数据的需求的重要工具。数据增强通过向训练数据添加转换或扰动来增加训练数据集。数据增强技术如水平或垂直翻转图像、裁剪、色彩变换、扩展和旋转（此外还有生成模型伪造的对抗样本），通常应用在视觉表象和图像分类中，通过数据增强有助于更准确的学习到输入数据所分布的流形（manifold)。\r\n\r\n在keras中，你可以使用ImageDataGenerator来实现上述的图像变换数据增强，如下代码：\r\n\r\n```\r\nfrom keras.preprocessing.image import ImageDataGenerator\r\ndatagen = ImageDataGenerator(horizontal_flip=True)\r\ndatagen.fit(train)\r\n```\r\n\r\n\r\n### 引入噪声\r\n与清洗数据的噪音相反，引入噪声也可以明显增加神经网络模型的鲁棒性（很像是以毒攻毒）。对于某些模型而言，向输入添加方差极小的噪声等价于对权重施加范数惩罚 (Bishop, 1995a,b)。常用有三种方式：\r\n\r\n- 在输入层引入噪声，可以视为是一种数据增强的方法。\r\n\r\n- 在模型权重引入噪声\r\n\r\n这项技术主要用于循环神经网络 (Jim et al., 1996; Graves, 2011)。向网络权重注入噪声，其代价函数等于无噪声注入的代价函数加上一个与噪声方差成正比的参数正则化项。\r\n\r\n- 在标签引入噪声\r\n\r\n原实际标签y可能多少含有噪声，当 y 是错误的，直接使用0或1作为标签，对最大化 log p(y | x)效果变差。另外，使用softmax 函数和最大似然目标，可能永远无法真正输出预测值为 0 或 1，因此它会继续学习越来越大的权重，使预测更极端。使用标签平滑的优势是能防止模型追求具体概率又不妨碍正确分类。如标签平滑 (label smoothing) 基于 k 个输出的softmax 函数，把明确分类 0 和 1 替换成 ϵ /(k−1) 和 1 − ϵ，对模型进行正则化。\r\n![](https://img-blog.csdnimg.cn/img_convert/1eb486e4797170cdb4899b0459148443.png)\r\n\r\n\r\n### 半监督学习\r\n半监督学习思想是在标记样本数量较少的情况下，通过在模型训练中直接引入无标记样本，以充分捕捉数据整体潜在分布，以改善如传统无监督学习过程盲目性、监督学习在训练样本不足导致的学习效果不佳的问题 。\r\n![](https://img-blog.csdnimg.cn/img_convert/71b3a0e13172d19dea62fd8a30dee35f.png)\r\n依据“流形假设——观察到的数据实际上是由一个低维流形映射到高维空间上的。由于数据内部特征的限制，一些高维中的数据会产生维度上的冗余，实际上只需要比较低的维度就能唯一地表示”，无标签数据相当于提供了一种正则化（regularization）,有助于更准确的学习到输入数据所分布的流形（manifold)，而这个低维流形就是数据的本质表示。\r\n\r\n\r\n### 多任务学习\r\n多任务学习(Caruana, 1993) 是通过合并几个任务中的样例（可以视为对参数施加的软约束）来提高泛化的一种方法，其引入一个先验假设：这些不同的任务中，能解释数据变化的因子是跨任务共享的。常见有两种方式：基于参数的共享及基于正则化的共享。\r\n![](https://img-blog.csdnimg.cn/img_convert/68edb20497ea1ec3ff0427165826e007.png)\r\n\r\n额外的训练样本以同样的方式将模型的参数推向泛化更好的方向，当模型的一部分在任务之间共享时，模型的这一部分更多地被约束为良好的值（假设共享是合理的），往往能更好地泛化。\r\n\r\n\r\n\r\n\r\n### bagging\r\nbagging是机器学习集成学习的一种。依据多释准则，结合了多个模型（符合经验观察的假设）的决策达到更好效果。具体如类似随机森林的思路，对原始的m个训练样本进行有放回随机采样，构建t组m个样本的数据集，然后分别用这t组数据集去训练t个的DNN，最后对t个DNN模型的输出用加权平均法或者投票法决定最终输出。\r\n![](https://img-blog.csdnimg.cn/img_convert/b7b8868679334350a82e7f4db59ba7b6.png)\r\n\r\nbagging 可以通过平滑效果降低了方差，并中和些噪声带来的误差，因此有更高的泛化能力。\r\n\r\n\r\n### Dropout\r\n\r\n\r\nDropout是正则化技术简单有趣且有效的方法，在神经网络很常用。其方法是：在每个迭代过程中，以一定概率p随机选择输入层或者隐藏层的（通常隐藏层）某些节点，并且删除其前向和后向连接（让这些节点暂时失效）。权重的更新不再依赖于有“逻辑关系”的隐藏层的神经元的共同作用，一定程度上避免了一些特征只有在特定特征下才有效果的情况，迫使网络学习更加鲁棒(指系统的健壮性)的特征，达到减小过拟合的效果。这也可以近似为机器学习中的集成bagging方法，通过bagging多样的的网络结构模型，达到更好的泛化效果。\r\n![](https://img-blog.csdnimg.cn/img_convert/4d1e4fa51b775fe3e57cc3859652c06f.png)\r\n相似的还有Drop Connect ，它和 Dropout 相似的地方在于它涉及在模型结构中引入稀疏性，不同之处在于它引入的是权重的稀疏性而不是层的输出向量的稀疏性。\r\n\r\n在Keras中，我们可以使用Dropout层实现dropout，代码如下：\r\n\r\n```\r\nfrom keras.layers.core import Dropout\r\n\r\nmodel = Sequential([\r\n Dense(output_dim=hidden1_num_units, input_dim=input_num_units, activation='relu'),\r\n Dropout(0.25)\r\n ])\r\n```\r\n\r\n(end)\r\n---\r\n\r\n文章首发公众号“算法进阶”，更多原创文章敬请关注","author":{"url":"https://github.com/aialgorithm","@type":"Person","name":"aialgorithm"},"datePublished":"2021-11-03T07:52:20.000Z","interactionStatistic":{"@type":"InteractionCounter","interactionType":"https://schema.org/CommentAction","userInteractionCount":0},"url":"https://github.com/29/Blog/issues/29"}

route-pattern	/_view_fragments/issues/show/:user_id/:repository/:id/issue_layout(.:format)
route-controller	voltron_issues_fragments
route-action	issue_layout
fetch-nonce	v2:a465ca4b-e075-4581-01a7-eb3177c865b7
current-catalog-service-hash	81bb79d38c15960b92d99bca9288a9108c7a47b18f2423d0f6438c5b7bcd2114
request-id	A6C6:1BA21A:A7D52E:EED223:696A5641
html-safe-nonce	8a73a5a0b09b94979679d9b8f88d1c4e4835eb14d32cc0a36cb8c9b373f14997
visitor-payload	eyJyZWZlcnJlciI6IiIsInJlcXVlc3RfaWQiOiJBNkM2OjFCQTIxQTpBN0Q1MkU6RUVEMjIzOjY5NkE1NjQxIiwidmlzaXRvcl9pZCI6IjUwMTU2NTk0MTcwMTMyMTI3MzciLCJyZWdpb25fZWRnZSI6ImlhZCIsInJlZ2lvbl9yZW5kZXIiOiJpYWQifQ==
visitor-hmac	75f740c47ef82d2c2cbadb9a7c394c3b40d3a91cc94e4cf0d82ef3d499e77a2e
hovercard-subject-tag	issue:1043176080
github-keyboard-shortcuts	repository,issues,copilot
google-site-verification	Apib7-x98H0j5cPqHWwSMm6dNU4GmODRoqxLiDzdx9I
octolytics-url	https://collector.github.com/github/collect
analytics-location	///voltron/issues_fragments/issue_layout
fb:app_id	1401488693436528
apple-itunes-app	app-id=1477376905, app-argument=https://github.com/_view_fragments/issues/show/aialgorithm/Blog/29/issue_layout
twitter:image	https://opengraph.githubassets.com/3ffd5a24cc76a489ceec8bb630be2a35b6bd4e6dc4a473840d0a91d63b90ce1c/aialgorithm/Blog/issues/29
twitter:card	summary_large_image
og:image	https://opengraph.githubassets.com/3ffd5a24cc76a489ceec8bb630be2a35b6bd4e6dc4a473840d0a91d63b90ce1c/aialgorithm/Blog/issues/29
og:image:alt	一、过拟合的本质及现象过拟合是指模型只过分地匹配特定训练数据集，以至于对训练集外数据无良好地拟合及预测。其本质原因是模型从训练数据中学习到了一些统计噪声，即这部分信息仅是局部数据的统计规律，该信息没有代表性，在训练集上虽然效果很好，但未知的数据集（测试集）并不适用。 1.1 拟合效果的评估方式通常由训练误差及测试误差（泛化误差）评估模型的学习程度及泛化能力。欠拟合时训练误差和测试误差在...
og:image:width	1200
og:image:height	600
og:site_name	GitHub
og:type	object
og:author:username	aialgorithm
hostname	github.com
expected-hostname	github.com
None	3f871c8e07f0ae1886fa8dac284166d28b09ad5bada6476fc10b674e489788ef
turbo-cache-control	no-preview
go-import	github.com/aialgorithm/Blog git https://github.com/aialgorithm/Blog.git
octolytics-dimension-user_id	33707637
octolytics-dimension-user_login	aialgorithm
octolytics-dimension-repository_id	147093233
octolytics-dimension-repository_nwo	aialgorithm/Blog
octolytics-dimension-repository_public	true
octolytics-dimension-repository_is_fork	false
octolytics-dimension-repository_network_root_id	147093233
octolytics-dimension-repository_network_root_nwo	aialgorithm/Blog
turbo-body-classes	logged-out env-production page-responsive
disable-turbo	false
browser-stats-url	https://api.github.com/_private/browser/stats
browser-errors-url	https://api.github.com/_private/browser/errors
release	63c426b30d262aba269ef14c40e3c817b384cd61
ui-target	canary-2
theme-color	#1e2327
color-scheme	light dark

Links:

Skip to content	https://github.com/aialgorithm/Blog/issues/29#start-of-content
	https://github.com/
Sign in	https://github.com/login?return_to=https%3A%2F%2Fgithub.com%2Faialgorithm%2FBlog%2Fissues%2F29
GitHub CopilotWrite better code with AI	https://github.com/features/copilot
GitHub SparkBuild and deploy intelligent apps	https://github.com/features/spark
GitHub ModelsManage and compare prompts	https://github.com/features/models
MCP RegistryNewIntegrate external tools	https://github.com/mcp
ActionsAutomate any workflow	https://github.com/features/actions
CodespacesInstant dev environments	https://github.com/features/codespaces
IssuesPlan and track work	https://github.com/features/issues
Code ReviewManage code changes	https://github.com/features/code-review
GitHub Advanced SecurityFind and fix vulnerabilities	https://github.com/security/advanced-security
Code securitySecure your code as you build	https://github.com/security/advanced-security/code-security
Secret protectionStop leaks before they start	https://github.com/security/advanced-security/secret-protection
Why GitHub	https://github.com/why-github
Documentation	https://docs.github.com
Blog	https://github.blog
Changelog	https://github.blog/changelog
Marketplace	https://github.com/marketplace
View all features	https://github.com/features
Enterprises	https://github.com/enterprise
Small and medium teams	https://github.com/team
Startups	https://github.com/enterprise/startups
Nonprofits	https://github.com/solutions/industry/nonprofits
App Modernization	https://github.com/solutions/use-case/app-modernization
DevSecOps	https://github.com/solutions/use-case/devsecops
DevOps	https://github.com/solutions/use-case/devops
CI/CD	https://github.com/solutions/use-case/ci-cd
View all use cases	https://github.com/solutions/use-case
Healthcare	https://github.com/solutions/industry/healthcare
Financial services	https://github.com/solutions/industry/financial-services
Manufacturing	https://github.com/solutions/industry/manufacturing
Government	https://github.com/solutions/industry/government
View all industries	https://github.com/solutions/industry
View all solutions	https://github.com/solutions
AI	https://github.com/resources/articles?topic=ai
Software Development	https://github.com/resources/articles?topic=software-development
DevOps	https://github.com/resources/articles?topic=devops
Security	https://github.com/resources/articles?topic=security
View all topics	https://github.com/resources/articles
Customer stories	https://github.com/customer-stories
Events & webinars	https://github.com/resources/events
Ebooks & reports	https://github.com/resources/whitepapers
Business insights	https://github.com/solutions/executive-insights
GitHub Skills	https://skills.github.com
Documentation	https://docs.github.com
Customer support	https://support.github.com
Community forum	https://github.com/orgs/community/discussions
Trust center	https://github.com/trust-center
Partners	https://github.com/partners
GitHub SponsorsFund open source developers	https://github.com/sponsors
Security Lab	https://securitylab.github.com
Maintainer Community	https://maintainers.github.com
Accelerator	https://github.com/accelerator
Archive Program	https://archiveprogram.github.com
Topics	https://github.com/topics
Trending	https://github.com/trending
Collections	https://github.com/collections
Enterprise platformAI-powered developer platform	https://github.com/enterprise
GitHub Advanced SecurityEnterprise-grade security features	https://github.com/security/advanced-security
Copilot for BusinessEnterprise-grade AI features	https://github.com/features/copilot/copilot-business
Premium SupportEnterprise-grade 24/7 support	https://github.com/premium-support
Pricing	https://github.com/pricing
Search syntax tips	https://docs.github.com/search-github/github-code-search/understanding-github-code-search-syntax
documentation	https://docs.github.com/search-github/github-code-search/understanding-github-code-search-syntax
Sign in	https://github.com/login?return_to=https%3A%2F%2Fgithub.com%2Faialgorithm%2FBlog%2Fissues%2F29
Sign up	https://github.com/signup?ref_cta=Sign+up&ref_loc=header+logged+out&ref_page=%2F%3Cuser-name%3E%2F%3Crepo-name%3E%2Fvoltron%2Fissues_fragments%2Fissue_layout&source=header-repo&source_repo=aialgorithm%2FBlog
Reload	https://github.com/aialgorithm/Blog/issues/29
Reload	https://github.com/aialgorithm/Blog/issues/29
Reload	https://github.com/aialgorithm/Blog/issues/29
aialgorithm	https://github.com/aialgorithm
Blog	https://github.com/aialgorithm/Blog
Notifications	https://github.com/login?return_to=%2Faialgorithm%2FBlog
Fork 259	https://github.com/login?return_to=%2Faialgorithm%2FBlog
Star 942	https://github.com/login?return_to=%2Faialgorithm%2FBlog
Code	https://github.com/aialgorithm/Blog
Issues 66	https://github.com/aialgorithm/Blog/issues
Pull requests 0	https://github.com/aialgorithm/Blog/pulls
Actions	https://github.com/aialgorithm/Blog/actions
Projects 0	https://github.com/aialgorithm/Blog/projects
Security Uh oh! There was an error while loading. Please reload this page.	https://github.com/aialgorithm/Blog/security
Please reload this page	https://github.com/aialgorithm/Blog/issues/29
Insights	https://github.com/aialgorithm/Blog/pulse
Code	https://github.com/aialgorithm/Blog
Issues	https://github.com/aialgorithm/Blog/issues
Pull requests	https://github.com/aialgorithm/Blog/pulls
Actions	https://github.com/aialgorithm/Blog/actions
Projects	https://github.com/aialgorithm/Blog/projects
Security	https://github.com/aialgorithm/Blog/security
Insights	https://github.com/aialgorithm/Blog/pulse
New issue	https://github.com/login?return_to=https://github.com/aialgorithm/Blog/issues/29
New issue	https://github.com/login?return_to=https://github.com/aialgorithm/Blog/issues/29
一文深层解决过拟合	https://github.com/aialgorithm/Blog/issues/29#top
	https://github.com/aialgorithm
	https://github.com/aialgorithm
aialgorithm	https://github.com/aialgorithm
on Nov 3, 2021	https://github.com/aialgorithm/Blog/issues/29#issue-1043176080
	https://camo.githubusercontent.com/4dccb6b99569a0ff5a11edf4060cceebc9c55d4bd3621103f7c9b785b7a3b490/68747470733a2f2f696d672d626c6f672e6373646e696d672e636e2f696d675f636f6e766572742f37363737336535313561396236316563386365313864646439343633653239312e706e67
	https://camo.githubusercontent.com/313343f5e8b19eee295265e4344514c38c5ca368fc1f9edde50697c31556b65a/68747470733a2f2f696d672d626c6f672e6373646e696d672e636e2f696d675f636f6e766572742f33633338353736653430653132306234373664333534363039353366303664632e706e67
	https://camo.githubusercontent.com/9c26c046c164858016d7047f83e98ca5c8fa9bfef3b7e71ce0139fa4df8c9c04/68747470733a2f2f696d672d626c6f672e6373646e696d672e636e2f696d675f636f6e766572742f39303262373461663135343038393035653532356632353764326166393866322e706e67
	https://camo.githubusercontent.com/182e1e58752257d7489296330f09758cf891b24df6543e8e0c4a124faaa5d922/68747470733a2f2f696d672d626c6f672e6373646e696d672e636e2f696d675f636f6e766572742f64373239323431323839306539313535373266353165353632626461336263332e706e67
	https://camo.githubusercontent.com/466e99f4aeb3dd447564acb67d3c3ffcbbf70cfa4f024b0d64560158dbd1e011/68747470733a2f2f696d672d626c6f672e6373646e696d672e636e2f696d675f636f6e766572742f38653734613534336637633334613364613635613834646334643534613733382e706e67
	https://camo.githubusercontent.com/69477c632c1bc96cceeffc962c31cef107dc387a88507867ecb460df44dd36cb/68747470733a2f2f696d672d626c6f672e6373646e696d672e636e2f696d675f636f6e766572742f31326664656332633934323666653232336164363966363066393866626666652e706e67
	https://camo.githubusercontent.com/06fcc9da578953f63e2b4f19bde3c89539bdf6af0ccce34b6de9d254232cd7f0/68747470733a2f2f696d672d626c6f672e6373646e696d672e636e2f696d675f636f6e766572742f61326634366666316539303238663032353466333435316234343732633839622e706e67
	https://camo.githubusercontent.com/e86788fb62e7136c2dfc2195b6d59de0677cde2b28548bee6ea2defa70b8000a/68747470733a2f2f696d672d626c6f672e6373646e696d672e636e2f696d675f636f6e766572742f62323438646363616361666566633466623230653032376639626630653231642e706e67
	https://camo.githubusercontent.com/43b84bdb028fca0aecc8e2f8e3c8e90d20fd8b6470da0dd37980f30f98529a91/68747470733a2f2f696d672d626c6f672e6373646e696d672e636e2f696d675f636f6e766572742f38663838393362623939353233616463666530366266373363366465336539372e706e67
	https://camo.githubusercontent.com/7f410bce365cad476bf4ab9b26fb433feb2635c244751ae3eb9319579f898b60/68747470733a2f2f696d672d626c6f672e6373646e696d672e636e2f696d675f636f6e766572742f30623836373032303064333665303165663232336537383561393233383065342e706e67
	https://camo.githubusercontent.com/019a1bd1562d3fb618a14440db34faa3f1a97d34ccff42bc2ea868e5adf75197/68747470733a2f2f696d672d626c6f672e6373646e696d672e636e2f696d675f636f6e766572742f38643934666361343537623361353666616364363637646361306330313363622e706e67
	https://camo.githubusercontent.com/0f4de596c0c7003d281001465fb1a82fd0eb20e1798f4c76bc57bac75c68cf32/68747470733a2f2f696d672d626c6f672e6373646e696d672e636e2f696d675f636f6e766572742f35633234306530613431643763363434623335376138646539626532613166622e706e67
	https://camo.githubusercontent.com/2b7106ed666d3278c55d61d95105afc39c10bf0e2c938d60a39b67c180b8810f/68747470733a2f2f696d672d626c6f672e6373646e696d672e636e2f696d675f636f6e766572742f37666430366265363664323564353033333833383838353034623262636437392e706e67
	https://camo.githubusercontent.com/e5d3cff57b88e4933195db31d1fd7a7632fd238ea9f524000eadb37ab870798d/68747470733a2f2f696d672d626c6f672e6373646e696d672e636e2f696d675f636f6e766572742f31656234383665343739373137306364623438393962303435393134383434332e706e67
	https://camo.githubusercontent.com/adf6fb8f0967c18941f2613caea6ae8ed52427855b7dbdb93ace82c144f365fa/68747470733a2f2f696d672d626c6f672e6373646e696d672e636e2f696d675f636f6e766572742f37316233613065313331373264313964656136326664386133306465653335662e706e67
	https://camo.githubusercontent.com/120a5248510c6cdbb79b4a3a305a18888d4e718e7aa21e64f9cc75af09064f0b/68747470733a2f2f696d672d626c6f672e6373646e696d672e636e2f696d675f636f6e766572742f36386564623230343937656131656333666630343237313635383236653030372e706e67
	https://camo.githubusercontent.com/47cc9e2ddb2238713fdc7d12156e230c9d20be90780cae5376d165270a3e9244/68747470733a2f2f696d672d626c6f672e6373646e696d672e636e2f696d675f636f6e766572742f62376238383638363739333334333530613832653766346462353962613762362e706e67
	https://camo.githubusercontent.com/0472004e47895e5c98cc9bf8d37cf52de5c3e6bd7fd4f344e24773ce357c56ad/68747470733a2f2f696d672d626c6f672e6373646e696d672e636e2f696d675f636f6e766572742f34643165346661353162373735666533653537636333383539363532633036662e706e67
	https://github.com
Terms	https://docs.github.com/site-policy/github-terms/github-terms-of-service
Privacy	https://docs.github.com/site-policy/privacy-policies/github-privacy-statement
Security	https://github.com/security
Status	https://www.githubstatus.com/
Community	https://github.community/
Docs	https://docs.github.com/
Contact	https://support.github.com?tags=dotcom-footer

Viewport: width=device-width

URLs of crawlers that visited me.