Think of it this way. Layers 46 through 52 aren’t seven workers doing the same job. They’re seven steps in a recipe. Layer 46 takes the abstract representation and performs step one of some cognitive operation — maybe decomposing a complex representation into subcomponents. Layer 47 takes that output and performs step two — maybe identifying relationships between the subcomponents. Layer 48 does step three, and so on through layer 52, which produces the final result.
data, but I showed it
— Mark M (@offlinemark) July 22, 2020。业内人士推荐新收录的资料作为进阶阅读
另外你也知道的,模型极不擅长数学,我给他几个样本字符,他「寻找规律」但一直告诉我没有规律,一定是查表得到的。于是我找了更多样本,甚至码着 CJK 码位搞出来了好几排字,让机器渲染出来、拍照,用 Gemini 的 Canvas 写了两个工具,造了一大堆的样本给他搜。
。新收录的资料对此有专业解读
luvlyasabeeпользовательница Reddit。关于这个话题,新收录的资料提供了深入分析
Фото: Valentyn Ogirenko / Reuters