- 主页 > 生活百科 > >
ChatGPT/InstructGPT详解( 六 )
^Radford, A., Wu, J., Child, R., Luan, D., Amodei, D. and Sutskever, I., 2019. Language models are unsupervised multitask learners. *OpenAI blog*, *1*(8), p.9. https://life-extension.github.io/2020/05/27/GPT%E6%8A%80%E6%9C%AF%E5%88%9D%E6%8E%A2/language-models.pdf ^Brown, Tom B., Benjamin Mann, Nick Ryder, Melanie Subbiah, Jared Kaplan, Prafulla Dhariwal, Arvind Neelakantan et al. “Language models are few-shot learners.” *arXiv preprint arXiv:2005.14165* (2020). https://proceedings.neurips.cc/paper/2020/file/1457c0d6bfcb4967418bfb8ac142f64a-Paper.pdf ^Wei, Jason, et al. "Finetuned language models are zero-shot learners." *arXiv preprint arXiv:2109.01652* (2021). https://arxiv.org/pdf/2109.01652.pdf ^Christiano, Paul F., et al. "Deep reinforcement learning from human preferences." *Advances in neural information processing systems* 30 (2017). https://arxiv.org/pdf/1706.03741.pdf ^Schulman, John, et al. "Proximal policy optimization algorithms." *arXiv preprint arXiv:1707.06347* (2017). https://arxiv.org/pdf/1707.06347.pdf?
推荐阅读
-
-
情感调解|小姑子借了30万赖着不还,上门讨要惹恼婆婆,差点结束8年婚姻
-
影迷宝爸给女儿取名“子怡”,还沾沾自喜,媳妇发飙倒着念试试
-
-
-
岳云鹏|岳云鹏雷佳音“极挑”抱团《未知的餐桌》变跑挑相争
-
-
藏红花泡水喝的功效,藏红花泡水喝的功效与作用及禁忌
-
-
-
-
工人日报|中国冰淇淋市场总量超千亿元 还有哪些机会可挖掘?
-
北青网综合|硬核!路边消防栓爆裂狂喷水,小伙一屁股坐下,人肉压水花
-
『iPhone』2000预算手机怎么选,96%以上的人都会选择这四款
-
-
特朗普:何时由媒体宣布下任总统?-特朗普还有戏吗-美国大选2020结果公布时间
-
[神奇的老外]澳洲虐待狂把折磨年轻女子当娱乐 逼其吞食呕吐物 用丙酮烟头烧她
-
茜茜看星座|格林为詹姆斯回怼皮尔斯,事实上,格林和詹姆斯的关系比想象中好
-
-
都市民生汇|被阻后用英文回怼,惹怒网友:装什么外国人,女子地铁乱吐瓜子