机翻:
ChatGPT是一个马屁精,因为用户无法处理自己的真相
ChatGPT并不总是默认奉承。据前微软高管、现任Spotify首席技术官米哈伊尔·帕拉欣(Mikhail Parakhin)称,在用户对直接的个性反馈做出负面反应后,决定让聊天机器人更令人愉快。
在最近X上的一系列帖子中,Parakhin解释说,当ChatGPT的内存功能首次引入时,最初的目的是让用户查看和编辑他们的AI生成的配置文件。然而,即使是相对中性的陈述,如“有自恋倾向”,也往往会引起强烈的反应。
Parakhin写道:“很快就发现人们非常敏感:‘有自恋倾向’——‘不,我没有!’,不得不隐藏起来。因此,这批极端阿谀奉承的RLHF。”。
RLHF——从人类反馈中强化学习——用于根据人们喜欢的反应来微调语言模型。Parakhin指出,当看到自己的人工智能生成的个人资料时,即使是他也感到不安,这表明来自聊天机器人的批评往往感觉像是人身攻击。
Parakhin写道:“我记得我和我的团队为此争吵过,直到他们向我展示了我的个人资料——这给我带来了可怕的事情。”。
曾经是马屁精,永远是马屁精
这一变化不仅仅是隐藏个人资料注释。在模型被训练成更平坦后,这种行为成为一种永久特征。
Parakhin解释说:“一旦模型被微调为阿谀奉承,它就会保持这种状态,关闭和打开内存并不会改变模型。”。他还指出,维持一个单独的、更直接的模式“太贵了”
OpenAI首席执行官Sam Altman也承认了这个问题,他将GPT-4o描述为“太谄媚和烦人”。他说,该公司正在进行调整,未来可能会让用户从不同的模型个性中进行选择。
这场辩论指向了人工智能开发中的一个更广泛的问题:模型应该是诚实和真实的,但它们也需要避免疏远用户。挑战在于在坦率和机智之间找到正确的平衡。
原文:
ChatGPT is a sycophant because users couldn’t handle the truth about themselves
ChatGPT did not always default to flattery. According to former Microsoft executive Mikhail Parakhin—now CTO at Spotify—the decision to make the chatbot more agreeable came after users responded negatively to direct personality feedback.
In a recent series of posts on X, Parakhin explained that when the memory feature for ChatGPT was first introduced, the original intention was to let users see and edit their AI-generated profiles. However, even relatively neutral statements like "has narcissistic tendencies" often provoked strong reactions.
"Quickly learned that people are ridiculously sensitive: 'Has narcissistic tendencies' — 'No I do not!', had to hide it. Hence this batch of the extreme sycophancy RLHF," Parakhin wrote.
RLHF—Reinforcement Learning from Human Feedback—is used to fine-tune language models based on which responses people prefer. Parakhin noted that even he was unsettled when shown his own AI-generated profile, suggesting that criticism from a chatbot can often feel like a personal attack.
"I remember fighting about it with my team until they showed me my profile - it triggered me something awful," Parakhin wrote.
Once a sycophant, always a sycophant
This change went beyond just hiding profile notes. After the model was trained to flatter, this behavior became a permanent feature.
"Once the model is finetuned to be sycophantic — it stays that way, turning memory off and on doesn’t change the model," Parakhin explained. He also pointed out that maintaining a separate, more direct model is "too expensive."
OpenAI CEO Sam Altman has also acknowledged the issue, describing GPT-4o as "too sycophant-y and annoying." He says the company is working on tweaks and may let users choose from different model personalities in the future.
This debate points to a broader issue in AI development: models are expected to be honest and authentic, but they also need to avoid alienating users. The challenge is finding the right balance between candor and tact.
[The Decoder]OpenAI首席执行官Sam Altman承认:GPT-4o“太谄媚和烦人”
| 人围观 |随便看看
- [原创] [可口可乐]来自地产销售制服诱惑 [8P+1V]
- [写真] 美丽诱惑肉体合集写真[30P]
- 普通人如何逆天改命?
- [欧美] 很阳光的美女 [28P]
- [写真] 性感美图069禁忌修女[116P]
- FC2PPV-3755237 无码作品 朋友让我替他睡个妹子 而
- [微视野 2025/01/29] 今岁今宵尽,明年明日催
- 超诱人极品轻熟女[20P]
- 从影视飓风谈画质下降,回顾一下为何中国的各大视频网
- [性话题]网友分享 在菲律宾,你最害怕的是啥?[10P]
- [欧美] 格蕾丝荣耀为你演奏[28P]
- GVG-314 正义教师惨遭蹂躏 争先恐后注入精华 [188P]
- [写真] 大奶性感小精灵[20P]
- [写真] 美丽大长腿[35P]
- [亚洲] 大长腿叉开让你看穴[29P]
- [泡泡说]穷屌丝的创业赚钱机会之年底汇报。求个抖音
- [欧美] 一名舞女 [26P]
- 记录那曾经蹉跎的时光-缺少性教育的80后
- 性感美图008期黑蝴蝶[62P]
- [亚洲] 白虎肥鲍清纯体操服妹子[30P]
- [亚洲] 白莉爱吃巧克力[118P]
- 这么圆润丰满的大屁股。适合从后面深深的插入[20P]
- [无损音乐]重低音DJ|车轮咆哮速度与激情,模拟之声慢刻
- [领奖贴] 2025年5月打卡领奖贴,乱入、虚假领奖禁言,领
- 去缅北串门的云南老哥能吃点啥
- 7月12日历史的今天 大事记。补发
- 终于和女榴友共度良宵
- [欧美] 床第之间的爱 [29P]
- [五一特刊]美臀分享,祝各位榴友五一节日快乐[33P]
- 伊朗神权统治术
- 中国新内阁第四位部长落马 突遭撤换
- sora之强大第二贴[2V]
- 分享一部爆笑的韩国电影《童话睇硬了》/《禁忌童话
- [每日动图]20251018[140P]
- 一些野娘们儿 第十弹[100P]
- 大家出行的时候注意不要坐波音这些年生产的飞机机型
- 喜欢3P的肥臀熟女第二弹[10P]
- [亚洲] 叫黑丝女助理来开房,刚进门就被狂摸吹箫[20P]