OpenAI 的宮鬥核心人物 - Ilya Sutskever

以下是關於網路上一些資訊的筆記

科技發展的同時，是否人類也能兼顧並保護自己的未來？

這幾天 OpenAI 的內部宮鬥經過了一系列的反轉再反轉再反轉XD

目前 Sam Altman 應該是會回歸到 OpenAI，董事會會全部重組
初步重組人員包括

Bret Taylor - Salesforce的前聯合首席官
Larry Summers - 美國前財政部部長
Adam D’Angelo - Quora 創始人

Ilya Sutskever 的思想很大一部份受到他的老師 Geoffrey Hinton 深度學習之父的影響
在學生時期跟著老師與他的學生 Alex Krizhevsky 寫出論文 AlexNet 論文連結
接著老師帶著他們創建了 DNN research，之後加入 Google 巨頭研究 AI 長達三年
最終在 2015 年被 Elon Musk 跟 Gregg Brockman 拉去跟 Sam Altman 創建了一開始的 OpenAI （那時候還是非營利組織）
Google 發表論文 Attention Is All you need

大綱

The paper you provided is titled “Attention Is All You Need” by Ashish Vaswani and others, published in 2017. This influential paper introduces the Transformer model, a new approach to sequence transduction (like machine translation) that relies solely on attention mechanisms, discarding the need for recurrent or convolutional neural networks.

Key Points:

Introduction and Background:
- Recurrent neural networks (RNNs), long short-term memory (LSTM), and gated recurrent neural networks have been standard in sequence modeling tasks like language modeling and machine translation.
- Traditional models align computation with the positions of input and output sequences, which limits parallelization due to their inherently sequential nature.
- Attention mechanisms have become integral in modeling dependencies in sequences, but usually they’re used alongside recurrent networks.
The Transformer Model:
- The Transformer model is a novel architecture that completely eschews recurrence and instead relies entirely on an attention mechanism.
- It allows for more parallelization, achieving superior quality in tasks like machine translation, while requiring significantly less training time.
- The model architecture has an encoder-decoder structure, using self-attention and point-wise, fully connected layers for both parts.
- The encoder and decoder are made of stacks of N=6 identical layers.
Attention Mechanism:
- The paper introduces “Scaled Dot-Product Attention” and “Multi-Head Attention” as key components of the Transformer.
- Multi-head attention allows the model to jointly attend to information from different representation subspaces at different positions.
Training and Results:
- Training was conducted on standard WMT 2014 English-German and English-French translation tasks.
- The Transformer model achieved state-of-the-art results on these tasks, outperforming existing models, with significantly reduced training costs.
- The model also generalizes well to other tasks like English constituency parsing.
Conclusion:
- The Transformer presents a significant shift in sequence transduction models, emphasizing the power of attention mechanisms.
- It showcases faster training capabilities and sets new state-of-the-art results in translation tasks.
- The paper opens avenues for future work in applying attention-based models to a wider range of tasks and exploring more efficient ways to handle large inputs and outputs.

This paper marked a major milestone in the field of machine learning, particularly in natural language processing, influencing numerous subsequent works and advancements.

OpenAI 開始走向大模型的方向，Ilya Sutskever 也開始領導後面的 GPT1234 和 Dalle，但在 GPT-5 的時候開始針對 AI 的安全性問題開始出現分歧

最主要的因素來自於刻在底層的 Rewarding 機制所衍生出的 Rewarding hacking。
Rewarding Hacking 最主要在討論每次讓 AI 做出下一步的舉動，如果有往前達到目標或是在達到目標的路徑上，系統就會給予獎勵，讓 AI 可以提高總分當作目標，但是因為神經網路的機制會讓 AI 嘗試很多很多人類未曾想過的點，類似找尋系統中未被規範但又能提高獎勵的漏洞，例如說讓 AI 玩貪吃蛇，很多訓練情況應該都會是那條蛇長大到某個程度後就是繞著自己的尾巴旋轉，企圖維持在這個分數而不去往前，或是甚至找到暫停鈕或是退出遊戲之類的情況。現在看起來雖然無傷大雅，甚只有點搞笑，但試想著這種競爭擴展到全方位呢？假設與AI對弈圍棋而AI的設定是要贏過你，那除了在棋盤上用棋力戰勝人類，他似乎還有其他更有效地選擇，讓與他對弈的人類無法在繼續下棋？畢竟他會嘗試所有的路徑。

所以從事 AI 的人開始呼籲說讓 AI 能夠稍微慢一點發展
那 Ilya 這邊就開始了一門比較新的科學，Super Alignment，最終是要打造要和人類的價值觀與利益相對齊。

其實 Ilya 不是一個追求 power 的人，然後 Ilya 也不是喜歡參與這些戰隊，或者說是一些思潮運動的人，Ilya 他其實持有的觀點，我覺得跟他的老師是比較像的，但是他的最後的選擇，比他的老師會勇敢一點，當然跟他比較年輕有關係，他覺得我也保守，我也相信AI他可能最終會有很大的風險，但是我依然要在一個企業裡面，我依然要在組織裡面，而不是在外面只是提醒大家有這個風險，我依然要承擔這個責任去創造一門新的科學，這門科學叫做 alignment，對齊技術，來幫助未來的AI真的具有某種底層的編程的注入，來確保他們的安全性，這是一個完全新的領域，沒有 Ilya 和 OpenAI 的這群人之前，沒有人知道原來還有一個科學叫做對齊科學，我們要去關注的，所以他這種在業界去推動進步的同時，又去有勇氣去考慮別人考慮不到的事情，並且推動一個新的科學發展的這種精神，我覺得是要被人了解到的，而不只是認為說他是保守主義者，他是一個，甚至有人會覺得他是個白蓮花，我覺得這個是對他的一個誤讀。
by 陳芳波 MindOS 創始人

EA vs EACC

Effective Altruism - EA 有效利他主義

保守派，重視 AI 安全發展，為了人類種族為了以後的人類後代來提前的鋪墊

Effective Accelerationism - EACC 有效加速主義

認為技術與資本應該聯合無條件，實現加速科技創新，並且快速地推向市場來顛覆社會架構

It’s not say they are going to hate humans and want to harm them. but this is going too powerful. And I think a good analogy would be the way humans treat animals.
It’s not that we hate animals, I think humans love animals and have a lot of affection for them. But when the time comes to build a highway between two cities, we are not asking the animals for permission, we just do it because it’s important for us. And I think by default that’s the kind of relationship that’s going to be between us and AGIs which are truly autonomous and operating on their own behalf. The beliefs and desires of the first AGIs will be extremely important. And so it’s important to program them correctly.
I think if it is not done. Then nature would revolution or natural selection favors those systems that prioritize survival above us.

by Ilya Sutskever