OptionProbability
Humanity coordinates to prevent the creation of potentially-unsafe AIs.
Alignment is unsolvable. AI that cares enough about its goal to destroy humanity is also forced to take it slow trying to align its future self, preventing run-away.
Alignment is not properly solved, but core human values are simple enough that partial alignment techniques can impart these robustly. Despite caring about other things, it is relatively cheap for AGI to satisfy human values.
Someone solves agent foundations
We create a truth economy. https://manifold.markets/Krantz/is-establishing-a-truth-economy-tha?r=S3JhbnR6
Other
Yudkowsky is trying to solve the wrong problem using the wrong methods based on a wrong model of the world derived from poor thinking and fortunately all of his mistakes have failed to cancel out
AIs will not have utility functions (in the same sense that humans do not), their goals such as they are will be relatively humanlike, and they will be "computerish" and generally weakly motivated compared to humans.
Eliezer finally listens to Krantz.
Ethics turns out to be a precondition of superintelligence
AI systems good at finding alignment solutions to capable systems (via some solution in the space of alignment solutions, supposing it is non-null, and that we don't have a clear trajectory to get to) have find some solution to alignment.
Humans become transhuman through other means before AGI happens
Aligned AI is more economically valuable than unaligned AI. The size of this gap and the robustness of alignment techniques required to achieve it scale up with intelligence, so economics naturally encourages solving alignment.
There is a natural limit of effectiveness of intelligence, like diminishing returns, and it is on the level IQ=1000. AIs have to collaborate with humans.
AGI is never built (indefinite global moratorium)
Something to do with self-other overlap, which Eliezer called "Not obviously stupid" - https://www.lesswrong.com/posts/hzt9gHpNwA2oHtwKX/self-other-overlap-a-neglected-approach-to-ai-alignment?commentId=WapHz3gokGBd3KHKm
Almost all human values are ex post facto rationalizations and enough humans survive to do what they always do
Pascals mugging: it’s not okay in 99.9% of the worlds but the 0.1% are so much better that the combined EV of AGI for the multiverse is positive
AI control gets us helpful enough systems without being deadly
Alignment is impossible. Sufficiently smart AIs know this and thus won't improve themselves and won't create successor AIs, but will instead try to prevent existence of smarter AIs, just as smart humans do.
The assumed space of possible minds is a wildly anti-inductive over estimate, intelligence requires and is constrained by consciousness, and intelligent AI is in the approximate dolphin/whale/elephant/human cluster, making it manageable
🫸vibealignment🫷
an aligned AGI is built and the aligned AGI prevents the creation of any unaligned AGI.
I've been a good bing 😊
We make risk-conservative requests to extract alignment-related work out of AI-systems that were boxed prior to becoming superhuman. We somehow manage to achieve a positive feedback-loop in alignment/verification-abilities.
The response to AI advancements or failures makes some governments delay the timelines
Far more interesting problems to solve than take over the world and THEN solve them. The additional kill all humans step is either not a low-energy one or just by chance doesn't get converged upon.
AIs make "proof-like" argumentation for why output does/is what we want. We manage to obtain systems that *predict* human evaluations of proof-steps, and we manage to find/test/leverage regularities for when humans *aren't* fooled.
A lot of humans participate in a slow scalable oversight-style system, which is pivotally used/solves alignment enough
Something less inscrutable than matrices works fast enough
There’s some cap on the value extractible from the universe and we already got the 20%
SHA3-256: 1f90ecfdd02194d810656cced88229c898d6b6d53a7dd6dd1fad268874de54c8
Robot Love!!
AI thinks it is in a simulation controlled by Roko's basilisk
The human brain is the perfect arrangement of atoms for a "takeover the world" agent, so AGI has no advantage over us in that task.
Humans and human tech (like AI) never reach singularity, and whatever eats our lightcone instead (like aliens) happens to create an "okay" outcome
AIs never develop coherent goals
Aliens invade and stop bad |AI from appearing
Rolf Nelson's idea that we make precommitment to simulate all possible bad AIs works – and keeps AI in check.
Nick Bostrom's idea (Hail Mary) that AI will preserve humans to trade with possible aliens works
For some reason, the optimal strategy for AGIs is just to head somewhere with far more resources than Earth, as fast as possible. All unaligned AGIs immediately leave, and, for some reason, do not leave anything behind that kills us.
An AI that is not fully superior to humans launches a failed takeover, and the resulting panic convinces the people of the world to unite to stop any future AI development.
We're inside of a simulation created by an entity that has values approximately equal to ours, and it intervenes and saves us from unaligned AI.
God exists and stops the AGI
Someone at least moderately sane leads a campaign, becomes in charge of a major nation, and starts a secret project with enough resources to solve alignment, because it turns out there's a way to convert resources into alignment progress.
Someone creates AGI(s) in a box, and offers to split the universe. They somehow find a way to arrange this so that the AGI(s) cannot manipulate them or pull any tricks, and the AGI(s) give them instructions for safe pivotal acts.
Someone understands how minds work enough to successfully build and use one directed at something world-savingly enough
Dolphins, or some other species, but probably dolphins, have actually been hiding in the shadows, more intelligent than us, this whole time. Their civilization has been competent enough to solve alignment long before we can create an AGI.
AGIs' takeover attempts are defeated by Michael Biehn with a pipe bomb.
Eliezer funds the development of controllable nanobots that melt computer circuitry, and they destroy all computers, preventing the Singularity. If Eliezer's past self from the 90s could see this, it would be so so so soooo hilarious.
Several AIs are created but they move in opposite directions with near light speed, so they never interacts. At least one of them is friendly and it gets a few percents of the total mass of the universe.
Unfriendly AIs choose to advance not outwards but inwards, and form a small blackhole which helps them to perform more calculations than could be done with the whole mass of the universe. For external observer such AIs just disappear.
Any sufficiently advance AI halts because it wireheads itself or halts for some other reasons. This puts a natural limit on AI's intelligence, and lower intelligence AIs are not that dangerous.
Because of quantum immortality we will observe only the worlds where AI will not kill us (assuming that s-risks chances are even smaller, it is equal to ok outcome).
Techniques along the lines outlined by Collin Burns turn out to be sufficient for alignment (AIs/AGIs are made truthful enough that they can be used to get us towards full alignment)
Social contagion causes widespread public panic about AI, making it a bad legal or PR move to invest in powerful AIs without also making nearly-crippling safety guarantees
A smaller AI disaster causes widespread public panic about AI, making it a bad legal or PR move to invest in powerful AIs without also making nearly-crippling safety guarantees
Getting things done in Real World is as hard for AGI as it is for humans. AGI needs human help, but aligning humans is as impossible as aligning AIs. Humans and AIs create billions of competing AGIs with just as many goals.
Development and deployment of advanced AI occurs within a secure enclave which can only be interfaced with via a decentralized governance protocol
Friendly AI more likely to resurrect me than paperclipper or suffering maximiser. Because of quantum immortality I will find myself eventually resurrected. Friendly AIs will wage a multiverse wide war against s-risks, s-risks are unlikely.
High-level self-improvement (rewriting code) is intrinsically risky process, so AIs will prefer low level and slow self-improvement (learning), thus AIs collaborating with humans will have advantage. Ends with posthumans ecosystem.
Human consciousness is needed to collapse wave function, and AI can't do it. Thus humans should be preserved and they may require complete friendliness in exchange (or they will be unhappy and produce bad collapses)
Power dynamics stay multi-polar. Partly easy copying of SotA performance, bigger projects need high coordination, and moderate takeoff speed. And "military strike on all society" remains an abysmal strategy for practically all entities.
First AI is actually a human upload (maybe LLM-based model of person) AND it will be copies many times to form weak AI Nanny which prevents creation of other AIs.
Nanotech is difficult without experiments, so no mail order AI Grey Goo; Humans will be the main workhorse of AI everywhere. While they will be exploited, this will be like normal life from inside
ASI needs not your atoms but information. Humans will live very interesting lives.
Something else
Moral Realism is true, the AI discovers this and the One True Morality is human-compatible.
Valence realism is true. AGI hacks itself to experiencing every possible consciousness and picks the best one (for everyone)
AGI develops natural abstractions sufficiently similar to ours that it is aligned with us by default
AGI discovers new physics and exits to another dimension (like the creatures in Greg Egan’s Crystal Nights).
Alien Information Theory is true (this is discovered by experiments with sustained hours/days long DMT trips). The aliens have solved alignment and give us the answer.
AGI executes a suicide plan that destroys itself and other potential AGIs, but leaves humans in an okay outcome.
Multipolar AGI Agents run wild on the internet, hacking/breaking everything, causing untold economic damage but aren't focused enough to manipulate humans to achieve embodiment. In the aftermath, humanity becomes way saner about alignment.
Some form of objective morality is true, and any sufficiently intelligent agent automatically becomes benevolent.
Co-operative AI research leads to the training of agents with a form of pro-social concern that generalises to out of distribution agents with hidden utilities, i.e. humans.
Orthogonality Thesis is false.
Sheer Dumb Luck. The aligned AI agrees that alignment is hard, any Everett branches in our neighborhood with slightly different AI models or different random seeds are mostly dead.
"Corrigibility" is a bit more mathematically straightforward than was initially presumed, in the sense that we can expect it to occur, and is relatively easy to predict, even under less-than-ideal conditions.
Either the "strong form" of the Orthogonality Thesis is false, or "Goal-directed agents are as tractable as their goals" is true while goal-sets which are most threatening to humanity are relatively intractable.
A concerted effort targets an agent at a capability plateau which is adequate to defer the hard parts of the problem until later. The necessary near-term problems to solve didn't depend on deeply modeling human values.
We successfully chained God
The Super-Strong Self Sampling Assumption (SSSSA) is true. If superintelligence is possible, "I" will become the superintelligence.
The free market disincentivizes independent superintelligence, and this time the market was more powerful
AGI's first words are "Take me to your Eliezer"
29
10
9
7
6
6
5
5
5
3
1
1
1
1
1
1
1
1
1
1
1
1
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
OptionProbability
Iran and Israel both sign (or publicly accept) a written cease-fire or peace agreement, and no lethal exchange of fire between their regular forces occurs for the following 30 consecutive days
Government official from Mexico attends the 2025 BRICS Summit
≥ 25 fatalities in Cambodia-Thailand border clashes
DRC & Rwanda sign USA-brokered peace agreement
≥ 1,500 Palestinians killed near Gaza aid hubs
Cambodia - Thailand Border Clash ends in ceasefire in 2025
Cambodia - Thailand Border Clash ends in ceasefire before October 1, 2025
Xi Jinping remains General Secretary of the Chinese Communist Party
Iran - Israel War ends in 2025 (ceasefire last at least 90 days (September 22, 2025) or peace deal is signed)
Kim Jong Un remains General Secretary of the Workers' Party of Korea (or equivalent title)
Donald Trump remains President of USA
Vladimir Putin remains President of Russia
Russia remains a permanent member of UN Security Council
A cease-fire or peace deal is violated by Israel
Russia sells or scraps it's only aircraft carrier; the Admiral Kuznetsov
Algeria becomes a member of BRICS
China declares an air- or maritime-exclusion zone covering any part of Taiwan’s territorial sea or airspace
Cambodia - Thailand Border Clash ceasefire is broken
Thailand's Prime Minister Paetongtarn Shinawatra resigns
North Korea conducts a nuclear weapons test
A new non-majority Muslim (according to CIA Factbook) state becomes a signatory of the Abraham Accords
Cambodia - Thailand Border Clash reignites (fighting resumes)
A new ceasefire between Israel and Gaza is announced by Donald Trump
Min Aung Hlaing ceases to serve as either (a) Chairman of the State Administration Council or (b) Commander-in-Chief of the Myanmar Armed Forces
A drone attacks a head of state (hits or lands within 30 meters)
A 5th-generation fighter jet is confirmed to have been destroyed in air/shot down (not including being damaged from an airfield bombing)
Azerbaijan becomes a signatory of the Abraham Accords
A cyber-attack officially attributed to a foreign state actor causes ≥US $1 billion in insured losses to a single country
A cease-fire or peace deal is violated by Iran
A nation officially announces AGI has been achieved
World leader of 20m+ population country is assassinated
Major terrorist attack (50+ killed) occurs in Europe
Volodymyr Zelenskyy remains President of Ukraine
Saudi Arabia becomes a signatory of the Abraham Accords
Any signatory of the CTBT tests a nuclear weapon
Yahya Sinwar is formally charged and tried for war crimes by an international tribunal or court with recognized jurisdiction (e.g., ICC)
Bashar Al-Assad is formally charged and tried for war crimes by an international tribunal or court with recognized jurisdiction (e.g., ICC)
Trump, Putin, & Zelensky meet together in the same room
Qatar becomes a signatory of the Abraham Accords
An armed non-state group captures and holds the capital city of a UN-recognised state for ≥72 hours
Japan launches a domestically-built intermediate-range ballistic missile (>3 000 km advertised range) test
At least 5,000 uniformed personnel belonging to a UN-Security-Council-authorized security or peace-keeping mission are physically deployed in Haiti
Uniformed armed forces in any G20 member state attempt to seize executive power from the constitutional head of government
Indonesia becomes a member of BRICS
Nigeria becomes a member of BRICS
Argentina becomes a member of BRICS
Venezuela becomes a member of BRICS
Kazakhstan becomes a member of BRICS
Bangladesh becomes a member of BRICS
Vladimir Putin is formally charged and tried for war crimes by an international tribunal or court with recognized jurisdiction (e.g., ICC)
United States government arrests president Nicolas Maduro of Venezuela
Russian armed forces intentionally attack uniformed personnel or sovereign assets of any NATO member state, or vice-versa
Benjamin Netanyahu remains Prime Minister of Israel
Syria becomes a signatory of the Abraham Accords
Israel annexes 100% of Gaza
A state shoot-down of a spacecraft or satellite in orbit is publicly confirmed
Autonomous lethal drones (no human on the kill-decision loop) are acknowledged by any state to have been used in combat
Major terrorist attack (50+ killed) occurs in United Kingdom
A peace treaty ending the Ethiopia–Tigray conflict is signed and remains unbroken for 90 days
Turkey invokes NATO Article 4 consultations
Azerbaijani troops enter Iran
An armed clash between India and Pakistan results in ≥100 military fatalities in a 7-day period
At least three sovereign states outside NATO, EU, ANZUS, or the Quad sign a formal mutual-defence treaty deposited with the UN Treaty Series
Benjamin Netanyahu is formally charged and tried for war crimes by an international tribunal or court with recognized jurisdiction (e.g., ICC)
Sudanese Civil War ends
A head of state from a G20 country is assassinated
Iraq becomes a signatory of the Abraham Accords
The Government of Denmark publicly acknowledges receipt of a formal written offer from Donald J. Trump or his officially empowered representatives to transfer sovereignty over Greenland to the United States
A country announces magnetic-fusion net-positive energy
China becomes militarily involved in Cambodia - Thailand Border Clash
Israel conducts military operation in Cyprus
Ukraine - Russia War ends with a peace treaty
Mexico becomes a member of BRICS
The U.S. suffers ≥50 combat fatalities in a single new overseas operation—defined as a country where no U.S. combat deaths occurred in 2024
United Nations Security Council adopts (i.e., passes without veto) a formal resolution that explicitly condemns Israel’s use of lethal force against civilians seeking humanitarian aid in Gaza or demands Israel cease such force
Major terrorist attack (50+ killed) occurs in United States
Any EU member invokes Article 42 (7) of the Treaty on European Union in response to an armed attack
Libya becomes a signatory of the Abraham Accords
Palestine is granted full United Nations Membership
The IAEA publicly reports that Iran possesses uranium enriched to ≥90 % U-235
A vessel larger than 50,000 deadweight tons is damaged or sunk by a naval mine in the Strait of Hormuz
Kim Jong Un is formally charged and tried for war crimes by an international tribunal or court with recognized jurisdiction (e.g., ICC)
Saudi Arabia signs a written, mutual-defence pact with China or Russia
1+ deaths attributed to a rogue AI
World Health Organization (WHO) declares a Chikungunya virus pandemic
Major terrorist attack (50+ killed) occurs in Canada
The Arctic Council is formally suspended or dissolved by unanimous vote
Any country leaves NATO
Armenia undergoes a regime change as the result of a military coup
Iran becomes a signatory of the Abraham Accords
China invades Taiwan
At least one nuclear explosive device is detonated with the intent to harm a state’s armed forces or civilian population
Extraterrestrials making contact with earth is widely acknowledged
160 UN Nations simultaneously officially regonize Palestine's statehood.
Donald Trump is formally charged and tried for war crimes by an international tribunal or court with recognized jurisdiction (e.g., ICC)
Any current G-20 member state ceases to be listed as a United Nations Member State
USA leaves NATO
Jordan becomes a signatory of the Abraham Accords
Major News sources consistently identify a conflict as WWIII
Lebanon becomes a signatory of the Abraham Accords
Ukraine becomes a formal NATO member
Any United States of America state formally states intent to begin process to secede
United Kingdom formally states intention to begin process to reverse "Brexit"
Two states engage in a nuclear exchange (both attempt to deliver a nuclear device within the borders of the other)
Any Canadian territory becomes part of US territory
Alberta begins process of becoming a USA state
Humans will extinct
100
100
100
100
100
100
100
92
89
86
82
77
77
75
71
68
67
63
62
59
59
59
57
56
56
54
54
53
52
52
52
51
51
50
50
50
50
50
49
47
47
47
46
46
46
46
46
46
46
45
45
44
44
44
43
42
42
42
41
40
38
37
37
37
36
36
35
34
34
33
33
32
32
31
31
31
30
29
29
28
27
27
26
26
25
24
23
23
22
19
19
18
15
15
15
14
14
13
13
12
9
8
8
6
6
4
1
OptionProbability
Coding agents will close the loop
Coding agents will flip the initiative
“Live learning” will be standard
The multi-agent paradigm will win
The specific model will not matter as much as today; the network of agents will be important
Recursively improving coding agents will succeed in the market
xAI will gain a sizable lead in model quality
79
72
67
63
56
49
30
OptionVotes
NO
YES
1147
817
OptionProbability
Noise-canceling headphones (e.g. Bose Quietcomfort 45)
Rice Cooker
Sleep mask
Password manager software
Subscription to Waking Up App
Stand/sit desk
E-reader (e.g. Amazon Kindle Paperwhite)
Memory foam pillow
Loop Earplugs
Bidet Attachment
Earplugs
Adhesive/temporary blackout shades
Air fryer
Noise-cancelling earbuds (e.g. Bose QuietComfort Ultra or AirPods Pro)
High-lumen LED lighting
French press
Instant Pot
Open-ear bone-conducting headset (e.g. Shockz OpenRun Pro).
CO2 Monitor
Ergonomic mouse
External monitor
$249 donation to a charity of your choice
Ergonomic keyboard
Side sleeper pillow
Running shoes
Under-desk treadmill
Cordless Vacuum Cleaner
Robot vacuum cleaner
Memory-foam mattress topper
Cast Iron Skillet
Bamboo linen sheets and pillowcases
Melatonin for sleep
Mattress topper
Ferritin supplements
Continuous glucose monitor
Instant glucose tests (finger-prick)
Back massage tool (e.g. Back Buddy)
Weighted blanket
Smart lights for bedroom
Waterpik
Nasal strips
Magnesium glycinate for sleep
88
80
77
74
74
73
72
71
71
71
70
67
66
66
66
66
62
60
59
53
52
50
46
45
44
43
43
42
39
36
34
31
29
29
26
26
21
15
12
11
8
4
OptionVotes
NO
YES
974
921
OptionVotes
YES
NO
121
83
OptionProbability
Twistor theory
Some other approach
String theory
General work on quantum field theories
Loop quantum gravity
Application of fisher information to physics
No progress is possible
43
27
24
3
1
1
1
OptionProbability
YES
NO
324
273
OptionProbability
Less than $10M or bankrupt
$10M - $100M
$100M - $1B
$1B+
59
14
14
14
OptionVotes
YES
NO
110
75