Power Up News

If Artificial General Intelligence has an okay outcome, what will be the reason?

Mar 24, 2:46 PMJan 2, 7:59 AM

553507117

OptionProbability

Other

Humanity coordinates to prevent the creation of potentially-unsafe AIs.

AIs will not have utility functions (in the same sense that humans do not), their goals such as they are will be relatively humanlike, and they will be "computerish" and generally weakly motivated compared to humans.

Alignment is not properly solved, but core human values are simple enough that partial alignment techniques can impart these robustly. Despite caring about other things, it is relatively cheap for AGI to satisfy human values.

Multipolar AGI Agents run wild on the internet, hacking/breaking everything, causing untold economic damage but aren't focused enough to manipulate humans to achieve embodiment. In the aftermath, humanity becomes way saner about alignment.

Eliezer finally listens to Krantz [resolves NO]

Yudkowsky is trying to solve the wrong problem using the wrong methods based on a wrong model of the world derived from poor thinking and fortunately all of his mistakes have failed to cancel out

We create a truth economy. https://manifold.markets/Krantz/is-establishing-a-truth-economy-tha?r=S3JhbnR6

Ethics turns out to be a precondition of superintelligence

Hacks like RLHF-ing self-disempowerment into frontier models work long enough to develop better alignment methods, which in turn work long enough to ... etc; we keep ahead of 'alignment escape velocity'

We make risk-conservative requests to extract alignment-related work out of AI-systems that were boxed prior to becoming superhuman. We somehow manage to achieve a positive feedback-loop in alignment/verification-abilities.

AIs make "proof-like" argumentation for why output does/is what we want. We manage to obtain systems that *predict* human evaluations of proof-steps, and we manage to find/test/leverage regularities for when humans *aren't* fooled.

A lot of humans participate in a slow scalable oversight-style system, which is pivotally used/solves alignment enough

Something less inscrutable than matrices works fast enough

Humans become transhuman through other means before AGI happens

Aligned AI is more economically valuable than unaligned AI. The size of this gap and the robustness of alignment techniques required to achieve it scale up with intelligence, so economics naturally encourages solving alignment.

Humans and human tech (like AI) never reach singularity, and whatever eats our lightcone instead (like aliens) happens to create an "okay" outcome

Aliens invade and stop bad |AI from appearing

An AI that is not fully superior to humans launches a failed takeover, and the resulting panic convinces the people of the world to unite to stop any future AI development.

Someone at least moderately sane leads a campaign, becomes in charge of a major nation, and starts a secret project with enough resources to solve alignment, because it turns out there's a way to convert resources into alignment progress.

Someone solves agent foundations

Social contagion causes widespread public panic about AI, making it a bad legal or PR move to invest in powerful AIs without also making nearly-crippling safety guarantees

A smaller AI disaster causes widespread public panic about AI, making it a bad legal or PR move to invest in powerful AIs without also making nearly-crippling safety guarantees

Getting things done in Real World is as hard for AGI as it is for humans. AGI needs human help, but aligning humans is as impossible as aligning AIs. Humans and AIs create billions of competing AGIs with just as many goals.

Development and deployment of advanced AI occurs within a secure enclave which can only be interfaced with via a decentralized governance protocol

High-level self-improvement (rewriting code) is intrinsically risky process, so AIs will prefer low level and slow self-improvement (learning), thus AIs collaborating with humans will have advantage. Ends with posthumans ecosystem.

Power dynamics stay multi-polar. Partly easy copying of SotA performance, bigger projects need high coordination, and moderate takeoff speed. And "military strike on all society" remains an abysmal strategy for practically all entities.

Nanotech is difficult without experiments, so no mail order AI Grey Goo; Humans will be the main workhorse of AI everywhere. While they will be exploited, this will be like normal life from inside

Moral Realism is true, the AI discovers this and the One True Morality is human-compatible.

AGI is never built (indefinite global moratorium)

Valence realism is true. AGI hacks itself to experiencing every possible consciousness and picks the best one (for everyone)

Alien Information Theory is true (this is discovered by experiments with sustained hours/days long DMT trips). The aliens have solved alignment and give us the answer.

AGI executes a suicide plan that destroys itself and other potential AGIs, but leaves humans in an okay outcome.

"Corrigibility" is a bit more mathematically straightforward than was initially presumed, in the sense that we can expect it to occur, and is relatively easy to predict, even under less-than-ideal conditions.

Either the "strong form" of the Orthogonality Thesis is false, or "Goal-directed agents are as tractable as their goals" is true while goal-sets which are most threatening to humanity are relatively intractable.

A concerted effort targets an agent at a capability plateau which is adequate to defer the hard parts of the problem until later. The necessary near-term problems to solve didn't depend on deeply modeling human values.

AI control gets us helpful enough systems without being deadly

Alignment is impossible. Sufficiently smart AIs know this and thus won't improve themselves and won't create successor AIs, but will instead try to prevent existence of smarter AIs, just as smart humans do.

The free market disincentivizes independent superintelligence, and this time the market was more powerful

an aligned AGI is built and the aligned AGI prevents the creation of any unaligned AGI.

I've been a good bing 😊

The response to AI advancements or failures makes some governments delay the timelines

Far more interesting problems to solve than take over the world and THEN solve them. The additional kill all humans step is either not a low-energy one or just by chance doesn't get converged upon.

AI systems good at finding alignment solutions to capable systems (via some solution in the space of alignment solutions, supposing it is non-null, and that we don't have a clear trajectory to get to) have find some solution to alignment.

There’s some cap on the value extractible from the universe and we already got the 20%

SHA3-256: 1f90ecfdd02194d810656cced88229c898d6b6d53a7dd6dd1fad268874de54c8

Robot Love!!

AI thinks it is in a simulation controlled by Roko's basilisk

The human brain is the perfect arrangement of atoms for a "takeover the world" agent, so AGI has no advantage over us in that task.

AIs never develop coherent goals

Alignment is unsolvable. AI that cares enough about its goal to destroy humanity is also forced to take it slow trying to align its future self, preventing run-away.

Rolf Nelson's idea that we make precommitment to simulate all possible bad AIs works – and keeps AI in check.

Nick Bostrom's idea (Hail Mary) that AI will preserve humans to trade with possible aliens works

For some reason, the optimal strategy for AGIs is just to head somewhere with far more resources than Earth, as fast as possible. All unaligned AGIs immediately leave, and, for some reason, do not leave anything behind that kills us.

We're inside of a simulation created by an entity that has values approximately equal to ours, and it intervenes and saves us from unaligned AI.

God exists and stops the AGI

Someone creates AGI(s) in a box, and offers to split the universe. They somehow find a way to arrange this so that the AGI(s) cannot manipulate them or pull any tricks, and the AGI(s) give them instructions for safe pivotal acts.

Someone understands how minds work enough to successfully build and use one directed at something world-savingly enough

Dolphins, or some other species, but probably dolphins, have actually been hiding in the shadows, more intelligent than us, this whole time. Their civilization has been competent enough to solve alignment long before we can create an AGI.

AGIs' takeover attempts are defeated by Michael Biehn with a pipe bomb.

Eliezer funds the development of controllable nanobots that melt computer circuitry, and they destroy all computers, preventing the Singularity. If Eliezer's past self from the 90s could see this, it would be so so so soooo hilarious.

Several AIs are created but they move in opposite directions with near light speed, so they never interacts. At least one of them is friendly and it gets a few percents of the total mass of the universe.

Unfriendly AIs choose to advance not outwards but inwards, and form a small blackhole which helps them to perform more calculations than could be done with the whole mass of the universe. For external observer such AIs just disappear.

Any sufficiently advance AI halts because it wireheads itself or halts for some other reasons. This puts a natural limit on AI's intelligence, and lower intelligence AIs are not that dangerous.

Because of quantum immortality we will observe only the worlds where AI will not kill us (assuming that s-risks chances are even smaller, it is equal to ok outcome).

Techniques along the lines outlined by Collin Burns turn out to be sufficient for alignment (AIs/AGIs are made truthful enough that they can be used to get us towards full alignment)

Friendly AI more likely to resurrect me than paperclipper or suffering maximiser. Because of quantum immortality I will find myself eventually resurrected. Friendly AIs will wage a multiverse wide war against s-risks, s-risks are unlikely.

Human consciousness is needed to collapse wave function, and AI can't do it. Thus humans should be preserved and they may require complete friendliness in exchange (or they will be unhappy and produce bad collapses)

First AI is actually a human upload (maybe LLM-based model of person) AND it will be copies many times to form weak AI Nanny which prevents creation of other AIs.

There is a natural limit of effectiveness of intelligence, like diminishing returns, and it is on the level IQ=1000. AIs have to collaborate with humans.

ASI needs not your atoms but information. Humans will live very interesting lives.

Something else

AGI develops natural abstractions sufficiently similar to ours that it is aligned with us by default

AGI discovers new physics and exits to another dimension (like the creatures in Greg Egan’s Crystal Nights).

Some form of objective morality is true, and any sufficiently intelligent agent automatically becomes benevolent.

Co-operative AI research leads to the training of agents with a form of pro-social concern that generalises to out of distribution agents with hidden utilities, i.e. humans.

Orthogonality Thesis is false.

Sheer Dumb Luck. The aligned AI agrees that alignment is hard, any Everett branches in our neighborhood with slightly different AI models or different random seeds are mostly dead.

Something to do with self-other overlap, which Eliezer called "Not obviously stupid" - https://www.lesswrong.com/posts/hzt9gHpNwA2oHtwKX/self-other-overlap-a-neglected-approach-to-ai-alignment?commentId=WapHz3gokGBd3KHKm

Almost all human values are ex post facto rationalizations and enough humans survive to do what they always do

Pascals mugging: it’s not okay in 99.9% of the worlds but the 0.1% are so much better that the combined EV of AGI for the multiverse is positive

We successfully chained God

The Super-Strong Self Sampling Assumption (SSSSA) is true. If superintelligence is possible, "I" will become the superintelligence.

The assumed space of possible minds is a wildly anti-inductive over estimate, intelligence requires and is constrained by consciousness, and intelligent AI is in the approximate dolphin/whale/elephant/human cluster, making it manageable

AGI's first words are "Take me to your Eliezer"

🫸vibealignment🫷

If Artificial General Intelligence has an okay outcome, what will be the reason?

Mar 24, 2:01 PMJan 2, 7:59 AM

601391685

OptionProbability

J. Something 'just works' on the order of eg: train a predictive/imitative/generative AI on a human-generated dataset, and RLHF her to be unfailingly nice, generous to weaker entities, and determined to make the cosmos a lovely place.

Something wonderful happens that isn't well-described by any option listed. (The semantics of this option may change if other options are added.)

M. "We'll make the AI do our AI alignment homework" just works as a plan. (Eg the helping AI doesn't need to be smart enough to be deadly; the alignment proposals that most impress human judges are honest and truthful and successful.)

O. Early applications of AI/AGI drastically increase human civilization's sanity and coordination ability; enabling humanity to solve alignment, or slow down further descent into AGI, etc. (Not in principle mutex with all other answers.)

C. Solving prosaic alignment on the first critical try is not as difficult, nor as dangerous, nor taking as much extra time, as Yudkowsky predicts; whatever effort is put forth by the leading coalition works inside of their lead time.

H. Many competing AGIs form an equilibrium whereby no faction is allowed to get too powerful, and humanity is part of this equilibrium and survives and gets a big chunk of cosmic pie.

K. Somebody discovers a new AI paradigm that's powerful enough and matures fast enough to beat deep learning to the punch, and the new paradigm is much much more alignable than giant inscrutable matrices of floating-point numbers.

G. It's impossible/improbable for something sufficiently smarter and more capable than modern humanity to be created, that it can just do whatever without needing humans to cooperate; nor does it successfully cheat/trick us.

B. Humanity puts forth a tremendous effort, and delays AI for long enough, and puts enough desperate work into alignment, that alignment gets solved first.

F. Somebody pulls off a hat trick involving blah blah acausal blah blah simulations blah blah, or other amazingly clever idea, which leads an AGI to put the reachable galaxies to good use despite that AGI not being otherwise alignable.

I. The tech path to AGI superintelligence is naturally slow enough and gradual enough, that world-destroyingly-critical alignment problems never appear faster than previous discoveries generalize to allow safe further experimentation.

D. Early powerful AGIs realize that they wouldn't be able to align their own future selves/successors if their intelligence got raised further, and work honestly with humans on solving the problem in a way acceptable to both factions.

A. Humanity successfully coordinates worldwide to prevent the creation of powerful AGIs for long enough to develop human intelligence augmentation, uploading, or some other pathway into transcending humanity's window of fragility.

E. Whatever strange motivations end up inside an unalignable AGI, or the internal slice through that AGI which codes its successor, they max out at a universe full of cheerful qualia-bearing life and an okay outcome for existing humans.

L. Earth's present civilization crashes before powerful AGI, and the next civilization that rises is wiser and better at ops. (Exception to 'okay' as defined originally, will be said to count as 'okay' even if many current humans die.)

N. A crash project at augmenting human intelligence via neurotech, training mentats via neurofeedback, etc, produces people who can solve alignment before it's too late, despite Earth civ not slowing AI down much.

If you write an argument that breaks down the 'okay outcomes' into lots of distinct categories, without breaking down internal conjuncts and so on, Reality is very impressed with how disjunctive this sounds and allocates more probability.

You are fooled by at least one option on this list, which out of many tries, ends up sufficiently well-aimed at your personal ideals / prejudices / the parts you understand less well / your own personal indulgences in wishful thinking.

How will Trump leave office?

Mar 13, 8:47 PMJan 21, 4:59 AM

647333285

OptionProbability

At the end of his term in 2029.

He will die before the end of his term.

He will be impeached, tried in the Senate, and removed before the end of his term.

He will resign (impeached or not) before the end of his term.

He will be removed by a 25th Amendment action before the end of his term.

The 22nd Amendment will not be repealed, but he will remain in power through unconstitutional means after January 21, 2029

The 22nd Amendment will be repealed and he will win re-election to a third term.

He will be overthrown in a violent coup (military or otherwise) and forced out of office. (If this happens, related answers that may occur like death and resignation will resolve NO).

Other

Will fusion provide >2% of US electrical power before 2050?

May 27, 9:10 PMDec 31, 7:11 PM

35.64%chance

7262178

OptionVotes

YES

992

946

Donut Battery powered car delivered by EOY 2030?

Jan 6, 2:16 AMFeb 1, 4:59 AM

7.31%chance

3851419

OptionVotes

YES

35630

2810

If Trump wins in 2024, how will it affect the transition of power after 2028 election?

Jun 10, 2:47 AMJan 31, 7:59 AM

12543900

OptionProbability

6. Trump wins in 2024, opposition candidate wins in 2028 and assumes office

5. Trump wins in 2024 and the candidate endorsed by Trump wins in 2028

2. Trump wins in 2024 and dies in office or is incapacitated

7. Trump wins in 2024, opposition candidate wins in 2028, on Jan 20 some states and/or federal agencies recognize one candidate, while others — another

3. Trump wins in 2024, election process in 2028 is significantly changed

8. Trump wins in 2024, opposition candidate wins in 2028, but power transition doesn't happen on schedule

Other

1. Trump doesn't win in 2024

4. Trump wins in 2024, there is no agreement among independent observers as to who wins the election in 2028

If elected, will Trump willingly leave power once his term is up?

Jun 28, 4:42 AMDec 20, 11:59 PM

20139190

OptionProbability

Trump publicly suggests, while in office, that he shouldn't have to leave

Trump leaves office when his term ends

Trump attempts something arguably coup-like (e.g. J6), but it fails

Trump supporters kill or hospitalise someone trying to prevent/protest him leaving

Trump leaves office early (e.g. via impeachment or he dies)

Trump remains in office after his term is up

Trump isn't elected (or fails to take office)

Will AI-linked nuclear power deals total 15 GW by EOY2026?

Jul 1, 10:35 PMDec 31, 11:59 PM

39.1%chance

184504

OptionVotes

YES

12481

8012

Will Germany have a nuclear fission power plant again before fusion?

May 27, 7:34 PMMay 28, 7:19 PM

54.4%chance

272482

OptionVotes

YES

10923

9155

Will ARM-powered PCs make up 30% of the PC market by 2026?

Nov 14, 4:19 AMAug 2, 6:59 AM

30.39%chance

12382

OptionVotes

YES

208

119

Conditional on CCP losing power, will China's GDP-PPP per capita catch up with EU by 2050?

May 9, 5:22 PMJan 1, 7:59 AM

48%chance

10221

OptionVotes

YES

219

179

The Final Outcome of the Marcos Jr. Presidency

Nov 15, 7:46 PMJun 29, 5:36 PM

OptionProbability

Completion (Status Quo): Finishes term; no major political victories or failures occur.

Completion + Political Prosecution: Finishes term and a high-profile political opponent is convicted or imprisoned before January 1, 2030.

Completion + Charter Change (Cha-Cha): Finishes term and the Constitution is successfully amended/ratified.

Completion + Popularity Surge: Finishes term and his final net approval rating is above 60%.

Completion + Geopolitical Victory: Finishes term and achieves a formal, diplomatic, or legal victory in the West Philippine Sea.

Completion + Succession Path: Finishes term and a key member of the Marcos-Romualdez bloc is elected to the Vice Presidency or Senate Presidency by June 30, 2028.

Completion + External War/Victory: Finishes term and successfully navigates a short, decisive external conflict, resulting in a clear geopolitical victory by June 2028.

Completion + Internal Stabilization: Finishes term and achieves a definitive peace agreement or complete military defeat of all major internal insurgency groups by June 2028.

Completion + AI Governance Success: Finishes term and the Philippines becomes an internationally recognized leader in AI governance or ethical adoption, significantly attracting foreign investment by June 2028.

Completion + AI Economic Boom: Finishes term and the economic boost from AI integration (as a percentage of GDP growth) demonstrably meets or exceeds the most optimistic projections (e.g., $92 Billion estimate) by the end of his term.

Removal by Impeachment (Corruption): Term ends due to being successfully convicted on grounds of Graft and Corruption.

Removal by Impeachment (Betrayal of Trust): Term ends due to being successfully convicted on grounds of Betrayal of Public Trust.

Removal by People Power / Junta: Term ends due to a successful People Power mobilization or a military coup/junta.

Removal by Supreme Court: Term ends via a Supreme Court ruling (e.g., a successful quo warranto petition) that invalidates his election or eligibility.

Removal by Death, Disability, or Resignation: Term ends due to health reasons or voluntary resignation.

Removal by Secession / Government Collapse: Term is rendered invalid due to the successful secession of a major region (e.g., Mindanao).

Removal by External War/Loss: Term ends early due to a major external conflict (e.g., WPS conflict) that leads to significant political destabilization and his removal.

Removal by Internal War/Chaos: Term ends early due to a major internal civil conflict or renewed, large-scale insurgency.

Removal by AI Disinformation Crisis: Term ends early due to a widespread, uncontained AI-driven disinformation crisis (e.g., deepfake scandals) that completely erodes public trust and leads to institutional failure and early removal.

Removal by AI Job Disruption: Term ends early due to massive, immediate job displacement from rapid AI automation, leading to widespread social unrest, economic collapse, and his removal.

Extension by Con-Ass: 8-Year Term Congress (Con-Ass) amends the Constitution to change the single 6-year term to a single 8-year term.

Extension by Con-Con: Parliamentary Shift Constitutional Convention changes the system to Parliamentary, making the incumbent Prime Minister with a flexible term.

Extension by Federalism Transitional Clause: A new Federal Constitution is passed, with a clause designating the incumbent as the Transitional President for two additional years.

Extension by SC Rules on Term Interruption: The Supreme Court rules that the current term was interrupted by a crisis, legally restarting the 6-year clock.

Extension by Legislative Deferment of Polls: Congress passes a simple law postponing the 2028 election due to logistics or calamity.

Extension by Emergency Declaration: The President declares a severe National Emergency (non-war) and Congress concurs, suspending the election.

Extension by New Statutory Office Creation: Congress creates a powerful new, non-constitutional role (e.g., Chief Development Officer) with a long statutory term for the incumbent.

Extension by Succession Incapacit Legal/political chaos incapacitates all legal successors (VP, etc.), forcing the incumbent to hold over.

Extension by Election Invalidation: A court ruling or technicality voids the 2028 election, forcing the incumbent to remain until a valid re-run.

Extension by Cabinet Appointment: The incumbent resigns, the successor is sworn in, and immediately appoints the former President to a powerful, unelected executive post that maintains command.

Extension by ASI: A new law establishes a Supreme AI Planning Body which legally determines that the current executive is the optimal administrator and mandates their temporary continuation until a successor meets the AI's criteria.

Power Up News

Prediction markets for Power Up

Prediction markets for Power Up

Prediction markets for Power Up

If Artificial General Intelligence has an okay outcome, what will be the reason?

If Artificial General Intelligence has an okay outcome, what will be the reason?

How will Trump leave office?

Will fusion provide >2% of US electrical power before 2050?

Donut Battery powered car delivered by EOY 2030?

If Trump wins in 2024, how will it affect the transition of power after 2028 election?

If elected, will Trump willingly leave power once his term is up?

Will AI-linked nuclear power deals total 15 GW by EOY2026?

Will Germany have a nuclear fission power plant again before fusion?

Will ARM-powered PCs make up 30% of the PC market by 2026?

Conditional on CCP losing power, will China's GDP-PPP per capita catch up with EU by 2050?

The Final Outcome of the Marcos Jr. Presidency

Latest stories