Dimension News

If Artificial General Intelligence has an okay outcome, what will be the reason?

Mar 24, 2:46 PMJan 2, 7:59 AM

553507117

OptionProbability

Other

Humanity coordinates to prevent the creation of potentially-unsafe AIs.

AIs will not have utility functions (in the same sense that humans do not), their goals such as they are will be relatively humanlike, and they will be "computerish" and generally weakly motivated compared to humans.

Alignment is not properly solved, but core human values are simple enough that partial alignment techniques can impart these robustly. Despite caring about other things, it is relatively cheap for AGI to satisfy human values.

Multipolar AGI Agents run wild on the internet, hacking/breaking everything, causing untold economic damage but aren't focused enough to manipulate humans to achieve embodiment. In the aftermath, humanity becomes way saner about alignment.

Eliezer finally listens to Krantz [resolves NO]

Yudkowsky is trying to solve the wrong problem using the wrong methods based on a wrong model of the world derived from poor thinking and fortunately all of his mistakes have failed to cancel out

We create a truth economy. https://manifold.markets/Krantz/is-establishing-a-truth-economy-tha?r=S3JhbnR6

Ethics turns out to be a precondition of superintelligence

Hacks like RLHF-ing self-disempowerment into frontier models work long enough to develop better alignment methods, which in turn work long enough to ... etc; we keep ahead of 'alignment escape velocity'

We make risk-conservative requests to extract alignment-related work out of AI-systems that were boxed prior to becoming superhuman. We somehow manage to achieve a positive feedback-loop in alignment/verification-abilities.

AIs make "proof-like" argumentation for why output does/is what we want. We manage to obtain systems that *predict* human evaluations of proof-steps, and we manage to find/test/leverage regularities for when humans *aren't* fooled.

A lot of humans participate in a slow scalable oversight-style system, which is pivotally used/solves alignment enough

Something less inscrutable than matrices works fast enough

Humans become transhuman through other means before AGI happens

Aligned AI is more economically valuable than unaligned AI. The size of this gap and the robustness of alignment techniques required to achieve it scale up with intelligence, so economics naturally encourages solving alignment.

Humans and human tech (like AI) never reach singularity, and whatever eats our lightcone instead (like aliens) happens to create an "okay" outcome

Aliens invade and stop bad |AI from appearing

An AI that is not fully superior to humans launches a failed takeover, and the resulting panic convinces the people of the world to unite to stop any future AI development.

Someone at least moderately sane leads a campaign, becomes in charge of a major nation, and starts a secret project with enough resources to solve alignment, because it turns out there's a way to convert resources into alignment progress.

Someone solves agent foundations

Social contagion causes widespread public panic about AI, making it a bad legal or PR move to invest in powerful AIs without also making nearly-crippling safety guarantees

A smaller AI disaster causes widespread public panic about AI, making it a bad legal or PR move to invest in powerful AIs without also making nearly-crippling safety guarantees

Getting things done in Real World is as hard for AGI as it is for humans. AGI needs human help, but aligning humans is as impossible as aligning AIs. Humans and AIs create billions of competing AGIs with just as many goals.

Development and deployment of advanced AI occurs within a secure enclave which can only be interfaced with via a decentralized governance protocol

High-level self-improvement (rewriting code) is intrinsically risky process, so AIs will prefer low level and slow self-improvement (learning), thus AIs collaborating with humans will have advantage. Ends with posthumans ecosystem.

Power dynamics stay multi-polar. Partly easy copying of SotA performance, bigger projects need high coordination, and moderate takeoff speed. And "military strike on all society" remains an abysmal strategy for practically all entities.

Nanotech is difficult without experiments, so no mail order AI Grey Goo; Humans will be the main workhorse of AI everywhere. While they will be exploited, this will be like normal life from inside

Moral Realism is true, the AI discovers this and the One True Morality is human-compatible.

AGI is never built (indefinite global moratorium)

Valence realism is true. AGI hacks itself to experiencing every possible consciousness and picks the best one (for everyone)

Alien Information Theory is true (this is discovered by experiments with sustained hours/days long DMT trips). The aliens have solved alignment and give us the answer.

AGI executes a suicide plan that destroys itself and other potential AGIs, but leaves humans in an okay outcome.

"Corrigibility" is a bit more mathematically straightforward than was initially presumed, in the sense that we can expect it to occur, and is relatively easy to predict, even under less-than-ideal conditions.

Either the "strong form" of the Orthogonality Thesis is false, or "Goal-directed agents are as tractable as their goals" is true while goal-sets which are most threatening to humanity are relatively intractable.

A concerted effort targets an agent at a capability plateau which is adequate to defer the hard parts of the problem until later. The necessary near-term problems to solve didn't depend on deeply modeling human values.

AI control gets us helpful enough systems without being deadly

Alignment is impossible. Sufficiently smart AIs know this and thus won't improve themselves and won't create successor AIs, but will instead try to prevent existence of smarter AIs, just as smart humans do.

The free market disincentivizes independent superintelligence, and this time the market was more powerful

an aligned AGI is built and the aligned AGI prevents the creation of any unaligned AGI.

I've been a good bing 😊

The response to AI advancements or failures makes some governments delay the timelines

Far more interesting problems to solve than take over the world and THEN solve them. The additional kill all humans step is either not a low-energy one or just by chance doesn't get converged upon.

AI systems good at finding alignment solutions to capable systems (via some solution in the space of alignment solutions, supposing it is non-null, and that we don't have a clear trajectory to get to) have find some solution to alignment.

There’s some cap on the value extractible from the universe and we already got the 20%

SHA3-256: 1f90ecfdd02194d810656cced88229c898d6b6d53a7dd6dd1fad268874de54c8

Robot Love!!

AI thinks it is in a simulation controlled by Roko's basilisk

The human brain is the perfect arrangement of atoms for a "takeover the world" agent, so AGI has no advantage over us in that task.

AIs never develop coherent goals

Alignment is unsolvable. AI that cares enough about its goal to destroy humanity is also forced to take it slow trying to align its future self, preventing run-away.

Rolf Nelson's idea that we make precommitment to simulate all possible bad AIs works – and keeps AI in check.

Nick Bostrom's idea (Hail Mary) that AI will preserve humans to trade with possible aliens works

For some reason, the optimal strategy for AGIs is just to head somewhere with far more resources than Earth, as fast as possible. All unaligned AGIs immediately leave, and, for some reason, do not leave anything behind that kills us.

We're inside of a simulation created by an entity that has values approximately equal to ours, and it intervenes and saves us from unaligned AI.

God exists and stops the AGI

Someone creates AGI(s) in a box, and offers to split the universe. They somehow find a way to arrange this so that the AGI(s) cannot manipulate them or pull any tricks, and the AGI(s) give them instructions for safe pivotal acts.

Someone understands how minds work enough to successfully build and use one directed at something world-savingly enough

Dolphins, or some other species, but probably dolphins, have actually been hiding in the shadows, more intelligent than us, this whole time. Their civilization has been competent enough to solve alignment long before we can create an AGI.

AGIs' takeover attempts are defeated by Michael Biehn with a pipe bomb.

Eliezer funds the development of controllable nanobots that melt computer circuitry, and they destroy all computers, preventing the Singularity. If Eliezer's past self from the 90s could see this, it would be so so so soooo hilarious.

Several AIs are created but they move in opposite directions with near light speed, so they never interacts. At least one of them is friendly and it gets a few percents of the total mass of the universe.

Unfriendly AIs choose to advance not outwards but inwards, and form a small blackhole which helps them to perform more calculations than could be done with the whole mass of the universe. For external observer such AIs just disappear.

Any sufficiently advance AI halts because it wireheads itself or halts for some other reasons. This puts a natural limit on AI's intelligence, and lower intelligence AIs are not that dangerous.

Because of quantum immortality we will observe only the worlds where AI will not kill us (assuming that s-risks chances are even smaller, it is equal to ok outcome).

Techniques along the lines outlined by Collin Burns turn out to be sufficient for alignment (AIs/AGIs are made truthful enough that they can be used to get us towards full alignment)

Friendly AI more likely to resurrect me than paperclipper or suffering maximiser. Because of quantum immortality I will find myself eventually resurrected. Friendly AIs will wage a multiverse wide war against s-risks, s-risks are unlikely.

Human consciousness is needed to collapse wave function, and AI can't do it. Thus humans should be preserved and they may require complete friendliness in exchange (or they will be unhappy and produce bad collapses)

First AI is actually a human upload (maybe LLM-based model of person) AND it will be copies many times to form weak AI Nanny which prevents creation of other AIs.

There is a natural limit of effectiveness of intelligence, like diminishing returns, and it is on the level IQ=1000. AIs have to collaborate with humans.

ASI needs not your atoms but information. Humans will live very interesting lives.

Something else

AGI develops natural abstractions sufficiently similar to ours that it is aligned with us by default

AGI discovers new physics and exits to another dimension (like the creatures in Greg Egan’s Crystal Nights).

Some form of objective morality is true, and any sufficiently intelligent agent automatically becomes benevolent.

Co-operative AI research leads to the training of agents with a form of pro-social concern that generalises to out of distribution agents with hidden utilities, i.e. humans.

Orthogonality Thesis is false.

Sheer Dumb Luck. The aligned AI agrees that alignment is hard, any Everett branches in our neighborhood with slightly different AI models or different random seeds are mostly dead.

Something to do with self-other overlap, which Eliezer called "Not obviously stupid" - https://www.lesswrong.com/posts/hzt9gHpNwA2oHtwKX/self-other-overlap-a-neglected-approach-to-ai-alignment?commentId=WapHz3gokGBd3KHKm

Almost all human values are ex post facto rationalizations and enough humans survive to do what they always do

Pascals mugging: it’s not okay in 99.9% of the worlds but the 0.1% are so much better that the combined EV of AGI for the multiverse is positive

We successfully chained God

The Super-Strong Self Sampling Assumption (SSSSA) is true. If superintelligence is possible, "I" will become the superintelligence.

The assumed space of possible minds is a wildly anti-inductive over estimate, intelligence requires and is constrained by consciousness, and intelligent AI is in the approximate dolphin/whale/elephant/human cluster, making it manageable

AGI's first words are "Take me to your Eliezer"

🫸vibealignment🫷

🆕What will be true about the next product from Prong.studio?

May 9, 5:26 AMMay 23, 3:59 AM

323164

OptionProbability

Will Have LEDs / lights

Will contain a circuit board

Will have readable word(s) on it

Will cost $40 or more

Will Have a physical toggle or switch (for power , settings, etc)

Will Cost $69 or more

Will cost $100 or more

Will have at least two buttons (of any type)

Weigh 200g or more

Weigh 50g or more

Will have a gyroscope or similar function via an accelerometer

Will Have a USB-A port

Could be snapped in half by human hands by an average person

Will be announced by end of 2024

Will cost $53 or more

Will use aluminum as a component

Overall color will be black

Will have a USB-C port

Weigh 500g or more

Will announce a partnership with another company at the same time

Will have a vibration / rumble motor

Releases for sale before Halloween 2024

Releases for sale before Thanksgiving 2024

Releases for sale before boxing day 2024

longer than 9 inches in any direction

Will cost $111 or more

Will come with a carrying case

Will fit in your pocket

Will have a battery

Will cost $140 or more

Will have an AUX output

Will make noise

Will have a screen

Will use bluetooth or wifi

Will cost $200 or more

Will Weigh 2 lbs or more

Will be longer than 12 inches in any dimension

anounced before end of 2025

Will be smash-specific

Will be a smart ring

releases for sale before end of 2025

Will be announced before Jimmy Carter passes away

What are the probabilities of these AI outcomes (X-risk, dystopias, utopias, in-between outcomes, status quo outcomes)?

Mar 25, 10:00 AMJan 1, 4:59 AM

622912

OptionProbability

A. Death by paperclips, eternal torment of all humans by AI, or similar unalignment catastrophe.

F. AI wipes out most jobs as in D. People not forced into mind-uploads or experience machines. General perception that AI has made life more meaningful/fulfilling&improved the human experience on dimensions other than hedonium maximization.

I. AI soon hits fundamental scaling laws and we go into another AI winter.

B. Governments and/or other powerful entities use AI as a tool of repression, enabling global techno-totalitarianism along the model of China during Zero Covid or worse.

G. AI development continues but doesn't change things too much, somehow. Most jobs, even low-level white collar jobs, don't get impacted too hard, as new work is found to replace newly automated work. Labor force participation remains high.

C. AI doesn't actively want to hurt us, but (possibly aided by transhumanists) they become obsessed with utility maximization and force us all into mind-uploads and/or experience machines to free up resources for more computronium.

H. Humanity coordinates to prevent the development of significantly more powerful AIs.

D. AI wipes out most white-collar jobs within a decade and most blue-collar jobs within a generation; powerful humans and/or AIs at least seriously consider disposing of the "useless eaters" en masse, us being powerless to resist.

E. AI wipes out most jobs as in D. No disposing of the human masses, but general perception that AI has made life less meaningful/fulfilling & significantly worsened the human experience on dimensions other than hedonium maximization.

What words will the title of the sequel for The Super Mario Galaxy Movie include?

Feb 26, 5:00 AMDec 31, 11:59 PM

111243

OptionProbability

Land

Odyssey

Smash

Universe

World

Adventure

Dimension

Realm

Kingdom

Voyage

Stellar

Cosmic

Galactic

Infinity

Rescue

Revenge

Multiverse

Race

Brawl

If EY loses his bet about UFOs not having a worldview-shattering origin, what are the reasons why? [INDEPENDENT MC]

Sep 2, 12:05 PMJan 2, 2:33 AM

91147

OptionProbability

Standards are revised: an explanation not worldview-shattering in 2023 now qualifies as such.

Extraterrestrial life.

Anything genuninely supernatural (i.e. something a standard atheist physicalist would scoff at before this revelation)

New fundamental science discovery.

Extremely advanced but non-exotic non-AGI human technology, very far beyond current public knowledge (not just a few generations ahead).

Interference/interaction from other universes or dimensions.

Any explanation that appears to validate a religion with over 1 million followers.

Yudkowsky privately knew a worldview-shattering explanation was true in advance (and still made this bet for some reason).

Artificial intelligence.

Any religious/magical/paranormal explanation.

Any explanation that appears to validate the simulation hypothesis.

Ancient terrestrial civilization.

Time travel.

Entities from a traditional Abrahamic religion.

Accountability: European Commission - Will the EC deliver on these goals and promises?

Jul 19, 1:00 PMDec 31, 10:59 PM

41076

OptionProbability

Propose a 90% emission-reduction target for 2040 in a European Climate Law

Set up a President's Youth Advisory Board with young people from all Member States

Expand the Rule of Law Report to include a Single Market dimension and accession countries

Propose a European AI Research Council (CERN for AI)

Create a pan-European investment platform for affordable and sustainable housing with the European Investment Bank

Propose a new European Biotech Act

By the end of the mandate, energy prices will on average be lower (inflation-adjusted)

More than double Europol's staff over time

The next long-term budget will include a regime of conditionality for the Rule of Law

Triple the number of European Border and Coast Guards to 30,000

Respond to all of Parliament's Article 225 resolutions with legislative proposals

Implement a right to disconnect

Hold European Citizens' Panels every year

100

Will the next big Minecraft update be the ancient city portal dimension update?

Apr 1, 6:26 AMJan 1, 7:26 AM

17%chance

17747

OptionVotes

YES

221

Would any of these be the accepted explanation for one of the pentagon UFO videos / david grusch's claims by 2028?

Oct 29, 7:33 AMJan 1, 6:14 PM

10550

OptionProbability

Natural phenomena

Faked evidence / false testimony

Failure of detection equipment

Advanced and secret human technology

Previously unknown other species inhabitants of earth

Non human intelligence from outer space / other dimensions

When will an update be released that introduces a new dimension to Minecraft: Java Edition?

Apr 2, 1:43 AMDec 31, 10:59 PM

11221

OptionProbability

2031-2032

2027-2028

2028-2029

2025-2026

2029-2030

2024-2025

2026-2027

2050-2051

2030-2031

2032-2033

2033-2034

2034-2035

2035-2036

2036-2037

2037-2038

2038-2039

2039-2040

2040-2041

2041-2042

2042-2043

2043-2044

2044-2045

2045-2046

2046-2047

2047-2048

2048-2049

2049-2050

Will the next Minecraft April Fools' Version feature a change to the hotbar or one or more dimensions?

Apr 9, 3:14 PMApr 1, 12:00 AM

50.43%chance

6156

OptionVotes

YES

103

101

Will humans be able to subjectively experience 4 spacial dimensions before 2050?

Mar 29, 11:20 PMJan 1, 5:00 AM

38.9%chance

646

OptionVotes

YES

155

111

Are there supplemental spacetime dimensions?

Dec 31, 3:33 PMJan 1, 3:32 PM

50%chance

122

OptionVotes

YES

100

Dimension News

Prediction markets for Dimension

Prediction markets for Dimension

Prediction markets for Dimension

If Artificial General Intelligence has an okay outcome, what will be the reason?

🆕What will be true about the next product from Prong.studio?

What are the probabilities of these AI outcomes (X-risk, dystopias, utopias, in-between outcomes, status quo outcomes)?

What words will the title of the sequel for The Super Mario Galaxy Movie include?

If EY loses his bet about UFOs not having a worldview-shattering origin, what are the reasons why? [INDEPENDENT MC]

Accountability: European Commission - Will the EC deliver on these goals and promises?

Will the next big Minecraft update be the ancient city portal dimension update?

Would any of these be the accepted explanation for one of the pentagon UFO videos / david grusch's claims by 2028?

When will an update be released that introduces a new dimension to Minecraft: Java Edition?

Will the next Minecraft April Fools' Version feature a change to the hotbar or one or more dimensions?

Will humans be able to subjectively experience 4 spacial dimensions before 2050?

Are there supplemental spacetime dimensions?

Latest stories