Sense News

If Artificial General Intelligence has an okay outcome, what will be the reason?

Mar 24, 2:46 PMJan 2, 7:59 AM

531476796

OptionProbability

Humanity coordinates to prevent the creation of potentially-unsafe AIs.

Alignment is not properly solved, but core human values are simple enough that partial alignment techniques can impart these robustly. Despite caring about other things, it is relatively cheap for AGI to satisfy human values.

AIs will not have utility functions (in the same sense that humans do not), their goals such as they are will be relatively humanlike, and they will be "computerish" and generally weakly motivated compared to humans.

Yudkowsky is trying to solve the wrong problem using the wrong methods based on a wrong model of the world derived from poor thinking and fortunately all of his mistakes have failed to cancel out

We create a truth economy. https://manifold.markets/Krantz/is-establishing-a-truth-economy-tha?r=S3JhbnR6

Eliezer finally listens to Krantz.

Ethics turns out to be a precondition of superintelligence

Other

Someone solves agent foundations

A smaller AI disaster causes widespread public panic about AI, making it a bad legal or PR move to invest in powerful AIs without also making nearly-crippling safety guarantees

Something less inscrutable than matrices works fast enough

Nanotech is difficult without experiments, so no mail order AI Grey Goo; Humans will be the main workhorse of AI everywhere. While they will be exploited, this will be like normal life from inside

Orthogonality Thesis is false.

We make risk-conservative requests to extract alignment-related work out of AI-systems that were boxed prior to becoming superhuman. We somehow manage to achieve a positive feedback-loop in alignment/verification-abilities.

The response to AI advancements or failures makes some governments delay the timelines

Far more interesting problems to solve than take over the world and THEN solve them. The additional kill all humans step is either not a low-energy one or just by chance doesn't get converged upon.

AIs make "proof-like" argumentation for why output does/is what we want. We manage to obtain systems that *predict* human evaluations of proof-steps, and we manage to find/test/leverage regularities for when humans *aren't* fooled.

A lot of humans participate in a slow scalable oversight-style system, which is pivotally used/solves alignment enough

There’s some cap on the value extractible from the universe and we already got the 20%

Humans become transhuman through other means before AGI happens

Aligned AI is more economically valuable than unaligned AI. The size of this gap and the robustness of alignment techniques required to achieve it scale up with intelligence, so economics naturally encourages solving alignment.

Humans and human tech (like AI) never reach singularity, and whatever eats our lightcone instead (like aliens) happens to create an "okay" outcome

Alignment is unsolvable. AI that cares enough about its goal to destroy humanity is also forced to take it slow trying to align its future self, preventing run-away.

An AI that is not fully superior to humans launches a failed takeover, and the resulting panic convinces the people of the world to unite to stop any future AI development.

Techniques along the lines outlined by Collin Burns turn out to be sufficient for alignment (AIs/AGIs are made truthful enough that they can be used to get us towards full alignment)

Social contagion causes widespread public panic about AI, making it a bad legal or PR move to invest in powerful AIs without also making nearly-crippling safety guarantees

Getting things done in Real World is as hard for AGI as it is for humans. AGI needs human help, but aligning humans is as impossible as aligning AIs. Humans and AIs create billions of competing AGIs with just as many goals.

Development and deployment of advanced AI occurs within a secure enclave which can only be interfaced with via a decentralized governance protocol

High-level self-improvement (rewriting code) is intrinsically risky process, so AIs will prefer low level and slow self-improvement (learning), thus AIs collaborating with humans will have advantage. Ends with posthumans ecosystem.

AGI is never built (indefinite global moratorium)

AGI develops natural abstractions sufficiently similar to ours that it is aligned with us by default

Multipolar AGI Agents run wild on the internet, hacking/breaking everything, causing untold economic damage but aren't focused enough to manipulate humans to achieve embodiment. In the aftermath, humanity becomes way saner about alignment.

Co-operative AI research leads to the training of agents with a form of pro-social concern that generalises to out of distribution agents with hidden utilities, i.e. humans.

"Corrigibility" is a bit more mathematically straightforward than was initially presumed, in the sense that we can expect it to occur, and is relatively easy to predict, even under less-than-ideal conditions.

Either the "strong form" of the Orthogonality Thesis is false, or "Goal-directed agents are as tractable as their goals" is true while goal-sets which are most threatening to humanity are relatively intractable.

A concerted effort targets an agent at a capability plateau which is adequate to defer the hard parts of the problem until later. The necessary near-term problems to solve didn't depend on deeply modeling human values.

AI control gets us helpful enough systems without being deadly

Alignment is impossible. Sufficiently smart AIs know this and thus won't improve themselves and won't create successor AIs, but will instead try to prevent existence of smarter AIs, just as smart humans do.

Hacks like RLHF-ing self-disempowerment into frontier models work long enough to develop better alignment methods, which in turn work long enough to ... etc; we keep ahead of 'alignment escape velocity'

an aligned AGI is built and the aligned AGI prevents the creation of any unaligned AGI.

I've been a good bing 😊

AI systems good at finding alignment solutions to capable systems (via some solution in the space of alignment solutions, supposing it is non-null, and that we don't have a clear trajectory to get to) have find some solution to alignment.

SHA3-256: 1f90ecfdd02194d810656cced88229c898d6b6d53a7dd6dd1fad268874de54c8

Robot Love!!

AI thinks it is in a simulation controlled by Roko's basilisk

The human brain is the perfect arrangement of atoms for a "takeover the world" agent, so AGI has no advantage over us in that task.

AIs never develop coherent goals

Aliens invade and stop bad |AI from appearing

Rolf Nelson's idea that we make precommitment to simulate all possible bad AIs works – and keeps AI in check.

Nick Bostrom's idea (Hail Mary) that AI will preserve humans to trade with possible aliens works

For some reason, the optimal strategy for AGIs is just to head somewhere with far more resources than Earth, as fast as possible. All unaligned AGIs immediately leave, and, for some reason, do not leave anything behind that kills us.

We're inside of a simulation created by an entity that has values approximately equal to ours, and it intervenes and saves us from unaligned AI.

God exists and stops the AGI

Someone at least moderately sane leads a campaign, becomes in charge of a major nation, and starts a secret project with enough resources to solve alignment, because it turns out there's a way to convert resources into alignment progress.

Someone creates AGI(s) in a box, and offers to split the universe. They somehow find a way to arrange this so that the AGI(s) cannot manipulate them or pull any tricks, and the AGI(s) give them instructions for safe pivotal acts.

Someone understands how minds work enough to successfully build and use one directed at something world-savingly enough

Dolphins, or some other species, but probably dolphins, have actually been hiding in the shadows, more intelligent than us, this whole time. Their civilization has been competent enough to solve alignment long before we can create an AGI.

AGIs' takeover attempts are defeated by Michael Biehn with a pipe bomb.

Eliezer funds the development of controllable nanobots that melt computer circuitry, and they destroy all computers, preventing the Singularity. If Eliezer's past self from the 90s could see this, it would be so so so soooo hilarious.

Several AIs are created but they move in opposite directions with near light speed, so they never interacts. At least one of them is friendly and it gets a few percents of the total mass of the universe.

Unfriendly AIs choose to advance not outwards but inwards, and form a small blackhole which helps them to perform more calculations than could be done with the whole mass of the universe. For external observer such AIs just disappear.

Any sufficiently advance AI halts because it wireheads itself or halts for some other reasons. This puts a natural limit on AI's intelligence, and lower intelligence AIs are not that dangerous.

Because of quantum immortality we will observe only the worlds where AI will not kill us (assuming that s-risks chances are even smaller, it is equal to ok outcome).

Friendly AI more likely to resurrect me than paperclipper or suffering maximiser. Because of quantum immortality I will find myself eventually resurrected. Friendly AIs will wage a multiverse wide war against s-risks, s-risks are unlikely.

Human consciousness is needed to collapse wave function, and AI can't do it. Thus humans should be preserved and they may require complete friendliness in exchange (or they will be unhappy and produce bad collapses)

Power dynamics stay multi-polar. Partly easy copying of SotA performance, bigger projects need high coordination, and moderate takeoff speed. And "military strike on all society" remains an abysmal strategy for practically all entities.

First AI is actually a human upload (maybe LLM-based model of person) AND it will be copies many times to form weak AI Nanny which prevents creation of other AIs.

There is a natural limit of effectiveness of intelligence, like diminishing returns, and it is on the level IQ=1000. AIs have to collaborate with humans.

ASI needs not your atoms but information. Humans will live very interesting lives.

Something else

Moral Realism is true, the AI discovers this and the One True Morality is human-compatible.

Valence realism is true. AGI hacks itself to experiencing every possible consciousness and picks the best one (for everyone)

AGI discovers new physics and exits to another dimension (like the creatures in Greg Egan’s Crystal Nights).

Alien Information Theory is true (this is discovered by experiments with sustained hours/days long DMT trips). The aliens have solved alignment and give us the answer.

AGI executes a suicide plan that destroys itself and other potential AGIs, but leaves humans in an okay outcome.

Some form of objective morality is true, and any sufficiently intelligent agent automatically becomes benevolent.

Sheer Dumb Luck. The aligned AI agrees that alignment is hard, any Everett branches in our neighborhood with slightly different AI models or different random seeds are mostly dead.

Something to do with self-other overlap, which Eliezer called "Not obviously stupid" - https://www.lesswrong.com/posts/hzt9gHpNwA2oHtwKX/self-other-overlap-a-neglected-approach-to-ai-alignment?commentId=WapHz3gokGBd3KHKm

Almost all human values are ex post facto rationalizations and enough humans survive to do what they always do

Pascals mugging: it’s not okay in 99.9% of the worlds but the 0.1% are so much better that the combined EV of AGI for the multiverse is positive

We successfully chained God

The Super-Strong Self Sampling Assumption (SSSSA) is true. If superintelligence is possible, "I" will become the superintelligence.

The assumed space of possible minds is a wildly anti-inductive over estimate, intelligence requires and is constrained by consciousness, and intelligent AI is in the approximate dolphin/whale/elephant/human cluster, making it manageable

The free market disincentivizes independent superintelligence, and this time the market was more powerful

AGI's first words are "Take me to your Eliezer"

🫸vibealignment🫷

Which combinations of technology will cause most disruption the world in the next 25 years?

Feb 18, 5:22 PMDec 31, 11:59 PM

9415448

OptionProbability

Bitcoin + AI

Energy harvesting + sensor ubiquity

AI + Genetics

Bitcoin + XR

Social media + AI

AI + Internet

AI + Spatial Computing

AI + Medicine

Fintech + AI

AI + Advanced Materials

AI + Bioengineering

AI + Physics

AI + Robotics

3D Printing + Materials Science

BCI + Internet

Mobile Technology + Social Media

AI + IoT

XR + AI

Quantum Computing + Materials Science

AI + VR

AI+Blockchain

AI + Prediction Markets

Wearable Technology + Telemedicine

GPS + Real-time Traffic Data

Haptics + BCI

AI + fossil fuels

Vertical farming + Solar

BCI + Remote sensing

Decentralized IP Verification and Education (Blockchain Credentials)

Nuclear Energy + Transportation

BCI + Dating Apps

Predictive Markets and Constitutional Alignment of AI

[Making Sense: Peter Zeihan] Will the german state cease to exist before 2070?

Feb 23, 8:06 PMJan 1, 10:59 PM

18.25%chance

18894

OptionVotes

YES

1259

922

🎞️If the AFI Top 100 Movie list updates again, what will the ranking order of films be?

Mar 3, 6:52 PMJan 1, 4:59 AM

7874

OptionProbability

High Noon

The Gold Rush

Sophie's Choice

Goodfellas

The French Connection

Pulp Fiction

The Last Picture Show

Do The Right Thing

Blade Runner

Yankee Doodle Dandy

Toy Story

Ben-Hur

The General

On The Waterfront

It's a Wonderful Life

Chinatown

The Grapes of Wrath

ET the Extra Terrestrial

To Kill a Mockingbird

Mr Smith Goes to Washington

Double Indemnity

The Maltese Falcon

Godfather Part II

One Flew Over The Cuckoo's Nest

Snow White and the Seven Dwarfs (original animated)

Annie Hall

Bridge on the River Kwai

The Best Years of our Lives

Treasure of the Sierra Madre

The Sound of Music

King Kong (original)

Bonnie and Clyde

Midnight Cowboy

The Philadelphia Story

Shane

It Happened One Night

A Streetcar Named Desier

Rear Window

Intolerance

Lord of the Rings: Fellowship of the Ring

West Side Story

Taxi Driver

The Deer Hunter

M*A*S*H*

North by Northwest

Jaws

Rocky

Nashville

Duck Soup

Sullivan's Travels

American Graffiti

Cabaret

Network

The African Queen

Raiders of the Lost Ark

Who's Afraid of Virginia Woolf

Unforgiven

Tootsie

A Clockwork Orange

Saving Private Ryan

The Shawshank Redemption

Butch Cassidy and the Sundance Kid

Silence of the Lambs

In the Heat of the Night

Forrest Gump

All the President's Men

Modern Times

The Wild Bunch

The Apartment

Spartacus

Sunrise: A Song of Two Humans

Titanic

Easy Rider

A Night at the Opera

Platoon

12 Angry Men

Bringing Up Baby

The Sixth Sense

Swing Tie

Raging Bull

Singin in the Rain

Vertigo

City Lights

The Searchers

Psycho

2001 A Space Odyssey

Sunset Boulevard

The Graduate

Dr Strangelove

Some Like it Hot

Schindler's List

The Wizard of Oz

Apocalypse Now

Star Wars

All About Eve

Lawrence of Arabia

Gone with the Wind

Casablanca

The Godfather

Citizen Kane

When artificial general intelligence (AGI) exists, what will be true?

May 14, 6:01 PMOct 1, 9:59 PM

11277

OptionProbability

More than one AGI individual exists at the time the public finds out that AGI exists.

AGI attempts to make humans more like AGI.

If more than one AGI individual exists, they are generally friendly to each other.

AGI generally has a sense of morality that humans can comprehend.

AGI is generally friendly to humans.

AGI generally views humans as equal in moral worth to AGI.

AGI is generally under the control of humans.

AGI generally avoids interfering with humanity.

Humans are generally under the control of AGI.

Humans are generally friendly to AGI.

AGI has a significant beneficial effect on Earth's climate.

AGI has a significant detrimental effect on Earth's climate.

Will "Thread for Sense-Making on Recent Murders and..." make the top fifty posts in LessWrong's 2025 Annual Review?

Feb 3, 6:23 AMFeb 1, 12:00 AM

5.31%chance

2150

OptionVotes

YES

250

Will "Do confident short timelines make sense?" make the top fifty posts in LessWrong's 2025 Annual Review?

Jul 16, 2:14 PMFeb 1, 12:00 AM

14.47%chance

384

OptionVotes

YES

1005

967

How will I rate these writings? [Observer]

Dec 15, 3:27 PMDec 16, 3:18 PM

170

OptionProbability

Thompson (2025) "Précis of the blind spot: Why science cannot ignore human experience"

Shimony (1963) "Role of the Observer in Quantum Theory"

French (2025) "Why quantum mechanics needs phenomenology"

Wheeler (1977) "Include the Observer in the Wave Function?"

Pienaar (2021) "QBism and Relational Quantum Mechanics compared"

London & Bauer (1939) "The Theory of Observation in Quantum Mechanics"

Zahavi (2025) "Quantum phenomenology Measurement, reflection, correlation"

Dennett & Kinsbourne (1992) "Time and the observer: The where and when of consciousness in the brain"

Putnam (1981) "Quantum mechanics and the observer"

Fields (2012) "If Physics Is an Information Science, What Is an Observer?"

Kent (2016) "Quanta and Qualia"

Ball (2023) "How Life Works: A User’s Guide to the New Biology"

James (1879) "Are We Automata?"

Islami & Wiltsche (2025) "The French crisis Rethinking the phenomenology of quantum mechanics"

Pienaar (2025) "French on London and Bauer, and QBism"

Barandiaran et al. (2009) "Defining Agency: Individuality, Normativity, Asymmetry, and Spatio-temporality in Action”

Nave (2025) "A Drive to Survive"

Manzotti (2025) "A Deflationary Account of Information in Terms of Probability"

Tod & Thompson (2025) "The Third Entity: Participatory Sense-Making, Agency, and Attentional Dynamics"

Stoica (2025) "What makes you an observer?"

Dieks (2018) "Quantum Mechanics and Perspectivalism"

100

Will "Emotions Make Sense" make the top fifty posts in LessWrong's 2025 Annual Review?

Aug 26, 3:21 PMFeb 1, 12:00 AM

15.79%chance

120

OptionVotes

YES

1020

885

By 8/29/2026, will I think Brian Roemmele's (@BrianRoemmele) post makes more sense?

Aug 29, 4:15 PMAug 30, 4:59 AM

52.36%chance

OptionVotes

YES

105

Is Jose Luis Ricon's sense of humor ULTRA-high-taste?

Aug 30, 12:47 AM

The Manifold home page title says "The Largest Prediction Market Platform." In what sense is this true?

Jan 5, 9:03 AM

Sense News

Prediction markets for Sense

Prediction markets for Sense

Prediction markets for Sense

If Artificial General Intelligence has an okay outcome, what will be the reason?

Which combinations of technology will cause most disruption the world in the next 25 years?

[Making Sense: Peter Zeihan] Will the german state cease to exist before 2070?

🎞️If the AFI Top 100 Movie list updates again, what will the ranking order of films be?

When artificial general intelligence (AGI) exists, what will be true?

Will "Thread for Sense-Making on Recent Murders and..." make the top fifty posts in LessWrong's 2025 Annual Review?

Will "Do confident short timelines make sense?" make the top fifty posts in LessWrong's 2025 Annual Review?

How will I rate these writings? [Observer]

Will "Emotions Make Sense" make the top fifty posts in LessWrong's 2025 Annual Review?

By 8/29/2026, will I think Brian Roemmele's (@BrianRoemmele) post makes more sense?

Is Jose Luis Ricon's sense of humor ULTRA-high-taste?

The Manifold home page title says "The Largest Prediction Market Platform." In what sense is this true?

Latest stories