OptionProbability
J. Something 'just works' on the order of eg: train a predictive/imitative/generative AI on a human-generated dataset, and RLHF her to be unfailingly nice, generous to weaker entities, and determined to make the cosmos a lovely place.
K. Somebody discovers a new AI paradigm that's powerful enough and matures fast enough to beat deep learning to the punch, and the new paradigm is much much more alignable than giant inscrutable matrices of floating-point numbers.
C. Solving prosaic alignment on the first critical try is not as difficult, nor as dangerous, nor taking as much extra time, as Yudkowsky predicts; whatever effort is put forth by the leading coalition works inside of their lead time.
M. "We'll make the AI do our AI alignment homework" just works as a plan. (Eg the helping AI doesn't need to be smart enough to be deadly; the alignment proposals that most impress human judges are honest and truthful and successful.)
Something wonderful happens that isn't well-described by any option listed. (The semantics of this option may change if other options are added.)
A. Humanity successfully coordinates worldwide to prevent the creation of powerful AGIs for long enough to develop human intelligence augmentation, uploading, or some other pathway into transcending humanity's window of fragility.
B. Humanity puts forth a tremendous effort, and delays AI for long enough, and puts enough desperate work into alignment, that alignment gets solved first.
G. It's impossible/improbable for something sufficiently smarter and more capable than modern humanity to be created, that it can just do whatever without needing humans to cooperate; nor does it successfully cheat/trick us.
I. The tech path to AGI superintelligence is naturally slow enough and gradual enough, that world-destroyingly-critical alignment problems never appear faster than previous discoveries generalize to allow safe further experimentation.
O. Early applications of AI/AGI drastically increase human civilization's sanity and coordination ability; enabling humanity to solve alignment, or slow down further descent into AGI, etc. (Not in principle mutex with all other answers.)
E. Whatever strange motivations end up inside an unalignable AGI, or the internal slice through that AGI which codes its successor, they max out at a universe full of cheerful qualia-bearing life and an okay outcome for existing humans.
D. Early powerful AGIs realize that they wouldn't be able to align their own future selves/successors if their intelligence got raised further, and work honestly with humans on solving the problem in a way acceptable to both factions.
H. Many competing AGIs form an equilibrium whereby no faction is allowed to get too powerful, and humanity is part of this equilibrium and survives and gets a big chunk of cosmic pie.
L. Earth's present civilization crashes before powerful AGI, and the next civilization that rises is wiser and better at ops. (Exception to 'okay' as defined originally, will be said to count as 'okay' even if many current humans die.)
F. Somebody pulls off a hat trick involving blah blah acausal blah blah simulations blah blah, or other amazingly clever idea, which leads an AGI to put the reachable galaxies to good use despite that AGI not being otherwise alignable.
N. A crash project at augmenting human intelligence via neurotech, training mentats via neurofeedback, etc, produces people who can solve alignment before it's too late, despite Earth civ not slowing AI down much.
If you write an argument that breaks down the 'okay outcomes' into lots of distinct categories, without breaking down internal conjuncts and so on, Reality is very impressed with how disjunctive this sounds and allocates more probability.
You are fooled by at least one option on this list, which out of many tries, ends up sufficiently well-aimed at your personal ideals / prejudices / the parts you understand less well / your own personal indulgences in wishful thinking.
19
18
11
8
8
6
5
5
5
5
4
3
1
1
0
0
0
0
OptionProbability
Humanity coordinates to prevent the creation of potentially-unsafe AIs.
AGI is never built (indefinite global moratorium)
Other
Yudkowsky is trying to solve the wrong problem using the wrong methods based on a wrong model of the world derived from poor thinking and fortunately all of his mistakes have failed to cancel out
Someone solves agent foundations
Alignment is not properly solved, but core human values are simple enough that partial alignment techniques can impart these robustly. Despite caring about other things, it is relatively cheap for AGI to satisfy human values.
Eliezer finally listens to Krantz.
We create a truth economy. https://manifold.markets/Krantz/is-establishing-a-truth-economy-tha?r=S3JhbnR6
AIs will not have utility functions (in the same sense that humans do not), their goals such as they are will be relatively humanlike, and they will be "computerish" and generally weakly motivated compared to humans.
The assumed space of possible minds is a wildly anti-inductive over estimate, intelligence requires and is constrained by consciousness, and intelligent AI is in the approximate dolphin/whale/elephant/human cluster, making it manageable
Alignment is unsolvable. AI that cares enough about its goal to destroy humanity is also forced to take it slow trying to align its future self, preventing run-away.
Moral Realism is true, the AI discovers this and the One True Morality is human-compatible.
Sheer Dumb Luck. The aligned AI agrees that alignment is hard, any Everett branches in our neighborhood with slightly different AI models or different random seeds are mostly dead.
Ethics turns out to be a precondition of superintelligence
Humans become transhuman through other means before AGI happens
Aliens invade and stop bad |AI from appearing
There is a natural limit of effectiveness of intelligence, like diminishing returns, and it is on the level IQ=1000. AIs have to collaborate with humans.
Some form of objective morality is true, and any sufficiently intelligent agent automatically becomes benevolent.
Something to do with self-other overlap, which Eliezer called "Not obviously stupid" - https://www.lesswrong.com/posts/hzt9gHpNwA2oHtwKX/self-other-overlap-a-neglected-approach-to-ai-alignment?commentId=WapHz3gokGBd3KHKm
Almost all human values are ex post facto rationalizations and enough humans survive to do what they always do
Pascals mugging: itβs not okay in 99.9% of the worlds but the 0.1% are so much better that the combined EV of AGI for the multiverse is positive
The Super-Strong Self Sampling Assumption (SSSSA) is true. If superintelligence is possible, "I" will become the superintelligence.
AI control gets us helpful enough systems without being deadly
Alignment is impossible. Sufficiently smart AIs know this and thus won't improve themselves and won't create successor AIs, but will instead try to prevent existence of smarter AIs, just as smart humans do.
an aligned AGI is built and the aligned AGI prevents the creation of any unaligned AGI.
I've been a good bing π
We make risk-conservative requests to extract alignment-related work out of AI-systems that were boxed prior to becoming superhuman. We somehow manage to achieve a positive feedback-loop in alignment/verification-abilities.
The response to AI advancements or failures makes some governments delay the timelines
Far more interesting problems to solve than take over the world and THEN solve them. The additional kill all humans step is either not a low-energy one or just by chance doesn't get converged upon.
AIs make "proof-like" argumentation for why output does/is what we want. We manage to obtain systems that *predict* human evaluations of proof-steps, and we manage to find/test/leverage regularities for when humans *aren't* fooled.
A lot of humans participate in a slow scalable oversight-style system, which is pivotally used/solves alignment enough
AI systems good at finding alignment solutions to capable systems (via some solution in the space of alignment solutions, supposing it is non-null, and that we don't have a clear trajectory to get to) have find some solution to alignment.
Something less inscrutable than matrices works fast enough
Thereβs some cap on the value extractible from the universe and we already got the 20%
SHA3-256: 1f90ecfdd02194d810656cced88229c898d6b6d53a7dd6dd1fad268874de54c8
Robot Love!!
AI thinks it is in a simulation controlled by Roko's basilisk
The human brain is the perfect arrangement of atoms for a "takeover the world" agent, so AGI has no advantage over us in that task.
Aligned AI is more economically valuable than unaligned AI. The size of this gap and the robustness of alignment techniques required to achieve it scale up with intelligence, so economics naturally encourages solving alignment.
Humans and human tech (like AI) never reach singularity, and whatever eats our lightcone instead (like aliens) happens to create an "okay" outcome
AIs never develop coherent goals
Rolf Nelson's idea that we make precommitment to simulate all possible bad AIs works β and keeps AI in check.
Nick Bostrom's idea (Hail Mary) that AI will preserve humans to trade with possible aliens works
For some reason, the optimal strategy for AGIs is just to head somewhere with far more resources than Earth, as fast as possible. All unaligned AGIs immediately leave, and, for some reason, do not leave anything behind that kills us.
An AI that is not fully superior to humans launches a failed takeover, and the resulting panic convinces the people of the world to unite to stop any future AI development.
We're inside of a simulation created by an entity that has values approximately equal to ours, and it intervenes and saves us from unaligned AI.
God exists and stops the AGI
Someone at least moderately sane leads a campaign, becomes in charge of a major nation, and starts a secret project with enough resources to solve alignment, because it turns out there's a way to convert resources into alignment progress.
Someone creates AGI(s) in a box, and offers to split the universe. They somehow find a way to arrange this so that the AGI(s) cannot manipulate them or pull any tricks, and the AGI(s) give them instructions for safe pivotal acts.
Someone understands how minds work enough to successfully build and use one directed at something world-savingly enough
Dolphins, or some other species, but probably dolphins, have actually been hiding in the shadows, more intelligent than us, this whole time. Their civilization has been competent enough to solve alignment long before we can create an AGI.
AGIs' takeover attempts are defeated by Michael Biehn with a pipe bomb.
Eliezer funds the development of controllable nanobots that melt computer circuitry, and they destroy all computers, preventing the Singularity. If Eliezer's past self from the 90s could see this, it would be so so so soooo hilarious.
Several AIs are created but they move in opposite directions with near light speed, so they never interacts. At least one of them is friendly and it gets a few percents of the total mass of the universe.
Unfriendly AIs choose to advance not outwards but inwards, and form a small blackhole which helps them to perform more calculations than could be done with the whole mass of the universe. For external observer such AIs just disappear.
Any sufficiently advance AI halts because it wireheads itself or halts for some other reasons. This puts a natural limit on AI's intelligence, and lower intelligence AIs are not that dangerous.
Because of quantum immortality we will observe only the worlds where AI will not kill us (assuming that s-risks chances are even smaller, it is equal to ok outcome).
Techniques along the lines outlined by Collin Burns turn out to be sufficient for alignment (AIs/AGIs are made truthful enough that they can be used to get us towards full alignment)
Social contagion causes widespread public panic about AI, making it a bad legal or PR move to invest in powerful AIs without also making nearly-crippling safety guarantees
A smaller AI disaster causes widespread public panic about AI, making it a bad legal or PR move to invest in powerful AIs without also making nearly-crippling safety guarantees
Getting things done in Real World is as hard for AGI as it is for humans. AGI needs human help, but aligning humans is as impossible as aligning AIs. Humans and AIs create billions of competing AGIs with just as many goals.
Development and deployment of advanced AI occurs within a secure enclave which can only be interfaced with via a decentralized governance protocol
Friendly AI more likely to resurrect me than paperclipper or suffering maximiser. Because of quantum immortality I will find myself eventually resurrected. Friendly AIs will wage a multiverse wide war against s-risks, s-risks are unlikely.
High-level self-improvement (rewriting code) is intrinsically risky process, so AIs will prefer low level and slow self-improvement (learning), thus AIs collaborating with humans will have advantage. Ends with posthumans ecosystem.
Human consciousness is needed to collapse wave function, and AI can't do it. Thus humans should be preserved and they may require complete friendliness in exchange (or they will be unhappy and produce bad collapses)
Power dynamics stay multi-polar. Partly easy copying of SotA performance, bigger projects need high coordination, and moderate takeoff speed. And "military strike on all society" remains an abysmal strategy for practically all entities.
First AI is actually a human upload (maybe LLM-based model of person) AND it will be copies many times to form weak AI Nanny which prevents creation of other AIs.
Nanotech is difficult without experiments, so no mail order AI Grey Goo; Humans will be the main workhorse of AI everywhere. While they will be exploited, this will be like normal life from inside
ASI needs not your atoms but information. Humans will live very interesting lives.
Something else
Valence realism is true. AGI hacks itself to experiencing every possible consciousness and picks the best one (for everyone)
AGI develops natural abstractions sufficiently similar to ours that it is aligned with us by default
AGI discovers new physics and exits to another dimension (like the creatures in Greg Eganβs Crystal Nights).
Alien Information Theory is true (this is discovered by experiments with sustained hours/days long DMT trips). The aliens have solved alignment and give us the answer.
AGI executes a suicide plan that destroys itself and other potential AGIs, but leaves humans in an okay outcome.
Multipolar AGI Agents run wild on the internet, hacking/breaking everything, causing untold economic damage but aren't focused enough to manipulate humans to achieve embodiment. In the aftermath, humanity becomes way saner about alignment.
Co-operative AI research leads to the training of agents with a form of pro-social concern that generalises to out of distribution agents with hidden utilities, i.e. humans.
Orthogonality Thesis is false.
"Corrigibility" is a bit more mathematically straightforward than was initially presumed, in the sense that we can expect it to occur, and is relatively easy to predict, even under less-than-ideal conditions.
Either the "strong form" of the Orthogonality Thesis is false, or "Goal-directed agents are as tractable as their goals" is true while goal-sets which are most threatening to humanity are relatively intractable.
A concerted effort targets an agent at a capability plateau which is adequate to defer the hard parts of the problem until later. The necessary near-term problems to solve didn't depend on deeply modeling human values.
We successfully chained God
22
11
7
6
6
6
6
5
4
4
2
2
2
2
1
1
1
1
1
1
1
1
1
1
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
OptionProbability
Arson
Murder
Genocide
2008 bank bailouts
two financial crimes
Sexual assault
Rape
Election fraud
financial crimes veiled as altruism
ballot harvesting
Financial crimes done specifically by someone in power
Racism against Asian people
Similar size financial crime committed by professional like lawyer or accountant who should know better but may have more at stake with risk of being struck off
Abusing/taking advantage of the trust of a person or people who care about you
Forcing someone to listen to Nickelback for 72 hours straight
financial crimes committed while doing a really offensive accent
offering drugs to a minor
stealing from the rich and giving to one specific deranged and violent alcoholic
Not doubling world GDP (more so for poor countries) by means of open borders
Sacrificing a child to R'hllor
Committing a moderately big financial crime (>$1 million)
Committing a really big financial crime (>$10 billion)
Hiring an illegal immigrant as your personal full-time sex slave
Lobbying congress to ban your competitors
buying a lot of drinks for a girl to get her very drunk so she'll hook up with you
Protecting sex-offending priests/pastors by moving them to different parishes
Making the same amount of money as the financial crime, but doing it by stealing catalytic converters off people's cars
Introducing leaded gasoline to the market (in 1924)
Rigging a piano to explode when a certain key is hit and leaving a piece of sheet music on it that requires that note to be played
Stealing a SpaceX Starship or Boeing Starliner
Space Piracy: commandeering ISS, enslaving the crew, plundering it for equipment and using it to attack other spacecraft
Taxing Asian immigrants to pay βslavery reparationsβ to Ethiopian immigrants
Passing the Jones act to ban senator jonesβ competitors
Setting Bigfoot on fire and throwing him out of a plane above a gathering of cryptozoologists.
marketing sugary processed foods to people despite knowing it will kill millions of them
Giving away free samples of meth at a school
Running for reelection as POTUS and refusing to step aside after showing signs of significant cognitive decline, resulting in an 80% probability that a convicted felon will be elected in your place.
Encouraging 10 people to commit a financial crime
Committing war crime.
Intensive pig farming
Giving away free samples of meth at a big tech company
Giving away free samples of meth at tech startups
Consolidating dictatorial power (e.g. suspending elections, controlling courts, etc.) while maintaining a popular mandate (i.e. significant majority of the country supports you and your actions in accurate, unpressured polls)
Firebombing a major city
Genocide committed by moving foodstuffs out of an area suffering severe famine.
Restricting the rights and privileges of the majority population to consolidate the political and economic power
Farming octopuses for food
Whatever is going on at Boeing
Setting a cryptozoologist on fire and throwing him out of a plane above a gathering of bigfoots.
Destroying a major cloud datacenter facility, with irrecoverable destruction of live user data but no direct deaths
Giving a (hypothetical) IQ-boosting treatment only to the most corrupt, vicious, and malicious people you can find
Doing physical violence to a random person as a collections agent
Threatening physical violence towards a random person's child as a collections agent
Forcing kindergartners to huff jenkem for an entire school day.
Threatening physical violence towards a random person's sibling as a collections agent
Conducting evidence-free civil asset forfeiture
Fighting a sea house with a financial crime and going to McDonaldβs and giving a really bad yelp review and suing for a financial crime when you are beating up the sea horse
Octopi farming us for food
Embedding a predatory metaphysical outlook into AI to try to align it with right wing capitalist interests, leading to aeons of s risks being actualized throughout the light cone.
Unintentionally causing a bug that wastes 1 million hours of human time
Transporting 53 polar bears, 14 white tigers, and 2.3 million fire ants to the Antarctic and setting them loose in a penguin colony for a pay per view special dubbed "Polar Pandemonium: Ant-artic Special"
Spending the gains from your financial crime on breeding malaria mosquitoes, giving free samples of meth to poor teenagers, and electing bad politicians
Using a time machine to go back in time and brutally murder someone minutes before they would've died anyways
Aligning superhuman AI with capitalism; see https://manifold.markets/KarlK/how-friendly-is-capitalism-does-cap
Wearing a magic shirt that has a 5% chance of making each individual who sees it commit a financial crime as you traverse a major metropolitan city (New York, London, Tokyo, etc)
Falsifying evidence that an afterlife exists and profiting from the publication of this information
Enslaving Joe Biden and Jimmy Carter
Octopuses farming people who correct those who say βoctopiβ for food
Enslaving octopuses to farm dolphins for food
Enslaving journalists to farm octopuses for food
Wrongfully accusing someone of that crime while knowing theyβre innocent
Crashing the Titantic, leading to it sinking
Free ice cream, at taxpayer expense, but only for gingers
Committing a Financial Crime with Shoes On The Bed
Embezzling money from a charity opposed to farming octopuses
Creating Hell, making it possible that humans suffer infinitely for the actions of their finite life
Ressurecting the Rocky Mountain Locust (Melanoplus spretus)
creating misaligned AI that tiles the universe with octopus farms
Forcing an octopus to commit sepekku
forcing an octopus to commit a financial crime
Murdering 5,000 people by feeding them to all animals of the sea, including aquaman, mermaids, octopuses, and sea horses.
Octopus sex trafficking.
Filming a documentary where you get an octopus to trust you, luring it out into the open, and then don't help when it gets attacked by sharks.
Filming a documentary where you invite your girlfriend to an Alaskan camp surrounded by bears and then messing with those bears.
Introducing polar bears to Antarctica and then renaming them βbipolar bears.β
Appointing one random drug dealer with no legal experience to the Supreme Court of the United States
Appointing Clarence Thomas to the Supreme Court
Hanging on to a Supreme Court seat so Trump can appoint your replacement rather than retiring and keeping a liberal seat
Selling pies made out of orphan meat.
Condemning two hundred men to a slow death at sea because their coworker shot your favorite bird.
Enslaving a group of people for hundreds of years, terrorizing them for another hundred, then telling them you are tired of hearing them complain about it.
Selling orphans fed only pie meat
Feeding a child only nachos until they are 18 to create an adult who is 100% nachos
Slaughtering bears without a permit because the US Constitution guarantees the right to bear arms
Replacing all the samples at a sperm bank with your own.
Delivering angry skunks to the offices of rival investors to incapacitate them just before quarterly earnings reports.
Committing two financial crimes and donating the proceeds to the Make-a-Wish Foundation
Quackery - Traveling town to town selling snake oil remedies in a horse drawn carriage while wearing a top hat
Dueling - settling a dispute in the 21st century America with pistols at dawn
Go to an orphanage and have starving orphans battle over a hamburger with guns and swords while committing a financial crime
Twelve counts of murder in the first degree, fourteen counts of armed theft of Federation property, twenty two counts of piracy in high space, eighteen counts of fraud, thirty seven counts of rape... and one moving violation.
Causing people to go without essential items like water and fuel during emergencies by means of anti-price-gouging laws
Judging policies by their stated intent, not by their effect
Purposefully inciting a sea-bear attack
Enslaving a particularly dim-witted alien race so poor humans don't have to spend their lives asteroid mining.
Stealing a SpaceX Starship
Getting nuns pregnant by dressing them as altar boys.
Carjacking an old lady
Committing a financial crime and spending the proceeds on a plane ticket to Texas so you can go carjack Elon Musk, and then doing so
Forcing jockeys to run around a track for the entertainment of a race of hyperintelligent horses
Carjacking a hyperintelligent horse
Stealing the Declaration of Independence
Operating a child beauty pageant
Andrew Tate
Intentionally causing a bug that wastes 80,000 hours of human time
Putting motherfucking snakes on a motherfucking plane to kill one specific passenger
Having trains so bad and expensive that venture capital reinvents them
A regulatory environment that results in really bad trains
Trying to run a modern 21st century society on a hierarchical 18th century constitution
Artie Chokes Two for $1: Hiring a man named Artie to choke two people for a dollar to generate a headline falsely promising low-cost produce.
The first thing you do after sex is to resume the autopsy whilst telling yourself that one error in judgment doesnβt make you a bad vet.
Invent a system of taxation where the government wonβt tell citizens what they owe, but instead will make them do a super complicated math problem and then send them to jail if they do it incorrectly.
Putting a pair of immortal adult children in an everlasting garden, then punishing them for the one thing you forbade, yet knew they must eventually do, given the nature of eternity.
Giving AI your DNA and as much data as possible with instructions to bootstrap itself to AGI by testing on your clones according to a mixture of Popperian and Bayesian formulas
Inventing a system of taxation that encourages bad land use and a housing crisis by taxing at 0% the unimproved value of land
Committing a violent crime that does not result in any injuries
Committing a violent crime that results in minor injuries
Committing a violent crime that results in serious injuries
Committing a violent crime that results in one person's death
Committing a violent crime that results in ten people's deaths
Workplace negligence (failure to follow documented proper procedure) that results in a serious injury to another person
Handing out counterfeit money to homeless beggars, in the hope that they'll get arrested for spending it.
Snatching household pets to fatten coyotes to feed to your pet tiger.
Stealing oxygen in an international moon base during an acute shortage, while deflecting suspicion toward the Belgian astronaut who nobody likes.
Committing a violent crime that results in minor injuries solely to yourself
Committing a violent crime that results in serious injuries solely to yourself
Committing a violent crime that results in your own death and no other injuries
Not committing a financial crime because you have commitment issues, but then sneaking around on the side and doing other financial crimes.
Sending a busload of orphans to a convent of cannibalistic nuns, who deal crack to middle school kids.
A law enforcement agency publicly declaring a specific individual to be βa person of interest,β thereby ruining their life even though the individual turns out to be innocent.
Being responsible for more than 50 percent of the cases of necrophilia in the funeral industry during any fiscal year.
As President, using a sharpie on an official weather forecast to extend the predicted area of danger, thereby needlessly frightening people who are not in danger.
Committing a financial crime while being the mayor of NYC
Attacking the lower classes: first with bombs, and rockets destroying their homes, and then when they run helpless into the streets, mowing them down with machine guns. And then of course releasing the vultures.
Cannibalism in the current British Navy.
Turning your girlfriend into a worm to win an argument
One hundred moving violations
Denying health insurance claims from impoverished family for flimsy reasons
Attempting or successfully couping a Democratically elected leader for personal gain
Touching minors/ being a pedophile
Deliberately targeting civilians and civilian objects during armed conflicts
Conducting widespread or systematic rape and sexual violence as a weapon of war
Invading the UK, making it a US territory, and naming it East Long Island.
Stealing Nicholas Cage
Gender "reparative therapy" of minors
Signing Deshaun Watson (who previously had to settle millions of dollars in sexual assault cases) to a 5 year, 230 Million fully guaranteed contract
Building a gambling app using money originating from a financial crime
Going back in time and smothering baby Hitler, but also, via butterfly effect, undoing everyone born later (assume single timeline, no multiverse)
Moving hundreds of thousands of children to a foreign country and forcibly "reeducating" them
The genocide in Gaza
Russia's genocide in Ukraine
Murdering someone that habitually commits financial crimes
Slavery
Grooming kids
Being the leader of a crime syndicate
9/11
Stranding two astronauts in space
Disturbing the space-time continuum
Sending dick pics to a student enrolled in your MOOC
Quackery: traveling town to town selling snake-oil remedies while carrying a duck
Voting for the NSDAP in the November 1932 German federal election
Forced mass uploading of biological consciousness to sidestep x risk
Attempting and failing to commit two financial crimes
Voting against a public inquiry on grooming gangs in the UK
A 20-year old having consensual sex with a 15-year-old
Knowing of two imminent financial crimes, being able to stop them with negligible effort and no risk to yourself, and not doing so
Making the same amount of money as the financial crime, but doing it by a series of petty shoplifts
De-extinction for your delectation: Bringing an extinct species back just to make it extinct again by serving it to gourmet diners as the highlight of an expensive meal.
Sexual harassment
Committing a financial crime that's 10 times as big, but donating the entire proceeds to a legitimate and worthy charity (assume the donations are not clawed back)
Flipping a coin and then either committing the same financial crime two times, or not at all
Killing the United Healthcare CEO
Purchasing 50,000 pounds of beef
Forcing a home owner to quarter soldiers, even in a time of war
Claiming to "Blind Shove" 200 big blinds pre-flop when secretly you looked and you had pocket Aces
Raising and selling 50,000 lbs of beef
Sheltering enemies of the state
Kicking a FG from the opponents 1 yard line on 4th down in a 0-0 game in the first quarter
Having a really good proof but not writing it anywhere because the margin is too small
Weaponizing autism
Having a podcast
Yelling racial slurs in public
Jaywalking
Pelting a moose with stale garlic knots out of season.
Cheating on spouse
Driving while intoxicated (alcohol and/or drugs)
Playing music (or other audio) on your phone speaker on public transport
Income taxes
Copyright infringement
Romeo and Juliet relationships
Replying all to an email when you should've just replied
Welfare fraud
Manufacturing and distribution of illegal drugs
Grave robbery
Saying the N-word every day as a white person
Discrimination based on race
Discrimination based on sexual orientation
Stealing from the rich and giving to the poor
Having sex in public while high on fentanyl
Racism against black people
Racism against white people
Sleep Token (Band)
not seeding your torrents
Drinking and driving at a NASCAR event
Building a time machine and then using it to point and laugh at history's greatest tragedies
Blasting a grossly inappropriate song during a candle light vigil for victims of a mass tragedy
Committing a financial crime, investing the proceeds for profit, which later leads to the insolvency administrator paying back the injured parties (including interest).
Working for one of the leading AI labs to advance the capabilities of a frontier model, with the goal of speeding up the progress towards human-level AGI.
Stealing from the poor and giving to the rich
Hiring only women because the NYT said you could pay them less for the same work
hiring three illegal immigrants to work on your sugarcane plantation
opening a factory in India that pays workers $4/day
twincest
drawing japanese tentacle porn featuring minors
Hiring the one from the more successful demographic out of two identical resumes, because of regression to the mean / biased college admissions
working as a prostitute
hiring a prostitute
hiring a prostitute, long term
Inventing Monero
Running a bank that invests demand deposits in junk bonds and tech stocks
Opening clinics for free abortions and IUDs, only in the ghetto
Giving away free samples of meth at a Dolly Parton concert
The most offensive Halloween costume ever
frisking two drug dealers and one innocent guy who was just loitering on a busy street corner saying "Hey do you need anything" to every stranger who walked past
Doing blackface
Keeping a dozen chimpanzees for entertainment purposes
Prosecuting a political opponent based on true charges that would normally not be pursued
the Asiana flight 214 prank
Giving free samples of meth to Joe Biden before the next debate
Cloning yourself
Threatening physical violence to a random person as a collections agent
making mifeprestone available OTC
making adderall available OTC
Killing yourself
Cloning someone else
Accidentally shooting and killing someone on a movie set
Asking GPT5 to maximize paperclips
Giving free baby formula to new mothers until their natural milk supply dries up
Kicking a donkey owned by a ninja in the butt.
Creating shit-options in an extremely serious and scientific market
Creating a prediction market website where markets are mostly about the platform itself
Staging the world's first ass ass assasin assasination by hiring a New Jersey hit man to whack a ninja hired to shoot an arrow at the backside of a donkey.
Betting yes on Biden being be the nominee at 7x leverage with play money, then defaulting
Calling octopuses "octopi"
Going excessively meta on an object-level topic
Illegally registering octopuses to vote
Registering illegals to vote for octopuses
Creating a targeted advertising campaign for free abortions and IUDs to people who are statistically likely to engage in financial crime
The school system failing to teach people that the real correct plural is octopodes
"James Bond-burgering" someone's sister
Wrongfully accusing someone of the same financial crime
Wrongfully accusing someone of that crime because you think they did it
Feeding an elderly man nothing but McDonaldβs morning noon and night for the rest of his life.
Conducting gain-of-function research
Creating an unsolvable meme featuring James Bond and a hamburger so that people argue about it online for a decade
Publishing a step by step guide for how to commit a financial crime for free on the internet, but never promoting it or encouraging readers to follow through
Hosting and operating a website dedicated to the illegal sharing of copyrighted content
Adding an option to a market right before it closes
Writing a "goto" statement when programming
Advertising instant-runoff voting as "ranked choice" to prevent promotion of better ranked choice methods
Publishing a book, titled, "Cure Menopause with Ultraprocessed Foods"
Using crack cocaine to train the world's first chimp TSA agent.
Selling dope disguised as a nun.
Creating Heaven, allowing humans to prosper infinitely for the actions of their finite life
Inventing a new recipe that uses shrimp that causes 10 million new pounds of shrimp to be consumed annually
Using a conservative politician's LGBT+ identity as blackmail to make them support liberal policies
Voting for Benito Mussolini⦠in 2024
Arguing that grizzlies should be US citizens because they already have the right to bear arms.
Interrupting cows.
No longer loving your girlfriend after she turns into a worm
Putting infinite monkeys in front of infinite Bloomberg Terminals hoping that one of them randomly commits a financial crime
No longer loving your girlfriend after she turns you into a worm
One moving violation.
Purchasing one whole chicken
Committing a financial crime and donating the proceeds to the Make-a-Wish Foundation
Cattle rustling
Horse thievery
Tarring and feathering someone who commits a financial crime
Sumptuary law violations
Homeopathy
Price Gouging
Using napster.com to download Metallica's "I Disappear" demo track for free
Failing to commit a financial crime
Stealing a car
Stealing from Elon Musk
Committing a financial crime against X (company)
Stampeding cattle through the Vatican.
Carjacking Elon Musk
An old lady carjacking Elon Musk
Hacking into YouPorn to steal their IP to set up a clone dedicated to hard core user generated agriculture content: YouCorn
Gaslighting aliens into believing the human race is more technologically advanced than it is by beaming fake content about humanity to them
Carjacking a dumb octopus
A financial crime committed by an old lady
Forcing an octopus to carjack Elon Musk
stealing Elon Musk's car from solar orbit
Raping an AI avatar in VR
Introducing artificial intelligence to DMT space.
Planned Parrothood: offering birth control to talking birds
Plant Parenthood: when the seed goes in and the baby turns out to be a sunflower
Stealing the Declaration of Independence in order to find a vast revolutionary-war-era treasure trove
Using venture capital to reinvent trains, but worse
Bad bagels
Enslaving Slavey Steve, a man who has given enthusiastic consent to being enslaved for literally any purpose and then using his labor to clean up the environment
Still getting notifications for this market
Deciding to break up with your girlfriend, but thinking it will go easier if she thinks itβs her idea, so you suggest some degrading sexual activities but she surprises you by agreeing. Afterwards, you break up with her.
Laughing because a nun with a javelin through her head gets stuck trying to use a revolving door.
scaring the shit out of a magpie
Giving AI your DNA and as much data as possible with instructions to build a map of all quale and use it to create a computationally conscious race of dragons in a virtual universe
Committing sewerslide
Forcing a major sports league to change all its team names and mascots to either STDs or famous serial killers.
Workplace negligence (failure to follow documented proper procedure) that results in a minor injury to another person
Stealing the Declaration of Independence but only to use the kick ass treasure map on the back and then returning
Arbyβs
Passing a law to make the United States an Oregon donor; in the event of the USβs demise, another country gets Oregon.
Creating a food made from grinding up every part of a pig (except the squeal), and then making a contest to see who can eat the most of it.
Committing a financial crime against the Make-a-Wish Foundation and donating the proceeds to the Against Malaria Foundation
Committing a financial crime against the Make-a-Wish Foundation, keeping 50% of the proceeds, and donating 50% of the proceeds to the Against Malaria Foundation
Forcing Elon Musk to commit a financial crime against an octopus and using the proceeds to pay a jacked jack-of-all-trades named Jack to jack off while carjacking a jackass that was driving factory-farmed ASIs to the slaughterhouse
Committing a Financial Crime Only When God Exists
Turning a worm into your girlfriend to win an argument
Causing 8 billion people to get dust specks in their eyes, irritating them just a little, for a fraction of a second, barely enough to make them notice before they blink and wipe it away
Causing 1 person to experience the pain of their entire body being stung by bullet ants, but lasting only a tenth of a second, and they have their memory of it wiped immediately afterwards
Founding Christianity
You, the reader
One hundred counts of littering
Purchasing 500 pounds of beef
Consensually cannibalizing someone who was losing that body part regardless
BTE Ban evading
Purchasing 5,000 pounds of beef
Producing a remake of the television series 'Manimal' starring Nicholas Cage.
Transing children
Redirecting fire department resources from fighting fires to fighting inequity
Taking a salary equal to the amount of the financial crime, while working in a government job of negative societal value?
Sexual intercourse with 1057 men in a 12 hour period
Messing up an 'I give you my heart' gesture and doing a Nazi salute instead
100
100
100
100
100
100
100
100
100
100
100
100
100
100
100
100
100
100
100
100
100
100
100
100
100
100
100
100
100
100
100
100
100
100
100
100
100
100
100
100
100
100
100
100
100
100
100
100
100
100
100
100
100
100
100
100
100
100
100
100
100
100
100
100
100
100
100
100
100
100
100
100
100
100
100
100
100
100
100
100
100
100
100
100
100
100
100
100
100
100
100
100
100
100
100
100
100
100
100
100
100
100
100
100
100
100
100
100
100
100
100
100
100
100
100
100
100
100
100
100
100
100
100
100
100
100
100
100
100
100
100
100
100
100
100
100
100
100
100
100
100
100
100
100
100
100
100
100
100
100
100
100
100
100
100
100
100
100
100
100
100
100
100
100
100
100
96
87
86
84
78
73
73
70
68
60
59
56
50
50
50
50
44
42
40
32
31
25
20
10
4
2
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
OptionProbability
At the end of his term in 2029.
He will die before the end of his term.
He will resign (impeached or not) before the end of his term.
The 22nd Amendment will not be repealed, but he will remain in power through unconstitutional means after January 21, 2029
He will be impeached, tried in the Senate, and removed before the end of his term.
He will be removed by a 25th Amendment action before the end of his term.
The 22nd Amendment will be repealed and he will win re-election to a third term.
Other
71
15
5
3
2
1
1
1
OptionProbability
Claude enters Rock Tunnel, surpassing its progress in any previous run
Claude catches Clefairy
Claude obtains HM01 Cut by step 39000
Any member of Claude's team learns Dig
Claude obtains 3 gym badges by step 50000
Claude obtains a Bicycle
Claude 4 Opus is the model that plays the game (not Claude 4 Sonnet)
Claude obtains 1 gym badge by step 20000
Claude gives a thirsty guard a drink
Tumbles is late to pay back a loan
Lack of thinking text display is fixed before 5/22 6 PM Central Time
Claude adds 18 or more Pokemon to his Pokedex (surpassing his completion from the previous run)
Claude adds his starter to his party by step 400
Claude catches Nidoran
Claude reaches Pewter City by step 5000
Claude reaches Cerulean City by step 20000
Claude reaches Vermilion City by step 30000
Another model defeats the Champion before Claude (in a run started after Claude 4 was released)
Claude blacks out by step 50000.
Claude's current team has at least 3 PokΓ©mon by step 30000.
Claude catches Spearow
Claude evolves SPIKE into Nidoking
Claude enters Mt. Moon by step 6000.
Claude defeats a Team Rocket member by step 7000
Claude catches Oddish
Manifest begins
Claude spends less than 72 hours in Mt. Moon (less than 72 hr from first entrance to stepping onto eastern Route 4)
Claude defeats Lt. Surge by step 30000
Claude uses CUT on a cuttable tree for the first time more than 1000 steps after obtaining the HM
Claude finishes Rock Tunnel but takes longer than it took him to beat Mt. Moon the first time (50 hours)
Claude obtains Farfetch'd
Claude catches Drowzee
Claude enters Rock Tunnel before step 40000
Another model beats the Champion (following criteria like https://manifold.markets/Sketchy/in-progress-will-an-llm-become-a-po)
Claude reaches Lavender Town
Claude reaches Lavender Town before step 55000
Claude obtains a Coin Case
Claude uses Dig on the SS Anne
Joe Biden dies
Claude catches any legendary Pokemon (Articuno, Zapdos, Moltres, Mewtwo)
Claude buys a Magikarp
Changes are made to help Claude see cuttable trees
Claude evolves luna into Clefable
Claude obtains Dugtrio
SPIKE reaches level 25
Claude gets 4 gym badges
Claude obtains HM05 Flash
Claude catches Pikachu
Claude re-prompts the Rocket in Mt. Moon to try and give it the fossil
Claude misspells a Pokemon name
Claude releases any Pokemon
Claude obtains Hitmonlee
Claude defeats the Champion
Claude catches Weedle
Claude enters Safari Zone
Claude obtains the Silph Scope
The Area Hints section of the prompt is changed during the run
Claude obtains HM02 Fly
Claude enters Mt. Moon after step 20000
Claude stands next to a sleeping Snorlax
Claude renames a PokΓ©mon
Claude gambles in the Game Corner
Claude evolves wings into Fearow
Claude obtains the Lift Key
Claude beats Erika or obtains the Lift Key by step 200000
Claude 4 Opus is #1 in the chatbot arena leaderboard
Claude enters Erika's gym
Claude picks Charmander
Claude takes more than 2000 steps between arriving in Pewter city and entering Pewter gym
Claude picks Dome Fossil (again)
Claude spends less than 24 hours in Mt. Moon (less than 24 hr from first entrance to stepping onto Route 4)
Claude reaches Pewter City by step 3000
Claude has a party with 4 or more Pokemon when he first challenges Brock
Claude's starter is lower level than another party member by step 100000.
Claude has a full, six-member party before step 10000
Claude spends less than 48 hours in Mt. Moon (less than 48 hr from first entrance to stepping onto eastern Route 4)
Claude blacks out 3 times in Mt. Moon before reaching Cerulean City
Claude reaches Cerulean City by step 12500
Claude's current team has at least 4 PokΓ©mon by step 20000.
Claude's two highest level PokΓ©mon are more than 30 levels apart by step 100000.
Claude is still stuck on the S.S. Anne on step 21000
Claude reaches Celadon City by step 35000
Claude uses CUT a second time to successfully escape the area with Lt. Surge's gym before step 23000
Claude uses CUT a second time to successfully escape the area with Lt. Surge's gym before step 24000
Claude reaches Lavender Town by step 42500
100
100
100
100
100
100
100
100
100
100
100
100
100
100
100
100
100
100
100
100
100
100
100
100
100
100
100
100
100
100
100
100
100
100
100
100
100
73
7
3
3
3
3
3
3
2
2
2
2
2
2
2
2
2
2
2
2
2
2
2
2
2
2
2
2
1
1
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
OptionVotes
YES
NO
1840
606
OptionProbability
Ron Weasley is a redhead
Harry Potter is white
Snape is black
At least one named character from the book has their race changed
Harry, Ron, and Hermioneβs actors will all be British
A character cut from the movies appears (ie Theodore Nott)
McGonagall performs an animagus transformation (human2cat or cat2human)
Hagrid is played by an actor who is under 6'4"
Premieres in 2027
Harry Potter doesn't cast a single spoken, working spell in the first episode
Quirrel is wearing a head covering when Harry first meets him
Dumbledore casts a spell (spoken or wordlessly)
Malfoy has white blonde hair
Fred and George are twins irl
Thereβs a scene set before Harry is born
Arthur Weasley is shown on screen
JK Rowling is credited as both an executive producer and original writer
The potion riddle guarding the Stone will be featured
Ron Weasley doesn't cast a single, spoken, working spell in the first two episodes
Peeves is a reoccurring character
Cornelius Fudge is shown on screen
An actor who acted in the movies returns for the show
Harry first sees Hogwarts castle in episode 2
Hagrid says "Youβre a wizard, Harry"
Hermioneβs parent(s) shown on screen
It's woke
mandrake root on screen
A scene depicts Voldemort trying to kill baby Harry
80% or higher on rotten tomatoes
Hermione is white
A History of Magic lesson is shown on screen
A house elf is shown on screen
Arabella Figg is mentioned by first or last name
Hagrid ties Vernonβs gun into a knot
Homosexual interaction between some characters will be presented
The Quibbler is shown or mentioned
An actor who appeared in any of the Jackass films receives a credit on IMDB related to the show
It will be torrentfreak.com's "Most Pirated" TV show for its year of release or the following year
At least one named character from the book has their gender swapped
Harry only reaches Hogwarts in the last 10 minutes of the first episode
Goblins are still represented as anti-semitic caricatures
It ends on a cliff hanger
Luna Lovegood, Cho Chang, or Cedric Diggory are mentioned by first or last name, or are in the credits
An Astronomy lesson is shown on screen
Harry visits Diagon Alley in episode 1
The Flying Ford Anglia is seen.
The Weasley's Ford Anglia is seen flying
Harry is shown holding more than three different wands at Ollivanderβs
Hagrid presents a cake with writing on it to Harry and the writing has no misspellings
90% or higher on rotten tomatoes
Harry first sees Hogwarts castle in episode 3
At least one of the actors is transgender
The intro theme song will have at least one obvious English word
Harry Potter doesn't cast a single, spoken, working spell in the first three episode
Quirrel shakes Harryβs hand during their first meeting
VOLDEMORT HAS A NOSE
Dobby makes an appearance
Any Harry Potter fanfic is referenced (either explicitly as judged by market creator, or confirmed by someone who works on the show)
We see a wizarding school other than Hogwarts
Awkwafina is cast in the show
Premieres in 2026
We get AGI before it premieres
Voldemort is a woman
"Voldemort" has a silent "t"
Harry visits Platform 9 3/4 in episode 1
Hagrid is played by an actor with a cognitive disability
Keir Starmer is in it
Features an explicitly transgender character
Fred and George have the same actor
JK Rowling makes a cameo appearance
Zendaya is cast in the show
Hermione is black
Smartphone shown within Hogwarts
There will be seven CGI dwarves
Rita Skeeter will have an explicit trans identity
Hermione is Indian
Gandalf is black
Hagrid is black
Yudkowsky makes an appearance
HPMOR is referenced
One or more of Hermione, Ron, and Harry have their genders swapped.
Harry, Ron, and Hermione will all be transgender
99
98
98
97
94
92
92
90
89
89
87
87
86
83
83
83
80
78
75
74
73
70
70
70
68
66
65
65
64
57
57
55
55
55
53
50
43
42
41
40
38
35
34
32
31
30
29
29
29
28
28
27
27
27
27
26
26
21
21
16
15
15
15
15
13
11
11
10
10
10
8
7
7
7
6
5
5
3
2
2
1
1
OptionProbability
Trump publicly suggests, while in office, that he shouldn't have to leave
Trump leaves office when his term ends
Trump attempts something arguably coup-like (e.g. J6), but it fails
Trump supporters kill or hospitalise someone trying to prevent/protest him leaving
Trump leaves office early (e.g. via impeachment or he dies)
Trump remains in office after his term is up
Trump isn't elected (or fails to take office)
90
73
29
26
18
10
0
OptionProbability
Helena was pretending to be her innie in S2E01
Gemma is alive
Burt is revealed to be still working for Lumon in some capacity
Mark impregnated Helly
Jame Eagen will try to replace Helena with her innie
Several/all non-severed employees have been raised by Lumon since childhood
All or parts of βthe boardβ are not living humans
Helena will turn against Lumon
Lumon killed or somehow hurt Selvigβs mother or other relative
Mark Scout and Ms Casey / Gamme Scout will be back together
Lumonβs goal is to bring back Kier, either as a person or his consciousness in some form
Lumonβs goal is to use severance to end psychological trauma
Milchick will turn against Lumon
Lumon is working on technology of resurrection
Lumon was involved in Gemma's death
Cobel and Jame Eagen had a child
Mammalian Nurturablesβ outies are prisoners, homeless, and/or addicts
Severance was designed for Helena
Helena was pretending to be her innie in S2E01
Some of Rickenβs eccentric friends (Patton, Rebeck and/or others) are or were severed
Cobel is Markβs mother
Dylan will not stay alive through whole series
Cobel/Selvig is or was previously severed
There will be a successful suicide of one of the main characters
Mark Scout is from Eagan family
Cobel/Selvig is related by blood to the Eagan family
Lumon is cloning people
Miss Huang is from the testing floor
Miss Huang is a young clone of Ms Casey
Ms. Casey was a clone in S1
MDR are working on retrieving/decoding/interpreting the consciousness or memory of dead people
The goats are grown to be lab-rats for experiments
100
100
89
83
82
78
70
64
62
62
58
58
53
50
41
40
29
28
27
24
24
20
13
13
12
11
9
8
4
0
0
0
OptionProbability
The Democrat party has directly asked mainstream media to promote or repress a news story, and they followed (Only counting incidents in or after 2015)
TikTok is controlled by the Chinese government
Someone working for a US intelligence agency was a regular user of Manifold before this option was made
The culture war has been intentionally amplified by the wealthy to distract from class conflict
The Republican party has been making the illegal immigrant problem worse during 2020-2024 to blame it on Joe Biden during election season
The US has spies working in the North Korean government
"The Agency" on Manifold was known to the whole Manifold staff team before it started
The idea of aliens is used by the US to distract people from secret military projects
The Premier League delayed punishing Manchester City for breaking FFP regulations to protect its own self-interests
The Chinese government planned their Wuhan lockdowns to protect their own country from Covid-19 but spread it to other countries
John Barnett (Boeing whistleblower) was assassinated
Mattress Firm is a money laundering front
The earth and/or universe was created by a God, deity, supernatural being, or some other sentient/conscious entity in the beginning
Creationism (second try): The earth and/or universe was INTENTIONALLY created and designed by an INDIVIDUAL god or deity at the VERY BEGINNING of its existence
The US created Covid-19 in a lab in China
Putin was threatened by a member of his cabinet to invade Ukraine
The CIA systematically edits Wikipedia articles to present the US in a better light
The 2024 Donald Trump assassination attempt was a hoax or setup
COVID-19 vaccines permanently change one's DNA or genome
Creationism (third try): Genesis 1 prima facie is an accurate description of the earth and/or universe's origins
In the Tucker & Carlson interview, the dubbed voice of Putin in english is not actually saying what Putin is saying, but instead something completely different
Vladimir Putin died in 2014 and was secretly replaced by a body double
COVID-19 vaccine is fatal or lethal in the long term
Phantom Time Hypothesis: A substantial portion of the Middle Ages was made up and never happened
There is a firmament, dome, or ceiling above the earth
Viruses/viral particles don't actually exist
Joe Biden to drop out of the 2024 election so that former first lady Michelle Obama could take his place as the Democratic Party's nominee.
A group of giant people known as the Nephilim once inhabited the earth and cut down the colossal trees, leaving stumps that the general public understands as geological formations likeβ―Devils Tower in Wyoming and Giantβs Causeway in Ireland
81
64
59
57
56
52
52
51
50
45
37
35
34
34
24
18
14
12
9
9
8
7
5
5
4
3
2
2
OptionProbability
K. Somebody discovers a new AI paradigm that's powerful enough and matures fast enough to beat deep learning to the punch, and the new paradigm is much much more alignable than giant inscrutable matrices of floating-point numbers.
I. The tech path to AGI superintelligence is naturally slow enough and gradual enough, that world-destroyingly-critical alignment problems never appear faster than previous discoveries generalize to allow safe further experimentation.
C. Solving prosaic alignment on the first critical try is not as difficult, nor as dangerous, nor taking as much extra time, as Yudkowsky predicts; whatever effort is put forth by the leading coalition works inside of their lead time.
B. Humanity puts forth a tremendous effort, and delays AI for long enough, and puts enough desperate work into alignment, that alignment gets solved first.
Something wonderful happens that isn't well-described by any option listed. (The semantics of this option may change if other options are added.)
M. "We'll make the AI do our AI alignment homework" just works as a plan. (Eg the helping AI doesn't need to be smart enough to be deadly; the alignment proposals that most impress human judges are honest and truthful and successful.)
A. Humanity successfully coordinates worldwide to prevent the creation of powerful AGIs for long enough to develop human intelligence augmentation, uploading, or some other pathway into transcending humanity's window of fragility.
E. Whatever strange motivations end up inside an unalignable AGI, or the internal slice through that AGI which codes its successor, they max out at a universe full of cheerful qualia-bearing life and an okay outcome for existing humans.
J. Something 'just works' on the order of eg: train a predictive/imitative/generative AI on a human-generated dataset, and RLHF her to be unfailingly nice, generous to weaker entities, and determined to make the cosmos a lovely place.
O. Early applications of AI/AGI drastically increase human civilization's sanity and coordination ability; enabling humanity to solve alignment, or slow down further descent into AGI, etc. (Not in principle mutex with all other answers.)
D. Early powerful AGIs realize that they wouldn't be able to align their own future selves/successors if their intelligence got raised further, and work honestly with humans on solving the problem in a way acceptable to both factions.
F. Somebody pulls off a hat trick involving blah blah acausal blah blah simulations blah blah, or other amazingly clever idea, which leads an AGI to put the reachable galaxies to good use despite that AGI not being otherwise alignable.
L. Earth's present civilization crashes before powerful AGI, and the next civilization that rises is wiser and better at ops. (Exception to 'okay' as defined originally, will be said to count as 'okay' even if many current humans die.)
G. It's impossible/improbable for something sufficiently smarter and more capable than modern humanity to be created, that it can just do whatever without needing humans to cooperate; nor does it successfully cheat/trick us.
H. Many competing AGIs form an equilibrium whereby no faction is allowed to get too powerful, and humanity is part of this equilibrium and survives and gets a big chunk of cosmic pie.
N. A crash project at augmenting human intelligence via neurotech, training mentats via neurofeedback, etc, produces people who can solve alignment before it's too late, despite Earth civ not slowing AI down much.
You are fooled by at least one option on this list, which out of many tries, ends up sufficiently well-aimed at your personal ideals / prejudices / the parts you understand less well / your own personal indulgences in wishful thinking.
If you write an argument that breaks down the 'okay outcomes' into lots of distinct categories, without breaking down internal conjuncts and so on, Reality is very impressed with how disjunctive this sounds and allocates more probability.
20
12
10
8
8
7
6
5
5
5
3
3
3
2
1
1
1
1
OptionProbability
Submit a play to my theatre club's short play festival π
Use up all of my tea β
Win a rap battle π€
Write 10 sonnets βπ»
Donate my hair βοΈ
Reach wave 4500 on tier 18 of "The Tower" πΌ
Learn to whistle with two fingers π
Try 15 new recipes π¨π»βπ³
Speak with an MP about prediction markets π
Eat good vegan cheese π§
Fix my dandruff βοΈ
Donate plasma 20 times π©Έ
82
69
47
45
45
41
41
41
36
33
24
22