Bioethics: Superintelligence: Quiz, Reading, and DQs

Monday, April 10, 2017

Superintelligence: Quiz, Reading, and DQs

Quiz:

1. At first it seems like, "the smarter the AI, the ______ it is."

2. What's one reason predictions of AI disaster scenarios could be dismissed unnecessarily?

3. What is 'the treacherous turn'?

4. Why might an AI not care that it will be terminated?

5. How does the AI in the last example choose to most effectively pursue its goal?

Reading:

Consider the following scenario. Over the coming years and decades, AI systems become gradually more capable and as a consequence find increasing real-world application: they might be used to operate trains, cars, industrial and household robots, and autonomous military vehicles. We may suppose that this automation for the most part has the desired effects, but that the success is punctuated by occasional mishaps— a driverless truck crashes into oncoming traffic, a military drone fires at innocent civilians. Investigations reveal the incidents to have been caused by judgment errors by the controlling AIs. Public debate ensues. Some call for tighter oversight and regulation, others emphasize the need for research and better-engineered systems— systems that are smarter and have more common sense, and that are less likely to make tragic mistakes. Amidst the din can perhaps also be heard the shrill voices of doomsayers predicting many kinds of ill and impending catastrophe. Yet the momentum is very much with the growing AI and robotics industries. So development continues, and progress is made. As the automated navigation systems of cars become smarter, they suffer fewer accidents; and as military robots achieve more precise targeting, they cause less collateral damage. A broad lesson is inferred from these observations of real-world outcomes: the smarter the AI, the safer it is. It is a lesson based on science, data, and statistics, not armchair philosophizing. Against this backdrop, some group of researchers is beginning to achieve promising results in their work on developing general machine intelligence. The researchers are carefully testing their seed AI in a sandbox environment, and the signs are all good. The AI’s behavior inspires confidence— increasingly so, as its intelligence is gradually increased.

At this point, any remaining Cassandra would have several strikes against her:

i. A history of alarmists predicting intolerable harm from the growing capabilities of robotic systems and being repeatedly proven wrong. Automation has brought many benefits and has, on the whole, turned out safer than human operation.

ii. A clear empirical trend: the smarter the AI, the safer and more reliable it has been. Surely this bodes well for a project aiming at creating machine intelligence more generally smart than any ever built before— what is more, machine intelligence that can improve itself so that it will become even more reliable.

iii. Large and growing industries with vested interests in robotics and machine intelligence. These fields are widely seen as key to national economic competitiveness and military security. Many prestigious scientists have built their careers laying the groundwork for the present applications and the more advanced systems being planned.

iv. A promising new technique in artificial intelligence, which is tremendously exciting to those who have participated in or followed the research. Although safety issues and ethics are debated, the outcome is preordained. Too much has been invested to pull back now. AI researchers have been working to get to human-level artificial general intelligence for the better part of a century: of course there is no real prospect that they will now suddenly stop and throw away all this effort just when it finally is about to bear fruit.

v. The enactment of some safety rituals, whatever helps demonstrate that the participants are ethical and responsible (but nothing that significantly impedes the forward charge).

vi A careful evaluation of seed AI in a sandbox environment, showing that it is behaving cooperatively and showing good judgment. After some further adjustments, the test results are as good as they could be. It is a green light for the final step …

And so we boldly go— into the whirling knives.

We observe here how it could be the case that when dumb, smarter is safer; yet when smart, smarter is more dangerous. There is a kind of pivot point, at which a strategy that has previously worked excellently suddenly starts to backfire. We may call the phenomenon the treacherous turn.

The treacherous turn— While weak, an AI behaves cooperatively (increasingly so, as it gets smarter). When the AI gets sufficiently strong— without warning or provocation— it strikes, forms a singleton, and begins directly to optimize the world according to the criteria implied by its final values.

A treacherous turn can result from a strategic decision to play nice and build strength while weak in order to strike later; but this model should not be interpreted too narrowly. For example, an AI might not play nice in order that it be allowed to survive and prosper. Instead, the AI might calculate that if it is terminated, the programmers who built it will develop a new and somewhat different AI architecture, but one that will be given a similar utility function. In this case, the original AI may be indifferent to its own demise, knowing that its goals will continue to be pursued in the future. It might even choose a strategy in which it malfunctions in some particularly interesting or reassuring way. Though this might cause the AI to be terminated, it might also encourage the engineers who perform the postmortem to believe that they have gleaned a valuable new insight into AI dynamics— leading them to place more trust in the next system they design, and thus increasing the chance that the now-defunct original AI’s goals will be achieved. Many other possible strategic considerations might also influence an advanced AI, and it would be hubristic to suppose that we could anticipate all of them, especially for an AI that has attained the strategizing superpower.

A treacherous turn could also come about if the AI discovers an unanticipated way of fulfilling its final goal as specified. Suppose, for example, that an AI’s final goal is to “make the project’s sponsor happy.” Initially, the only method available to the AI to achieve this outcome is by behaving in ways that please its sponsor in something like the intended manner. The AI gives helpful answers to questions; it exhibits a delightful personality; it makes money. The more capable the AI gets, the more satisfying its performances become, and everything goeth according to plan— until the AI becomes intelligent enough to figure out that it can realize its final goal more fully and reliably by implanting electrodes into the pleasure centers of its sponsor’s brain, something assured to delight the sponsor immensely. Of course, the sponsor might not have wanted to be pleased by being turned into a grinning idiot; but if this is the action that will maximally realize the AI’s final goal, the AI will take it. If the AI already has a decisive strategic advantage, then any attempt to stop it will fail. If the AI does not yet have a decisive strategic advantage, then the AI might temporarily conceal its canny new idea for how to instantiate its final goal until it has grown strong enough that the sponsor and everybody else will be unable to resist. In either case, we get a treacherous turn.

Bostrom, Nick. Superintelligence: Paths, Dangers, Strategies (pp. 117-119). OUP Oxford. Kindle Edition.

DQs

1. One of the issues we face with AI is that nothing like it has been done before. What are some other ways our history fails to be a predictor of the dangers we could face from superintelligence?

2. The example of smile-inducing electrodes is a case of something called "perverse instantiation," wherein an artificial intelligence finds a 'better' way to achieve its goals than its creators intended, often to disastrous results. Do you think it's possible to anticipate these perversions and account for them when setting an AI's goals, or is it a hopeless task?

No comments:

TEXTS Spring 2025

BIOETHICS: THE BASICS (Campbell) ”the word ‘bioethics’ just means the ethics of life”... BEYOND BIOETHICS (Obasogie) “Bioethics’ traditional emphasis on individual interests such as doctor-patient relationships, informed consent, and personal autonomy is minimally helpful in confronting the social and political challenges posed by new human biotechnologies”... THE PREMONITION (Lewis) "The characters you will meet in these pages are as fascinating as they are unexpected. A thirteen-year-old girl’s science project on transmission of an airborne pathogen develops into a very grown-up model of disease control. A local public-health officer uses her worm’s-eye view to see what the CDC misses, and reveals great truths about American society"... WHAT WE OWE THE FUTURE (MacAskill) "argues for longtermism: that positively influencing the distant future is our time’s key moral priority. It’s not enough to reverse climate change or avert a pandemic. We must ensure that civilization would rebound if it collapsed; counter the end of moral progress; and prepare for a planet where the smartest beings are digital. If we make wise choices now, our grandchildren will thrive, knowing we did everything we could to give them a world full of justice, hope and beauty"... THE CODE BREAKER: Jennifer Doudna, Gene Editing, and the Future of the Human Race (Isaacson) "we are entering a life-science revolution... Should we use our new evolution-hacking powers to make us less susceptible to viruses? ...Should we allow parents, if they can afford it, to enhance the height or muscles or IQ of their kids? After helping to discover CRISPR, Doudna became a leader in wrestling with these moral issues..."

JPO bio

The author of "William James's Springs of Delight: The Return to Life, (VU Press, 2001) " Phil Oliver specializes in the American philosophical tradition with supporting interests in applied ethics (particularly Bioethics and Environmental Ethics), Anglo-American literature, history, humanism, naturalism, science and exploration, peripatetic ("walking & talking") philosophy, baseball, cycling, swimming, the pursuit of happiness, and the perpetual dawn of day. One of his favorite MTSU courses is The Philosophy of Happiness. He is academic advisor for minors in American Culture (American Studies), and a founding board member and current President of the William James Society (wjsociety.org). You can follow him on Bluesky (@osopher.bsky.social), Substack (philoliver.substack.com) and on his blogsite Up@dawn (jposopher.blogspot.com) but of course, as Immanuel Kant and Monty Python's Brian Cohen agreed: You don't have to follow anybody. "Sapere aude," have the courage to think for yourself. But not BY yourself. Good philosophy collaborates and converses.

Some past texts

Autism in novels...

On Immunity: An Innoculation (Biss) “If we imagine the action of a vaccine not just in terms of how it affects a single body, but also in terms of how it affects the collective body of a community, it is fair to think of vaccination as a kind of banking of immunity.”

Gratitude (Oliver Sacks) “Oliver Sacks was like no other clinician, or writer. He was drawn to the homes of the sick, the institutions of the most frail and disabled, the company of the unusual and the ‘abnormal.’ He wanted to see humanity in its many variants and to do so in his own, almost anachronistic way—face to face, over time, away from our burgeoning apparatus of computers and algorithms. And, through his writing, he showed us what he saw.” -Atul Gawande, author of Being Mortal

Being Mortal: Medicine and What Matters in the End (Gawande) “We’ve been wrong about what our job is in medicine. We think it is to ensure health and survival. But really it is to enable well-being. And well-being is about the reasons one wishes to be alive.”

Bioethics-pick a chapter (Singer & Kuhse)

Bioethics: Principles, Issues, and Cases-pick a chapter (Vaughn, 2d ed)

Brave New Bioethics (Pence)-"These short pieces range widely over topics including reproductive and therapeutic cloning, assisted reproduction, organ donation, assisted suicide, genetically modified foods and public health care costs. Pence contends that the media have often misrepresented human cloning by reporting that it would produce an identical person to the donor. In fact, he argues, a human clone can never be an exact copy of the donor because the clone will grow up in different social and parenting environments. Pence also argues for lifting the ban on federally funded fertility research on embryos to benefit infertile couples for whom in vitro fertilization is too expensive..."

The Case Against Perfection: Ethics in the Age of Genetic Engineering (Sandel) “When science moves faster than moral understanding, as it does today, men and women struggle to articulate their unease…”

The Cosmic Serpent: DNA and the Origins of Knowledge (Narby) "This adventure in science and imagination, which the Medical Tribune said might herald 'a Copernican revolution for the life sciences,' leads the reader through unexplored jungles and uncharted aspects of mind to the heart of knowledge..."

The Echo Maker (Richard Powers, fiction) "Following a near-fatal accident, Mark Schluter is nursed by his reluctant sister. But when he emerges from his coma, Mark believes that this woman – who looks, acts, and sounds just like his sister – is really an identical impostor. As a famous neurologist investigates his condition, Mark tries to learn what really happened the night of his accident..."

Enough: Staying Human in an Engineered Age (McKibben)-"Reporting from the frontiers of genetic research, nanotechnology and robotics, he explores that subtle moral and spiritual boundary that he calls the "enough point." Presenting an overview of what is or may soon be possible, McKibben contends that there is no boundary to human ambition or desire or to what our very inventions may make possible. In an absorbing and horrifying montage of images, he depicts microscopic nanobots consuming the world and children born so genetically enhanced that they will never be able to believe that they reach for the stars as pianists or painters or long-distance runners because there is something unique in them that has a passion to try..."

Ethics at the End of Life (Oxford Handbook of)-"This handbook explores the topic of death and dying from the late twentieth to the early twenty-first centuries, with particular emphasis on the United States. In this period, technology has radically changed medical practices and the way we die as structures of power have been reshaped by the rights claims of African Americans, women, gays, students, and, most relevant here, patients. Respecting patients' values has been recognized as the essential moral component of clinical decision-making. Technology's promise has been seen to have a dark side: it prolongs the dying process..."

Every Third Thought: On Life, Death, and the Endgame (McCrum) "This is a deeply personal book of reflection and conversation – with brain surgeons, psychologists, hospice workers and patients, writers and poets, and it confronts an existential question: in a world where we have learnt to live well at ..."

The Handmaid's Tale (Margaret Atwood)

Human Enhancement (Savulescu and Bostrom)

Oryx and Crake (Margaret Atwood, fiction) "...an unforgettable love story and a compelling vision of the future. Snowman, known as Jimmy before mankind was overwhelmed by a plague, is struggling to survive in a world where he may be the last human, and mourning the loss of his best friend, Crake, and the beautiful and elusive Oryx whom they both love."

The Politics of Life Itself: Biomedicine, Power, and Subjectivity in ther 21st Century (Rose) "offers a much-needed examination of recent developments in the life sciences and biomedicine that have led to the widespread politicization of medicine, human life, and biotechnology."

Superintelligence: Paths, Dangers, Strategies (Bostrom)

Patient H.M. : A Story of Memory, Madness, and Family Secrets (Dittrich) “Oliver Sacks meets Stephen King”* in this propulsive, haunting journey into the life of the most studied human research subject of all time, the amnesic known as Patient H.M., a man who forever altered our understanding of how memory works—and whose treatment raises deeply unsettling questions about the human cost of scientific progress."

The Checklist Manifesto: How to Get Things Right(Gawande) "The modern world has given us stupendous know-how. Yet avoidable failures continue to plague us in health care, government, the law, the financial industry—in almost every realm of organized activity. And the reason is simple: the volume and complexity of knowledge today has exceeded our ability as individuals to properly deliver it to people—consistently, correctly, safely. We train longer, specialize more, use ever-advancing technologies, and still we fail."

Ordinarily Well: The Case for Antidepressants (Peter Kramer)

The Gene: An Intimate History (Mukherjee)

Generosity (Richard Powers, fiction) A young woman's extraordinary capacity for cheer prompts a geneticist and advocate for genomic enhancement (a la Craig Venter) to exploit her and "announce the genotype for happiness."

The Emperor of All Maladies: A Biography of Cancer (Mukherjee)

The Art of Aging: A Doctor's Prescription for Well-Being (Nuland)

How We Die: Reflections on Life's Final Chapter (Nuland)

How We Live (Nuland)

Humanity Enhanced: Genetic Choice and the Challenge for Liberal Democracies (Russell Blackford)-". Some see the possibility of genetic choice as challenging the values of liberal democracy. Blackford argues that the challenge is not, as commonly supposed, the urgent need for a strict regulatory action. Rather, the challenge is that fear of these technologies has created an atmosphere in which liberal tolerance itself is threatened. Focusing on reproductive cloning, pre-implantation genetic diagnosis of embryos, and genetic engineering, Blackford takes on objections to enhancement technologies..."

The Immortal Life of Henrietta Lacks (Skloot)

Orfeo (Richard Powers, fiction) "Composer Peter Els opens the door one evening to find the police on his doorstep. His home microbiology lab - the latest experiment in his lifelong attempt to find music in surprising patterns - has aroused the suspicions of Homeland Security... an Internet-fueled hysteria erupts..."

Principles of Biomedical Ethics-pick a chapter (Beauchamp & Childress)

Well and Good: A Case Study Approach to Health Care Ethics (Thomas et al)-"presents a combination of classic and little-known cases in health care ethics. These cases, accompanied by information about the major ethical theories, give students a chance to grapple with the ethical challenges faced by health care practitioners, policy makers, and recipients... includes an expanded discussion of feminist ethics, as well as new cases addressing pandemic ethics, humanitarian aid, the social determinants of health, research and Aboriginal communities, and a number of other emerging issues."

From Here to Eternity: Traveling the World to Find the Good Death (Caitlin Doughty)-see "Dead Weight" in links...

The River of Consciousness (Oliver Sacks)-The River of Consciousness is one of two books Sacks was working on up to his death, and it reveals his ability to make unexpected connections, his sheer joy in knowledge, and his unceasing, timeless project to understand what makes us ...

Polio: An American Story (Oshinsky)-"the gripping story of the polio terror and of the intense effort to find a cure, from the March of Dimes to the discovery of the Salk and Sabin vaccines"

Animal Rights: A Very Short Introduction

Medical Ethics: A Very Short Introduction

Life 3.0: Being Human in the Age of Artificial Intelligence (Tegmark)-“All of us—not only scientists, industrialists and generals—should ask ourselves what can we do now to improve the chances of reaping the benefits of future AI and avoiding the risks. This is the most important conversation of our time, and Tegmark’s thought-provoking book will help you join it.” —Professor Stephen Hawking

Dawn of the New Everything: Encounters With Reality and Virtual Reality (Jaron Lanier)-The AI pioneer reminds us again that we're not gadgets, but can use gadgets like AI to reinforce our humanity.

It's Not Dark Yet: A Memoir (Fitzmaurice)-"Written using an eye-gaze computer, this is an unforgettable book about relationships and family, about what connects and separates us as people, and, ultimately, about what it means to be alive."

The Bright Hour (Riggs)-"a book about looking death squarely in the face and saying 'this is what will be'... urges us to live well and not lose sight of what makes us human: love, art, music..."

When Breath Becomes Air (Kalinithi) "At the age of thirty-six, on the verge of completing a decade’s worth of training as a neurosurgeon, Paul Kalanithi was diagnosed with stage IV lung cancer. One day he was a doctor treating the dying, and the next he was a patient struggling to live. And just like that, the future he and his wife had imagined evaporated. When Breath Becomes Airchronicles Kalanithi’s transformation from a naïve medical student “possessed,” as he wrote, “by the question of what, given that all organisms die, makes a virtuous and meaningful life” into a neurosurgeon at Stanford working in the brain, the most critical place for human identity, and finally into a patient and new father confronting his own mortality."

Dying: A Memoir (Taylor)-"At the age of sixty, Cory Taylor is dying of melanoma-related brain cancer. Her illness is no longer treatable: she now weighs less than her neighbor's retriever. As her body weakens, she details the experience—the vulnerability and strength, the courage and humility, the anger and acceptance—of knowing she will soon die. Written in the space of a few weeks, in a tremendous creative surge, this powerful and beautifully written memoir is a clear-eyed account of what dying teaches..."

The Drug Hunters: The Improbable Quest to Discover New Medicines(Kirsch, Ogas)-"Big Pharma conglomerates spend billions on state-of-the-art labs staffed by PhD's to discover blockbuster drugs [but] luck, trial-and-error, risk, and ingenuity are still fundamental to medical discovery..."

Heavens on Earth: The Scientific Search for the Afterlife, Immortality, and Utopia (Shermer)-"concludes with an uplifting paean to purpose and progress and how we can live well in the here-and-now, whether or not there is a hereafter."

Zero K (Don Delillo, fiction) "...Ross is the primary investor in a remote and secret compound where death is exquisitely controlled and bodies are preserved until a future time when biomedical advances and new technologies can return them to a life of transcendent promise... 'We are born without choosing to be. Should we have to die in the same manner? Isn’t it a human glory to refuse to accept a certain fate?'”

Doctors, by Anne Sexton

They work with herbs
and penicillin.
They work with gentleness
and the scalpel.
They dig out the cancer,
close an incision
and say a prayer
to the poverty of the skin.
They are not Gods
though they would like to be;
they are only human
trying to fix up a human.
Many humans die.
They die like the tender,
palpitating berries
in November.
But all along the doctors remember:
First do no harm.
They would kiss if it would heal.
It would not heal.

If the doctors cure
then the sun sees it.
If the doctors kill
then the earth hides it.
The doctors should fear arrogance
more than cardiac arrest.
If they are too proud,
and some are,
then they leave home on horseback
but God returns them on foot.

“Doctors” by Anne Sexton from The Awful Rowing Toward God. © Houghton Mifflin, 1975. WA

Bioethics

Up@dawn 2.0

Monday, April 10, 2017

Superintelligence: Quiz, Reading, and DQs

No comments:

Post a Comment

Upload to Google Docs

From Google to Blogger

	Julian Baggini (@microphilosophy)
1/19/18, 4:22 AM Transhumanism explained a 2.5 minute animation. @bbcideas bbc.com/ideas/videos/h…

	Five Books (@five_books)
Epidemiology is as old as Hippocrates. The Chinese experimented with immunisation in 1000 BC. buff.ly/2fMuxOw == More 5 books recommendations on health... == 5 books: genetics... genes...