Wootware

"Weapons of Math Destruction" by Cathy O'Neil

 tháng 1 04, 2017     No comments   

In a 1947 lecture on computing machinery, Alan Turing made a prediction: "The new machines will in no way replace thought, but rather they will increase the need for it."



Someday, he said, machines would think for themselves, but the computers of the near future would require human supervision to prevent malfunctions:

"The intention in constructing these machines in the first instance is to treat them as slaves, giving them only jobs which have been thought out in detail, jobs such that the user of the machine fully understands in principle what is going on all the time." 1
It is unclear now whether machines remain slaves, or if they are beginning to be masters. Machine-learning algorithms pervasively control the lives of Americans. We do not fully understand what they do, and when they malfunction they harm us, by reinforcing the unjust systems we already have. Usually unintentionally, they can make the lives of poor people and people of color worse.



In "Weapons of Math Destruction", Cathy O'Neil identifies such an algorithm as a "WMD" if it satisfies three criteria: it makes decisions of consequence for a large number of people, it is opaque and unaccountable, and it is destructive. I interviewed O'Neil to learn what data scientists should do to disarm these weapons.



Automated Injustice



Recidivism risk models are a striking example of algorithms that reinforce injustice. These algorithms purport to predict how likely a convict is to commit another crime in the next few years. The model described in O'Neil's book, called LSI-R, assesses offenders with 54 questions, then produces a risk score based on correlations between each offender's characteristics and the characteristics of recidivists and non-recidivists in a sample population of offenders.



Some of LSI-R's factors measure the offender's past behavior: Has she ever been expelled from school, or violated parole? But most factors probably aren't under the individual's control: Does she live in a high-crime neighborhood? Is she poor? And many factors are not under her control at all: Has a family member been convicted of any crimes? Did her parents raise her with a "rewarding" parenting style?



Studies of LSI-R show it gives worse scores to poor black people. Some of its questions directly measure poverty, and others (such as frequently changing residence) are proxies for poverty. LSI-R does not know the offender's race. It would be illegal to ask, but, O'Neil writes, "with the wealth of detail each prisoner provides, that single illegal question is almost superfluous." For example, it asks the offender's age when he was first involved with the police. O'Neil cites a 2013 New York Civil Liberties Union study that young black and Hispanic men were ten times as likely to be stopped by the New York City police, even though only a tiny fraction were doing anything criminal.



So far, the LSI-R does not automatically become destructive. If it is accurate, and used for benign choices like spending more time treating and counselling offenders with high risk scores, it could do some good. But in many states, judges use the LSI-R and models like it to decide how long the offender's sentence should be. This is not LSI-R's intended use, and it is certainly not accurate enough for it: a study this year found that LSI-R misclassified 41% of offenders. 2



Success, According to Whom?



O'Neil told me that whether an algorithm becomes a WMD depends on who defines success, and according to whom. "Over and over again, people act as if there's only one set of stakeholders."



When a recidivism risk model is used to sentence someone to a longer prison term, the sole stakeholder respected is law enforcement. "Law enforcement cares more about true positives, correctly identifying someone who will reoffend and putting them in jail for longer to keep them from committing another crime." But our society has a powerful interest in preventing false positives. Indeed, we were founded on a constitution that considered a false positive—that is, being punished for a crime you did not commit—to be extremely costly. Principles including the presumption of innocence, the requirement that guilt is proven beyond reasonable doubt, and so on, express our desire to avoid unjust punishment, even at the cost of some criminals being punished too little or going free.



However, this interest is ignored when an offender is punished for a bad LSI-R score. His total sentence accounts not only for the crime he committed, but also for future crimes he is thought likely to commit. Furthermore, he is punished for who he is: Being related to a criminal or being raised badly are circumstances of birth, but for many people facing sentencing, such circumstances are used to add years to their time behind bars.



Statistically Unsound



Cathy O'Neil says weapons of math destruction are usually caused by two failures. The first is when only one stakeholder's interests define success. LSI-R is an example of this. The other is a lack of actual science in data science. For these algorithms, she told me, "We actually don't have reasonable ways of checking to see whether something is working or not."



A New York City public school program begun in 2007 assessed teachers with a "value added model", which estimated how much a teacher affected each student's progress on standardized tests. To begin, the model forecast students' progress, given their neighborhood, family income, previous achievement, and so on. At the end of the year their actual progress was compared to the forecast, and the difference was attributed to the teacher's effectiveness. O'Neil tells the story of Tim Clifford, a public school teacher who scored only 6 out of 100 the first year he was assessed, then 96 out of 100 the next year. O'Neil writes, "Attempting to score a teacher's effectiveness by analyzing the test results of only twenty-five or thirty students is statistically unsound, even laughable." One analysis of the assessment showed that a quarter of teachers' scores swung by 40 points in a year. Another showed that, with such small samples, the margin of error made half of all teachers statistically indistinguishable.



Nevertheless, the score might determine if the teacher was given a bonus, or fired. Although its decision was probabilistic, appealing it required conclusive evidence. O'Neil points out that time and again, "the human victims of WMDs are held to a higher standard of evidence than the algorithms themselves." The model is math so it is presumed correct, and anyone who objects to its scores is suspect.



New York Governor Andrew Cuomo put a moratorium on these teacher evaluations in 2015. We are starting to see that some questions require too subtle an intelligence for our current algorithms to answer accurately. As Alan Turing said, "If a machine is expected to be infallible, it cannot also be intelligent."



Responsible Data Science



I asked Cathy O'Neil about the responsibilities of data scientists, both in their daily work and as reformers of their profession. Regarding daily work, O'Neil drew a sharp line: "I don't want data scientists to be de facto policy makers." Rather, their job is to explain to policy makers the moral tradeoffs of their choices. The same as any programmer gathers requirements before coding a solution, data scientists should gather requirements regarding the relative cost of different kinds of errors. Machine learning algorithms are always imperfect, but they can be tweaked for either more false positives or more false negatives. When the stakes are high, the choice between the two is a moral one. Data scientists must pose these questions frankly to policy makers, says O'Neil, and "translate moral decisions into code."



Tradeoffs in the private sector often pit corporate interests against human ones. This is especially dangerous to the poor because, as O'Neil writes, "The privileged are processed more by people, the masses by machines." She told me that when the boss asks for an algorithm that optimizes for profit, it is the data scientist's duty to mention that the algorithm should also consider fairness.



"Weapons of Math Destruction" tells us how to recognize a WMD once it is built. But how can we predict whether an algorithm will become a WMD? O'Neil told me, "The biggest warning sign is if you're choosing winners and losers, and if it's a big deal for losers to lose. If it's an important decision and it's a secret formula, then that's a set-up for a weapon of math destruction. The only other ingredient you need in that setup is actually making it destructive."



Reform



Cathy O'Neil says the top priority, for data scientists who want to disarm WMDs, is to develop tools for analyzing them. For example, any EU citizen harmed by an algorithmic decision may soon have the legal right to an explanation, but so far we lack the tools to provide one. We also need tools to measure disparate impact and unfairness. O'Neil says, "We need tools to decide whether an algorithm is being racist."



New data scientists should enter the field with better training in ethics. Curricula usually ignore questions of justice, as if the job of the data scientist were purely technical. Data-science contests like Kaggle also encourage this view, says O'Neil. "Kaggle has defined the success and the penalty function. The hard part of data science is everything that happens before Kaggle." O'Neil wants more case studies from the field, anonymized so students can learn from them how data science is really practiced. It would be an opportunity to ask: When an algorithm makes a mistake, who gets hurt?



If data scientists take responsibility for the effects of their work, says O'Neil, they will become activists. "I'm hoping the book, at the very least, gets people to acknowledge the power that they're wielding," she says, "and how it could be used for good or bad. The very first thing we have to realize is that well-intentioned people can make horrible mistakes."






1. Quoted in "Alan Turing: The Enigma", by Andrew Hodges. Princeton University Press. ↩



2. See also ProPublica's analysis of bias in a similar recidivism model, COMPAS. ↩
  • Share This:  
  •  Facebook
  •  Twitter
  •  Google+
  •  Stumble
  •  Digg
Gửi email bài đăng nàyBlogThis!Chia sẻ lên XChia sẻ lên Facebook
Bài đăng Mới hơn Bài đăng Cũ hơn Trang chủ

0 nhận xét:

Đăng nhận xét

Popular Posts

  • Microsoft Office 2016 Portable + Professional Plus Download Free ( 32-bit/64-bit )
    Microsoft Office 2016 Portable + Professional Plus Download Free ( 32-bit/64-bit ) for all Microsoft windows. It's an offline setup stan...
  • Autodesk ArtCAM 2017 Crack + Patch + Full Version
    Autodesk ArtCAM 2017 Crack + Patch + Full Version ArtCAM is, in fact, a design tool designed more for designers than engineers, and allows d...
  • Download PTC Creo v5.0 / v4.0 + Crack
    Download PTC Creo v5.0 / v4.0 + Crack is a full version offline installer software program for your pc and you can also download from portab...
  • Windows 7 Tiny Unattended Fully Activated CD (x86)
    Description:- This is a re-up of experience's Windows Tiny7 Rev01 Unattended Activated CD (x86). Total size after install is 1.64 Gb, an...
  • Maxon CINEMA 4D Studio R19.024 Free Download
    Download Maxon CINEMA 4D Studio R19.024 Free latest version offline setup for Microsoft Windows 7, 8, 10, XP, Vista. Maxon CINEMA 4D Studio ...
  • ARTCUT 2009 Software USB Driver Free Full Download
    Download ARTCUT 2009 Software USB Driver Full Free latest version offline setup for Microsoft Windows 7, 8,  10, XP, Vista. ARTCUT 2009 Soft...
  • Tekla Structural Designer 2019 v19.0.0.104 (32/64 Bit) Free Full Download
    Download Tekla Structural Designer 2019 v19.0.0.104 (32/64 Bit) Full Free Crack latest version offline setup for Microsoft Windows 7, 8, 10,...
  • Adobe Premiere Pro CC 2018 Download Latest Version
    Adobe Premiere Pro CC 2018 Download Latest Version Adobe Premiere Pro CC 2018 Download Latest Version for Windows and Mac 32 bit & 64-bi...
  • Adobe Acrobat Pro DC 2019 Portable (v19.010.20064)
    Adobe Acrobat Pro DC 2019 Portable (v19.010.20064) is a software for creating PDF files. With the help of this software, the user can conver...
  • SketchUp Pro 2019 v19.0.685 + Windows + Portable / macOS
    SketchUp Pro 2019 v19.0.685 + Windows + Portable / macOS is a full version offline installer software program for your pc and you can also d...

Maxon CINEMA 4D Studio R19.024 Free Download

Download Maxon CINEMA 4D Studio R19.024 Free latest version offline setup for Microsoft Windows 7, 8, 10, XP, Vista. Maxon CINEMA 4D Studio ...

Tìm kiếm Blog này

Lưu trữ Blog

  • tháng 6 2019 (4)
  • tháng 5 2019 (4)
  • tháng 4 2019 (4)
  • tháng 3 2019 (13)
  • tháng 2 2019 (15)
  • tháng 1 2019 (23)
  • tháng 12 2018 (20)
  • tháng 11 2018 (3)
  • tháng 10 2018 (6)
  • tháng 9 2018 (1)
  • tháng 8 2018 (7)
  • tháng 7 2018 (26)
  • tháng 6 2018 (31)
  • tháng 5 2018 (9)
  • tháng 4 2018 (12)
  • tháng 3 2018 (17)
  • tháng 2 2018 (14)
  • tháng 1 2018 (3)
  • tháng 12 2017 (8)
  • tháng 11 2017 (11)
  • tháng 10 2017 (22)
  • tháng 9 2017 (24)
  • tháng 8 2017 (9)
  • tháng 7 2017 (5)
  • tháng 5 2017 (4)
  • tháng 4 2017 (3)
  • tháng 3 2017 (4)
  • tháng 2 2017 (6)
  • tháng 1 2017 (7)
  • tháng 12 2016 (4)
  • tháng 11 2016 (4)
  • tháng 10 2016 (2)
  • tháng 8 2016 (4)
  • tháng 7 2016 (4)
  • tháng 6 2016 (3)
  • tháng 5 2016 (6)
  • tháng 4 2016 (5)
  • tháng 3 2016 (2)
  • tháng 1 2016 (3)
  • tháng 12 2015 (2)
  • tháng 11 2015 (11)
  • tháng 10 2015 (22)
  • tháng 9 2015 (12)
  • tháng 8 2015 (2)
  • tháng 7 2015 (16)
  • tháng 6 2015 (11)
  • tháng 5 2015 (5)
  • tháng 4 2015 (31)
  • tháng 3 2015 (33)
  • tháng 2 2015 (4)
  • tháng 1 2015 (1)
  • tháng 11 2014 (20)
  • tháng 10 2014 (5)
Được tạo bởi Blogger.

Nhãn

  • 3d Animation
  • 3d CAD
  • 3D Software
  • Action Games
  • Adobe
  • Adobe Elements
  • Adobe Illustrator
  • Adobe Photoshop
  • Adobe Premiere
  • Adventure Games
  • advocacy
  • Africa
  • Android
  • Android Application
  • Android Stuff
  • Antivirus
  • APK Games
  • Apps
  • Arcade Games
  • Asia-Pacific
  • Audio Editing
  • award
  • Bangla Software
  • BBC
  • Best and Top
  • board
  • Board of Directors
  • brochure
  • C
  • call
  • Call for Proposals
  • Caribbean
  • CD & DVD
  • children
  • coding literacy
  • community
  • community service awards
  • conference
  • conferences
  • contributions
  • Converter
  • Converters
  • CorelDraw ELEMENTS
  • CSA
  • Cuba
  • Cubase
  • Data Recovery
  • deadlines
  • Dell Desktop Drivers
  • democracy
  • Design
  • Design & Editing
  • Design Tools
  • Development
  • director
  • diversity
  • Django Girls
  • documentation
  • Dominican Republic
  • donations
  • Driver
  • Driver Software
  • DriverPack Solution
  • Drivers
  • e-vote
  • ecosystem
  • edu-sig
  • education
  • election
  • europython
  • events
  • Fighting Games
  • Folder Lock
  • Font Creator
  • foundation
  • frank-willison. Young Coders
  • Gaming
  • Google Summer of Code
  • grants
  • Graphic Design
  • Graphic Tools
  • Graphics
  • GTA
  • hardware
  • IDM
  • Illustrator
  • infrastructure
  • Inpages
  • Internet
  • Internet Security
  • Japan
  • job board
  • jobs
  • Mac
  • Memories of Lost Time
  • mentoring
  • micro:bit
  • microbit
  • MicroPython
  • microsoft
  • Microsoft Office
  • Multi-media
  • Multimedia
  • Music
  • News
  • nominate
  • nominations
  • non-profit
  • NumFocus
  • Office
  • Office Elements
  • open source
  • Operating System
  • opportunity
  • oscon
  • outreach
  • Pc Game
  • PC Games
  • PC-Games
  • Popular Tools
  • Portable
  • porting
  • PSF
  • PSF funding
  • public relations
  • Puzzle Games
  • PyCaribbean
  • pycon
  • pycon2016
  • pycon2018
  • pydotorg
  • PyLadies
  • PyOhio
  • pypi
  • pypy
  • python
  • python3
  • Racing Games
  • RPG-Games
  • Santo Domingo
  • Science
  • Scientific Computing
  • scipy
  • Screen Recorder
  • Security
  • Security Softwares
  • Shooting Games
  • Simulation Games
  • Software
  • Softwares
  • South America
  • sponsorship
  • Sports Games
  • sprints
  • Strategy Games
  • students
  • Super Copier
  • Super Copy
  • support
  • System Software
  • System Tools
  • talks
  • Tech
  • Tips
  • Total Security
  • travel
  • tutorials
  • Typing Software
  • uk
  • Utilities
  • Video Editing
  • volunteers
  • Windows
  • Windows Themes
  • Working Group
  • Young Coders

Báo cáo vi phạm

  • Trang chủ

Copyright © Wootware | Powered by Blogger
Design by Hardeep Asrani | Blogger Theme by NewBloggerThemes.com | Distributed By Gooyaabi Templates