Data ethics in the real world¶
To make the ideas contained in the checklist more concrete, we've compiled examples of times when tradoffs were handled well, and times when things have gone wrong. Examples are paired with the checklist questions to help illuminate where in the process ethics discussions may have helped provide a course correction. Positive examples show how principles of deon can be followed in the real world.
|
--- | --- |
- ✅ A voiceover studio is now required to get informed consent from a performer before using their likeness in AI-generated content.
- ⛔ Facebook uses phone numbers provided for two-factor authentication to target users with ads.
- ⛔ African-American men were enrolled in the Tuskegee Study on the progression of syphilis without being told the true purpose of the study or that treatment for syphilis was being withheld.
- ⛔ OpenAI's ChatGPT memorized and regurgitated entire poems without checking for copyright permissions.
- ⛔ StreetBump, a smartphone app to passively detect potholes, may fail to direct public resources to areas where smartphone penetration is lower, such as lower income areas or areas with a larger elderly population.
- ⛔ Facial recognition cameras used for passport control register Asian's eyes as closed.
- ✅ DuckDuckGo enables users to anonymously access ChatGPT by not collecting user IP addresses along with queries.
- ⛔ Personal information on taxi drivers can be accessed in poorly anonymized taxi trips dataset released by New York City.
- ⛔ Netflix prize dataset of movie rankings by 500,000 customers is easily de-anonymized through cross referencing with other publicly available datasets.
- ⛔ In six major cities, Amazon's same day delivery service excludes many predominantly black neighborhoods.
- ⛔ Facial recognition software is significanty worse at identifying people with darker skin.
- ✅ MediCapt, which documents forensic evidence in conflict regions, effectively protects sensitive information using encryption, limited access, and security audits.
- ⛔ Personal and financial data for more than 146 million people was stolen in Equifax data breach.
- ⛔ Cambridge Analytica harvested private information from over 50 million Facebook profiles without users' permission.
- ⛔ AOL accidentally released 20 million search queries from 658,000 customers.
- ✅ Code for America programmatically cleared >140,000 eligible criminal records by collaborating with multiple relevant stakeholders like policymakers, advocacy groups, and courts.
- ⛔ When Apple's HealthKit came out in 2014, women couldn't track menstruation.
- ✅ A study by Park et al shows how reweighting can mitigate racial bias when predicting risk of postpartum depression.
- ⛔ word2vec, trained on Google News corpus, reinforces gender stereotypes.
- ⛔ Women are more likely to be shown lower-paying jobs than men in Google ads.
- ⛔ Misleading chart shown at Planned Parenthood hearing distorts actual trends of abortions vs. cancer screenings and preventative services.
- ⛔ Georgia Dept. of Health graph of COVID-19 cases falsely suggests a steeper decline when dates are ordered by total cases rather than chronologically.
- ✅ NASA's Transform to Open Science initiative is working to make research more reproducible and accessible.
- ✅ Medic's Community Health Tooklit supports health workers in hard-to-reach areas. The toolkit is fully open source on Github for anyone to view or collaborate.
- ⛔ Excel error in well-known economics paper undermines justification of austerity measures.
- ✅ Amazon developed an experimental AI recruiting tool, but did not deploy it because it learned to perpetuate bias against women.
- ⛔ In hypothetical trials, language models assign the death penalty more frequently to defendants who use African American dialects.
- ⛔ Variables used to predict child abuse and neglect are direct measurements of poverty, unfairly targeting low-income families for child welfare scrutiny.
- ⛔ Criminal sentencing risk asessments don't ask directly about race or income, but other demographic factors can end up being proxies.
- ⛔ Creditworthiness algorithms based on nontraditional criteria such as grammatic habits, preferred grocery stores, and friends' credit scores can perpetuate systemic bias.
- ✅ A study by Garriga et al uses ML best practices to test for and communicate fairness across racial groups for a model that predicts mental health crises.
- ⛔ Apple credit card offers smaller lines of credit to women than men.
- ⛔ With COMPAS, a risk-assessment algorithm used in criminal sentencing, black defendants are almost twice as likely as white defendants to be mislabeled as likely to reoffend.
- -- Northpointe's rebuttal to ProPublica article.
- -- Related academic study.
- ⛔ Google's speech recognition software doesn't recognize women's voices as well as men's.
- ⛔ Google searches involving black-sounding names are more likely to serve up ads suggestive of a criminal record than white-sounding names.
- ⛔ OpenAI's GPT models show racial bias in ranking job applications based on candidate names.
- ✅ Facebook seeks to optimize "time well spent", prioritizing interaction over popularity.
- ⛔ YouTube's search autofill suggests pedophiliac phrases due to high viewership of related videos.
- ⛔ A widely used commercial algorithm in the healthcare industry underestimates the care needs of black patients because it optimizes for spending as a proxy for need, introducing racial bias due to unequal access to care.
- ✅ GDPR includes a "right to explanation," i.e. meaningful information on the logic underlying automated decisions.
- ⛔ Patients with pneumonia with a history of asthma are usually admitted to the intensive care unit as they have a high risk of dying from pneumonia. Given the success of the intensive care, neural networks predicted asthmatics had a low risk of dying and could therefore be sent home. Without explanatory models to identify this issue, patients may have been sent home to die.
- ✅ OpenAI posted an explanation of how ChatGPT is trained to behave, its limitations, and future directions for improvement.
- ⛔ Google Flu claims to accurately predict weekly influenza activity and then misses the 2009 swine flu pandemic.
- ✅ RobotsMali uses AI to create children's books in Mali's native languages, and incorporates human review to ensure that all AI-generated content is accurate and culturally sensitive.
- ⛔ Dutch Prime Minister and entire cabinet resign after investigations reveal that 26,000 innocent families were wrongly accused of social benefits fraud partially due to a discriminatory algorithm.
- ⛔ Sending police officers to areas of high predicted crime skews future training data collection as police are repeatedly sent back to the same neighborhoods regardless of the true crime rate.
- ✅ Healing ARC uses a targeted, race-conscious algorithm to counteract documented inequities in access to heart failure care for Black and Latinx patients.
- ⛔ Software mistakes result in healthcare cuts for people with diabetes or cerebral palsy.