Digital Citizen
Fastmail
Categories: Technology
Add to My List
Listen to the last episode:
Join us on a journey to learn more about the intersection of linguistics and AI with special guest Emily M. Bender. Come with us as we learn how linguistics functions in modern language models like ChatGPT.
Episode Notes
Discover the origins of language models, the negative implications of sourcing data to train these technologies, and the value of authenticity.
▶️ Guest Interview - Emily M. Bender
- Learn more about Emily M. Bender
- Read On the Danger of Stochastic Parrots(2021) by Emily M. Bender, Timnit Gebru, Angelina McMillan-Major, and Margaret Mitchell.
- Check out the publications by cognitive scientist Abeba Birhane.
- See work from AI research scientist Meg Mitchell.
- Emily M. Bender is a Professor of Linguistics at the University of Washington. Her work focuses on grammar, engineering, and the societal impacts of language technology. She's spoken and written about what it means to make informed decisions about AI and large language models such as ChatGPT.
- Artificial Intelligence (AI) is a marketing term developed in the 1950s by John McCarthy. It refers to an area of computer science. AI is a technology built using natural language processing and linguistics, the science of how language works. Understanding how language works is necessary to comprehend large language models' potential misuse and limitations.
- Language model is the term for a type of technology designed to model the distribution of word forms in text. While early language models simply determined the relative frequency of words in a text, today’s language models are bigger in terms of the data they store and the language they are trained on. As a society, we must continue reminding ourselves that synthetic text is not a credible information source. Before sharing information, it’s smart to verify that something was written by a human rather than a machine. Valuing authenticity and citations are some of the most important things we can do.
- Distributional biases are generated in the data output used for large language models. The less care we put into curating training data, the more various patterns and systems of oppression will be reproduced, regardless of whether they are presented as fact or fiction in the end result.
- Being a good digital citizen means avoiding using products built on data theft and labor exploitation. On an individual level, we should insist on transparency regarding synthetic media. Part of the problem is that there is currently no watermarking at the source. There is a major need for regulation and accountability around synthetic text nationally. We can also continue to increase the value of authenticity.
- Digital Citizen Website: fastmail.com/digitalcitizen.
- Check out our blog.
- Tweet us @Fastmail.
- Follow us on Mastodon: @fastmail@mastodon.social.
If you love this show, please leave us a review on Apple Podcasts or wherever you listen to podcasts. Take our survey to tell us what you think at digitalcitizenshow.com/survey.
Previous episodes
-
24 - Exploring AI with Emily M. Bender Tue, 25 Jun 2024 - 0h
-
23 - From Players to Creators: Diving into the Video Game Industry Tue, 11 Jun 2024 - 0h
-
22 - You Can Thrive Here: Local Leaders on Philly’s Move Into Ethical Tech Tue, 28 May 2024 - 0h
-
21 - Repairing Our Right To Fix it with Aaron Perzanowski Tue, 14 May 2024 - 0h
-
20 - Getting Things Done Using Your Calendar with David Tedaldi from Morgen Tue, 30 Apr 2024 - 0h
-
19 - Finding the Balance Between Productivity and Grind Culture with Abha from The Werk Life Tue, 16 Apr 2024 - 0h
-
18 - Avoiding Procrastination with Adam Conover Tue, 02 Apr 2024 - 0h
-
17 - Digital Citizen - Season 3 Trailer Tue, 19 Mar 2024 - 0h
-
16 - The Future of AI with WGA Strike Leader Adam Conover Wed, 13 Sep 2023 - 0h
-
15 - The Future of Hybrid Digital Communities Wed, 17 May 2023 - 0h
-
14 - Experiencing Art in a Digital World with JiaJia Fei Tue, 20 Dec 2022 - 0h
-
13 - How to Prioritize Connection in a Remote Workforce with Recess Part 2 Tue, 06 Dec 2022 - 0h
-
12 - Why Play is so Powerful in Adulthood with Recess Part 1 Tue, 22 Nov 2022 - 0h
-
11 - Building a More Accessible Tech World with Dan from Hopeworks Tue, 08 Nov 2022 - 0h
-
10 - Uplifting Community Through Good Digital Citizenship with Kayondra Garrison Tue, 08 Nov 2022 - 0h
-
9 - The Future of Online Community with L.X. Beckett Tue, 25 Oct 2022 - 0h
-
8 - Upgrade Your Cyber Security with Troy Hunt Part 2 Tue, 11 Oct 2022 - 0h
-
7 - Everything You Need to Know About Data Breaches with Troy Hunt Mon, 26 Sep 2022 - 0h
-
6 - How to Stay Safe Online with Michael Fey from 1Password Wed, 29 Sep 2021 - 0h
-
5 - Why Open Internet Standards Are So Important To Your Future with Bron Gondwana Mon, 23 Aug 2021 - 0h
-
4 - Understanding Your Digital Rights with Lucie Krahulcova Tue, 10 Aug 2021 - 0h
-
3 - How To Have a Healthier Social Media Diet with Tom Webster Tue, 27 Jul 2021 - 0h
-
2 - How To Improve Your Digital Life with BJ Fogg Mon, 12 Jul 2021 - 0h
-
1 - Digital Citizen - Trailer #1 Fri, 16 Apr 2021 - 0h
Show more episodes
5