Emily M. Bender

Emily M. Bender
Born1973 (age 50–51)
Known forResearch on the risks of large language models and ethics of NLP; coining the term 'Stochastic parrot'; research on the use of Head-driven phrase structure grammar in computational linguistics
SpouseVijay Menon[4]
MotherSheila Bender[3]
Academic background
Alma materUC Berkeley and Stanford University[1][2]
ThesisSyntactic variation and linguistic competence: The case of AAVE copula absence (2000[1][2])
Doctoral advisorTom Wasow
Penelope Eckert[2]
Academic work
DisciplineLinguistics
Sub-disciplineSyntax, computational linguistics
InstitutionsUniversity of Washington

Emily Menon Bender (born 1973) is an American linguist who is a professor at the University of Washington. She specializes in computational linguistics and natural language processing. She is also the director of the University of Washington's Computational Linguistics Laboratory.[5][6] She has published several papers on the risks of large language models and on ethics in natural language processing.[7]

Education

Bender earned an AB in Linguistics from UC Berkeley in 1995. She received her MA from Stanford University in 1997 and her PhD from Stanford in 2000 for her research on syntactic variation and linguistic competence in African American Vernacular English (AAVE).[8][1] She was supervised by Tom Wasow and Penelope Eckert.[2]

Career

Before working at University of Washington, Bender held positions at Stanford University, UC Berkeley and worked in industry at YY Technologies.[9] She currently holds several positions at the University of Washington, where she has been faculty since 2003, including professor in the Department of Linguistics, adjunct professor in the Department of Computer Science and Engineering, faculty director of the Master of Science in Computational Linguistics,[10] and director of the Computational Linguistics Laboratory.[11] Bender is the current holder of the Howard and Frances Nostrand Endowed Professorship.[12][13]

Bender was elected VP-elect of the Association for Computational Linguistics in 2021.[14] Bender served as VP-elect in 2022, moving to Vice-President in 2023. She is serving as President through 2024,[15][16] and will serve as Past President in 2025. Bender was elected a Fellow of the American Association for the Advancement of Science in 2022.[17]

Contributions

Bender has published research papers on the linguistic structures of Japanese, Chintang, Mandarin, Wambaya, American Sign Language and English.[18]

Bender has constructed the LinGO Grammar Matrix, an open-source starter kit for the development of broad-coverage precision HPSG grammars.[19][20] In 2013, she published Linguistic Fundamentals for Natural Language Processing: 100 Essentials from Morphology and Syntax, and in 2019, she published Linguistic Fundamentals for Natural Language Processing II: 100 Essentials from Semantics and Pragmatics with Alex Lascarides, which both explain basic linguistic principles in a way that makes them accessible to NLP practitioners.[citation needed]

In 2021, Bender presented a paper, "On the Dangers of Stochastic Parrots: Can Language Models Be Too Big? 🦜" co-authored with Google researcher Timnit Gebru and others at the ACM Conference on Fairness, Accountability, and Transparency[21] that Google tried to block from publication, part of a sequence of events leading to Gebru departing from Google, the details of which are disputed.[22] The paper concerned ethical issues in building natural language processing systems using machine learning from large text corpora.[23] Since then, she has invested efforts to popularize AI ethics and has taken a stand against hype over large language models.[24][25]

The Bender Rule, which originated from the question Bender repeatedly asked at the research talks, is research advice for computational scholars to "always name the language you're working with".[4]

She draws a distinction between linguistic form versus linguistic meaning.[4] Form refers to the structure of language (e.g. syntax), whereas meaning refers to the ideas that language represents. In a 2020 paper, she argued that machine learning models for natural language processing which are trained only on form, without connection to meaning, cannot meaningfully understand language.[26] Therefore, she has argued that tools like ChatGPT have no way to meaningfully understand the text that they process, nor the text that they generate.[citation needed]

Selected publications

Books

  • Bender, Emily M. (2000). Syntactic Variation and Linguistic Competence: The Case of AAVE Copula Absence. Stanford University. ISBN 978-0493085425.
  • Sag, Ivan; Wasow, Thomas; Bender, Emily M. (2003). Syntactic theory: A formal introduction. Center for the Study of Language and Information. ISBN 978-1575864006.
  • Bender, Emily M. (2013). Linguistic Fundamentals for Natural Language Processing: 100 Essentials from Morphology and Syntax. Synthesis Lectures on Human Language Technologies. Springer. ISBN 978-3031010224.
  • Bender, Emily M.; Lascarides, Alex (2019). Linguistic Fundamentals for Natural Language Processing II: 100 Essentials from Semantics and Pragmatics. Synthesis Lectures on Human Language Technologies. Springer. ISBN 978-3031010446.

Articles

  • Bender, Emily (2000). "The Syntax of Mandarin Bă: Reconsidering the Verbal Analysis". Journal of East Asian Linguistics. 9 (2): 105–145. doi:10.1023/A:1008348224800. S2CID 115999663 – via Academia.edu.
  • Bender, Emily M.; Flickinger, Dan; Oepen, Stephan (2002). The Grammar Matrix: An open-source starter-kit for the rapid development of cross-linguistically consistent broad-coverage precision grammars. Proceedings of the 2002 workshop on Grammar engineering and evaluation. Vol. 15.
  • Siegel, Melanie; Bender, Emily M. (2002). Efficient deep processing of Japanese. Proceedings of the 3rd workshop on Asian language resources and international standardization. Vol. 12.
  • Goodman, M. W.; Crowgey, J.; Xia, F; Bender, E. M. (2015). "Xigt: Extensible interlinear glossed text for natural language processing". Lang Resources & Evaluation. 49 (2): 455–485. doi:10.1007/s10579-014-9276-1. S2CID 254372685.
  • Xia, Fei; Lewis, William D.; Goodman, Michael Wayne; Slayden, Glenn; Georgi, Ryan; Crowgey, Joshua; Bender, Emily M. (2016). "Enriching A Massively Multilingual database of interlinear glossed text". Lang Resources & Evaluation. 50 (2): 321–349. doi:10.1007/s10579-015-9325-4. S2CID 254379828.
  • Bender, Emily M.; Gebru, Timnit; McMillan-Major and, Angelina; Shmitchell, Shmargaret (2021). On the Dangers of Stochastic Parrots: Can Language Models Be Too Big? 🦜. FAccT '21: Proceedings of the 2021 ACM Conference on Fairness, Accountability, and Transparency. doi:10.1145/3442188.3445922.

See also

References

  1. ^ a b c Baković, Eric (2006-10-04). "Language Log: Speaking of missing words in American history". Language Log. Archived from the original on 2023-05-27. Retrieved 2024-02-12.
  2. ^ a b c d "Emily M. Bender". OpenReview. Retrieved 2023-09-11.
  3. ^ "In Conversation with Emily Menon Bender - Sheila Bender's Writing It Real". 2023-09-07. Retrieved 2023-09-11.
  4. ^ a b c Weil, Elizabeth (2023-03-01). "You Are Not a Parrot". Intelligencer. Retrieved 2023-09-11.
  5. ^ "Emily M. Bender | Department of Linguistics | University of Washington". linguistics.washington.edu. Retrieved 2021-11-09.
  6. ^ "Emily M. Bender". University of Washington faculty website. Retrieved February 4, 2023.
  7. ^ Bender, Emily M. (2022-06-14). "Human-like programs abuse our empathy – even Google engineers aren't immune". The Guardian. ISSN 0261-3077. Retrieved 2023-02-04.
  8. ^ Bender, Emily. "Emily Bender CV" (PDF).
  9. ^ "Emily M. Bender". University of Washington. 2021-11-10. Retrieved 2021-11-10.
  10. ^ "UW Computational Linguistics Master's Degree – Online & Seattle". www.compling.uw.edu. Retrieved 2017-07-19.
  11. ^ "UW Computational Linguistics Lab".
  12. ^ Parvi, Joyce (2019-08-21). "Emily M. Bender is awarded Howard and Frances Nostrand Endowed Professorship for 2019–2021". linguistics.washington.edu. Retrieved 2019-12-08.
  13. ^ "Emily M Bender". The Alan Turing Institute. Retrieved 2021-10-31.
  14. ^ "ACL 2021 Election Results: Congratulations to Emily M. Bender and Mohit Bansal". 2021-11-09. Retrieved 2021-11-10.
  15. ^ "About the ACL". 2024. Retrieved 2024-02-23.
  16. ^ "ACL Officers". 2024-02-05. Retrieved 2024-02-23.
  17. ^ "2022 AAAS Fellows". American Association for the Advancement of Science. Retrieved 2023-08-03.
  18. ^ "Emily M. Bender: Publications". University of Washington faculty website. Retrieved 2021-11-18.
  19. ^ "LinGO Grammar Matrix | Department of Linguistics | University of Washington". linguistics.washington.edu. Retrieved 2017-07-19.
  20. ^ "An open source grammar development environment and broad-coverage English grammar using HPSG" (PDF). LREC. 2000. Archived from the original (PDF) on 2017-08-09. Retrieved 2017-07-19.
  21. ^ Bender, Emily M.; Gebru, Timnit; McMillan-Major, Angelina; Shmitchell, Shmargaret (2021-03-03). "On the Dangers of Stochastic Parrots: Can Language Models be Too Big? 🦜". Proceedings of the 2021 ACM Conference on Fairness, Accountability, and Transparency. FAccT '21. New York, NY, USA: Association for Computing Machinery. pp. 610–623. doi:10.1145/3442188.3445922. ISBN 978-1-4503-8309-7.
  22. ^ Simonite, Tom. "What Really Happened When Google Ousted Timnit Gebru". Wired. ISSN 1059-1028. Retrieved 2024-04-02.
  23. ^ Hao, Karen (December 4, 2020). "We read the paper that forced Timnit Gebru out of Google. Here's what it says". MIT Technology Review.
  24. ^ "Inside a Hot-Button Research Paper: Dr. Emily M. Bender Talks Large Language Models and the Future of AI Ethics". Emerging Tech Brew. Retrieved 2022-09-26.
  25. ^ Bender, Emily M. (2022-05-02). "On NYT Magazine on AI: Resist the Urge to be Impressed". Medium. Retrieved 2022-09-26.
  26. ^ Bender, Emily M.; Koller, Alexander (2020-07-05). "Climbing towards NLU: On Meaning, Form, and Understanding in the Age of Data". Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics. Online: Association for Computational Linguistics: 5185–5198. doi:10.18653/v1/2020.acl-main.463. S2CID 211029226.