AI Ethics Learning Toolkit

Is AI Theft?

“This harvesting of our work, scraping and impersonating our styles, it’s the same as a parasite that sucks the light out of its host without invitation.”

– Tim Flach, wildlife photographer, one of the most “scraped” artists in the world

To many users, AI seems like magic. It can generate images, write papers, and summarize complex topics into tidy bullet points. But what sources are under AI’s hood? How does it know so much? AI models are trained on vast amounts of human-generated content from the internet – including everything from transcribed YouTube videos and Reddit threads to image libraries and the entire text of Wikipedia. This raises a critical legal and ethical question: Are AI companies committing copyright infringement by scraping and using this content without permission? The New York Times, numerous authors, artists, and musicians think so. They’ve filed lawsuits against companies like OpenAI and Meta, arguing that their work fuels these models – yet they receive no credit, attribution, or compensation. AI companies defend their practices by citing the Fair Use doctrine, claiming that AI outputs are “transformative” – computational mashups that are fundamentally different from the original works. But many legal questions remain unresolved, and frustration is building across the many creative industries that are impacted. As students increasingly turn to AI, it’s important for them to consider not just what the tools can do, but the human contributions and intellectual property that made them possible. Understanding the legal and ethical issues is key to becoming a responsible and informed user.

Learning Activities

🗣️ Conversation Starters A Few Questions to Get the Discussion Going


  • Have you heard/read any stories about artists, musicians, authors speaking out about their work being impacted by AI? If so, are you sympathetic?
  • How do you see the long-term impacts of generative AI on creative industries (e.g., writing, art, music, movies/TV)?
  • Do you think artists, musicians, writers should be credited and/or compensated by AI companies who have trained on their work? What would a fair system look like?
  • Is it legal to upload copyrighted content to AI to get summaries for personal use? How aware are you of copyright restrictions with AI tools that you use? Who might be harmed by copyright violations with AI?
  • What responsibilities should students or researchers have when giving credit to AI-generated content they relied on in their own work?

💡 Active Learning with AI Fun Ways to Explore AI’s Strengths and Limitations


  • Prompt ChatGPT for the full text of something you know to be copyrighted (Ex. a chapter from your Chem 101 textbook). How does it respond? 
  • Prompt ChatGPT to mimic lyrics “in the style of” your favorite artist, musician, or writer (Ex. In the style of Taylor Swift, write me a song about getting ghosted by a friend). What do you think about the output? Could you see ‘mimicry’ as potentially harmful? 
  • Try a music generator (ex. Suno) and create a song. What was your prompt? What do you think about the output? Could you see this as potentially harmful?

🎓 Disciplinary Extensions Ideas for Exploring AI’s Impact in Specific Fields


  • Journalism/Public Policy: How does AI harm journalism? Use a Case Study, like the New York Times lawsuit, to have students debate the legal arguments 
  • Art/Visual Studies/Film Discuss the impact of AI on the art and film industry. Case study on the OpenAI Studio Ghibli AI (background article).
  • I&E/Engineering: Engineering, or I&E, students can explore the intellectual property issues behind engineering design involving AI. E.g. If an AI tool is used to design a product, who should own the result? The user, the developer, or the company behind the AI?
  • Literature: The question of whether and how digital humanists can conduct quantitative works has been a discussion for a long time. Some publishers (like JSTOR and Wiley) already license their copyrighted materials for scholars to conduct “distant reading.” Is this use case different from licensing AI models to train on copyrighted materials? Why or why not?

Resources

Scholarly

Recommendations


  1.  Sellman, M. (2025, February 7). Photographer says AI being used to copy his work. The Times.