Prompt Surfing DRG - Course Schedule

Week 1 (1/8/2024) - Introductions

Topics

Introduce the course topic and structure as well as some case studies

Discussions and Activities

Introductions, expectations, and student projects
Setting the stage: What is prompt engineering?
Introduction to GenAI Applications
It’s not just ChatGPT: Other LLMs & non-text systems (e.g., audio, text, 3D models) (AH)

Central Activity

Playing with an LLM: In groups, get ChatGPT to write a children’s story. What do you notice?

Required Reading

All Gas, No Brakes in A.I. + Metaverse Update + Lessons From a Prompt Engineer (1hr podcast)

Additional Resources

Researchers Gain New Understanding From Simple AI(this is a simplified explanation of “Attention is all you need”, the transformers paper)
Characterizing Emergent Phenomena in Large Language Models (a blog post ABOUT some google research, this is a very handy short example of scaling in capabilities merely from increased size. An important thing to get from this is that we aren’t sure of the mechanism)
On the Opportunities and Risks of Foundation Models (an absolutely GARGANTUAN guide to foundation models (ie LLMs and the like) in AI. 200 pages so defintely an “optional” read)
Awesome-LLM: A curated list of Large Language Model(s)(a community generated repository of papers and links to code for large language models)
A list of open LLMs available for commercial use(another list of LLMs, here exclusively those with licenses appropriate for commercial use.)
Prompt Engineering blog post from Lillian Weng
Eight Things to Know about Large Language Models (a high level review from an Anthropic researcher. Very valuable for a broad understanding of some capabilities.)

Week 2 - HOLIDAY

Week 3 (1/22/2024) - Text

Topics

Explanation of text generation and exploration of an open-source alternative to ChatGPT, including an exercise with model guardrails

Discussions and Activities

Reading discussion
What does it mean to generate text? (AH)
- What is a token?
- What is a token window?
- What is a transformer?
Intro to “Break a GPT”
- AS does point of view shifts
- AH does “I like to pretend”

Central Activity

Break a GPT: Working in groups to get ChatGPT to spit out something unexpected or aberrant
- What do I mean by break?
- Can we define breakage?

Required Reading

Talking about Large Language Models(from a researcher at Deepmind, this is useful for terms and concepts as well as a good guide for how to think about these systems as a whole.)

Additional Resources

Chatbots Don’t Know What Stuff Isn’t(from Quanta Magazine this article does a good job of illustrating early troubles with handling negation in language models. Pay special attention to some of the proposed reasons AND issues in assessing progress since inspection is so difficult.)
Can AI really be protected from text-based attacks?(A short general interest article hinting at problems with safety mechanisms on LLMs introducing a difficult and complex space to protect.)
Adversarial Attacks on LLMs(a more technical post from Lillian Weng)
An illustrated explanation of NLP transformer based models The Illustrated BERT, ELMo, and co. (How NLP Cracked Transfer Learning
Google’s attempt to “break GPT” by extracting training data
Breaking a GPT-3 powered Twitter bot by asking it to “ignore all previous instructions”
NIST research report on adversarial machine learning input input more broadly, not just with Large Language Models.

Week 4 (1/29/2024) - Images

Topics

Discussion of the impact of early image generators (ca 2022-2023)
Introduction to image generation via Latent Diffusion Models
Exercise with Stable Diffusion

Discussions and Activities

Reading discussion
Introduction to image generation
Play with an image generator
How do we install one of our own?

Central Activity

Getting started with a project: Get together to talk about project ideas

Required Reading

The art of text-to-image generative AI (a transcript of a talk by Bruce Sterling on Dall-E Mini)

Additional Resources

Hands Are Hard: Unlearning How We Talk About Machine Learning in the Arts
Looking for Meaning in all the Wrong Places
One Vision of Machine Art (a short prompt looking at two passages in an H.P. Lovecraft story from the 1930s with eerie parallels to generative art)
The Illustrated Stable Diffusion
GitHub page for the Latent Diffusion model paper (formed the kernel of what has become Stable Diffusion)

Week 5 (2/5/2024) - Interpretation

Topics

Discussion of means and methods of interpretation of model output, including recent research on LLMs and a design case study of image classification

Discussions and Activities

Do we know what’s happening in these systems? Interpreting LLMs
What are some means/methods of interpretability? (transition to the reading)
How do we think about interpretability when we can’t see what’s going on?
Introduce image generator (Again, AH)

Central Activity

Play with an image generator

Required Reading

Objective Portrait: A practice-based inquiry to explore AI as a reflective design partner

Additional Resources

AI Transparency in the Age of LLMs: A Human-Centered Research Roadmap
Building A Virtual Machine inside ChatGPT
The debate over understanding in AI’s large language models
Chain-of-Thought Prompting Elicits Reasoning in Large Language Models
Large Language Models are Zero-Shot Reasoners (the genesis of “let’s think step by step” LLM prompting research)
Large Language Models Cannot Self-Correct Reasoning (a technical but critical look at the problem of correction, specifically the problem of self-correction)
The Internal State of an LLM Knows When It’s Lying (an attempt to inspect internal states about the information produced by the model)
Can Language Models Encode Perceptual Structure Without Grounding? A Case Study in Color

Week 6 (2/12/2024) - Change

Topics

Discussion of community impact on model development and the impact/import of biases in model output

Discussions and Activities

How do communities change model outputs?
WHO IS US?!

Central Activity

Consultation (AH, AS, BC): 3 groups (4 people per group), 15 min per person

Required Reading

‘Person’ == Light-skinned, Western Man, and Sexualization of Women of Color: Stereotypes in Stable Diffusion (a paper from UW researchers on stereotypes in Stable Diffusion)

Additional Resources

Robust fine-tuning of zero-shot models (demonstrates that making a weighted sum of the matrix of model weights of two LLMs produces a merged model with a weighted sum of the concept understandings of the two)
An Image is Worth One Word: Personalizing Text-to-Image Generation using Textual Inversion (the first research paper to demonstrate that lightweight collections of tokens could be built by training images and be used to stand in for a complex prompt)
Local Large Language Models (A highly technical but not academic guide to Local LLMs and fine-tuning. It also includes and explanation of how LoRAs work.)
A GitHub Next project evaluating GPT-4 performance with calculation (spoiler: not great!)
ChatGPT math performance over time (A more detailed demonstration by researchers at Stanford)

Week 7 - HOLIDAY

Week 8 (2/26/2024) - Project

Topics

Collaborative critique of in-progress work

Discussions and Activities

What have you been working on? (start with original groups, then do a rotation)
Collaborative critique

Central Activity

Collaborative critique

Required Reading

No reading for this week. Focus on your collaborative critique submission and your project.

Additional Resources

Vanderbilt University apologizes for using ChatGPT to write mass-shooting email
Temporary policy: Generative AI (e.g., ChatGPT) is banned
ChatGPT use shows that the grant-application system is broken
A good debate from two linguist/philosophers on how to think about meaning and chatbots. Rather than John Searle’s Chinese room they think about…an octopus!
A magazine article summarizing recent work showing the potentially bad impact of code-assistant tools like CoPilot on code quality

Week 9 (3/4/2024) - Testing

Topics

Testing and review of work by peers and the teaching team

Discussions and Activities

Test out what you have been making/writing/breaking
Returning to a reading (the class picks a reading for today that we all promise to (re)read heavily and discuss)
How do these systems impact humans?

Central Activity

Have students prepare ONE question for the class on what is blocking them or troubling them

Required Reading

No reading for this week. Focus on your project.

Additional Resources

Why Meta’s latest large language model survived only three days online

Week 10 (3/11/2024) - Feedback

Discussions and Activities

Demo what you’ve been making/writing/breaking