Syllabus Week in the Age of AI: Part I

August 22, 2023

by Betsy Barre

Midjourney's rendering of "Illustration of Wake Forest Campus covered in computer code." It's an image of a campus with Georgian architecture covered in pixelated dots and dashes. — Midjourney’s rendering of “an illustration of the Wake Forest University campus covered in computer code.”

For teaching centers, the weeks before the start of the fall semester are the most wonderful time of the year. We welcome new faculty into our community of teacher-scholars, and faculty of all ranks begin to reach out as they design their syllabi and reflect on changes they would like to make. This year, almost everyone is considering how they want to approach artificial intelligence in their classrooms. Although ChatGPT was released in November, it took some time for the news to spread, and many of us took a wait-and-see approach in the spring. We’re now hoping to develop a more intentional approach. Yet, as is often the case in the life of a teacher, there is too much to read and not enough time.

This post is the first of a four-part series that aims to ease that burden by summarizing and curating the relevant literature. Given the need for just-in-time guidance, I’ve structured each post as a series of FAQs with the most time-sensitive questions at the top. Today’s post will address AI policy, syllabus statements, and strategies for preserving academic integrity. In later posts, I’ll share strategies for using AI to support student learning, discuss ways AI might support our work as teachers, and recommend readings, podcasts, and videos for those who want to learn more about the nature of artificial intelligence and its broader implications for society.

Although I am introducing these FAQs in blog posts, they will also live together on our new AI Resources page. We will continue to update those resources throughout the year, so we encourage you to bookmark that page for future reference.

As always, the CAT is available for consultations if you have any questions or would like a conversation partner as you think through your approach!

Just-In-Time Guidance

Cheating has always existed, often at depressingly high levels.¹ And for almost as long, we have sought to limit the harm it can cause through various forms of punishment (or, as we like to say now, “accountability”). In the case of cheating, punishment serves at least two functions. It stops the cheater from doing wrong (receiving an unfair advantage over fellow students) and deters other students from attempting something similar. But in both cases, detection is an essential piece of the puzzle. If we don’t know students are cheating, we can neither stop the wrongdoing nor deter others from doing the same.

There has typically been an inverse relationship between the cost of cheating and its ability to be detected. So while it has always been possible to pay another person to write an undetectable essay for you, the cost of doing so has been prohibitive for most students.² AI presents a unique threat to the accountability approach because it changes the slope of this relationship between cost and detectability. While it’s true students must put forth some effort (they cannot, as some students have done, submit work that begins, “As a generative AI model …”), it no longer takes much work to cheat in undetectable ways.

Given this reality, it makes sense to think of the challenge before us as one of detection. If we could find a way to detect AI-generated output like we detect plagiarized papers, students would be no more likely to use AI than they are to plagiarize, and we could return to business as usual. So we seek tips to improve our ability to spot AI-generated text or, failing that, software that will do this work for us.

Unfortunately, AI detection is both technically and ethically complex, and this complexity is only going to increase with time. But even if we decide AI detection is neither realistic nor ethical, we need not despair. And that’s because accountability is not the only way to shape student behavior. Yes, punishment can be a powerful motivator. But we also know it can also have unintended and unpredictable effects. As a result, experts on Academic Integrity have long sought to expand our toolkit beyond accountability alone. By turning our attention to these approaches, which aim to cultivate students’ positive, intrinsic motivations, it may be possible to escape the ruin of the spring semester without solving the detection problem.

https://academicintegrity.org/resources/facts-and-statistics
This is, of course, another example of privileged students using their privilege to extend their advantages.

Although we all want to believe we can spot AI-generated text when we encounter it, researchers have known for quite some time that humans struggle to distinguish between human- and ai-generated prose.³ It is, then, unsurprising that numerous start-ups (and OpenAI itself) were prepared to launch AI detection tools within weeks of ChatGPT’s release. And in April, Turnitin released its own secure, LMS-based tool to the faculty of over 10,000 institutions.

Since then, debates about their reliability have raged, OpenAI has quietly removed its detector from its site, Turnitin has updated its reported false-positive rates, and both Vanderbilt and The University of Pittsburgh have decided to disable Turnitin’s AI detection tools, as a result. Despite these criticisms, thousands of instructors–including many Wake Forest faculty–continue to find these tools valuable.

As with most debates, the details are more complicated than the public discourse suggests. Turnitin still maintains a 1% false positive rate for paper-level scores higher than 20%, and they are the only company able to test its tool on a 20-year archive of papers written by college students before AI came on the scene. Nevertheless, they acknowledge they will miss at least 15% of AI-generated text to maintain a false-positive rate of 1%. And this performance only applies to essays written with GPT-3. Students who can pay $20 a month for GPT-4 are far less likely to be detected.

These rates may seem encouraging if we imagine 1 false accusation for every 100 suspicious cases. Yet this false-positive rate is based on all papers submitted to Turnitin, including those we would have never investigated without the software. Assuming 25,000 papers are submitted to Turnitin each academic year, and 75% of those papers are human-generated, 188 of those human-generated papers would be inaccurately flagged as more than 20% AI-generated each year. To make this more concrete, any instructor who assigns three papers to 50 students would likely encounter 1-2 false positives each term.

Finally, it is worth remembering that AI detectors are themselves AI tools, trained in much the same way. To the extent you find AI problematic because of its propensity to”bullshit” (in Harry Frankfurt’s technical sense of that term), you must also acknowledge that these detectors are just as likely to speak confidently about things they don’t actually “know.” If you’re worried about your students trusting a machine that can hallucinate facts, remember that the same could be true of the reports you receive about student papers.

Clark, Elizabeth, Tal August, Sofia Serrano, Nikita Haduong, Suchin Gururangan, and Noah A. Smith. 2021. “All That’s `Human’ Is Not Gold: Evaluating Human Evaluation of Generated Text.” In Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (Volume 1: Long Papers), 7282–96. Online: Association for Computational Linguistics. https://doi.org/10.18653/v1/2021.acl-long.565; Kreps, Sarah, R. Miles McCain, and Miles Brundage. 2022. “All the News That’s Fit to Fabricate: AI-Generated Text as a Tool of Media Misinformation.” Journal of Experimental Political Science 9 (1): 104–17. https://doi.org/10.1017/XPS.2020.37.

In an ideal world, our students would be intrinsically motivated to adhere to our guidelines, and for the right reasons. Yet they enter our classrooms with a variety of motivations, and not all of them are aligned with our goals. One might reasonably ask how much we can shape these motivations in the course of a single semester. If students don’t want to learn and care little about academic integrity, is there much we can do?

The primary reason I am optimistic about the future of AI in our classrooms is that I believe in the power of teachers. While we may not be able to win over every student, I believe most students want to learn and will do so with integrity if the conditions are right. And thanks to the fabulous work of many brilliant social scientists, we happen to know a thing or two about what those conditions look like.

For starters, we can involve them in the process of thinking through our collective approach to AI. We know that motivation increases when students feel the environment is supportive and aligned with their goals. Giving them a say in the process gives them some ownership over their environment while helping them better understand the reasons for taking a particular approach.

We also know that moral reminders can be powerful tools to motivate students to align their behavior with their values and commitments. So if we ask students to sign on to a co-constructed set of principles, and remind them of the importance of those principles before each assignment is submitted, they may be more likely to give us their best.

If you think back to the times in your life you were learning the most, what was your primary driver? Chances are it was not a desire for an A or a desire to comply with an externally imposed policy. It was, most likely, the joy of participating in activities you found personally or socially meaningful. Likewise, our courses become more meaningful when we connect our material to the interests of our students and develop relevant, authentic assignments.

Finally, it is worth noting that even the most highly motivated students, committed to learning for its own sake, can also be deeply concerned about grades. And insofar as they perceive a threat to those grades, their intrinsic motivation to learn may take a back seat. So it may not be enough to make learning meaningful in the age of AI. We may also need to reduce the power of extrinsic motivators like grades.

Center for the Advancement of Teaching

Just-In-Time Guidance

Recent Posts

Archives

Where to start

Get to know WFU

Resources

Support Wake Forest

Wake Forest Giving Societies

Center for the Advancement of Teaching

Just-In-Time Guidance

I’m feeling overwhelmed. Where should I begin?

Why do I need to craft my own policy?

What should I consider when crafting a syllabus statement?

Do you have any examples of syllabus statements I might consider?

Is there a way to detect whether our students have used AI?

What should I know about AI detection and AI detectors?

What if AI detection is verified with additional evidence?

Can I redesign my assignments to make AI use impossible?

Can I use grades to foster integrity?

Can I really cultivate an intrinsic motivation to learn?

Recent Posts

Archives