How to Stop ChatGPT from Going Off the Rails

When Startup asked me to cover this week’s newsletter, my first instinct was to ask ChatGPT—OpenAI’s viral chatbot—to see what it came up with. It’s what I’ve been doing with emails, recipes, and LinkedIn posts all week. Productivity is way down, but sassy limericks about Elon Musk are up 1000 percent.

I asked the bot to write a column about itself in the style of Steven Levy, but the results weren’t great. ChatGPT served up generic commentary about the promise and pitfalls of AI, but didn’t really capture Steven’s voice or say anything new. As I wrote last week, it was fluent, but not entirely convincing. But it did get me thinking: Would I have gotten away with it? And what systems could catch people using AI for things they really shouldn’t, whether that’s work emails or college essays?

To find out, I spoke to Sandra Wachter, a professor of technology and regulation at the Oxford Internet Institute who speaks eloquently about how to build transparency and accountability into algorithms. I asked her what that might look like for a system like ChatGPT.

Amit Katwala: ChatGPT can pen everything from classical poetry to bland marketing copy, but one big talking point this week has been whether it could help students cheat. Do you think you could tell if one of your students had used it to write a paper?

Sandra Wachter: This will start to be a cat-and-mouse game. The tech is maybe not yet good enough to fool me as a person who teaches law, but it may be good enough to convince somebody who is not in that area. I wonder if technology will get better over time to where it can trick me too. We might need technical tools to make sure that what we’re seeing is created by a human being, the same way we have tools for deepfakes and detecting edited photos.

That seems inherently harder to do for text than it would be for deepfaked imagery, because there are fewer artifacts and telltale signs. Perhaps any reliable solution may need to be built by the company that’s generating the text in the first place.

You do need to have buy-in from whoever is creating that tool. But if I’m offering services to students I might not be the type of company that is going to submit to that. And there might be a situation where even if you do put watermarks on, they’re removable. Very tech-savvy groups will probably find a way. But there is an actual tech tool [built with OpenAI’s input] that allows you to detect whether output is artificially created.

What would a version of ChatGPT that had been designed with harm reduction in mind look like?

A couple of things. First, I would really argue that whoever is creating those tools put watermarks in place. And maybe the EU’s proposed AI Act can help, because it deals with transparency around bots, saying you should always be aware when something isn’t real. But companies might not want to do that, and maybe the watermarks can be removed. So then it’s about fostering research into independent tools that look at AI output. And in education, we have to be more creative about how we assess students and how we write papers: What kind of questions can we ask that are less easily fakeable? It has to be a combination of tech and human oversight that helps us curb the disruption.

Source link

What's Hot

The Redmi Note 13 is a bigger downgrade compared to the 5G model than you might think

Xiaomi Redmi Watch 4 is a budget smartwatch with a premium look and feel

Cosori TurboBlaze Air Fryer review

Bring Elden Ring to the table with the upcoming board game adaptation

ONI: Road to be the Mightiest Oni reveals its opening movie

GTA 6 images and footage allegedly leak

Wild west adventure Card Cowboy turns cards into weird and silly stories

7 Reasons Why You Should Study PHP Programming Language

Logitech MX Master 3S and MX Keys Combo for Business Gen 2 Review

Lenovo ThinkPad X1 Carbon Gen10 Review

Lenovo IdeaPad 5i Chromebook, 16-inch+120Hz

It’s 2023 and Spotify Still Can’t Say When AirPlay 2 Support Will Arrive

YouTube adds very convenient iPhone homescreen widgets

Google finishes iOS 16 Lock Screen widgets rollout w/ Maps

Is Apple actually turning iMessage into AIM or is this sketchy redesign rumor for laughs?

MeetKai launches AI-powered metaverse, starting with a billboard in Times Square

The DeanBeat: RP1 simulates putting 4,000 people together in a single metaverse plaza

Improving the customer experience with virtual and augmented reality

Why the metaverse won’t fall to Clubhouse’s fate

How Apple privacy changes have forced social media marketing to evolve

Microsoft Patch Tuesday October Fixed 85 Vulnerabilities – Latest Hacking News

Decentralization and KYC compliance: Critical concepts in sovereign policy

What Thoma Bravo’s latest acquisition reveals about identity management

What is a Service Robot? The vision of an intelligent service application is possible.

Tom Brady just chucked another Microsoft Surface tablet

The best AIO coolers for your PC in 2022

YC’s Michael Seibel clarifies some misconceptions about the accelerator • DailyTech

How to Stop ChatGPT from Going Off the Rails

Multiple Milestones As New Majority Capital Boosts Entrepreneurship Through Acquisition

Getty Images Plunges Into the Generative AI Pool

3 Hot Startup Opportunities In Augmented Reality

The ChatGPT App Can Now Talk to You—and Look Into Your Life

AnkerMake M5 3D printer Getting Strong Crowed Backing

Alphabet’s Layoffs Aren’t Very Googley

FastestVPN review

Tips for employees and leaders navigating job cuts – Startup

The Redmi Note 13 is a bigger downgrade compared to the 5G model than you might think

Xiaomi Redmi Watch 4 is a budget smartwatch with a premium look and feel

Cosori TurboBlaze Air Fryer review

What's Hot

How to Stop ChatGPT from Going Off the Rails

Related Posts