We’re not very good at telling when text has been written by AI

Adi Gaskell24 Oct 2023

381 2 minutes read

Originally posted on The Horizons Tracker.

One of the key selling points of tools like ChatGPT is its ability to rapidly create content. The launch earlier this year coincided with widespread concerns about the impact of such tools on the workplace. Research¹ from Stanford explores whether readers are able to identify whether text has been written by AI or humans.

The researchers embarked on an exploration of this quandary by examining the degree to which we are able to differentiate between human-generated and AI-generated text on platforms such as OKCupid, Airbnb, and Guru.com.

Unable to distinguish

The team’s revelations were eye-opening: study participants could only distinguish between human and AI text with an accuracy rate of 50-52%, which is roughly equivalent to a coin flip.

The real cause for concern is that we can fashion AI that appears more human than actual humans, as we can optimize the AI’s language to leverage the same kind of presumptions that humans possess. This is worrying since it poses a risk that these machines can impersonate humans to a greater degree than us, with the potential to deceive.

“One thing we already knew is that people are generally bad at detecting deception because we are trust-default,” the researchers explain. “For this research, we were curious, what happens when we take this idea of deception detection and apply it to generative-AI, to see if there are parallels with other deception and trust literature?”

Upon administering text samples from the three social media platforms to participants, the researchers found that although we are unable to differentiate between AI and human-generated text with any significant degree of accuracy, we do not arrive at random conclusions either.

Our incorrect assessments are founded on similar assumptions, based on reasonable intuition and shared language cues. In other words, we frequently arrive at the wrong conclusion, whether it is AI or human-generated text, but we do so for similar reasons.

For instance, participants erroneously attributed high grammatical accuracy and the use of first-person pronouns to human-generated text. Similarly, referencing family life and utilizing informal, conversational language was also wrongly attributed to human-generated text.

A rise in misinformation

The researchers believe that the poor heuristics we tend to use to determine the authenticity of text combined with the ease of producing automated content will inevitably result in a rise in misinformation.

“The volume of AI-generated content could overtake human-generated content on the order of years, and that could really disrupt our information ecosystem,” they explain. “When that happens, the trust-default is undermined, and it can decrease trust in each other.”

Solutions are far from straightforward, but the researchers believe things like AI watermarking or even providing AI with a particular “accent” could help. We also need to do more to teach young people about the various risks involved in the online world.

Article source: We’re Not Very Good At Telling When Text Has Been Written By AI.

Header image source: Alexandra Koch on Pixabay.

Reference:

Jakesch, M., Hancock, J. T., & Naaman, M. (2023). Human heuristics for AI-generated language are flawed. Proceedings of the National Academy of Sciences, 120(11), e2208839120. ↩

Rate this post

Cookie	Duration	Description
cookielawinfo-checkbox-advertisement	1 year	Set by the GDPR Cookie Consent plugin, this cookie is used to record the user consent for the cookies in the "Advertisement" category .
cookielawinfo-checkbox-analytics	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Analytics".
cookielawinfo-checkbox-functional	11 months	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Functional".
cookielawinfo-checkbox-necessary	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary".
cookielawinfo-checkbox-others	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Other.
cookielawinfo-checkbox-performance	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Performance".
viewed_cookie_policy	11 months	The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.

Cookie	Duration	Description
_ga	2 years	The _ga cookie, installed by Google Analytics, calculates visitor, session and campaign data and also keeps track of site usage for the site's analytics report. The cookie stores information anonymously and assigns a randomly generated number to recognize unique visitors.
_gid	1 day	Installed by Google Analytics, _gid cookie stores information on how visitors use a website, while also creating an analytics report of the website's performance. Some of the data that are collected include the number of visitors, their source, and the pages they visit anonymously.

Unable to distinguish

A rise in misinformation

Adi Gaskell

Related Articles

AI-based credit risk tools can be ruined by noisy data

Paper highlights the bias inherent in legal AI

ChatGPT is great – you’re just using it wrong

Introduction to knowledge graphs (part 3): Data graphs