‘A perfect storm’: The Wall Street Journal has 21 people detecting ‘deepfakes’

By Digiday • July 1, 2019 •

To combat the growing threat of spreading misinformation ahead of the U.S. 2020 general election, The Wall Street Journal has formed a committee to help reporters navigate fake content.

Last September, the publisher assigned 21 of its staff from across its newsroom to form the committee. Each of them is on-call to answer reporters’ queries about whether a piece of content has been manipulated. The publisher has issued criteria to committee members which help them determine whether the content is fake or not. After each query from a reporter, members write up a report with details of what they learned.

This is part of the Journal’s plan to root out so-called “deepfakes,” content that has been manipulated by artificial intelligence, which has been a growing concern for publishers like Reuters and The Washington Post over the last year.

Heightened political tensions, increasingly sophisticated technology and the speed at which fake content can spread, has elevated the importance in spotting these fakes.

“We hear there will be a massive proliferation in technology in a couple of years which will coincide with the U.S. general election,” said Francesco Marconi, the Journal’s research and development chief. “That’s a perfect storm, technology is evolving so quickly, it’s an important inflection point. We want to be proactive.”

Like most underbelly areas of the internet, Marconi first came across the term “deepfake” on Reddit after seeing this academic paper on manipulating a video of Barack Obama in 2018. This, combined with a lot of activity in universities and research centers into the growth of deepfakes, spurred the R&D team to form the WSJ Media Forensics committee, a joint effort between the Standards & Ethics team and R&D.

The Journal also invites academics and researchers to give regular talks on new detection technology. It has written newsroom guides and holds training sessions — Marconi estimates between 120 and 150 WSJ journalists have participated. It also monitors the different detection tools being developed by major tech companies and startups. For now, training is not mandatory. Although the technology and tools are improving to spot fakes, standard journalistic process, like thoroughly checking sources, still apply.

Understanding the scale of the problem is tricky, but there are some indications. Cybersecurity startup DeepTrace has detected over 8,000 fake pornographic videos on adult entertainment sites. Like a lot of new technology, AI-manipulated videos have roots in the porn industry. In 2018, Google searches for “deepfakes” were 1,000 times higher than in 2017. A study from the Massachusetts Institute of Technology in January into the makeup of all academic papers about AI, found that neural networks (the main approach used for these doctored videos) were mentioned in 25% of the papers, more than any other machine-learning method.

What makes it even trickier is a lot of doctored videos are not malicious but done for comedic or satirical effect. The Salvador Dalí Museum in St Petersburg, Florida, for instance, has used AI to bring the artist back to life to greet visitors.

“There are good intentions, but there will always be bad actors,” said Marconi. “We saw a proliferation of types of manipulated media has negative consequences to journalism and society. We decided it was a threat to our news-gathering process.”

Marconi wasn’t able to share how many fake videos the Journal’s committee has come across or details of the process it has developed to spot them due to newsroom policy.

The next battlefield is audio. Marconi cited an example of manipulated audio of Donald Trump and Barack Obama speaking Mandarin. “It’s a different monster, and it’s scary; there’s no way the human ear can tell.”

Like with a lot of offshoots from AI, headlines have over-hyped the phenomenon, but Marconi attributes this more to artificial intelligence itself. “At least from the deep-fake standpoint, it’s better for there to be a lot of awareness than to miss it completely,” he said.

Headlines have bubbled up about a doctored video of a speaker of the U.S. House of Representatives, Nancy Pelosi. The video, which surfaced in May, tried to make the politician appear drunk. Facebook, keen to avoid editorial responsibility, was criticized for not removing the video, clips of which had 2.5 million views, admitting an “execution mistake.” The video now seems to have disappeared. In retaliation, fake videos of Mark Zuckerberg spread around the internet.

As the proliferation of deepfakes grows, Facebook and other platforms’ stance on balancing freedom of speech with restricting the spread of misinformation will again come under pressure. But avoiding censorship is an increasingly thorny issue, he said: “How do you balance censorship with freedom of speech in the age where machines can create this content?”

https://digiday.com/?p=339445

More in Media

Member Exclusive

Media Briefing: ‘Cloudflare is locking the door’: Publishers celebrate victory against AI bot crawlers

July 3, 2025

After years of miserably watching their content get ransacked for free by millions of unidentified AI bot crawlers, publishers were finally thrown a viable lifeline.

Publishing in the Platform Era

How Vogue could navigate potential industry headwinds as Anna Wintour — who agency execs say made ad dollars flow — brings on new edit lead

July 3, 2025

Anna Wintour’s successor at Vogue will have to overcome the myriad of challenges facing fashion media and the digital publishing ecosystem.

Generative AI

Here are the biggest misconceptions about AI content scraping

July 2, 2025

An increase in bots scraping content from publishers’ sites represents a huge threat to their businesses. But scraping for AI training and scraping for real-time outputs present different challenges and opportunities.